It Takes Two Flints to Make a Fire: Multitask Learning of Neural Relation and Explanation Classifiers
Affiliation
Department of Computer Science, University of ArizonaIssue Date
2023-03-01
Metadata
Show full item recordPublisher
MIT Press JournalsCitation
Zheng Tang, Mihai Surdeanu; It Takes Two Flints to Make a Fire: Multitask Learning of Neural Relation and Explanation Classifiers. Computational Linguistics 2023; 49 (1): 117–156. doi: https://doi.org/10.1162/coli_a_00463Journal
Computational LinguisticsRights
© 2022 Association for Computational Linguistics. Published under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International license.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
We propose an explainable approach for relation extraction that mitigates the tension between generalization and explainability by jointly training for the two goals. Our approach uses a multi-task learning architecture, which jointly trains a classifier for relation extraction, and a sequence model that labels words in the context of the relations that explain the decisions of the relation classifier. We also convert the model outputs to rules to bring global explanations to this approach. This sequence model is trained using a hybrid strategy: supervised, when supervision from pre-existing patterns is available, and semi-supervised otherwise. In the latter situation, we treat the sequence model’s labels as latent variables, and learn the best assignment that maximizes the performance of the relation classifier. We evaluate the proposed approach on the two datasets and show that the sequence model provides labels that serve as accurate explanations for the relation classifier’s decisions, and, importantly, that the joint training generally improves the performance of the relation classifier. We also evaluate the performance of the generated rules and show that the new rules are a great add-on to the manual rules and bring the rule-based system much closer to the neural models. © 2022 Association for Computational Linguistics.Note
Open access journalISSN
0891-2017Version
Final Published Versionae974a485f413a2113503eed53cd6c53
10.1162/coli_a_00463
Scopus Count
Collections
Except where otherwise noted, this item's license is described as © 2022 Association for Computational Linguistics. Published under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International license.