Students Who Study Together Learn Better: On the Importance of Collective Knowledge Distillation for Domain Transfer in Fact Verification
| dc.contributor.author | Mithun, M.P. | |
| dc.contributor.author | Suntwal, S. | |
| dc.contributor.author | Surdeanu, M. | |
| dc.date.accessioned | 2022-05-19T23:19:49Z | |
| dc.date.available | 2022-05-19T23:19:49Z | |
| dc.date.issued | 2021 | |
| dc.identifier.citation | Mithun, M. P., Suntwal, S., & Surdeanu, M. (2021, November). Students Who Study Together Learn Better: On the Importance of Collective Knowledge Distillation for Domain Transfer in Fact Verification. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (pp. 6968-6973). | |
| dc.identifier.isbn | 9781955917094 | |
| dc.identifier.doi | 10.18653/v1/2021.emnlp-main.558 | |
| dc.identifier.uri | http://hdl.handle.net/10150/664432 | |
| dc.description.abstract | While neural networks produce state-of-the-art performance in several NLP tasks, they depend heavily on lexicalized information, which transfers poorly between domains. Previous work (Suntwal et al., 2019) proposed delexicalization as a form of knowledge distillation to reduce dependency on such lexical artifacts. However, a critical unsolved issue that remains is how much delexicalization should be applied? A little helps reduce over-fitting, but too much discards useful information. We propose Group Learning (GL), a knowledge and model distillation approach for fact verification. In our method, while multiple student models have access to different delexicalized data views, they are encouraged to independently learn from each other through pair-wise consistency losses. In several cross-domain experiments between the FEVER and FNC fact verification datasets, we show that our approach learns the best delexicalization strategy for the given training dataset and outperforms state-of-the-art classifiers that rely on the original data. © 2021 Association for Computational Linguistics | |
| dc.language.iso | en | |
| dc.publisher | Association for Computational Linguistics (ACL) | |
| dc.rights | Copyright © 2021 Association for Computational Linguistics, licensed on a Creative Commons Attribution 4.0 International License. | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
| dc.title | Students Who Study Together Learn Better: On the Importance of Collective Knowledge Distillation for Domain Transfer in Fact Verification | |
| dc.type | Proceedings | |
| dc.type | text | |
| dc.contributor.department | University of Arizona | |
| dc.identifier.journal | EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings | |
| dc.description.note | Open access journal | |
| dc.description.collectioninformation | This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu. | |
| dc.eprint.version | Final published version | |
| dc.source.journaltitle | EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings | |
| refterms.dateFOA | 2022-05-19T23:19:49Z |

