General Benefits of Mono-Lingual Pre-Training in Transformers
dc.contributor.advisor | Bethard, Steven | |
dc.contributor.author | Zhang, Jiacheng | |
dc.creator | Zhang, Jiacheng | |
dc.date.accessioned | 2021-06-22T03:12:07Z | |
dc.date.available | 2021-06-22T03:12:07Z | |
dc.date.issued | 2021 | |
dc.identifier.citation | Zhang, Jiacheng. (2021). General Benefits of Mono-Lingual Pre-Training in Transformers (Master's thesis, University of Arizona, Tucson, USA). | |
dc.identifier.uri | http://hdl.handle.net/10150/660173 | |
dc.description.abstract | Pre-trained transformer is a class of neural networks behind many recent natural language processing systems. Its success is often attributed to linguistic knowledge injected during the pre-training process. In this work, we make multiple attempts to surgically remove language specific knowledge from BERT. Surprisingly, these interventions often do little damage to BERT's performance on GLUE tasks. By contrasting against non-pre-trained transformers with oracle initialization, we argue that when it comes to explain BERT's working, there is a sizable void below linguistic probing and above model initialization. | |
dc.language.iso | en | |
dc.publisher | The University of Arizona. | |
dc.rights | Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction, presentation (such as public display or performance) of protected items is prohibited except with permission of the author. | |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | |
dc.subject | BERT | |
dc.subject | pre-training | |
dc.subject | transformers | |
dc.title | General Benefits of Mono-Lingual Pre-Training in Transformers | |
dc.type | text | |
dc.type | Electronic Thesis | |
thesis.degree.grantor | University of Arizona | |
thesis.degree.level | masters | |
dc.contributor.committeemember | Surdeanu, Mihai | |
dc.contributor.committeemember | Barnard, Kobus | |
thesis.degree.discipline | Graduate College | |
thesis.degree.discipline | Computer Science | |
thesis.degree.name | M.S. | |
refterms.dateFOA | 2021-06-22T03:12:08Z |