Show simple item record

dc.contributor.authorLi, Huayu
dc.contributor.authorChen, Xiwen
dc.contributor.authorDitzler, Gregory
dc.contributor.authorRoveda, Janet
dc.contributor.authorLi, Ao
dc.date.accessioned2024-03-13T16:57:17Z
dc.date.available2024-03-13T16:57:17Z
dc.date.issued2024-02-03
dc.identifier.citationLi, H., Chen, X., Ditzler, G., Roveda, J., & Li, A. (2024). Knowledge distillation under ideal joint classifier assumption. Neural Networks, 106160.en_US
dc.identifier.pmid38330746
dc.identifier.doi10.1016/j.neunet.2024.106160
dc.identifier.urihttp://hdl.handle.net/10150/671224
dc.description.abstractKnowledge distillation constitutes a potent methodology for condensing substantial neural networks into more compact and efficient counterparts. Within this context, softmax regression representation learning serves as a widely embraced approach, leveraging a pre-established teacher network to guide the learning process of a diminutive student network. Notably, despite the extensive inquiry into the efficacy of softmax regression representation learning, the intricate underpinnings governing the knowledge transfer mechanism remain inadequately elucidated. This study introduces the ‘Ideal Joint Classifier Knowledge Distillation’ (IJCKD) framework, an overarching paradigm that not only furnishes a lucid and exhaustive comprehension of prevailing knowledge distillation techniques but also establishes a theoretical underpinning for prospective investigations. Employing mathematical methodologies derived from domain adaptation theory, this investigation conducts a comprehensive examination of the error boundary of the student network contingent upon the teacher network. Consequently, our framework facilitates efficient knowledge transference between teacher and student networks, thereby accommodating a diverse spectrum of applications.en_US
dc.language.isoenen_US
dc.publisherElsevier Ltden_US
dc.rights© 2024 Elsevier Ltd. All rights reserved.en_US
dc.rights.urihttps://rightsstatements.org/vocab/InC/1.0/en_US
dc.subjectdeep learningen_US
dc.subjectknowledge distillationen_US
dc.subjectTeacher-student learningen_US
dc.titleKnowledge distillation under ideal joint classifier assumptionen_US
dc.typeArticleen_US
dc.identifier.eissn1879-2782
dc.contributor.departmentDepartment of Electrical & Computer Engineering at the University of Arizonaen_US
dc.contributor.departmentDepartment of Biomedical Engineering, The University of Arizonaen_US
dc.contributor.departmentBIO5 Institute, The University of Arizonaen_US
dc.identifier.journalNeural networks : the official journal of the International Neural Network Societyen_US
dc.description.note24 month embargo; first published 03 February 2024en_US
dc.description.collectioninformationThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.en_US
dc.eprint.versionFinal accepted manuscripten_US
dc.source.journaltitleNeural networks : the official journal of the International Neural Network Society
dc.source.volume173
dc.source.beginpage106160
dc.source.endpage
dc.source.countryUnited States


Files in this item

Thumbnail
Name:
softmax_regression_representat ...
Size:
2.431Mb
Format:
PDF
Description:
Final Accepted Manuscript

This item appears in the following Collection(s)

Show simple item record