Interpretable and Reliable Oral Cancer Classifier with Attention Mechanism and Expert Knowledge Embedding via Attention Map
Author
Song, B.Zhang, C.
Sunny, S.
KC, D.R.
Li, S.
Gurushanth, K.
Mendonca, P.
Mukhia, N.
Patrick, S.
Gurudath, S.
Raghavan, S.
Tsusennaro, I.
Leivon, S.T.
Kolur, T.
Shetty, V.
Bushan, V.
Ramesh, R.
Pillai, V.
Wilder-Smith, P.
Suresh, A.
Kuriakose, M.A.
Birur, P.
Liang, R.
Affiliation
Wyant College of Optical Sciences, The University of ArizonaComputer Science Department, The University of Arizona
Issue Date
2023-02-23Keywords
attention branch networkattention map
attention mechanism
expert knowledge embedding
human-in-the-loop deep learning
visual explanation
Metadata
Show full item recordPublisher
MDPICitation
Song, B.; Zhang, C.; Sunny, S.; KC, D.R.; Li, S.; Gurushanth, K.; Mendonca, P.; Mukhia, N.; Patrick, S.; Gurudath, S.; et al. Interpretable and Reliable Oral Cancer Classifier with Attention Mechanism and Expert Knowledge Embedding via Attention Map. Cancers 2023, 15, 1421. https://doi.org/10.3390/cancers15051421Journal
CancersRights
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Convolutional neural networks have demonstrated excellent performance in oral cancer detection and classification. However, the end-to-end learning strategy makes CNNs hard to interpret, and it can be challenging to fully understand the decision-making procedure. Additionally, reliability is also a significant challenge for CNN based approaches. In this study, we proposed a neural network called the attention branch network (ABN), which combines the visual explanation and attention mechanisms to improve the recognition performance and interpret the decision-making simultaneously. We also embedded expert knowledge into the network by having human experts manually edit the attention maps for the attention mechanism. Our experiments have shown that ABN performs better than the original baseline network. By introducing the Squeeze-and-Excitation (SE) blocks to the network, the cross-validation accuracy increased further. Furthermore, we observed that some previously misclassified cases were correctly recognized after updating by manually editing the attention maps. The cross-validation accuracy increased from 0.846 to 0.875 with the ABN (Resnet18 as baseline), 0.877 with SE-ABN, and 0.903 after embedding expert knowledge. The proposed method provides an accurate, interpretable, and reliable oral cancer computer-aided diagnosis system through visual explanation, attention mechanisms, and expert knowledge embedding. © 2023 by the authors.Note
Open access journalISSN
2072-6694Version
Final Published Versionae974a485f413a2113503eed53cd6c53
10.3390/cancers15051421
Scopus Count
Collections
Except where otherwise noted, this item's license is described as © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license.

