Asymmetric decoder design for efficient convolutional encoder-decoder architectures in medical image reconstruction
Affiliation
Biomedical Engineering/Electrical and Computer Engineering, University of ArizonaIssue Date
2022
Metadata
Show full item recordPublisher
SPIECitation
Rahman, T., Bilgin, A., & Cabrera, S. (2022). Asymmetric decoder design for efficient convolutional encoder-decoder architectures in medical image reconstruction. Progress in Biomedical Optics and Imaging - Proceedings of SPIE, 11952.Rights
Copyright © 2022 SPIE.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
In recent years, significant research has been performed on developing powerful and efficient Convolutional Neural Network (CNN) architectures. To utilize these architectures in pixel-level regression tasks such as tomographic image reconstruction, a feature extraction encoder is often combined with a symmetrical decoder to generate an encoder-decoder structure, such as a U-Net. However, a more powerful decoder focusing on high-frequency features can provide higher pixel-level accuracy. In this work, we investigate the use of asymmetrical encoder-decoder architectures in medical image reconstruction tasks. The state-of-the-art EfficientNet architecture utilizes depthwise convolutions and channel attention within inverted residual bottleneck blocks to generate highly compressed features while maintaining a significant FLOPS efficiency advantage compared to regular convolutional encoders. We develop an asymmetric encoder-decoder architecture, which uses the EfficientNet as an encoder. The proposed decoder architecture combines the multi-resolution features generated by the EfficientNet encoder using an incremental feature expansion strategy, which leads to better preservation of the structural details in reconstructed images. We have tested our asymmetrical encoder-decoder approach on undersampled MRI reconstruction tasks using the Calgary Campinas multi-channel brain MR dataset. Results demonstrate that the proposed asymmetric approach vastly outperforms a symmetric Efficient U-Net, achieving a 3dB improvement in PSNR. SSIM was also improved, and the asymmetric network was found to recover small structural details more effectively. Furthermore, the proposed asymmetric Efficient U-Net provides a four-fold reduction in inference time when compared to the conventional U-Net architecture. © COPYRIGHT SPIE. Downloading of the abstract is permitted for personal use only.Note
Immediate accessISSN
1605-7422ISBN
9781510647756Version
Final published versionae974a485f413a2113503eed53cd6c53
10.1117/12.2610084
