Paper
7 June 2023 How certain are tansformers in image classification: uncertainty analysis with Monte Carlo dropout
Md. Farhadul Islam, Sarah Zabeen, Md. Azharul Islam, Fardin Bin Rahman, Anushua Ahmed, Dewan Ziaul Karim, Annajiat Alim Rasel, Meem Arafat Manab
Author Affiliations +
Proceedings Volume 12701, Fifteenth International Conference on Machine Vision (ICMV 2022); 127010K (2023) https://doi.org/10.1117/12.2679442
Event: Fifteenth International Conference on Machine Vision (ICMV 2022), 2022, Rome, Italy
Abstract
Researchers have been inspired to implement transformer models in solving machine vision problems after their tremendous success with natural language tasks. Using the straightforward architecture and swift performance of transformers, a variety of computer vision problems can be solved with more ease and effectiveness. However, a comparative evaluation of their uncertainty in prediction has not been done yet. As we know, real world applications require a measure of uncertainty to produce accurate predictions, which allows researchers to handle uncertain inputs and special cases, in order to successfully prevent overfitting. Our study approaches the unexplored issue of uncertainty estimation among three popular and effective transformer models employed in computer vision, such as Vision Transformers (ViT), Swin Transformers (SWT), and Compact Convolutional Transformers (CCT). We conduct a comparative experiment to determine which particular architecture is the most reliable in image classification. We use dropouts at the inference phase in order to measure the uncertainty of these transformer models. This approach, commonly known as Monte Carlo Dropout (MCD), works well as a low-complexity estimation to compute uncertainty. The MCD-based CCT model is the least uncertain architecture in this classification task. Our proposed MCD-infused CCT model also yields the best results with 78.4% accuracy, while the SWT model with embedded MCD exhibits Pmaximum performance gain where the accuracy increased by almost 3% with the final result being 71.4%.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Md. Farhadul Islam, Sarah Zabeen, Md. Azharul Islam, Fardin Bin Rahman, Anushua Ahmed, Dewan Ziaul Karim, Annajiat Alim Rasel, and Meem Arafat Manab "How certain are tansformers in image classification: uncertainty analysis with Monte Carlo dropout", Proc. SPIE 12701, Fifteenth International Conference on Machine Vision (ICMV 2022), 127010K (7 June 2023); https://doi.org/10.1117/12.2679442
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Transformers

Monte Carlo methods

Performance modeling

Statistical modeling

Image classification

Artificial neural networks

Deep learning

Back to Top