Paper
28 February 2024 Aircraft detection based on improved Vision Transformer
Yunong Xiong, Xiaobo Guo, Xiaolei Liu
Author Affiliations +
Proceedings Volume 13071, International Conference on Mechatronic Engineering and Artificial Intelligence (MEAI 2023); 130713Q (2024) https://doi.org/10.1117/12.3025629
Event: International Conference on Mechatronic Engineering and Artificial Intelligence (MEAI 2023), 2023, Shenyang, China
Abstract
In the aviation field, accurate aircraft detection is of great significance for traffic monitoring, safety assurance, and military reconnaissance. Although traditional convolutional neural networks have achieved significant success in image recognition and object detection, they still face challenges in processing aerial images containing multi-scale and complex backgrounds. To address these issues, this study proposes a Vision Transformer based model that incorporates SKAttention (Selective Kernel Attention) and MSCAM (Multi Scale Channel Attention Module) technologies to improve the accuracy and efficiency of aircraft detection.

SKAttention technology effectively enhances the flexibility and accuracy of the model in processing aircraft images of different scales by adaptively selecting the most suitable convolution kernel size. MSCAM, on the other hand, optimizes the model's ability to process aircraft details and background information by enhancing channel attention at different scales. By combining these two methods into the Vision Transformer architecture, our model achieved accuracy and recall of 98.7% and 98.2%, respectively. These results validate the effectiveness of SKAttention and MSCAM in improving the performance of aviation aircraft detection based on Vision Transformer, providing new technological approaches and research directions for aviation image processing and object detection.
© (2024) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yunong Xiong, Xiaobo Guo, and Xiaolei Liu "Aircraft detection based on improved Vision Transformer", Proc. SPIE 13071, International Conference on Mechatronic Engineering and Artificial Intelligence (MEAI 2023), 130713Q (28 February 2024); https://doi.org/10.1117/12.3025629
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Object detection

Visual process modeling

Image processing

Target detection

Transformers

Performance modeling

Convolution

RELATED CONTENT


Back to Top