Enhanced multihead self-attention block network for remote sensing image scene classification

Yijin Li; Jiaxin Wang; Sibao Chen; Jin Tang; Bin Luo

doi:10.1117/1.JRS.17.016517

28 March 2023 Enhanced multihead self-attention block network for remote sensing image scene classification

Yijin Li, Jiaxin Wang, Sibao Chen, Jin Tang, Bin Luo

Author Affiliations +

Journal of Applied Remote Sensing, Vol. 17, Issue 1, 016517 (March 2023). https://doi.org/10.1117/1.JRS.17.016517

Abstract

Remote sensing image scene classification has been widely researched with the aim of assigning semantics labels to the land cover. Although convolutional neural networks (CNN), such as VggNet and ResNet, have achieved good performance, the complex background and redundant information of remote sensing images restrict the improvement of final accuracy. We propose an enhanced multihead self-attention block network, which effectively reduces the adverse impact of background and emphasize the main information. In this model, due to the possible redundancy of high-level information of CNN, we only replace the final three bottleneck blocks of ResNet50 with the enhanced multihead self-attention layer to focus on the salient region of each image more effectively. Our enhanced multihead self-attention layer provides the following improvements over the classical module. First, we construct a triple-way convolution to deal with the arbitrary directionality of remote sensing images and get more stable attention information. Then, the improved relative position encodings are used to consider the relative distance between different location features. Finally, we use depthwise convolution and InstanceNorm operation to restore the diversity ability of multiheads. The contrast and ablation experiments carried out on three public datasets show our approach improves upon the baseline significantly and achieves remarkable performance compared with some state-of-the-art methods.

Citation Download Citation

Yijin Li, Jiaxin Wang, Sibao Chen, Jin Tang, and Bin Luo "Enhanced multihead self-attention block network for remote sensing image scene classification," Journal of Applied Remote Sensing 17(1), 016517 (28 March 2023). https://doi.org/10.1117/1.JRS.17.016517

Received: 28 November 2022; Accepted: 7 March 2023; Published: 28 March 2023

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

;

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $24.00

Non-members: $28.00 ADD TO CART

JOURNAL ARTICLE
21 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Remote sensing

Image enhancement

Education and training

Convolution

Scene classification

Feature extraction

Ablation

Show All Keywords

Keywords/Phrases

Search In:

Publication Years