Chuan Xu, Yinging Hou, Wenying Du, Wei Cao, Ying Wang, Zhiwei Ye, Ting Bai, Wei Yang, Liye Mei
Journal of Applied Remote Sensing, Vol. 18, Issue 04, 044508, (October 2024) https://doi.org/10.1117/1.JRS.18.044508
TOPICS: Feature extraction, Feature fusion, Buildings, Remote sensing, Transformers, Performance modeling, Image fusion, Visualization, Panoramic photography, Education and training
Cropland is crucial for national food security, maintaining agricultural product quality, and ensuring environmental safety. Consequently, there is an urgent need for cropland change detection (CD) using high-resolution remote sensing images to accurately track cropland distribution and changes. However, the irregular shapes of cultivated areas and challenges in feature fusion complicate boundary localization, posing a risk of losing critical change features. To address these issues, we introduce the dynamic sparse hierarchical attention-driven cropland CD network, which combines a vision transformer with dynamic sparse hierarchical attention (DSHA-Former) and holistic complementation fusion (HCF) modules. DSHA-Former effectively detects targets and refines edge features in images of various sizes. Concurrently, HCF preserves key details by integrating core, setting, margin, and panorama data, supplementing global image-level content comprehensively. This significantly improves the definition of changed areas in the process of merging features from different scales, thus enhancing cropland CD. We evaluate our approach on the CL-CD dataset, achieving an F1-score of 79.49%. In addition, the network demonstrates strong generalization capabilities on both the LEVIR-CD and WHU-CD datasets, with F1-scores of 92.42% and 92.18%, respectively. Our method is effective and demonstrates broad applicability in detecting cropland changes.