Paper
16 August 2024 Swin UNet: a memory-efficient and accurate deep learning model for medical image segmentation
Jiachen Pan
Author Affiliations +
Proceedings Volume 13230, Third International Conference on Machine Vision, Automatic Identification, and Detection (MVAID 2024); 132300J (2024) https://doi.org/10.1117/12.3035725
Event: Third International Conference on Machine Vision, Automatic Identification and Detection, 2024, Kunming, China
Abstract
Medical image segmentation is a challenging and important task that aims to identify and separate different anatomical structures or pathological regions from complex and noisy image data. However, most existing deep learning models for medical image segmentation are based on convolutional neural networks (CNNs), which have high memory consumption and limited spatial reasoning capabilities. In this paper, we propose a novel deep learning model for medical image segmentation based on Swin UNET, which combines the self-attention mechanism of Swin Transformer and the encoder-decoder architecture of U-Net. We also propose a memory management strategy that optimizes the number of heads of the multi-head self-attention mechanism using probabilistic mirror flipping and grid search. We conduct extensive experiments on a challenging medical image segmentation dataset and demonstrate that our model and strategy achieve comparable or better accuracy than the state-of-the-art models while significantly reducing the memory usage. Our model and strategy are robust and generalizable, as they can handle arbitrary input resolutions, scales, and modalities, and achieve state-of-the-art performance on a challenging medical image segmentation dataset. Our study contributes to the advancement of the research field of medical image segmentation, and provides a practical and scalable solution for real-world application scenarios with limited resources.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Jiachen Pan "Swin UNet: a memory-efficient and accurate deep learning model for medical image segmentation", Proc. SPIE 13230, Third International Conference on Machine Vision, Automatic Identification, and Detection (MVAID 2024), 132300J (16 August 2024); https://doi.org/10.1117/12.3035725
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Medical imaging

Transformers

Deep learning

Data modeling

Mathematical optimization

Image processing

Back to Top