Open Access Paper
12 November 2024 Multilabel semantic segmentation algorithm for stroke extraction in calligraphy teaching
Jiayun Yu, Dingyu Li, Zhanyang Xu, Jinghong Wang, Wei Lin
Author Affiliations +
Proceedings Volume 13395, International Conference on Optics, Electronics, and Communication Engineering (OECE 2024) ; 133953R (2024) https://doi.org/10.1117/12.3048589
Event: International Conference on Optics, Electronics, and Communication Engineering, 2024, Wuhan, China
Abstract
Chinese characters, as the fundamental medium of communication within Chinese culture, stand out due to their intricate structures. Strokes, the basic elements of Chinese characters, are crucial for assessing Chinese handwriting. Accurate stroke extraction is essential and serves as the initial step in this evaluation. Traditional stroke extraction methods typically rely on specific rules that often fail to capture the full complexity of Chinese characters and cannot align the strokes according to the sequence used in template characters during assessments. To address these challenges, this paper redefines stroke extraction as a multi-label semantic segmentation task and introduces a new model, M-TransUnet. This model utilizes a deep convolutional approach to train individual Chinese characters, maintaining the integrity of stroke structures and resolving ambiguities in stroke segment combinations. It also accurately determines the order of strokes, aiding in subsequent tasks such as stroke evaluation. Furthermore, since handwriting images are only segmented into foreground and background without additional color cues, they are prone to false positive (FP) segmentation noise. To mitigate this issue, we propose a Local Smooth Strategy on Strokes (LSSS) that diminishes noise impacts on the segmentation results.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Jiayun Yu, Dingyu Li, Zhanyang Xu, Jinghong Wang, and Wei Lin "Multilabel semantic segmentation algorithm for stroke extraction in calligraphy teaching", Proc. SPIE 13395, International Conference on Optics, Electronics, and Communication Engineering (OECE 2024) , 133953R (12 November 2024); https://doi.org/10.1117/12.3048589
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Semantics

Education and training

Binary data

Classification systems

Neural networks

Transformers

Back to Top