Paper
8 November 2023 Structured Fourier contour embedding for arbitrary-shaped slender text detection
Jianyong Chen, Lingyu Liang, Wocheng Xiao
Author Affiliations +
Proceedings Volume 12923, Third International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2023); 1292311 (2023) https://doi.org/10.1117/12.3011781
Event: 3rd International Conference on Artificial Intelligence, Virtual Reality and Visualization (AIVRV 2023), 2023, Chongqing, China
Abstract
Arbitrary-shaped text detection is a challenging task where a text instance representation should be well designed to cover diverse text geometry variances. Most of current methods represent text instances as image mask or contour point sequences, but they might lead to expensive post-processing or restricted capability of shape modeling. Most recently, Zhu et al. proposed Fourier Contour Embedding (FCE) method to represent arbitrary shaped text contours in the Fourier domain, and constructed FCENet to obtain state-of-the-art (SOTA) performance. However, even the FCE signature and FCENet may suffer significant detection performance degradation on slender-shaped text instances like most other text detectors because of the mismatch between the square-shaped receptive fields and the large aspect ratio of arbitrary-shaped slender text}. To tackle this problem, we designed Structured Fourier Contour Embedding (Structured-FCE) to encode both instance-level features and local geometric features of the boundaries. Then, we constructed Structured-FCENet to predict and merge Structured-FCE signatures to construct the boundaries of arbitrary-shaped slender texts. Qualitative and quantitative results illustrate that Structured-FCENet obtain SOTA performance on CTW1500, Total-Text and DAST1500 benchmark datasets, especially for the challenging slender text instances of various shape.
(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Jianyong Chen, Lingyu Liang, and Wocheng Xiao "Structured Fourier contour embedding for arbitrary-shaped slender text detection", Proc. SPIE 12923, Third International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2023), 1292311 (8 November 2023); https://doi.org/10.1117/12.3011781
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Modeling

Feature extraction

Information fusion

Structural design

Computer vision technology

Industrial applications

Target detection

RELATED CONTENT


Back to Top