Presentation + Paper
9 October 2018 Scene text detection and recognition system for visually impaired people in real world
Author Affiliations +
Proceedings Volume 10794, Target and Background Signatures IV; 107940S (2018) https://doi.org/10.1117/12.2325523
Event: SPIE Security + Defence, 2018, Berlin, Germany
Abstract
Visually Impaired (VI) people around the world have difficulties in socializing and traveling due to the limitation of traditional assistive tools. In recent years, practical assistance systems for scene text detection and recognition allow VI people to obtain text information from surrounding scenes. However, real-world scene text features complex background, low resolution, variable fonts as well as irregular arrangement which make it difficult to achieve robust scene text detection and recognition. In this paper, a scene text recognition system to help VI people is proposed. Firstly, we propose a high-performance neural network to detect and track objects, which is applied to specific scenes to obtain Regions of Interest (ROI). In order to achieve real-time detection, a light-weight deep neural network has been built using depth-wise separable convolutions that enables the system to be integrated into mobile devices with limited computational resources. Secondly, we train the neural network using the textural features to improve the precision of text detection. Our algorithm suppresses the effects of spatial transformation (including translation, scaling, rotation as well as other geometric transformations) based on the spatial transformer networks. Open-source optical character recognition (OCR) is used to train scene texts individually to improve the accuracy of text recognition. The interactive system eventually transfers the number and distance information of inbound buses to visually impaired people. Finally, a comprehensive set of experiments on several benchmark datasets demonstrates that our algorithm has achieved an extraordinary trade-off between precision and resource usage.
Conference Presentation
© (2018) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Lei Fei, Kaiwei Wang, Shufei Lin, Kailun Yang, Ruiqi Cheng, and Hao Chen "Scene text detection and recognition system for visually impaired people in real world", Proc. SPIE 10794, Target and Background Signatures IV, 107940S (9 October 2018); https://doi.org/10.1117/12.2325523
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Neural networks

Detection and tracking algorithms

Optical character recognition

Network architectures

Navigation systems

Object recognition

Back to Top