With the rapid development of information technology, text recognition in natural scenes has become a hot topic of current research. In order to accurately and quickly identify the box number in the container image in the natural scene, this paper proposes a deep learning-based image text recognition model (Faster-RCNN and CNN with Attention (FRCA)), which consists of two stages: box number area detection and box number character recognition. We use the improved Faster-RCNN network to detect the location of the box number, which increase the attention mechanism in the area generation network (RPN) to speed up the detection speed while ensuring the accuracy. And we use the improved CNN to recognize the box number characters. The experiments on the benchmark dataset and the real dataset prove that compared with the connected region detection method, the Faster-RCNN and VGG-16 combination method, the FasterRCNN and ResNet-101 combined detection method, the accuracy of FRCA model in this paper is better than the former two schemes, and the speed of detection of FRCA network is faster than that of the second and third scheme due to the increase of attention mechanism.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.