Paper
23 March 1994 Context-driven text recognition by means of dictionary support
Josua Boon, Frank Hoenes, Majdi Ben Hadj Ali
Author Affiliations +
Proceedings Volume 2181, Document Recognition; (1994) https://doi.org/10.1117/12.171124
Event: IS&T/SPIE 1994 International Symposium on Electronic Imaging: Science and Technology, 1994, San Jose, CA, United States
Abstract
This paper presents an alternative method for typed character recognition by way of the textual context. The approach here is word-oriented, and uses no a priori knowledge about typical appearance of characters. It leads back to an approach suggested by R. G. Casey where text recognition is considered as solving a substitution cipher, or cryptogram. Character images are considered only in order to distinguish or group (cluster) them. The recognition information used is provided by dictionaries. The overall procedure can be divided into three principle steps: (1) a ciphertext like symbolic representation of the text is generated. (2) in an initialization phase only a few but reliable word recognitions are striven for. The resulting partial symbol-character assignments are sufficient to initiate the following relaxation of the recognition process as the third step. Whereas Casey uses several ambiguous alternatives for word recognition, the approach here is based on acquiring a few, but reliable, recognition alternatives. Thus, instead of a spell check program, a dictionary with a heuristic-driven look- up control combined with an appropriate access mechanism is used.
© (1994) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Josua Boon, Frank Hoenes, and Majdi Ben Hadj Ali "Context-driven text recognition by means of dictionary support", Proc. SPIE 2181, Document Recognition, (23 March 1994); https://doi.org/10.1117/12.171124
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Associative arrays

Optical character recognition

Detection and tracking algorithms

Image segmentation

Reliability

Information operations

Lead

RELATED CONTENT

Intelligent word-based text recognition
Proceedings of SPIE (February 01 1991)
Automated vision system for inspection of wedge bonds
Proceedings of SPIE (August 01 1992)
Multiple-agent adaptation in whole-book recognition
Proceedings of SPIE (January 24 2011)
Word level script identification for scanned document images
Proceedings of SPIE (December 15 2003)

Back to Top