KEYWORDS: Optical character recognition, Feature extraction, Data modeling, Databases, Computing systems, Systems modeling, Pattern recognition, Current controlled current source, Quantization, Image classification
Recognizing old documents is highly desirable since the demand for quickly searching millions of archived documents
has recently increased. Using Hidden Markov Models (HMMs) has been proven to be a good solution to tackle the
main problems of recognizing typewritten Arabic characters. These attempts however achieved a remarkable success for
omnifont OCR under very favorable conditions, they didn't achieve the same performance in practical conditions, i.e. noisy
documents. In this paper we present an omnifont, large-vocabulary Arabic OCR system using Pseudo Two Dimensional
Hidden Markov Model (P2DHMM), which is a generalization of the HMM. P2DHMM offers a more efficient way to
model the Arabic characters, such model offer both minimal dependency on the font size/style (omnifont), and high level
of robustness against noise. The evaluation results of this system are very promising compared to a baseline HMM system
and best OCRs available in the market (Sakhr and NovoDynamics). The recognition accuracy of the P2DHMM classifier
is measured against the classic HMM classifier, the average word accuracy rates for P2DHMM and HMM classifiers are
79% and 66% respectively. The overall system accuracy is measured against Sakhr and NovoDynamics OCR systems, the
average word accuracy rates for P2DHMM, NovoDynamics, and Sakhr are 74%, 71%, and 61% respectively.
Automatic blotch removal in old movies is important in film restoration. Blotches are black or white spots randomly
occurring along the movie frames. Removing these spots are obtained by first automatically detecting the blotches then
interpolating them using the spatial and temporal information in current, succeeding, and preceding frames. In this paper,
simplified Rank Order Detector (sROD) is used with tweaked parameters to over detect the blotches, Epitome Analysis is
used for interpolating the detected blotches.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.