The problem of form classification is to assign a single-page form image to one of a set of predefined form types or classes.
We classify the form images using low level pixel density information from the binary images of the documents. In this
paper, we solve the form classification problem with a classifier based on the k-means algorithm, supported by adaptive
boosting. Our classification method is tested on the NIST scanned tax forms data bases (special forms databases 2 and 6)
which include machine-typed and handwritten documents. Our method improves the performance over published results
on the same databases, while still using a simple set of image features.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.