Paper
23 February 2005 Content-based document enhancement by fuzzy clustering with spatial constraints
Author Affiliations +
Abstract
In this paper, we present a new system to segment and label the contents of scanned documents as either text or image, using a modified fuzzy c-means (FCM) algorithm. Each pixel is assigned a feature pattern extracted from the gray level distribution and computed at different scales. The invariant feature pattern is then assigned to a specific region using fuzzy logic. Our algorithm is formulated by modifying the objective function of the standard FCM algorithm to allow the labeling of a pixel to be influenced by the labels in its immediate neighborhood. The neighborhood effect acts as a regularizer and biases the solution towards piecewise-homogeneous labelings. Such a regularization is useful in segmenting scans corrupted by scanner noise.
© (2005) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Mohamed Nooman Ahmed and Brian E. Cooper "Content-based document enhancement by fuzzy clustering with spatial constraints", Proc. SPIE 5673, Applications of Neural Networks and Machine Learning in Image Processing IX, (23 February 2005); https://doi.org/10.1117/12.585543
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Fuzzy logic

Image processing algorithms and systems

Scanners

Feature extraction

Image enhancement

Halftones

RELATED CONTENT

A robust segmentation of scanned documents
Proceedings of SPIE (February 08 2015)
High recall document content extraction
Proceedings of SPIE (January 24 2011)

Back to Top