Page Segmentation and Zone Classification: The State of the Art

TitlePage Segmentation and Zone Classification: The State of the Art
Publication TypeReports
Year of Publication1999
AuthorsOkun O, Doermann D, Pietikainen M
Date Published1999/11//
InstitutionUniversity of Maryland, College Park
Abstract

Page segmentation and zone classification are key areas of research in document image processing, because they occupy an intermediate position between document preprocessing and higher-level document understanding such as logical page analysis and OCR. Such analysis of the page relies heavily on an appropriate document model and results in a representation of the physical structure of the document. The purpose of this review is to analyze progress made in page segmentation and zone classification and suggest what needs to be done to advance the field.