Content Features for Logical Document Labeling

TitleContent Features for Logical Document Labeling
Publication TypeConference Papers
Year of Publication2003
AuthorsLiang J, Doermann D
Conference NameProc. SPIE Conference on Document Recognition and Retrieval X
Date Published2003///
Abstract

The use of content features extracted from recognized text is valuable in labeling logical elements in documents withoutrigid layout structure, like business letters. This paper discusses a model-based approach to combining content features
with other geometrical and presentation features for logical labeling. Models are automatically initialized and adaptively
improved using training samples. Satisfactory experimental results are presented.