Table-Form Structure Analysis Based on Box-Driven Reasoning

TitleTable-Form Structure Analysis Based on Box-Driven Reasoning
Publication TypeJournal Articles
Year of Publication1996
AuthorsHori O, Doermann D
JournalIEICE TRANSACTIONS on Information and Systems
Pagination542 - 547
Date Published1996/05/20/
ISBN Number, 0916-8532

Table-form document structure analysis is an important problem in the document processing domain. This paper presents a new method called Box-Driven Reasoning (BDR) to robustly analyze the structure of table-form documents that include touching characters and broken lines. Real documents are copied repeatedly and overlaid with printed data, resulting in characters that touch cells and lines that are broken. Most previous methods employ a line-oriented approach, but touching characters and broken lines make the procedure fail at an early stage. BDR deals with regions directly in contrast with other previous methods and a reduced resolution image is introduced to supplement information deteriorated by noise. Experimental tests show that BDR reliably recognizes cells and strings in document images with touching characters and broken lines.