David Doermann

Research Scientist Emeritus

Education:

Ph.D., University of Maryland (Computer Science)

Biography:

David Doermann is a research scientist emeritus in UMIACS.

In April, 2018, Doermann relocated to the University at Buffalo where he is a professor of computer science and engineering and director of the University at Buffalo Artificial Intelligence Institute.

At the University of Maryland, he served as co-director of the Laboratory for Language and Media Processing in UMIACS and as an adjunct member of the graduate faculty.

Doermann's team of researchers focus on topics related to document image analysis and multimedia information processing. Recent intelligent document image analysis projects include page decomposition, structural analysis and classification, page segmentation, logo recognition, document image compression, duplicate document image detection, image based retrieval, character recognition, generation of synthetic OCR data, and signature verification. In video processing, projects have centered on the segmentation of compressed domain video sequences, structural representation and classification of video, detection of reformatted video sequences, and the performance evaluation of automated video analysis algorithms.

In 2002 he received an Honorary Doctorate of Technology Sciences from the University of Oulu for his contributions to digital media processing and document analysis research. Doermann is a founding co-editor of the International Journal on Document Analysis and Recognition, has served as the general chair or co-chair of more than a half dozen international conferences and workshops, including the International Conference on Document Analysis and Recognition (ICDAR), which was held in Washington, D.C., in 2013. He has authored more than 30 journal publications and more than 125 refereed conference papers.

He received a B.Sc. degree in computer science and mathematics from Bloomsburg University in 1987, a M.Sc. degree in 1989 in the Department of Computer Science at the University of Maryland, College Park. He continued his studies in the Computer Vision Laboratory, where he earned a doctorate in 1993.

Publications

2007

Yu X, Yi L, Fermüller C, Doermann D. 2007. Object detection using a shape codebook. British Machine Vision Conference. 4

Farrell R, Doermann D, Davis LS. 2007. Learning Higher-order Transition Models in Medium-scale Camera Networks. Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on. :1-8.

Lin Z, Davis LS, Doermann D, DeMenthon D. 2007. Simultaneous appearance modeling and segmentation for matching people under occlusion. Proceedings of the 8th Asian conference on Computer vision - Volume Part II. :404-413.

Lin Z, Davis LS, Doermann D, DeMenthon D. 2007. Poster Session 5: Matching and Registration-Simultaneous Appearance Modeling and Segmentation for Matching People Under Occlusion. Lecture Notes in Computer Science. 4844:404-413.

2006

Liang J, DeMenthon D, Doermann D. 2006. Mosaicing of Camera-captured Documents Without Pose Restriction. Computer Vision and Image Understanding.

Li Y, Zheng Y, Doermann D, Jaeger S. 2006. Script-Independent Text Line Segmentation in Freestyle Handwritten Documents. LAMP-TR-136, CS-TR-4836, UMIACS-TR-2006-51, CFAR-TR-1017

Jaeger S, Zhu G, Doermann D, Chen K, Sampat S. 2006. DOCLIB: a Software Library for Document Processing. International Conference on Document Recognition and Retrieval XIII. :1-9.

DeMenthon D, Doermann D. 2006. Video Retrieval of Near-Duplicates using k-Nearest Neighbor Retrieval of Spatio-Temporal Descriptors. Multimedia Tools and Applications (MTAP). 30

Liu X, Li H, Doermann D. 2006. Imaging as an Alternative Data Channel for Camera Phones. ACM International Conference Proceeding Series; Proceedings of the 5th International Conference on Mobile and Ubiquitous Multimedia. :No.5-No.5.

Li Y, Zheng Y, Doermann D, Jaeger S. 2006. ANew Algorithm for Detecting Text Line in Handwritten Documents. 10th International Workshop on Frontiers in Handwriting Recognition. :35-40.

Luo M, DeMenthon D, Yu X, Doermann D. 2006. SOFTCBIR: Object Searching in Videos Combining Keypoint Matching and Graduated Assignment. LAMP-TR-132,CAR-TR-1013,CS-TR-4804,UMIACS-TR-2006-25

Liang J, DeMenthon D, Doermann D. 2006. Camera-Based Document Image Mosaicing. ICPR'06. :476-479.

Li Y, Zheng Y, Doermann D. 2006. Detecting Text Line in Handwritten Documents. ICPR'06. :1030-1033.

Zheng Y, Doermann D. 2006. Robust Point Matching for Nonrigid Shapes By Preserving Local Neighborhood Structures. IEEETransactions on Pattern Analysis and Machine Intelligence. 28(4):643-649.

Zhu G, Jaeger S, Doermann D. 2006. ARobust Stamp Detection Framework on Degraded Documents. International Conference on Document Recognition and Retrieval XIII. :1-9.

Karagol-Ayan B, Doermann D, Weinberg A. 2006. Morphology Induction from Limited Noisy Data Using Approximate String Matching. Proceedings of the Eighth Meeting of the ACL Special Interest Group in Computational Phonology (SIGPHON 2006). :60-68.

2005

Chen K, Jaeger S, Zhu G, Doermann D. 2005. DOCLIB: A document processing research tool. Symposium on Document Image Understanding Technology. :159-163.

Liang J, DeMenthon D, Doermann D. 2005. Flattening curved documents in images. Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. 2:338-345vol.2-338-345vol.2.

Ma H, Doermann D. 2005. Font identification using the grating cell texture operator. SPIE Conference on Document Recogntion and Retreival XXII. :148-156.

Ma H, Doermann D. 2005. Adaptive OCR with Limited User Feedback. 8th Int. Conf. on Document Analysis and Recognition (ICDAR'05). :814-818.

Huang M, DeMenthon D, Doermann D, Golebiowski L. 2005. Document Ranking by Layout Relevance. ICDAR. :362-366.

Zheng Y, Doermann D. 2005. Handwriting matching and its application to handwriting synthesis. Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on. :861-865Vol.2-861-865Vol.2.

Zheng Y, Li H, Doermann D. 2005. AParallel-Line Detection Algorithm Based on HMMDecoding. IEEE Transactions on Pattern Analysis and Machine Intelligence. 27(5):777-792.

Liang J, DeMenthon D, Doermann D. 2005. Unwarping Images of Curved Documents Using Global Shape Optimization. Proc. Fist International Workshop on Camera-based Document Analysis and Recognition. :25-29.

Liang J, Doermann D, Li H. 2005. Camera-Based Analysis of Text and Documents: ASurvey. International Journal on Document Analysis and Recognition. 7(2+3):83-104.

Liu X, Doermann D. 2005. Using Computer Vision to Detect Web Browser Display Errors. 3rd Web Document Analysis Workshop (on ICDAR 2005). :5-9.

Yu Y, Doermann D. 2005. Model of Object-Based Coding for Surveillance Video. Proceedings in the ICASSP'04 IEEE International Conference on Computer Vision. :693-696.

Liu X, Doermann D, Li H. 2005. Fast Camera Motion Estimation for Hand-Devices and Applications. 4th International Conference on Mobile and Ubiquitous Multimedia. :103-108.

Jaeger S, Ma H, Doermann D. 2005. Identifying Script on Word-Level with Informational Confidence. 8th Int. Conf. on Document Analysis and Recognition. :416-420.

Zheng Y, Doermann D. 2005. Robust Point Matching for Two-Dimensional Nonrigid Shapes. Proceedings in the ICASSP'04 IEEEInternational Conference on Computer Vision. :1561-1566.

2004

Byrne W, Doermann D, Franz M, Gustman S, Hajic J, Oard D, Picheny M, Psutka J, Ramabhadran B. 2004. Automatic recognition of spontaneous speech for access to multilingual oral history archives. IEEE Transactions on Speech and Audio Processing, Special Issue on Spontaneous Speech Processing. 12(4):420-435.

Ma H, Doermann D. 2004. Adaptive Hindi OCRUsing Generalized Hausdorff Image Comparison. ACMTransactions on Asian Language Information Processing. 26(2):198-213.

Zheng Y, Li H, Doermann D. 2004. Machine printed text and handwriting identification in noisy document images. Pattern Analysis and Machine Intelligence, IEEE Transactions on. 26(3):337-353.

Kang H-J, Doermann D. 2004. Product approximation by minimizing the upper bound of Bayes error rate for Bayesian combination of classifiers. Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on. 1:252-255Vol.1-252-255Vol.1.

Luo M, DeMenthon D, Doermann D. 2004. Shot boundary detection based on Image correlation features in video. TREC-VID: Text Retrieval and Evaluation Conference.

Balcells-Capellades M, DeMenthon D, Doermann D. 2004. An Appearance-based Approach for Consistent Labeling of Humans and Objects in Video. Pattern Analysis and Applications. :1433-7541.

Ma H, Doermann D. 2004. Word Level Script Identification on Scanned Document Images. SPIEConference on Document Recognition and Retrieval. :124-135.

Ghanem N, Doermann D, Davis LS, DeMenthon D. 2004. Mining Tools for Surveillance Video. Proceedings in SPIE 16th International Symposium on Electronic Imaging. :5307259-270-5307259-270.

Ghanem N, DeMenthon D, Doermann D, Davis LS. 2004. Representation and Recognition of Events in Surveillance Video Using Petri Nets. Second IEEEWorkshop on Event Mining 2004, CVPR2004. :112-112.

Zheng Y, Doermann D. 2004. Robust Point Matching for Non-Rigid Shapes: ARelaxation Labeling Based Approach. LAMP-TR-117,CAR-TR-1005,CS-TR-4633,UMIACS-TR-2004-75

Oard D, Soergel D, Doermann D, Huang X, Murray CG, Wang J, Ramabhadran B, Franz M, Gustman S, Mayfield J et al.. 2004. Building an information retrieval test collection for spontaneous conversational speech. Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval. :41-48.

Ghanem NM, Doermann D, Davis LS, DeMenthon D. 2004. Mining tool for surveillance video. SPIE 16th International Symposium on Electronic Imaging, Storage and Retrieval Methods and Applications for Multimedia. 5307:259-270.

2003

Ma H, Karagol-Ayan B, Doermann D, Oard D, Wang J. 2003. Parsing and Tagging of Bilingual Dictionaries. TALTraitement Automatique Des Langues. 44(2):125-150.

Rosenfeld A, Doermann D, DeMenthon D. 2003. Video Mining.

Karagol-Ayan B, Doermann D, Dorr BJ. 2003. Acquisition of Bilingual MTLexicons from OCRed Dictionaries. Proceedings of the Ninth Machine Translation Summit. :208-215.

Ma H, Doermann D. 2003. Adaptive Hindi OCRUsing Generalized Hausdorff Image Comparison. LAMP-TR-105,CFAR-TR-987,CS-TR-4519,UMIACS-TR-2003-87

Ma H, Karagol-Ayan B, Doermann D, Oard D, Wang J. 2003. Tagging and Parsing of Bilingual Dictionary. LAMP-TR-106,CFAR-TR-991,CS-TR-4529,UMIACS-TR-2003-97

DeMenthon D, Doermann D. 2003. Video Retrieval using Spatio-Temporal Descriptors. ACMMultimedia '03. :508-517.

Ma H, Doermann D. 2003. Gabor Filter Based Multi-class Classifier for Scanned Document Images. 7th International Conference on Document Analysis and Recognition (ICDAR). :968-972.

Liang J, Doermann D. 2003. Content Features for Logical Document Labeling. Proc. SPIE Conference on Document Recognition and Retrieval X. :189-196.

Zheng Y, Li H, Doermann D. 2003. Machine Printed Text and Handwriting Identification in Noisy Document Images. LAMP-TR-107,CFAR-TR-992,CS-TR-4531,UMIACS-TR-2003-99

Zheng Y, Li H, Doermann D. 2003. AModel-based Line Detection Algorithm in Documents. ICDAR. :44-48.

Capellades MB, Doermann D, DeMenthon D, Chellappa R. 2003. An appearance based approach for human and object tracking. Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on. 2:II-85-8vol.3-II-85-8vol.3.

Doermann D, Zi G. 2003. Groundtruth Image Generation from Electronic Text (Demonstration). Symposium on Document Image Understanding Technology. :309-312.

Shin C, Doermann D, Rosenfeld A. 2003. Measuring Structural Similarity of Document Pages for Searching Document Image Databases. 5th IASTEDInternational Conference on Signal and Image Processing. :320-325.

Karagol-Ayan B, Doermann D, Dorr BJ. 2003. Acquisition of bilingual MT lexicons from OCRed dictionaries. Proceedings of the 9th MT Summit. :208-215.

Zheng Y, Li H, Doermann D. 2003. AParallel Line Detection Algorithm Based on HMMDecoding. LAMP-TR-109,CAR-TR-994,CS-TR-4545,UMIACS-TR-2003-1113

Oard D, Doermann D, Dorr BJ, He D, Resnik P, Weinberg A, Byrne W, Khudanpur S, Yarowsky D, Leuski A et al.. 2003. Desparately seeking cebuano. Third Conference on Human Language Technologies.

Zheng Y, Li H, Doermann D. 2003. Text Identification in Noisy Document Images Using Markov Random Field. ICDAR. :599-603.

Gilbert X, Li H, Doermann D. 2003. Sports Video Classification Using HMM. ICME. 2:345-348.

Karagol-Ayan B, Doermann D, Dorr BJ. 2003. Use of OCR for Rapid Construction of Bilingual Lexicons. LAMP-TR-104,CFAR-TR-986,CS-TR-4510,UMIACS-TR-2003-78

Doermann D, Karunanidhi A. 2003. Video Analysis for Pervasive Environments. ICME. 2:161-164.

2002

Mariano VY, Min J, Park J-H, Kasturi R, Mihalcik D, Li H, Doermann D, Drayer T. 2002. Performance evaluation of object detection algorithms. Pattern Recognition, 2002. Proceedings. 16th International Conference on. 3:965-969vol.3-965-969vol.3.

Zheng Y, Li H, Doermann D. 2002. Segmentation and Identification of Handwriting in Noisy Documents. IAPRConference on Document Analysis System. :95-105.

Rautiainen M, Doermann D. 2002. Temporal Color Correlograms in Video Retrieval. International Conference on Pattern Recognition. :267-270.

Gupta P, Doermann D, DeMenthon D. 2002. Beam Search for Feature Selection in Automatic SVMDefect Classification. International Conference on Pattern Recognition. :212-215.

Wolf C, Doermann D. 2002. Binarization of low quality text using a Markov random field model. Pattern Recognition, 2002. Proceedings. 16th International Conference on. 3:160-163vol.3-160-163vol.3.

Doermann D, Ma H, Karagol-Ayan B, Oard D. 2002. Lexicon Acquisition from Bilingual Dictionaries. SPIEPhotonic West Electronic Imaging Conference. :37-48.

Doermann D, Intrator N, Rivin E, Steinherz T. 2002. Hidden loop recovery for handwriting recognition. Frontiers in Handwriting Recognition, 2002. Proceedings. Eighth International Workshop on. :375-380.

Li H, Doermann D. 2002. Text Quality Estimation in Digital Video. SPIE Conf. on Document Recognition and Information Retrieval. :232-243.