Evaluating Basic Text Meaning Representations Produced by the OntoSem Semantic Analyzer

 

 Sergei Nirenburg

Department of Computer Science and

Institute for Language and Information Technologies

University of Maryland, Baltimore County


UMIACS Computational Linguistics Colloquium

May 12, 2004, 11:00am, UMIACS Room 2120


 

I will describe an evaluation regimen developed for the text meaning representations (TMRs) produced by the general purpose text analyzer OntoSem. The TMRs are formulated in a metalanguage informed by an ontology developed specifically to support natural language processing. OntoSem uses a combination of knowledge-based and stochastic constraints for making heuristic preferences in the space of potential textual analyses. Static constraints are stored in Ontosem lexicons, the ontology and the fact repository of remembered instances of ontological concepts. Dynamic constraints are generated by pre-semantic processing stages, such as morphology and syntax, as well as by semantic processing itself. The initial evaluation has concentrated on a subset of material produced by the analyzer and recorded in the TMRs, specifically, on the “who did what to whom” basic propositional semantic content. Therefore, at the semantic level we measured only the quality of word sense disambiguation and the correctness of semantic dependency structures produced by OntoSem. The goal of evaluation is to determine the quality of TMRs but even more importantly to assign blame for various classes of errors, thus suggesting directions for improvement of both knowledge resources and processors. I will describe the OntoSem processing environment, the evaluation regime itself and results from our first evaluation effort.


 About the Speaker:

Sergei Nirenburg is Professor in the CSEE Department of the University of Maryland, Baltimore County and Director of its Institute for Language and Information Technologies (ILIT). Before coming to UMBC, Dr. Nirenburg was Director of the Computing Research Laboratory and Professor of Computer Science at New Mexico State University. Dr. Nirenburg has written or edited seven books and published over 130 articles in various areas of computational linguistics and artificial intelligence.
 For the colloquium series schedule, see the UMD Computational http://www.umiacs.umd.edu/research/CLIP/colloq/.  If you are interested in meeting with the speaker, please contact Doug <http://www.glue.umd.edu/~oard/>  Oard (oard@umiacs.umd.edu <mailto:oard@umiacs.umd.edu> ).