UMIACS Computational Linguistics Colloquium,
April 29, 1998
Probabilistic Best-First Chart Parsing, or
Parsing Fast by Parsing Smart
Eugene Charniak
Brown University
UMIACS Computational Linguistics Colloquium
April 29, 1998,
4pm, AVW Room 4406
We consider the use of context-free grammars (CFGs) for parsing
(assigning syntactic structure to) English sentences. There are
well-known CFG parsing algorithms that can parse in n-cubed time,
where n is the length of the sentence. Unfortunately, this is not
nearly efficient enough when dealing with large grammars and longish
sentences. In this talk we describe some new techniques devised by a
group of us at Brown for dramatically reducing the amount of work
necessary to find a parse for a sentence using a probabilistic CFG (a
CFG in which each rule is assigned a probability). The key idea is to
use the probabilistic information to order the parsing steps so as to
guide the parser toward a parse without most of the usual hunting and
pecking. We describe three such parsers, each using more information
and achieving correspondingly better results than the last.
For the colloquium series schedule, see the UMD
Computational Linguistics Colloquium Series web page at
http://umiacs.umd.edu/~resnik/cl_colloquium/. If you are interested
in meeting with the speaker, please contact Mari Broman Olsen (molsen@umiacs.umd.edu) or Philip Resnik (resnik@umiacs.umd.edu).