edu.mit.csail.brill
Class BrillTagger

java.lang.Object
  |
  +--edu.mit.csail.brill.BrillTagger

public class BrillTagger
extends Object

The Java interface to Eric Brill's part-of-speech tagger. It is written as a wrapper around BTagger, the raw JNI interface around the original tagger.

Here is the complete tagset:

  1. CC Coordinating conjunction
  2. CD Cardinal number
  3. DT Determiner
  4. EX Existential there
  5. FW Foreign word
  6. IN Preposition/subord. conjunction
  7. JJ Adjective
  8. JJR Adjective, comparative
  9. JJS Adjective, superlative
  10. LS List item marker
  11. MD Modal
  12. NN Noun, singular or mass
  13. NNS Noun, plural
  14. NNP Proper noun, singular
  15. NNPS Proper noun, plural
  16. PDT Predeterminer
  17. POS Possessive ending
  18. PRP Personal pronoun
  19. PRP$ Possessive pronoun
  20. RB Adverb
  21. RBR Adverb, comparative
  22. RBS Adverb, superlative
  23. RP Particle
  24. SYM Symbol (mathematical or scientific)
  25. TO to
  26. UH Interjection
  27. VB Verb, base form
  28. VBD Verb, past tense
  29. VBG Verb, gerund/present participle
  30. VBN Verb, past participle
  31. VBP Verb, non-3rd ps. sing. present
  32. VBZ Verb, 3rd ps. sing. present
  33. WDT wh-determiner
  34. WP wh-pronoun
  35. WP$ Possessive wh-pronoun
  36. WRB wh-adverb
  37. # Pound sign
  38. $ Dollar sign
  39. . Sentence-final punctuation
  40. , Comma
  41. : Colon, semi-colon
  42. ( Left bracket character
  43. ) Right bracket character
  44. " Straight double quote
  45. ` Left open single quote
  46. " Left open double quote
  47. ' Right close single quote
  48. " Right close double quote

Author:
Jimmy Lin

Constructor Summary
BrillTagger()
          Creates an instance of Brill's part-of-speech tagger.
 
Method Summary
 void destroy()
          Destroys the tagger.
static void main(String[] arg)
           
 void setup()
          Initializes the tagger.
 edu.mit.csail.brill.Sentence tag(String sentence)
          Part-of-speech tags a sentence.
static String tokenize(String s)
           
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

BrillTagger

public BrillTagger()
Creates an instance of Brill's part-of-speech tagger.

Method Detail

setup

public void setup()
Initializes the tagger.


destroy

public void destroy()
Destroys the tagger.


tag

public edu.mit.csail.brill.Sentence tag(String sentence)
Part-of-speech tags a sentence.

Parameters:
sentence - the sentence to tag
Returns:
TaggedToken[] an array containing the tag output

tokenize

public static String tokenize(String s)

main

public static void main(String[] arg)
                 throws IOException
IOException