Package LBJ2.nlp

Parsers, data structures, pre-processing algorithms, and common feature extracting classifiers (implemented with LBJ) useful for natural language processing are implemented in this package.

See:
          Description

Class Summary
Affixes This class implements a classifier that takes a Word as input and generates features representing the prefixes and suffixes of the input word.
Capitalization This class implements a classifier that takes a Word as input and generates Boolean features representing the capitalizations of the words in a [-2, +2] window around the input word.
ColumnFormat This parser returns arrays of Strings representing the rows of a file in column format.
Forms This class implements a classifier that takes a Word as input and generates features representing the forms of the words in a [-2, +2] window around the input word.
NLDocument Use this class to represent a natural language document.
POS This class converts the String names of POS tags into discrete integer values.
POSBracketToVector Use this parser to return LinkedVector objects representing sentences given file names of POS bracket form files to parse.
POSBracketToWord Deprecated. As of LBJ release 2.0.4, the functionality of this class has been superceded by the ChildrenFromVectors parser used in conjunction with POSBracketToVector.
Sentence This representation of a sentence simply stores the entire text of the sentence in a string.
SentenceSplitter Use this class to extract sentences from plain text.
Word Implementation of a word for natural language processing.
WordSplitter This parser takes the plain, unannotated Sentences returned by another parser (e.g., SentenceSplitter) and splits them into Word objects.
WordTypeInformation This class implements a classifier that takes a Word as input and generates Boolean features representing interesting information about the forms of the words in a [-2, +2] window around the input word.
 

Package LBJ2.nlp Description

Parsers, data structures, pre-processing algorithms, and common feature extracting classifiers (implemented with LBJ) useful for natural language processing are implemented in this package.