Package LBJ2.nlp.seg

The segmentation of sequences of words into semantically meaningful groups is a common NLP paradigm; this package aims to support such tasks in a general way.

See:
          Description

Class Summary
BIOTester This class may be used to produce a detailed report of the segment by segment performance of a given classifier on given labeled testing data.
PlainToTokenParser This parser takes the Words in the representation created by another Parser and creates a new representation consisting of Tokens.
SegmentTagPlain Use this command line program to produce textual segment annotations on an input text using a learned Word classifier.
Token Simple extension of the Word class from LBJ's NLP library.
 

Package LBJ2.nlp.seg Description

The segmentation of sequences of words into semantically meaningful groups is a common NLP paradigm; this package aims to support such tasks in a general way. Two examples of such a task, a chunker and a named entity tagger, have been implemented using the tools in this package and the classifiers in the LBJ2.nlp package.