Package edu.mit.jmwe.data

Provides the basic data structures used by the library and their default implementations.

See:
          Description

Interface Summary
IHasForm Items that have a textual surface form implement this interface.
IHasID An object with a unique id.
IHasMWEPOS An object that is one of the parts of speech enumerated in MWEPOS.
IInfMWEDesc A description of an inflected form of a multi-word expression (MWE).
IInfMWEDesc.IInfPart A part of an inflected multi-word expression.
IMarkedSentence<T extends IToken> A marked sentence is a sentence (i.e., a list of tokens) that has been tagged with a unique id.
IMWE<T extends IToken> A multi-word expression found in a list of tokens.
IMWEDesc An MWE description consisting of an IMWEDescID, list of parts, and counts relating to the MWE's appearance in a reference concordance.
IMWEDesc.IPart A part of a multi-word expression.
IMWEDescID A MWE description ID that can be used to retrieve the MWE's lemma, part of speech, and inflected form.
IRootMWEDesc A description of a multi-word expression (MWE).
IRootMWEDesc.IRootPart A part of a root multi-word expression.
IToken A token that is a constituent of an sentence.
 

Class Summary
AbstractMWEDesc<P extends IMWEDesc.IPart> A base class for MWE descriptions that can be used to construct a description from some combination of: a surface form, a list of parts, and counts relating to the MWE's appearance in a reference concordance.
InfMWEDesc Default implementation of the IInfMWEDesc interface.
InfMWEDesc.InfMWEDescBuilder A builder for inflected MWE description objecgts
MWE<T extends IToken> Default implementation of the IMWE interface.
MWEComparator<T extends IToken> A comparator that compares IMWEs by checking which MWE starts earlier in the list of tokens used to construct this comparator.
MWEDescID Default implementation of IMWEDescID.
RootMWEDesc Default implementation of the IRootMWEDesc interface.
StopWords Standard list of stop words, taken from Apache Lucene library.
Token Default implementation of the IToken interface.
 

Enum Summary
MWEPOS Represents the part of speech of a multi-word expression.
 

Package edu.mit.jmwe.data Description

Provides the basic data structures used by the library and their default implementations.



Copyright © 2011 Massachusetts Institute of Technology. All Rights Reserved.