Package | Description |
---|---|
edu.mit.jmwe.data |
Provides the basic data structures used by the library and their default implementations.
|
edu.mit.jmwe.data.concordance |
Provides interfaces and classes for accessing data taken Semcor-formatted concordances, useful for benchmarking detectors.
|
edu.mit.jmwe.harness.result.error |
Provides error detectors to evaluate the results of a test harness run
|
Modifier and Type | Interface and Description |
---|---|
interface |
IMarkedSentence<T extends IToken>
A marked sentence is a sentence (i.e., a list of tokens) that has been tagged
with a unique id.
|
Modifier and Type | Interface and Description |
---|---|
interface |
IConcordanceSentence
A sentence drawn from the Semcor corpus.
|
Modifier and Type | Class and Description |
---|---|
class |
ConcordanceSentence
Default implementation of
ISemcorSentence |
Modifier and Type | Interface and Description |
---|---|
interface |
IErrorDetector
Error detectors look for errors made by an
IMWEDetector by analyzing the ISentenceResult obtained by
running the detector over a marked sentence using a test harness. |
Modifier and Type | Class and Description |
---|---|
class |
AbstractErrorDetector
Abstract base class implementation of the error detector interface that takes
care of the ID boilerplate.
|
class |
AllStopWords
Finds the multi-word expressions the detector failed to identify whose parts
are all stop words.
|
class |
CommonErrorDetector
This error detector looks for several types of common errors made by MWE detectors.
|
class |
CompositeErrorDetector
Error detectors extending this class are composed of multiple simpler error
detectors.
|
class |
DetectorDisagreement
Counts and stores those MWEs that were not found by the backing detector.
|
class |
ExtraPOS
Finds the proper noun multi-word expressions that were truncated incorrectly
because of a token with a possessive ending tag.
|
class |
ExtraPrep
Finds the proper noun multi-word expressions that were truncated incorrectly
because of a token with a prepositional tag.
|
class |
InflectionError
Finds the MWEs that do not have the same form as the MWE description they are
marked with.
|
class |
InflectionPatternError
Identifies those multi-word expressions that do not follow the inflection
patterns listed in the
InflectionRule enum. |
class |
InterstitialTokens
Counts and stores the multi-word expressions that have tokens that are separated
by one or more tokens in the sentence that are not also a part of the MWE.
|
class |
MissingFromIndex
Finds those MWEs that were not found because they do not appear in the given
index.
|
class |
OutOfOrder
Finds the proper noun multi-word expressions with one or more parts tagged as
something other than a proper noun that the
IMWEDetector failed to
identify |
class |
PNounLong
Finds the proper noun multi-word expressions detected by the MWE Detector
that are longer than they should be.
|
class |
PNounShort
Finds the proper noun multi-word expressions detected by the MWE detector
that are shorter than they should be.
|
class |
UntaggedPNoun
Finds the proper noun multi-word expressions with one or more parts tagged as
something other than a proper noun that the
IMWEDetector failed to
identify |
class |
VBDVBN
Finds the two-token MWEs that have the tag sequence VBD_VBN.
|
class |
WrongPOS
Finds the multi-word expressions detected by the
IMWEDetector that
have the right tokens but the wrong part of speech. |
Copyright © 2011 Massachusetts Institute of Technology. All Rights Reserved.