public class NameFinderME extends java.lang.Object implements TokenNameFinder
Modifier and Type | Field and Description |
---|---|
static java.lang.String |
CONTINUE |
static int |
DEFAULT_BEAM_SIZE |
static java.lang.String |
OTHER |
static java.lang.String |
START |
Constructor and Description |
---|
NameFinderME(TokenNameFinderModel model) |
NameFinderME(TokenNameFinderModel model,
AdaptiveFeatureGenerator generator,
int beamSize)
Deprecated.
the beam size is now configured during training time in the
trainer parameter file via beamSearch.beamSize
|
NameFinderME(TokenNameFinderModel model,
AdaptiveFeatureGenerator generator,
int beamSize,
SequenceValidator<java.lang.String> sequenceValidator)
Deprecated.
the beam size is now configured during training time in the
trainer parameter file via beamSearch.beamSize
|
NameFinderME(TokenNameFinderModel model,
int beamSize)
Deprecated.
the beam size is now configured during training time in the
trainer parameter file via beamSearch.beamSize
|
Modifier and Type | Method and Description |
---|---|
void |
clearAdaptiveData()
Forgets all adaptive data which was collected during previous calls to one
of the find methods.
|
static Span[] |
dropOverlappingSpans(Span[] spans)
Removes spans with are intersecting or crossing in anyway.
|
Span[] |
find(java.lang.String[] tokens)
Generates name tags for the given sequence, typically a sentence, returning token spans for any identified names.
|
Span[] |
find(java.lang.String[] tokens,
java.lang.String[][] additionalContext)
Generates name tags for the given sequence, typically a sentence, returning
token spans for any identified names.
|
double[] |
probs()
Returns an array with the probabilities of the last decoded sequence.
|
void |
probs(double[] probs)
Populates the specified array with the probabilities of the last decoded
sequence.
|
double[] |
probs(Span[] spans)
Returns an array of probabilities for each of the specified spans which is
the arithmetic mean of the probabilities for each of the outcomes which
make up the span.
|
static TokenNameFinderModel |
train(java.lang.String languageCode,
java.lang.String type,
ObjectStream<NameSample> samples,
java.util.Map<java.lang.String,java.lang.Object> resources)
Deprecated.
|
static TokenNameFinderModel |
train(java.lang.String languageCode,
java.lang.String type,
ObjectStream<NameSample> samples,
TrainingParameters trainParams,
AdaptiveFeatureGenerator generator,
java.util.Map<java.lang.String,java.lang.Object> resources)
Deprecated.
|
static TokenNameFinderModel |
train(java.lang.String languageCode,
java.lang.String type,
ObjectStream<NameSample> samples,
TrainingParameters trainParams,
byte[] featureGeneratorBytes,
java.util.Map<java.lang.String,java.lang.Object> resources)
Deprecated.
|
static TokenNameFinderModel |
train(java.lang.String languageCode,
java.lang.String type,
ObjectStream<NameSample> samples,
TrainingParameters trainParams,
TokenNameFinderFactory factory) |
public static final int DEFAULT_BEAM_SIZE
public static final java.lang.String START
public static final java.lang.String CONTINUE
public static final java.lang.String OTHER
public NameFinderME(TokenNameFinderModel model)
@Deprecated public NameFinderME(TokenNameFinderModel model, AdaptiveFeatureGenerator generator, int beamSize, SequenceValidator<java.lang.String> sequenceValidator)
model
- beamSize
- @Deprecated public NameFinderME(TokenNameFinderModel model, AdaptiveFeatureGenerator generator, int beamSize)
@Deprecated public NameFinderME(TokenNameFinderModel model, int beamSize)
public Span[] find(java.lang.String[] tokens)
TokenNameFinder
find
in interface TokenNameFinder
tokens
- an array of the tokens or words of the sequence, typically a sentence.public Span[] find(java.lang.String[] tokens, java.lang.String[][] additionalContext)
tokens
- an array of the tokens or words of the sequence, typically a
sentence.additionalContext
- features which are based on context outside of the
sentence but which should also be used.public void clearAdaptiveData()
clearAdaptiveData
in interface TokenNameFinder
public void probs(double[] probs)
chunk
. The specified array should be at least as large as the
number of tokens in the previous call to chunk
.probs
- An array used to hold the probabilities of the last decoded
sequence.public double[] probs()
chunk
.chunk
when it was last called.public double[] probs(Span[] spans)
spans
- The spans of the names for which probabilities are desired.public static TokenNameFinderModel train(java.lang.String languageCode, java.lang.String type, ObjectStream<NameSample> samples, TrainingParameters trainParams, TokenNameFinderFactory factory) throws java.io.IOException
java.io.IOException
@Deprecated public static TokenNameFinderModel train(java.lang.String languageCode, java.lang.String type, ObjectStream<NameSample> samples, TrainingParameters trainParams, AdaptiveFeatureGenerator generator, java.util.Map<java.lang.String,java.lang.Object> resources) throws java.io.IOException
train(String, String, ObjectStream, TrainingParameters, TokenNameFinderFactory)
instead.languageCode
- the language of the training datatype
- null or an override type for all types in the training datasamples
- the training datatrainParams
- machine learning train parametersgenerator
- null or the feature generatorresources
- the resources for the name finder or null if nonejava.io.IOException
@Deprecated public static TokenNameFinderModel train(java.lang.String languageCode, java.lang.String type, ObjectStream<NameSample> samples, TrainingParameters trainParams, byte[] featureGeneratorBytes, java.util.Map<java.lang.String,java.lang.Object> resources) throws java.io.IOException
train(String, String, ObjectStream, TrainingParameters, TokenNameFinderFactory)
instead.languageCode
- the language of the training datatype
- null or an override type for all types in the training datasamples
- the training datatrainParams
- machine learning train parametersfeatureGeneratorBytes
- descriptor to configure the feature generation
or nullresources
- the resources for the name finder or null if nonejava.io.IOException
@Deprecated public static TokenNameFinderModel train(java.lang.String languageCode, java.lang.String type, ObjectStream<NameSample> samples, java.util.Map<java.lang.String,java.lang.Object> resources) throws java.io.IOException
train(String, String, ObjectStream, TrainingParameters, TokenNameFinderFactory)
instead.java.io.IOException
public static Span[] dropOverlappingSpans(Span[] spans)
The following rules are used to remove the spans:
Identical spans: The first span in the array after sorting it remains
Intersecting spans: The first span after sorting remains
Contained spans: All spans which are contained by another are removed
spans
- Copyright © 2010 - 2020 Adobe. All Rights Reserved