Package opennlp.tools.dictionary
Class Dictionary
- java.lang.Object
-
- opennlp.tools.dictionary.Dictionary
-
- All Implemented Interfaces:
java.lang.Iterable<StringList>
,SerializableArtifact
public class Dictionary extends java.lang.Object implements java.lang.Iterable<StringList>, SerializableArtifact
This class is a dictionary.
-
-
Constructor Summary
Constructors Constructor Description Dictionary()
Initializes an emptyDictionary
.Dictionary(boolean caseSensitive)
Dictionary(java.io.InputStream in)
Initializes theDictionary
from an existing dictionary resource.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description java.util.Set<java.lang.String>
asStringSet()
Gets this dictionary as aSet<String>
.boolean
contains(StringList tokens)
Checks if this dictionary has the given entry.boolean
equals(java.lang.Object obj)
java.lang.Class<?>
getArtifactSerializerClass()
Gets the Serializer Class forDictionary
int
getMaxTokenCount()
int
getMinTokenCount()
int
hashCode()
java.util.Iterator<StringList>
iterator()
Retrieves an Iterator over all tokens.static Dictionary
parseOneEntryPerLine(java.io.Reader in)
Reads a dictionary which has one entry per line.void
put(StringList tokens)
Adds the tokens to the dictionary as one new entry.void
remove(StringList tokens)
Removes the given tokens form the current instance.void
serialize(java.io.OutputStream out)
Writes the current instance to the givenOutputStream
.int
size()
Retrieves the number of tokens in the current instance.java.lang.String
toString()
-
-
-
Constructor Detail
-
Dictionary
public Dictionary()
Initializes an emptyDictionary
.
-
Dictionary
public Dictionary(boolean caseSensitive)
-
Dictionary
public Dictionary(java.io.InputStream in) throws java.io.IOException
Initializes theDictionary
from an existing dictionary resource.- Parameters:
in
-InputStream
- Throws:
java.io.IOException
-
-
Method Detail
-
put
public void put(StringList tokens)
Adds the tokens to the dictionary as one new entry.- Parameters:
tokens
- the new entry
-
getMinTokenCount
public int getMinTokenCount()
- Returns:
- minimum token count in the dictionary
-
getMaxTokenCount
public int getMaxTokenCount()
- Returns:
- maximum token count in the dictionary
-
contains
public boolean contains(StringList tokens)
Checks if this dictionary has the given entry.- Parameters:
tokens
- query- Returns:
- true if it contains the entry otherwise false
-
remove
public void remove(StringList tokens)
Removes the given tokens form the current instance.- Parameters:
tokens
- filter tokens
-
iterator
public java.util.Iterator<StringList> iterator()
Retrieves an Iterator over all tokens.- Specified by:
iterator
in interfacejava.lang.Iterable<StringList>
- Returns:
- token-
Iterator
-
size
public int size()
Retrieves the number of tokens in the current instance.- Returns:
- number of tokens
-
serialize
public void serialize(java.io.OutputStream out) throws java.io.IOException
Writes the current instance to the givenOutputStream
.- Parameters:
out
-OutputStream
- Throws:
java.io.IOException
-
equals
public boolean equals(java.lang.Object obj)
- Overrides:
equals
in classjava.lang.Object
-
hashCode
public int hashCode()
- Overrides:
hashCode
in classjava.lang.Object
-
toString
public java.lang.String toString()
- Overrides:
toString
in classjava.lang.Object
-
parseOneEntryPerLine
public static Dictionary parseOneEntryPerLine(java.io.Reader in) throws java.io.IOException
Reads a dictionary which has one entry per line. The tokens inside an entry are whitespace delimited.- Parameters:
in
-Reader
- Returns:
- the parsed dictionary
- Throws:
java.io.IOException
-
asStringSet
public java.util.Set<java.lang.String> asStringSet()
Gets this dictionary as aSet<String>
. Onlyiterator()
,size()
andcontains(Object)
methods are implemented. If this dictionary entries are multi tokens only the first token of the entry will be part of the Set.- Returns:
- a Set containing the entries of this dictionary
-
getArtifactSerializerClass
public java.lang.Class<?> getArtifactSerializerClass()
Gets the Serializer Class forDictionary
- Specified by:
getArtifactSerializerClass
in interfaceSerializableArtifact
- Returns:
DictionarySerializer
-
-