java.lang.Object
- org.apache.lucene.facet.taxonomy.directory.DirectoryTaxonomyWriter

All Implemented Interfaces:

java.io.Closeable, java.lang.AutoCloseable, TaxonomyWriter, TwoPhaseCommit
```
public class DirectoryTaxonomyWriter
extends java.lang.Object
implements TaxonomyWriter
```
TaxonomyWriter which uses a Directory to store the taxonomy information on disk, and keeps an additional in-memory cache of some or all categories.
In addition to the permanently-stored information in the Directory, efficiency dictates that we also keep an in-memory cache of recently seen or all categories, so that we do not need to go back to disk for every category addition to see which ordinal this category already has, if any. A TaxonomyWriterCache object determines the specific caching algorithm used.
This class offers some hooks for extending classes to control the IndexWriter instance that is used. See openIndexWriter(org.apache.lucene.store.Directory, org.apache.lucene.index.IndexWriterConfig).

Nested Class Summary

Nested Classes
Modifier and Type	Class	Description
`static class`	`DirectoryTaxonomyWriter.DiskOrdinalMap`	`DirectoryTaxonomyWriter.OrdinalMap` maintained on file system
`static class`	`DirectoryTaxonomyWriter.MemoryOrdinalMap`	`DirectoryTaxonomyWriter.OrdinalMap` maintained in memory
`static interface`	`DirectoryTaxonomyWriter.OrdinalMap`	Mapping from old ordinal to new ordinals, used when merging indexes wit separate taxonomies.

Field Summary

Fields
Modifier and Type Field Description

static java.lang.String INDEX_EPOCH
Property name of user commit data that contains the index epoch.

Constructor Summary

Constructors
Constructor	Description
`DirectoryTaxonomyWriter(Directory d)`	Create this with `OpenMode.CREATE_OR_APPEND`.
`DirectoryTaxonomyWriter(Directory directory, IndexWriterConfig.OpenMode openMode)`	Creates a new instance with a default cache as defined by `defaultTaxonomyWriterCache()`.
`DirectoryTaxonomyWriter(Directory directory, IndexWriterConfig.OpenMode openMode, TaxonomyWriterCache cache)`	Construct a Taxonomy writer.

Method Summary

All Methods Static Methods Instance Methods Concrete Methods
Modifier and Type	Method	Description
`int`	`addCategory(FacetLabel categoryPath)`	addCategory() adds a category with a given path name to the taxonomy, and returns its ordinal.
`void`	`addTaxonomy(Directory taxoDir, DirectoryTaxonomyWriter.OrdinalMap map)`	Takes the categories from the given taxonomy directory, and adds the missing ones to this taxonomy.
`void`	`close()`	Frees used resources as well as closes the underlying `IndexWriter`, which commits whatever changes made to it to the underlying `Directory`.
`void`	`commit()`	The second phase of a 2-phase commit.
`static TaxonomyWriterCache`	`defaultTaxonomyWriterCache()`	Defines the default `TaxonomyWriterCache` to use in constructors which do not specify one.
`java.util.Map<java.lang.String,java.lang.String>`	`getCommitData()`	Returns the commit user data map that was set on `TaxonomyWriter.setCommitData(Map)`.
`Directory`	`getDirectory()`	Returns the `Directory` of this taxonomy writer.
`int`	`getParent(int ordinal)`	getParent() returns the ordinal of the parent category of the category with the given ordinal.
`int`	`getSize()`	getSize() returns the number of categories in the taxonomy.
`long`	`getTaxonomyEpoch()`	Expert: returns current index epoch, if this is a near-real-time reader.
`void`	`prepareCommit()`	prepare most of the work needed for a two-phase commit.
`void`	`replaceTaxonomy(Directory taxoDir)`	Replaces the current taxonomy with the given one.
`void`	`rollback()`	Rollback changes to the taxonomy writer and closes the instance.
`void`	`setCacheMissesUntilFill(int i)`	Set the number of cache misses before an attempt is made to read the entire taxonomy into the in-memory cache.
`void`	`setCommitData(java.util.Map<java.lang.String,java.lang.String> commitUserData)`	Sets the commit user data map.
`static void`	`unlock(Directory directory)`	Forcibly unlocks the taxonomy in the named directory.

Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Field Detail
  - INDEX_EPOCH
```
public static final java.lang.String INDEX_EPOCH
```
    Property name of user commit data that contains the index epoch. The epoch changes whenever the taxonomy is recreated (i.e. opened with IndexWriterConfig.OpenMode.CREATE.
    Applications should not use this property in their commit data because it will be overridden by this taxonomy writer.
    
    See Also:
    
    Constant Field Values
- Constructor Detail
  - DirectoryTaxonomyWriter
```
public DirectoryTaxonomyWriter(Directory directory,
                               IndexWriterConfig.OpenMode openMode,
                               TaxonomyWriterCache cache)
                        throws java.io.IOException
```
    Construct a Taxonomy writer.
    
    Parameters:
    
    directory - The Directory in which to store the taxonomy. Note that the taxonomy is written directly to that directory (not to a subdirectory of it).
    
    openMode - Specifies how to open a taxonomy for writing: APPEND means open an existing index for append (failing if the index does not yet exist). CREATE means create a new index (first deleting the old one if it already existed). APPEND_OR_CREATE appends to an existing index if there is one, otherwise it creates a new index.
    
    cache - A TaxonomyWriterCache implementation which determines the in-memory caching policy. See for example LruTaxonomyWriterCache and Cl2oTaxonomyWriterCache. If null or missing, defaultTaxonomyWriterCache() is used.
    
    Throws:
    
    CorruptIndexException - if the taxonomy is corrupted.
    
    LockObtainFailedException - if the taxonomy is locked by another writer. If it is known that no other concurrent writer is active, the lock might have been left around by an old dead process, and should be removed using unlock(Directory).
    
    java.io.IOException - if another error occurred.
  - DirectoryTaxonomyWriter
```
public DirectoryTaxonomyWriter(Directory directory,
                               IndexWriterConfig.OpenMode openMode)
                        throws java.io.IOException
```
    Creates a new instance with a default cache as defined by defaultTaxonomyWriterCache().
    
    Throws:
    
    java.io.IOException
  - DirectoryTaxonomyWriter
```
public DirectoryTaxonomyWriter(Directory d)
                        throws java.io.IOException
```
    Create this with OpenMode.CREATE_OR_APPEND.
    
    Throws:
    
    java.io.IOException
- Method Detail
  - unlock
```
public static void unlock(Directory directory)
                   throws java.io.IOException
```
    Forcibly unlocks the taxonomy in the named directory.
    Caution: this should only be used by failure recovery code, when it is known that no other process nor thread is in fact currently accessing this taxonomy.
    This method is unnecessary if your Directory uses a NativeFSLockFactory instead of the default SimpleFSLockFactory. When the "native" lock is used, a lock does not stay behind forever when the process using it dies.
    
    Throws:
    
    java.io.IOException
  - defaultTaxonomyWriterCache
```
public static TaxonomyWriterCache defaultTaxonomyWriterCache()
```
    Defines the default TaxonomyWriterCache to use in constructors which do not specify one.
    The current default is Cl2oTaxonomyWriterCache constructed with the parameters (1024, 0.15f, 3), i.e., the entire taxonomy is cached in memory while building it.
  - close
```
public void close()
           throws java.io.IOException
```
    Frees used resources as well as closes the underlying IndexWriter, which commits whatever changes made to it to the underlying Directory.
    
    Specified by:
    
    close in interface java.lang.AutoCloseable
    
    Specified by:
    
    close in interface java.io.Closeable
    
    Throws:
    
    java.io.IOException
  - addCategory
```
public int addCategory(FacetLabel categoryPath)
                throws java.io.IOException
```
    Description copied from interface: TaxonomyWriter
    
    addCategory() adds a category with a given path name to the taxonomy, and returns its ordinal. If the category was already present in the taxonomy, its existing ordinal is returned.
    Before adding a category, addCategory() makes sure that all its ancestor categories exist in the taxonomy as well. As result, the ordinal of a category is guaranteed to be smaller then the ordinal of any of its descendants.
    
    Specified by:
    
    addCategory in interface TaxonomyWriter
    
    Throws:
    
    java.io.IOException
  - commit
```
public void commit()
            throws java.io.IOException
```
    Description copied from interface: TwoPhaseCommit
    
    The second phase of a 2-phase commit. Implementations should ideally do very little work in this method (following TwoPhaseCommit.prepareCommit(), and after it returns, the caller can assume that the changes were successfully committed to the underlying storage.
    
    Specified by:
    
    commit in interface TwoPhaseCommit
    
    Throws:
    
    java.io.IOException
  - setCommitData
```
public void setCommitData(java.util.Map<java.lang.String,java.lang.String> commitUserData)
```
    Description copied from interface: TaxonomyWriter
    
    Sets the commit user data map. That method is considered a transaction and will be committed even if no other changes were made to the writer instance.
    NOTE: the map is cloned internally, therefore altering the map's contents after calling this method has no effect.
    
    Specified by:
    
    setCommitData in interface TaxonomyWriter
  - getCommitData
```
public java.util.Map<java.lang.String,java.lang.String> getCommitData()
```
    Description copied from interface: TaxonomyWriter
    
    Returns the commit user data map that was set on TaxonomyWriter.setCommitData(Map).
    
    Specified by:
    
    getCommitData in interface TaxonomyWriter
  - prepareCommit
```
public void prepareCommit()
                   throws java.io.IOException
```
    prepare most of the work needed for a two-phase commit. See IndexWriter.prepareCommit().
    
    Specified by:
    
    prepareCommit in interface TwoPhaseCommit
    
    Throws:
    
    java.io.IOException
  - getSize
```
public int getSize()
```
    Description copied from interface: TaxonomyWriter
    
    getSize() returns the number of categories in the taxonomy.
    Because categories are numbered consecutively starting with 0, it means the taxonomy contains ordinals 0 through getSize()-1.
    Note that the number returned by getSize() is often slightly higher than the number of categories inserted into the taxonomy; This is because when a category is added to the taxonomy, its ancestors are also added automatically (including the root, which always get ordinal 0).
    
    Specified by:
    
    getSize in interface TaxonomyWriter
  - setCacheMissesUntilFill
```
public void setCacheMissesUntilFill(int i)
```
    Set the number of cache misses before an attempt is made to read the entire taxonomy into the in-memory cache.
    This taxonomy writer holds an in-memory cache of recently seen categories to speed up operation. On each cache-miss, the on-disk index needs to be consulted. When an existing taxonomy is opened, a lot of slow disk reads like that are needed until the cache is filled, so it is more efficient to read the entire taxonomy into memory at once. We do this complete read after a certain number (defined by this method) of cache misses.
    If the number is set to 0, the entire taxonomy is read into the cache on first use, without fetching individual categories first.
    NOTE: it is assumed that this method is called immediately after the taxonomy writer has been created.
  - getParent
```
public int getParent(int ordinal)
              throws java.io.IOException
```
    Description copied from interface: TaxonomyWriter
    
    getParent() returns the ordinal of the parent category of the category with the given ordinal.
    When a category is specified as a path name, finding the path of its parent is as trivial as dropping the last component of the path. getParent() is functionally equivalent to calling getPath() on the given ordinal, dropping the last component of the path, and then calling getOrdinal() to get an ordinal back.
    If the given ordinal is the ROOT_ORDINAL, an INVALID_ORDINAL is returned. If the given ordinal is a top-level category, the ROOT_ORDINAL is returned. If an invalid ordinal is given (negative or beyond the last available ordinal), an ArrayIndexOutOfBoundsException is thrown. However, it is expected that getParent will only be called for ordinals which are already known to be in the taxonomy. TODO (Facet): instead of a getParent(ordinal) method, consider having a
    getCategory(categorypath, prefixlen) which is similar to addCategory except it doesn't add new categories; This method can be used to get the ordinals of all prefixes of the given category, and it can use exactly the same code and cache used by addCategory() so it means less code.
    
    Specified by:
    
    getParent in interface TaxonomyWriter
    
    Throws:
    
    java.io.IOException
  - addTaxonomy
```
public void addTaxonomy(Directory taxoDir,
                        DirectoryTaxonomyWriter.OrdinalMap map)
                 throws java.io.IOException
```
    Takes the categories from the given taxonomy directory, and adds the missing ones to this taxonomy. Additionally, it fills the given DirectoryTaxonomyWriter.OrdinalMap with a mapping from the original ordinal to the new ordinal.
    
    Throws:
    
    java.io.IOException
  - rollback
```
public void rollback()
              throws java.io.IOException
```
    Rollback changes to the taxonomy writer and closes the instance. Following this method the instance becomes unusable (calling any of its API methods will yield an AlreadyClosedException).
    
    Specified by:
    
    rollback in interface TwoPhaseCommit
    
    Throws:
    
    java.io.IOException
  - replaceTaxonomy
```
public void replaceTaxonomy(Directory taxoDir)
                     throws java.io.IOException
```
    Replaces the current taxonomy with the given one. This method should generally be called in conjunction with IndexWriter.addIndexes(Directory...) to replace both the taxonomy as well as the search index content.
    
    Throws:
    
    java.io.IOException
  - getDirectory
```
public Directory getDirectory()
```
    Returns the Directory of this taxonomy writer.
  - getTaxonomyEpoch
```
public final long getTaxonomyEpoch()
```
    Expert: returns current index epoch, if this is a near-real-time reader. Used by DirectoryTaxonomyReader to support NRT.

Class DirectoryTaxonomyWriter

Nested Class Summary

Field Summary

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Field Detail

INDEX_EPOCH

Constructor Detail

DirectoryTaxonomyWriter

DirectoryTaxonomyWriter

DirectoryTaxonomyWriter

Method Detail

unlock

defaultTaxonomyWriterCache

close

addCategory

commit

setCommitData

getCommitData

prepareCommit

getSize

setCacheMissesUntilFill

getParent

addTaxonomy

rollback

replaceTaxonomy

getDirectory

getTaxonomyEpoch