Class KStemFilter

  • All Implemented Interfaces:
    java.io.Closeable, java.lang.AutoCloseable

    public final class KStemFilter
    extends TokenFilter
    A high-performance kstem filter for english.

    See "Viewing Morphology as an Inference Process" (Krovetz, R., Proceedings of the Sixteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 191-203, 1993).

    All terms must already be lowercased for this filter to work correctly.

    Note: This filter is aware of the KeywordAttribute. To prevent certain terms from being passed to the stemmer KeywordAttribute.isKeyword() should be set to true in a previous TokenStream. Note: For including the original term as well as the stemmed version, see KeywordRepeatFilterFactory

    • Constructor Detail

    • Method Detail

      • incrementToken

        public boolean incrementToken()
                               throws java.io.IOException
        Returns the next, stemmed, input Token.
        Specified by:
        incrementToken in class TokenStream
        Returns:
        The stemmed form of a token.
        Throws:
        java.io.IOException - If there is a low-level I/O error.