Class CJKAnalyzer

    • Field Detail

      • DEFAULT_STOPWORD_FILE

        public static final java.lang.String DEFAULT_STOPWORD_FILE
        File containing default CJK stopwords.

        Currently it contains some common English words that are not usually useful for searching and some double-byte interpunctions.

        See Also:
        Constant Field Values
    • Constructor Detail

      • CJKAnalyzer

        public CJKAnalyzer​(Version matchVersion,
                           CharArraySet stopwords)
        Builds an analyzer with the given stop words
        Parameters:
        matchVersion - lucene compatibility version
        stopwords - a stopword set
    • Method Detail

      • getDefaultStopSet

        public static CharArraySet getDefaultStopSet()
        Returns an unmodifiable instance of the default stop-words set.
        Returns:
        an unmodifiable instance of the default stop-words set.