Package org.apache.lucene.analysis.cn
Analyzer for Chinese, which indexes unigrams (individual chinese characters).
Three analyzers are provided for Chinese, each of which treats Chinese text in a different way.
- StandardAnalyzer: Index unigrams (individual Chinese characters) as a token.
 - CJKAnalyzer (in the analyzers/cjk package): Index bigrams (overlapping groups of two adjacent Chinese characters) as tokens.
 - SmartChineseAnalyzer (in the analyzers/smartcn package): Index words (attempt to segment Chinese text into words) as tokens.
 
- StandardAnalyzer: 我-是-中-国-人
 - CJKAnalyzer: 我是-是中-中国-国人
 - SmartChineseAnalyzer: 我-是-中国-人
 
- 
Class Summary Class Description ChineseAnalyzer Deprecated. (3.1) UseStandardAnalyzerinstead, which has the same functionality.ChineseFilter Deprecated. (3.1) UseStopFilterinstead, which has the same functionality.ChineseFilterFactory Deprecated. UseStopFilterFactoryinstead.ChineseTokenizer Deprecated. (3.1) UseStandardTokenizerinstead, which has the same functionality.ChineseTokenizerFactory Deprecated. UseStandardTokenizerFactoryinstead.