public class CJKAnalyzer extends Analyzer
| Modifier and Type | Field and Description |
|---|---|
static java.lang.String[] |
STOP_WORDS
An array containing some common English words that are not usually
useful for searching and some double-byte interpunctions.
|
overridesTokenStreamMethod| Constructor and Description |
|---|
CJKAnalyzer()
Deprecated.
Use
CJKAnalyzer(Version) instead |
CJKAnalyzer(java.lang.String[] stopWords)
Deprecated.
Use
CJKAnalyzer(Version, String[]) instead |
CJKAnalyzer(Version matchVersion)
Builds an analyzer which removes words in
STOP_WORDS. |
CJKAnalyzer(Version matchVersion,
java.lang.String[] stopWords)
Builds an analyzer which removes words in the provided array.
|
| Modifier and Type | Method and Description |
|---|---|
TokenStream |
reusableTokenStream(java.lang.String fieldName,
java.io.Reader reader)
Returns a (possibly reused)
TokenStream which tokenizes all the text
in the provided Reader. |
TokenStream |
tokenStream(java.lang.String fieldName,
java.io.Reader reader)
Creates a
TokenStream which tokenizes all the text in the provided Reader. |
close, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, setOverridesTokenStreamMethod, setPreviousTokenStreampublic static final java.lang.String[] STOP_WORDS
public CJKAnalyzer()
CJKAnalyzer(Version) insteadSTOP_WORDS.public CJKAnalyzer(Version matchVersion)
STOP_WORDS.public CJKAnalyzer(java.lang.String[] stopWords)
CJKAnalyzer(Version, String[]) insteadstopWords - stop word arraypublic CJKAnalyzer(Version matchVersion, java.lang.String[] stopWords)
stopWords - stop word arraypublic final TokenStream tokenStream(java.lang.String fieldName, java.io.Reader reader)
TokenStream which tokenizes all the text in the provided Reader.tokenStream in class AnalyzerfieldName - lucene field namereader - input ReaderTokenStream built from CJKTokenizer, filtered with
StopFilterpublic final TokenStream reusableTokenStream(java.lang.String fieldName, java.io.Reader reader) throws java.io.IOException
TokenStream which tokenizes all the text
in the provided Reader.reusableTokenStream in class AnalyzerfieldName - lucene field namereader - Input ReaderTokenStream built from CJKTokenizer, filtered with
StopFilterjava.io.IOExceptionCopyright © 2000-2016 Apache Software Foundation. All Rights Reserved.