XML Configuration Reference : Linguistic : ChineseWordFinder
 
ChineseWordFinder
com.exalead.linguistic.v10.ChineseWordFinder
This class performs words detection for Chinese. Use with a Standard tokenizer.
Parent elements:
com.exalead.linguistic.v10.StandardTokenizer (as StandardTokenizer)
Attributes:
Name
Type
Default value
Description
resourceDir
string
URL for the resource (data://, file;// or resource://) if null, use standard resource of product.
addSimplified
boolean
Annotates each token with its simplified Chinese transliteration, when applicable.
addNGrams
boolean
Annotates with NGrams
minNGramLength
int
1
Minimum length of NGrams (1: monograms, 2: bigrams...)
maxNGramLength
int
2
Maximum length of NGrams (1: monograms, 2: bigrams...)