XML Configuration Reference : Linguistic : JapaneseWordFinder
 
JapaneseWordFinder
com.exalead.linguistic.v10.JapaneseWordFinder
Performs words detection for Japanese. Use with a Standard tokenizer.
Parent elements:
com.exalead.linguistic.v10.StandardTokenizer (as StandardTokenizer)
Attributes:
Name
Type
Default value
Description
resourceDir
string
URL for the resource (data://, file;// or resource://) if null, use standard resource of product.
processRomaji
boolean
Processes romaji tokens
addRomaji
boolean
Annotates each token with its romaji transliteration, when applicable.
addCharset
boolean
Annotates with charset
addNGrams
boolean
Annotates with NGrams
minNGramLength
int
1
Minimum length of NGrams (1: monograms, 2: bigrams...)
maxNGramLength
int
2
Maximum length of NGrams (1: monograms, 2: bigrams...)
addTransliterations
boolean
Annotates with transliterations (kanji, katakana, hiragana, romaji).
addPartOfSpeech
boolean
Annotates with part of speech.