XML Configuration Reference : Linguistic : JapaneseTokenizer
 
JapaneseTokenizer
com.exalead.linguistic.v10.JapaneseTokenizer
When set in the configuration, tokenizes Japanese documents.
Parent elements:
com.exalead.linguistic.v10.TokenizationConfig (as TokenizationConfig)
Attributes:
Name
Type
Default value
Description
addRomanji
boolean
Annotates each token with its romaji transliteration, when applicable.
addMorphology
boolean
Annotates each token with its part of speech, when applicable.
resourceDir
string
URL for the resource (data://, file;// or resource://) if null, use standard resource of product.
favor
enum(precision, recall)
recall
Favoring precision decreases the amount of unrelevant documents returned by queries, favoring recall decreases the amount of missing documents.