XML Configuration Reference : Index : LanguageConfiguration
 
LanguageConfiguration
com.exalead.indexing.analysis.v10.LanguageConfiguration
Configuration of the linguistic extraction for a given language.
Parent elements:
com.exalead.indexing.analysis.v10.AnalysisPipeline (as AnalysisPipeline)
Attributes:
Name
Type
Default value
Description
language
iso code
The language iso code
generateWordDict
boolean
Extracts words for the global dictionary.
wordDictModulo
int
1
Word extraction modulo, by default extract all words.
maxWordDictWordsPerDocument
long
-1
Maximum number of words extracted per document.
maxExtractedWordLength
int
64
Maximum length a word must have to be extracted.
spellCheckNGramMaxSize
int
3
Maximum number of consecutive words for spellchecking. If the value is set to '-1', spellcheck data is not generated for this language. 0 and 1 values are illegal, default is 3.
spellCheckNGramsDictModulo
int
5
NGrams extraction modulo. It extracts 1 ngram out of 5 by default.
maxSpellCheckNGramsPerDocument
long
-1
Maximum number of ngrams extracted per document.
maxExtractedSpellCheckNGramLength
int
256
Maximum length an ngram must have to be extracted.
relatedTermsDictModulo
int
1
Submits 1 out of X documents for related terms generation. If the value is set to 0, related terms are not generated for this language.
maxRelatedTermsDictContextsPerDocument
long
-1
Maximum number of related terms extracted per document.