Name | Type | Default value | Description |
---|---|---|---|
name | string | The tokenization configuration's name. |
Name | Type | Description |
---|---|---|
FormIndexingConfig | com.exalead.linguistic.v10.FormIndexingConfig | How to index different word forms (exact, normalized, lemmatized) for each language. |
NormalizerConfig | com.exalead.linguistic.v10.NormalizerConfig | How to normalize words. |
Tokenizer | com.exalead.linguistic.v10.Tokenizer* | List of enabled tokenizers. Tokenizer choice depends on the document's language. |