Name | Type | Default value | Description |
---|---|---|---|
language | iso code | The language handled by this tokenizer. This can be null for all unhandled languages. | |
concatAlphaNum | boolean | True | Concat alpha and num chars |
concatNumAlpha | boolean | True | Concat num and alpha chars |
Name | Type | Description |
---|---|---|
charOverrides | com.exalead.linguistic.v10.StandardTokenizerOverride* | Set of rules that allows the forcing of a specific character type. |
patternOverrides | com.exalead.linguistic.v10.StandardTokenizerOverride* | Set of rules that allows the forcing of specific regexp type. |
TokenizerPlugin | com.exalead.linguistic.v10.TokenizerPlugin* |