XML Configuration Reference : Linguistic : StandardTokenizerPatternOverride
 
StandardTokenizerPatternOverride
com.exalead.linguistic.v10.StandardTokenizerPatternOverride
No documentation for this element.
Parent elements:
com.exalead.linguistic.v10.StandardTokenizer (as charOverrides)
com.exalead.linguistic.v10.StandardTokenizer (as patternOverrides)
Attributes:
Name
Type
Default value
Description
type
enum(token, separator, sentence, ignore, punct)
token
Values = "token", "separator", "sentence" (will break related terms extraction, named entities, ...), "ignore" or "punct" (sentence is considered as a separator but it is also considered as an entity separator for semantic extractors)
toOverride
string
separated
boolean
True
Pattern must be separated to match.