XML Configuration Reference : Linguistic : BasisTechTokenizationCompatibility
 
BasisTechTokenizationCompatibility
com.exalead.linguistic.v10.BasisTechTokenizationCompatibility
No documentation for this element.
Parent elements:
com.exalead.linguistic.v10.StandardTokenizer (as StandardTokenizer)
Attributes:
Name
Type
Default value
Description
languages
string
en,de,fr,sv,es,it,nl,pt,no,fi,da,bg,ca,cs,el,hr,hu,pl,ru,sk,sl,sr
Postprocesses BasisTech's analyzer output in order to generate a tokenization as close as possible to that of this standard tokenizer for these languages.