Name | Type | Default value | Description |
---|---|---|---|
languages | string | en,de,fr,sv,es,it,nl,pt,no,fi,da,bg,ca,cs,el,hr,hu,pl,ru,sk,sl,sr | Postprocesses BasisTech's analyzer output in order to generate a tokenization as close as possible to that of this standard tokenizer for these languages. |