Name | Type | Default value | Description |
---|---|---|---|
maxApprox | int | 40 | Maximum number of alternate spellings to retrieve for each word. |
insertion | double | 0.8 | Damerau-Levenshtein distance incurred for inserting a letter. |
deletion | double | 0.9 | Damerau-Levenshtein distance incurred for deleting a letter. |
substitution | double | 0.9 | Damerau-Levenshtein distance incurred for substituting a letter. |
transposition | double | 0.9 | Damerau-Levenshtein distance incurred for transposing letters. |
first | double | 0.1 | Additional Damerau-Levenshtein distance incurred for any operation on the first letter of a word. |
blank | double | 0.5 | Damerau-Levenshtein distance incurred for inserting or deleting a space. |
phoneticDist | double | 0.1 | Damerau-Levenshtein distance incurred for phonetic matches. |
precision | double | 1.0 | The higher the precision, the less often alternatives are proposed; the lower, the more recall. |
minLengthCandidate | int | 4 | Minimum length for a word to trigger a distance 1 spell check. |
minLengthCandidate2 | int | 8 | Minimum length for a word to trigger a distance 2 spell check. |
minLengthMultiWordCandidate | int | 2 | Minimum length for a word anywhere in a multi-word expression to trigger a distance 1 spell check. |
minLengthMultiWordCandidate2 | int | 8 | Minimum length for a word anywhere in a multi-word expression to trigger a distance 2 spell check. |
minLengthInsideMultiWordCandidate | int | 1 | Minimum length for a word in the middle of a multi-word expression to trigger a distance 1 spell check. |
minLengthInsideMultiWordCandidate2 | int | 6 | Minimum length for a word in the middle of a multi-word expression to trigger a distance 2 spell check. |
minNGramLengthForDist2 | int | 12 | Minimum length for a multi-word expression to trigger a distance 2 spell check. |
filterWhiteList | boolean | Applies minimum length criteria on white list entries. | |
filterPhonetic | boolean | True | Applies the minimum length criteria on phonetic matches. |
tokenizeResults | boolean | Tokenizes the suggested expression, and annotates each token with the corresponding word from the original query. This allows for additional processing after spellchecking, such as lemmatization, and normalization. For example, for a suggested expression of 3 tokens: - If true: each token has an annotation representing the associated word from the original user query. - If false: The first token is annotated with 3 tokens and a length of 3. | |
maxSuggestions | int | 1 | Suggests no more than N alternate expressions |
allSuggestionsMustAlwaysBeRetrieved | boolean | Suggests alternate expressions even when they have a lower score than the corresponding expression from the original query. The score is determined by an algorithm that considers both the Damerau-Levenshtein distance and frequency in the corpus. | |
whitelist | string | Reference a dictionary's spellcheck whitelist resource. Used to always suggest expressions for the listed words. | |
blacklist | string | Reference a dictionary's spellcheck blacklist resource. Used to never suggest expressions for the listed words. |
Name | Type | Description |
---|---|---|
SpellCheckBlackList | com.exalead.linguistic.v10.SpellCheckBlackList | Never suggest expressions for these words. |
SpellCheckWhiteList | com.exalead.linguistic.v10.SpellCheckWhiteList | Always suggest expressions for these words. |