Related terms are a list of nouns or adjectives separated by link words, and shared by at least N documents of your corpus (N=5 by default). This setting can be configured in Index > Linguistics > Dictionaries > Your_Search_Logic > Related Terms.
An internal, language-specific resource file that cannot be edited identifies these links.
Related terms are flagged at index time as semantic annotations, based on the configuration of the Related Terms Extractor semantic processor.
Note: You can also add text directly to the dictionary using the dedicated annotation relatedTermCustom when defining annotations (Kind or Name field).
Required Settings
For the Related Terms Extractor you must specify:
Setting
Description
Allow list
Sends the specified expression to the dictionary as a possible related term.
Block list
Blocks the specified expression from displaying as a related term in the Refinements panel.
Related terms min. span
The minimum number of words (excluding stop words) in a generated related term. This parameter is used only when Extract new related terms is enabled. The default is 2.
Related terms max. span
The maximum number of words (excluding stop words) in a generated related term. This parameter is used only when Extract new related terms is enabled. The default is 3.
Max. related terms per doc
The maximum number of related terms per doc; the default is 64.
Keep longest match
Keeps only the longest match. For example, if you have 5 tokens ('a', 'b', 'c', 'd', 'e') and 4 related terms 'a', 'a-c', 'b-c-d' and 'd-e', this option only keeps 'b-c-d' and removes all other related terms; the default is true.
Optional Settings
You can also specify these optional settings:
• Input from (optional)
a comma-separated list of context names of the document chunks for which this processor is applied.
• Additional attributes that do not appear in the Administration Console. For a full list and descriptions, see "RelatedTermsSynthesisConfig" in the Exalead CloudView XML Configuration Reference Guide.
To set these options, you must do one of the following:
◦ <DATADIR>/config/analysis.xml, OR
◦ In the API Console, go to Indexing > setAnalysisConfigList.
Search-Time Configuration
For related terms to display on the Refinements panel at search time, they must meet the following criteria:
• Not be shared by more than X% of your hits (X=25 by default).
• Be in at least Y hits (Y=3 by default).
• Have a corpus frequency of at least Z (Z=0 by default).
To configure related terms behavior at search time
• In the Administration Console, select Search > Search Logics > Your_Search_Logic > Facets.
• Under Related terms (at the bottom), select Enable.
• Adjust the options that appear to control which related terms appear in the Refinements panel at search-time.