XML Configuration Reference : Search : Classifier
 
Classifier
com.exalead.indexing.analysis.v10.Classifier
A Classifier classifies a whole document according to the existing annotations on selected Document Chunks. The annotations are matched against a learning resource.
Parent elements:
com.exalead.mercury.mami.search.v20.SemanticProcessorModule (as SemanticProcessorModule)
com.exalead.mercury.mami.search.v20.SemanticQueryAnalysisConfig (as SemanticQueryAnalysisConfig)
Attributes:
Name
Type
Default value
Description
name
string
Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts
string
Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState
string
Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass
string
If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty
string
If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled
boolean
Disables the DocumentProcessor
resourceDir
string
URL for the vocabulary resource (data://, file;// or resource://)
annotationName
string
Name of the annotation to add.
language
iso code
Language for which the vocabulary classifier is activated.
excludedLanguages
string
Language for which the vocabulary classifier is deactivated (works only if language=xx, comma-separated).
addAnnotationsOnKeywords
boolean
If true, it adds annotations to all matching tokens.
maxAnnotations
int
-1
Maximum number of annotations per document.
minTrustLevel
int
The minimum trust level of categories to keep.
maxKeywords
int
-1
The maximum number of keywords to keep.
minKeywords
int
1
The minimum number of keywords per class.
collapseToken
boolean
If true, all identical tokens are collapsed.
extraPrefixAnnotations
string
The optional list of prefix annotations to keep (comma-separated).
extraAnnotationsMinTrustLevel
int
100
The minimum trust level to keep an extra annotation.
Nested elements:
Name
Type
Description
fromDataModel
com.exalead.indexing.analysis.v10.SemanticProcessor
If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor