CloudView Configuration Reference

DataModel com.exalead.datamodel.v10.DataModel

CloudView Data Model

Attributes

Name	Type	Default value	Description
name	string	-	Name of this Data Model.
defaultClass	string	-	Default class to push documents when there is no "datamodel_class" papi directive, or no "TYPE" meta. Usually, all data model classes generate type-prefixed field names, like 'myclass_myproperty', except the defaultClass which has unprefixed fields in the expansion. The analysis pipeline deduces the data model class of a document by applying the following rules with this specific order: Value of papi custom directive "datamodel_class" if any. Value of papi meta "TYPE" if any. Value of the default data model class of the connector pushing the document if any. Use the data model class marked by the 'defaultClass' property.
possibleLanguages	string	-	The list of languages that can appear in the corpus. Language detection will be restricted to this set. A null value means that all languages are possible.
storeAllMetas (UI: Store all unprocessed metas)	boolean	False	Saves all metas that were not mapped to an index field or category as "csv encoded metas". These metas are retrievable only. To make a meta searchable, create a data model property for it.
traceAllMetas (UI: Trace all metas)	boolean	False	Select this option before indexing, then click "Add Properties from Traced Metas" to simplify creating new properties. The traced metas are stored in a separate database. Once you have created properties based on traced metas, de-select this option to prevent the database from growing unnecessarily over time. Trace all metas pushed by connectors in order to ease property configuration.
expandNamedEntitiesLikeIn2014	boolean	False

Nested elements

DataModelClass*
DataModelEnum*
SemanticType*

Type	Wrapper	Description
DataModelClass*	-	List of classes of this data model.
DataModelEnum*	-
SemanticType*	-	The list of semantic types of this data model. A semantic type defines the semantic analysis to perform, and can be assigned to 'alphanumeric' properties. The semantic types available out of the box are: text, identifier, meta, and url.

XML representation

<DataModel name="" defaultClass="" possibleLanguages="" storeAllMetas="false" traceAllMetas="false" expandNamedEntitiesLikeIn2014="false" ></DataModel>

DataModelClass com.exalead.datamodel.v10.DataModelClass

No documentation for this element.

Parent objects

DataModel

Attributes

name string
parentClass string

Name	Type	Default value	Description
name	string	-
parentClass	string	-	The parent class name. May be null if this class has no parent class.

Nested elements

DynamicProperty*
ExpansionParams
Label*
Property*

Type	Wrapper	Description
DynamicProperty*	-	The dynamic properties of the class.
ExpansionParams	-
Label*	-	Labels for i18n. TODO not implemented
Property*	-	The properties of the class.

XML representation

<DataModelClass name="" parentClass="" ></DataModelClass>

DynamicProperty com.exalead.datamodel.v10.DynamicProperty

Allows you to define matching rules to map multiple document meta names to a single property, and consequently to a single index field or category facet (or output context, if neither index field nor category facet is selected).
This is useful when you do not know all the metas available in your data source. It also reduces the number of fields needed in the index.

Parent objects

DataModelClass

Attributes

storeMetaNames boolean

Name	Type	Default value	Description
storeMetaNames	boolean	False	Stores the meta name associated to each value, in the resulting index field (or category facet or output context). This is useful to search and retrieve specific metas within the field. If 'Store meta names' is not selected for a dynamic property, the resulting index field (or category facet or output context) is the same as any other alphanumerical or numerical field created with standard properties: it contains only meta values.

Nested elements

MatchingRule*
Property

Type	Wrapper	Description
MatchingRule*	-	List of rules to define which document metas should be mapped to this dynamic property.
Property	-	All property attributes apply to this dynamic property.

XML representation

<DynamicProperty storeMetaNames="false" ></DynamicProperty>

MatchingRule com.exalead.datamodel.v10.MatchingRule

No documentation for this element.

Parent objects

DynamicProperty

Attributes

pattern string
mode enum(prefix, suffix, substring, pattern, exact)
unprefix boolean

Name	Type	Default value	Description
pattern	string	-	If null, property name is used
mode	enum(prefix, suffix, substring, pattern, exact)
unprefix	boolean	False

XML representation

<MatchingRule mode="" unprefix="false" ></MatchingRule>

AlphanumProperty com.exalead.datamodel.v10.AlphanumProperty

Alphanumerical property.

Parent objects

DataModelClass
DynamicProperty

Attributes

overrideClassExpansionParams boolean
dedicatedField boolean
dynamicProperty string
searchable boolean
retrievable boolean
ramBased boolean
facet boolean
disjunctiveRefines boolean
defaultValue string
additionalMetaNames string
arity enum(zero_or_one, one, many)
name string
comment string
semanticType string
indexInTextField boolean
queryExpansionConfig string
enumerated boolean

Name	Type	Default value	Description
overrideClassExpansionParams	boolean	False
dedicatedField	boolean	True	Should we create a dedicated index field for this property, or use a multi-field encoding? A dedicated field is recommended if more than a few percent of objects have this property. Internal: if there is no dedicated field, we store in the "metas" field and index using prefixed-words.
dynamicProperty	string	-	Storing this meta in a dynamical property decreases index's memory footprint and improve index's commit time. Faceting and sorting performances will be impacted (slower).
searchable (UI: Searchable)	boolean	True	Allows users to search for matches in this property using a prefix. For example, to search for "API" in the title of a document, enter the search query <pre>title:API</pre>
retrievable (UI: Retrievable)	boolean	True	Displays the meta in the hit content of search results. If the property doesn't have a dedicated field, the value will be CSV-encoded in the 'metas' field.
ramBased (UI: RAM based)	boolean	False	Stores this property in RAM. Use this option if you need to: sort search results on this property. use this property in a virtual field. use this property in a virtual facet. use this property for an aggregation value. Only the "retrievable" part of the property is stored in RAM.
facet	boolean	False	Creates a navigation facet from the property values.
disjunctiveRefines (UI: Disjunctive refines)	boolean	False	Makes refinements on this facet disjunctive ("checkboxes") rather than exclusive (select one).
defaultValue	string	-	If there is no value for this property in the document, use this default value.
additionalMetaNames	string	-	Maps these metas (as a comma-separated list) to this index or facet field. Usable only for properties having a dedicated field, a facet, or are searchable without prefix.
arity (UI: Cardinality)	enum(zero_or_one, one, many)	zero_or_one	The arity of the property (zero_or_one, one, many) {@code enum(zero_or_one, one, many)}
name	string	-	Property name
comment	string	-	Optional comment to explain property use.
semanticType	string	-	Defines how the text is processed and indexed (tokenization, normalization and indexing-time semantic processing). This value must refer to a valid !SemanticType object in the DataModel.
indexInTextField	boolean	False	Allows users to search for matches in this property without using a prefix, by mapping the values to the generic "text" index field, as invisible text. For example, to search for "API" in a document, enter the search query <pre>API</pre>
queryExpansionConfig	string	-	Search-time query expansion config to use for this property. This follows the 'QEC' format specification.
enumerated	boolean	False	Is there a controlled and limited set of values for this property? Enumerated properties are indexed using a specific "Value" field, which stores entire values in an optimized way. Thus, performing faceting on this kind of field is faster than on a category field. It is not possible to perform any semantic processing on value fields.

Nested elements

tagsKeyValue*
AdvancedParams
KeyValue*
Label*

Type	Wrapper	Description
KeyValue*	tags	User-defined tags for this property. This can be used for example to implement 6W faceting of the data model.
AdvancedParams	-
KeyValue*	-
Label*	-	Labels for the internationalization of this property's name

XML representation

<AlphanumProperty overrideClassExpansionParams="false" dedicatedField="true" searchable="true" retrievable="true" ramBased="false" facet="false" disjunctiveRefines="false" defaultValue="" additionalMetaNames="" arity="zero_or_one" name="" comment="" semanticType="" indexInTextField="false" queryExpansionConfig="" enumerated="false" ></AlphanumProperty>

AdvancedParams com.exalead.datamodel.v10.AdvancedParams

Advanced configuration options of a property

Parent objects

DataModelClass

Attributes

generateAnalysis boolean
generateIndexSchema boolean
generatePrefix boolean
generateFacet boolean
generateMeta boolean
expandedPipelines string
expandedSearchLogics string
facetSortFunction string
facetReverseSort boolean
facetExplicitSortOrderValues string
facetInHits boolean
facetInNavigation boolean
facetRoot string
categoryField string
categoryFacetMaxPerLevel int
facetMaxInHits int
numericalBits int
doublePrecision int
enableTrustedQueries boolean
dateInputFormat string
dateOutputFormat string
gzip boolean
unit string
patternSearchEnabled boolean
patternSearchOptimizationType enum(ngramv1, ngramv2)
forcedRankToTextMapping nullableint
detectTitle boolean

Name	Type	Default value	Description
generateAnalysis (UI: Generate analysis config)	boolean	True	Creates the required mappings and document processors for this property. Only set this option to "false" to take complete control over analysis expansion for this property.
generateIndexSchema (UI: Generate index schema)	boolean	True	Creates the required index fields for this property. Only set this option to "false" to take complete control over index schema expansion for this property.
generatePrefix (UI: Generate prefix handler)	boolean	True	Creates a prefix handler in the search logics for this property. Only set this option to "false" to take complete control over query prefix handler expansion for this property.
generateFacet (UI: Generate category facet)	boolean	True	Creates a category facet in the search logics for this property. Only set this option to "false" to take complete control over facet config expansion for this property.
generateMeta (UI: Generate hit meta)	boolean	True	Creates a hit meta in the search logics for this property. Only set this option to "false" to take complete control over meta expansion for this property.
expandedPipelines	string	-	A comma-separated list of analysis pipelines to generate mappings and document processors on. If empty, generate on all pipelines.
expandedSearchLogics	string	-	A comma-separated list of search logics to generate prefix handlers, facets and hit metas on. If empty, generate on all search logics.
facetSortFunction	string	count	Sorting method if a category facet is created for this property. Select "ReverseSort" to reverse the sorting order.
facetReverseSort	boolean	False	Reverse sorting order if a category facet is created for this property.
facetExplicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
facetInHits (UI: Use for hit content)	boolean	True	Displays this property in the hits content (if a category facet is created).
facetInNavigation (UI: Use for navigation)	boolean	True	Displays this property in the navigation (if a category facet is created).
facetRoot (UI: Override root path)	string	-	Replaces the default facet root path (Top/ClassProperties/<Property name>) (if a category facet is created).
categoryField (UI: Override field)	string	-	Replaces the category field for this property (if a category facet is created).
categoryFacetMaxPerLevel (UI: Max. values)	int	100	Displays a maximum of N values per level for this property (if a category facet is created).
facetMaxInHits	int	100	Displays a maximum of N values for this property (if a category facet is created).
numericalBits (UI: Bits for numerical value)	int	63	Number of bits used to store a numerical value.
doublePrecision	int	4	Number of figures after the decimal point (if this is a double property).
enableTrustedQueries (UI: Trusted queries support)	boolean	False	Precomputes the indexing of prefixes for the Trusted Queries widget.
dateInputFormat	string	-	Specifies an input format for DateFormatter for numerical properties with date or date_time type
dateOutputFormat	string	-	Specifies an output format for numerical properties with date or date_time type, used in full hits
gzip	boolean	True	Activates content compression (if an alphanum field is created)
unit	string	-	Unit Of Measurement in which the value is stored
patternSearchEnabled	boolean	False	Calculates additional dictionaries to perform pattern search within words (native prefix, suffix, substring search using #anumpattern) without relying on pattern expansion. This has a very high cost in indexing time and indexing size. Do not select unless advised by Exalead teams.
patternSearchOptimizationType	enum(ngramv1, ngramv2)	ngramv1	Select algorithm based on dictionary ngrams or indexed ngrams (untokenized text only).
forcedRankToTextMapping	nullableint		Sets the ranking value for chunks in mapping to text field. -1 means that the chunk internal ranking value is kept.
detectTitle	boolean	False	Detect words set after # in path and use them as title

XML representation

<AdvancedParams generateAnalysis="true" generateIndexSchema="true" generatePrefix="true" generateFacet="true" generateMeta="true" expandedPipelines="" expandedSearchLogics="" facetSortFunction="count" facetReverseSort="false" facetInHits="true" facetInNavigation="true" facetRoot="" categoryField="" categoryFacetMaxPerLevel="100" facetMaxInHits="100" numericalBits="63" doublePrecision="4" enableTrustedQueries="false" dateInputFormat="" dateOutputFormat="" gzip="true" patternSearchEnabled="false" patternSearchOptimizationType="ngramv1" forcedRankToTextMapping="" detectTitle="false" ></AdvancedParams>

AdvancedPointParams com.exalead.datamodel.v10.AdvancedPointParams

Advanced configuration options of a point property

Parent objects

DataModelClass

Attributes

facetSortFunction string
facetReverseSort boolean
facetExplicitSortOrderValues string
facetInHits boolean
facetInNavigation boolean
facetRoot string
categoryField string
categoryFacetMaxPerLevel int
facetMaxInHits int
numericalBits int
doublePrecision int
enableTrustedQueries boolean
dateInputFormat string
dateOutputFormat string
gzip boolean
unit string
patternSearchEnabled boolean
patternSearchOptimizationType enum(ngramv1, ngramv2)
forcedRankToTextMapping nullableint
detectTitle boolean
generateAnalysis boolean
generateIndexSchema boolean
generatePrefix boolean
generateFacet boolean
generateMeta boolean
expandedPipelines string
expandedSearchLogics string
generateFromSeparateMetas boolean
firstCoordinateMeta string
secondCoordinateMeta string

Name	Type	Default value	Description
facetSortFunction	string	count	Sorting method if a category facet is created for this property. Select "ReverseSort" to reverse the sorting order.
facetReverseSort	boolean	False	Reverse sorting order if a category facet is created for this property.
facetExplicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
facetInHits (UI: Use for hit content)	boolean	True	Displays this property in the hits content (if a category facet is created).
facetInNavigation (UI: Use for navigation)	boolean	True	Displays this property in the navigation (if a category facet is created).
facetRoot (UI: Override root path)	string	-	Replaces the default facet root path (Top/ClassProperties/<Property name>) (if a category facet is created).
categoryField (UI: Override field)	string	-	Replaces the category field for this property (if a category facet is created).
categoryFacetMaxPerLevel (UI: Max. values)	int	100	Displays a maximum of N values per level for this property (if a category facet is created).
facetMaxInHits	int	100	Displays a maximum of N values for this property (if a category facet is created).
numericalBits (UI: Bits for numerical value)	int	63	Number of bits used to store a numerical value.
doublePrecision	int	4	Number of figures after the decimal point (if this is a double property).
enableTrustedQueries (UI: Trusted queries support)	boolean	False	Precomputes the indexing of prefixes for the Trusted Queries widget.
dateInputFormat	string	-	Specifies an input format for DateFormatter for numerical properties with date or date_time type
dateOutputFormat	string	-	Specifies an output format for numerical properties with date or date_time type, used in full hits
gzip	boolean	True	Activates content compression (if an alphanum field is created)
unit	string	-	Unit Of Measurement in which the value is stored
patternSearchEnabled	boolean	False	Calculates additional dictionaries to perform pattern search within words (native prefix, suffix, substring search using #anumpattern) without relying on pattern expansion. This has a very high cost in indexing time and indexing size. Do not select unless advised by Exalead teams.
patternSearchOptimizationType	enum(ngramv1, ngramv2)	ngramv1	Select algorithm based on dictionary ngrams or indexed ngrams (untokenized text only).
forcedRankToTextMapping	nullableint		Sets the ranking value for chunks in mapping to text field. -1 means that the chunk internal ranking value is kept.
detectTitle	boolean	False	Detect words set after # in path and use them as title
generateAnalysis (UI: Generate analysis config)	boolean	True	Creates the required mappings and document processors for this property. Only set this option to "false" to take complete control over analysis expansion for this property.
generateIndexSchema (UI: Generate index schema)	boolean	True	Creates the required index fields for this property. Only set this option to "false" to take complete control over index schema expansion for this property.
generatePrefix (UI: Generate prefix handler)	boolean	True	Creates a prefix handler in the search logics for this property. Only set this option to "false" to take complete control over query prefix handler expansion for this property.
generateFacet (UI: Generate category facet)	boolean	True	Creates a category facet in the search logics for this property. Only set this option to "false" to take complete control over facet config expansion for this property.
generateMeta (UI: Generate hit meta)	boolean	True	Creates a hit meta in the search logics for this property. Only set this option to "false" to take complete control over meta expansion for this property.
expandedPipelines	string	-	A comma-separated list of analysis pipelines to generate mappings and document processors on. If empty, generate on all pipelines.
expandedSearchLogics	string	-	A comma-separated list of search logics to generate prefix handlers, facets and hit metas on. If empty, generate on all search logics.
generateFromSeparateMetas	boolean	False	Content generated by several metas
firstCoordinateMeta	string	-	This meta will be used to define the first coordinate of the point
secondCoordinateMeta	string	-	This meta will be used to define the second coordinate of the point

XML representation

<AdvancedPointParams facetSortFunction="count" facetReverseSort="false" facetInHits="true" facetInNavigation="true" facetRoot="" categoryField="" categoryFacetMaxPerLevel="100" facetMaxInHits="100" numericalBits="63" doublePrecision="4" enableTrustedQueries="false" dateInputFormat="" dateOutputFormat="" gzip="true" patternSearchEnabled="false" patternSearchOptimizationType="ngramv1" forcedRankToTextMapping="" detectTitle="false" generateAnalysis="true" generateIndexSchema="true" generatePrefix="true" generateFacet="true" generateMeta="true" expandedPipelines="" expandedSearchLogics="" generateFromSeparateMetas="false" firstCoordinateMeta="" secondCoordinateMeta="" ></AdvancedPointParams>

AdvancedGeoParams com.exalead.datamodel.v10.AdvancedGeoParams

Advanced configuration options of a geo property

Parent objects

DataModelClass

Attributes

facetSortFunction string
facetReverseSort boolean
facetExplicitSortOrderValues string
facetInHits boolean
facetInNavigation boolean
facetRoot string
categoryField string
categoryFacetMaxPerLevel int
facetMaxInHits int
numericalBits int
doublePrecision int
enableTrustedQueries boolean
dateInputFormat string
dateOutputFormat string
gzip boolean
unit string
patternSearchEnabled boolean
patternSearchOptimizationType enum(ngramv1, ngramv2)
forcedRankToTextMapping nullableint
detectTitle boolean
generateAnalysis boolean
generateIndexSchema boolean
generatePrefix boolean
generateFacet boolean
generateMeta boolean
expandedPipelines string
expandedSearchLogics string
maxBlockSize int
precision int
bboxSuffix string

Name	Type	Default value	Description
facetSortFunction	string	count	Sorting method if a category facet is created for this property. Select "ReverseSort" to reverse the sorting order.
facetReverseSort	boolean	False	Reverse sorting order if a category facet is created for this property.
facetExplicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
facetInHits (UI: Use for hit content)	boolean	True	Displays this property in the hits content (if a category facet is created).
facetInNavigation (UI: Use for navigation)	boolean	True	Displays this property in the navigation (if a category facet is created).
facetRoot (UI: Override root path)	string	-	Replaces the default facet root path (Top/ClassProperties/<Property name>) (if a category facet is created).
categoryField (UI: Override field)	string	-	Replaces the category field for this property (if a category facet is created).
categoryFacetMaxPerLevel (UI: Max. values)	int	100	Displays a maximum of N values per level for this property (if a category facet is created).
facetMaxInHits	int	100	Displays a maximum of N values for this property (if a category facet is created).
numericalBits (UI: Bits for numerical value)	int	63	Number of bits used to store a numerical value.
doublePrecision	int	4	Number of figures after the decimal point (if this is a double property).
enableTrustedQueries (UI: Trusted queries support)	boolean	False	Precomputes the indexing of prefixes for the Trusted Queries widget.
dateInputFormat	string	-	Specifies an input format for DateFormatter for numerical properties with date or date_time type
dateOutputFormat	string	-	Specifies an output format for numerical properties with date or date_time type, used in full hits
gzip	boolean	True	Activates content compression (if an alphanum field is created)
unit	string	-	Unit Of Measurement in which the value is stored
patternSearchEnabled	boolean	False	Calculates additional dictionaries to perform pattern search within words (native prefix, suffix, substring search using #anumpattern) without relying on pattern expansion. This has a very high cost in indexing time and indexing size. Do not select unless advised by Exalead teams.
patternSearchOptimizationType	enum(ngramv1, ngramv2)	ngramv1	Select algorithm based on dictionary ngrams or indexed ngrams (untokenized text only).
forcedRankToTextMapping	nullableint		Sets the ranking value for chunks in mapping to text field. -1 means that the chunk internal ranking value is kept.
detectTitle	boolean	False	Detect words set after # in path and use them as title
generateAnalysis (UI: Generate analysis config)	boolean	True	Creates the required mappings and document processors for this property. Only set this option to "false" to take complete control over analysis expansion for this property.
generateIndexSchema (UI: Generate index schema)	boolean	True	Creates the required index fields for this property. Only set this option to "false" to take complete control over index schema expansion for this property.
generatePrefix (UI: Generate prefix handler)	boolean	True	Creates a prefix handler in the search logics for this property. Only set this option to "false" to take complete control over query prefix handler expansion for this property.
generateFacet (UI: Generate category facet)	boolean	True	Creates a category facet in the search logics for this property. Only set this option to "false" to take complete control over facet config expansion for this property.
generateMeta (UI: Generate hit meta)	boolean	True	Creates a hit meta in the search logics for this property. Only set this option to "false" to take complete control over meta expansion for this property.
expandedPipelines	string	-	A comma-separated list of analysis pipelines to generate mappings and document processors on. If empty, generate on all pipelines.
expandedSearchLogics	string	-	A comma-separated list of search logics to generate prefix handlers, facets and hit metas on. If empty, generate on all search logics.
maxBlockSize	int	24	Maximum number of objects in lookup structure leaves.
precision	int	6	The number of decimals that will be used in geometrical representations and computations.
bboxSuffix	string	_bbox	Meta name suffix for BBox.

XML representation

<AdvancedGeoParams facetSortFunction="count" facetReverseSort="false" facetInHits="true" facetInNavigation="true" facetRoot="" categoryField="" categoryFacetMaxPerLevel="100" facetMaxInHits="100" numericalBits="63" doublePrecision="4" enableTrustedQueries="false" dateInputFormat="" dateOutputFormat="" gzip="true" patternSearchEnabled="false" patternSearchOptimizationType="ngramv1" forcedRankToTextMapping="" detectTitle="false" generateAnalysis="true" generateIndexSchema="true" generatePrefix="true" generateFacet="true" generateMeta="true" expandedPipelines="" expandedSearchLogics="" maxBlockSize="24" precision="6" bboxSuffix="_bbox" ></AdvancedGeoParams>

Label com.exalead.datamodel.v10.Label

The translation of a data model label in a language

Parent objects

AlphanumProperty
DataModelClass
DataModelEnum
DataModelEnumElement
DataProperty
EnumProperty
GeoProperty
NumericalProperty
ObjectProperty
Property

Attributes

language string
value string

Name	Type	Default value	Description
language	string	-	Label language
value	string	-	Label value

XML representation

<Label language="en" value="" ></Label>

NumericalProperty com.exalead.datamodel.v10.NumericalProperty

Numerical properties offer the feature of optionally enabling range (superior, inferior, ...) on values

Parent objects

DataModelClass
DynamicProperty

Attributes

overrideClassExpansionParams boolean
dedicatedField boolean
dynamicProperty string
searchable boolean
retrievable boolean
ramBased boolean
facet boolean
disjunctiveRefines boolean
defaultValue string
additionalMetaNames string
arity enum(zero_or_one, one, many)
name string
comment string
type enum(integer, double, date, date_time, point_geo, point_xy, unsigned, measure)

Name	Type	Default value	Description
overrideClassExpansionParams	boolean	False
dedicatedField	boolean	True	Should we create a dedicated index field for this property, or use a multi-field encoding? A dedicated field is recommended if more than a few percent of objects have this property. Internal: if there is no dedicated field, we store in the "metas" field and index using prefixed-words.
dynamicProperty	string	-	Storing this meta in a dynamical property decreases index's memory footprint and improve index's commit time. Faceting and sorting performances will be impacted (slower).
searchable (UI: Searchable)	boolean	True	Allows users to search for matches in this property using a prefix. For example, to search for "API" in the title of a document, enter the search query <pre>title:API</pre>
retrievable (UI: Retrievable)	boolean	True	Displays the meta in the hit content of search results. If the property doesn't have a dedicated field, the value will be CSV-encoded in the 'metas' field.
ramBased (UI: RAM based)	boolean	False	Stores this property in RAM. Use this option if you need to: sort search results on this property. use this property in a virtual field. use this property in a virtual facet. use this property for an aggregation value. Only the "retrievable" part of the property is stored in RAM.
facet	boolean	False	Creates a navigation facet from the property values.
disjunctiveRefines (UI: Disjunctive refines)	boolean	False	Makes refinements on this facet disjunctive ("checkboxes") rather than exclusive (select one).
defaultValue	string	-	If there is no value for this property in the document, use this default value.
additionalMetaNames	string	-	Maps these metas (as a comma-separated list) to this index or facet field. Usable only for properties having a dedicated field, a facet, or are searchable without prefix.
arity (UI: Cardinality)	enum(zero_or_one, one, many)	zero_or_one	The arity of the property (zero_or_one, one, many) {@code enum(zero_or_one, one, many)}
name	string	-	Property name
comment	string	-	Optional comment to explain property use.
type	enum(integer, double, date, date_time, point_geo, point_xy, unsigned, measure)		Type of this numerical property. Depending on the type, additional validation and formating will be performed on the value. Valid values are: integer: The value must be an integer (this is the default). double: The value must be a double. It is possible to configure the precision in advanced parameters. date: A date, with a day precision. date_time: A date, with time (second precision). point_geo: A point on a sphere, in GPS coordinates. The value must have a "latitude,longitude" format, with latitude being a double between -90 and +90, and longitude being a double between -180 and 180. point_xy: A point on a XY plan. The value must have a "X,Y" format, with X and Y being positive integers. unsigned: The value must be a positive integer. measure: The value must be a double and could be associated to a unit of measurement symbol.

Nested elements

tagsKeyValue*
AdvancedParams
KeyValue*
Label*

Type	Wrapper	Description
KeyValue*	tags	User-defined tags for this property. This can be used for example to implement 6W faceting of the data model.
AdvancedParams	-
KeyValue*	-
Label*	-	Labels for the internationalization of this property's name

XML representation

<NumericalProperty overrideClassExpansionParams="false" dedicatedField="true" searchable="true" retrievable="true" ramBased="false" facet="false" disjunctiveRefines="false" defaultValue="" additionalMetaNames="" arity="zero_or_one" name="" comment="" type="" ></NumericalProperty>

GeoProperty com.exalead.datamodel.v10.GeoProperty

Geo property

Parent objects

DataModelClass
DynamicProperty

Attributes

overrideClassExpansionParams boolean
dedicatedField boolean
dynamicProperty string
searchable boolean
retrievable boolean
ramBased boolean
facet boolean
disjunctiveRefines boolean
defaultValue string
additionalMetaNames string
arity enum(zero_or_one, one, many)
name string
comment string
type enum(Planar)

Name	Type	Default value	Description
overrideClassExpansionParams	boolean	False
dedicatedField	boolean	True	Should we create a dedicated index field for this property, or use a multi-field encoding? A dedicated field is recommended if more than a few percent of objects have this property. Internal: if there is no dedicated field, we store in the "metas" field and index using prefixed-words.
dynamicProperty	string	-	Storing this meta in a dynamical property decreases index's memory footprint and improve index's commit time. Faceting and sorting performances will be impacted (slower).
searchable (UI: Searchable)	boolean	True	Allows users to search for matches in this property using a prefix. For example, to search for "API" in the title of a document, enter the search query <pre>title:API</pre>
retrievable (UI: Retrievable)	boolean	True	Displays the meta in the hit content of search results. If the property doesn't have a dedicated field, the value will be CSV-encoded in the 'metas' field.
ramBased (UI: RAM based)	boolean	False	Stores this property in RAM. Use this option if you need to: sort search results on this property. use this property in a virtual field. use this property in a virtual facet. use this property for an aggregation value. Only the "retrievable" part of the property is stored in RAM.
facet	boolean	False	Creates a navigation facet from the property values.
disjunctiveRefines (UI: Disjunctive refines)	boolean	False	Makes refinements on this facet disjunctive ("checkboxes") rather than exclusive (select one).
defaultValue	string	-	If there is no value for this property in the document, use this default value.
additionalMetaNames	string	-	Maps these metas (as a comma-separated list) to this index or facet field. Usable only for properties having a dedicated field, a facet, or are searchable without prefix.
arity (UI: Cardinality)	enum(zero_or_one, one, many)	zero_or_one	The arity of the property (zero_or_one, one, many) {@code enum(zero_or_one, one, many)}
name	string	-	Property name
comment	string	-	Optional comment to explain property use.
type	enum(Planar)		Coordinate system of this geo property. For now, only cartesian/planar systems are supported.

Nested elements

tagsKeyValue*
AdvancedParams
KeyValue*
Label*

Type	Wrapper	Description
KeyValue*	tags	User-defined tags for this property. This can be used for example to implement 6W faceting of the data model.
AdvancedParams	-
KeyValue*	-
Label*	-	Labels for the internationalization of this property's name

XML representation

<GeoProperty overrideClassExpansionParams="false" dedicatedField="true" searchable="true" retrievable="true" ramBased="false" facet="false" disjunctiveRefines="false" defaultValue="" additionalMetaNames="" arity="zero_or_one" name="" comment="" type="" ></GeoProperty>

ObjectProperty (deprecated) com.exalead.datamodel.v10.ObjectProperty

Property storing a link to another DataModelClass.
It can only store the identifier of the foreign object in the holding class. This only gives the ability to perform some high-level fetch of related objects. This does not enable searching for this object using foreign constraints.

Parent objects

DataModelClass
DynamicProperty

Attributes

arity enum(zero_or_one, one, many)
name string
comment string
type string
generateInnerjoinKey boolean
storeIdentifier boolean

Name	Type	Default value	Description
arity (UI: Cardinality)	enum(zero_or_one, one, many)	zero_or_one	The arity of the property (zero_or_one, one, many) {@code enum(zero_or_one, one, many)}
name	string	-	Property name
comment	string	-	Optional comment to explain property use.
type	string	-	Name of the related DataModelClass
generateInnerjoinKey	boolean	False	NOT IMPLEMENTED. Generates a field that can be used for performing INNERJOIN between this property and its class. Note that doing this incurs a memory cost.
storeIdentifier	boolean	False	Stores the unique identifiers of the property values for further retrieval.

Nested elements

tagsKeyValue*
KeyValue*
Label*

Type	Wrapper	Description
KeyValue*	tags	User-defined tags for this property. This can be used for example to implement 6W faceting of the data model.
KeyValue*	-
Label*	-	Labels for the internationalization of this property's name

XML representation

<ObjectProperty arity="zero_or_one" name="" comment="" type="" generateInnerjoinKey="false" storeIdentifier="false" ></ObjectProperty>

EnumProperty (deprecated) com.exalead.datamodel.v10.EnumProperty

A property referring to a data model enum.
Internals: always indexed as a category.
This is not recommended, as it has little added value over an alphanum property. An alphanum property can actually be faster when using the "enumerated" flag.

Parent objects

DataModelClass
DynamicProperty

Attributes

arity enum(zero_or_one, one, many)
name string
comment string
type string
disjunctiveRefines boolean

Name	Type	Default value	Description
arity (UI: Cardinality)	enum(zero_or_one, one, many)	zero_or_one	The arity of the property (zero_or_one, one, many) {@code enum(zero_or_one, one, many)}
name	string	-	Property name
comment	string	-	Optional comment to explain property use.
type	string	-	Name of the related DataModelEnum.
disjunctiveRefines	boolean	False	Are refinements on this facet disjunctive ("checkboxes") or exclusive (select one)?

Nested elements

tagsKeyValue*
KeyValue*
Label*

Type	Wrapper	Description
KeyValue*	tags	User-defined tags for this property. This can be used for example to implement 6W faceting of the data model.
KeyValue*	-
Label*	-	Labels for the internationalization of this property's name

XML representation

<EnumProperty arity="zero_or_one" name="" comment="" type="" disjunctiveRefines="false" ></EnumProperty>

ExpansionParams com.exalead.datamodel.v10.ExpansionParams

No documentation for this element.

Parent objects

DataModelClass

Attributes

Name	Type	Default value	Description
generateAnalysis (UI: Generate analysis config)	boolean	True	Creates the required mappings and document processors for this property. Only set this option to "false" to take complete control over analysis expansion for this property.
generateIndexSchema (UI: Generate index schema)	boolean	True	Creates the required index fields for this property. Only set this option to "false" to take complete control over index schema expansion for this property.
generatePrefix (UI: Generate prefix handler)	boolean	True	Creates a prefix handler in the search logics for this property. Only set this option to "false" to take complete control over query prefix handler expansion for this property.
generateFacet (UI: Generate category facet)	boolean	True	Creates a category facet in the search logics for this property. Only set this option to "false" to take complete control over facet config expansion for this property.
generateMeta (UI: Generate hit meta)	boolean	True	Creates a hit meta in the search logics for this property. Only set this option to "false" to take complete control over meta expansion for this property.
expandedPipelines	string	-	A comma-separated list of analysis pipelines to generate mappings and document processors on. If empty, generate on all pipelines.
expandedSearchLogics	string	-	A comma-separated list of search logics to generate prefix handlers, facets and hit metas on. If empty, generate on all search logics.

XML representation

<ExpansionParams generateAnalysis="true" generateIndexSchema="true" generatePrefix="true" generateFacet="true" generateMeta="true" expandedPipelines="" expandedSearchLogics="" ></ExpansionParams>

DataModelEnum com.exalead.datamodel.v10.DataModelEnum

A custom enum allows you to define a closed list of values for a property.
Note that i18n is not implemented.
This option is not recommended.

Parent objects

DataModel

Attributes

name string
strict boolean
defaultValue int

Name	Type	Default value	Description
name	string	-
strict	boolean	False	If strict, this data type will only accept values listed in elements. Warning: This is not implemented for the moment.
defaultValue	int	-1	Index of default value in the elements list. -1 = no default value for property

Nested elements

DataModelEnumElement*
Label*

Type	Wrapper	Description
DataModelEnumElement*	-	Possible values for the enum.
Label*	-	Labels for i18n. TODO not implemented

XML representation

<DataModelEnum name="" strict="false" defaultValue="-1" ></DataModelEnum>

DataModelEnumElement com.exalead.datamodel.v10.DataModelEnumElement

No documentation for this element.

Parent objects

DataModelEnum

Attributes

name string

Name	Type	Default value	Description
name	string	-

Nested elements

Label*

Type	Wrapper	Description
Label*	-	Labels for i18n. TODO not implemented

XML representation

<DataModelEnumElement name="" ></DataModelEnumElement>

SemanticType com.exalead.datamodel.v10.SemanticType

Parent objects

DataModel

Attributes

Name	Type	Default value	Description
name	string	-	Name for this type, to be used in the "type" field of the AlphanumProperty.
extraContexts	string	-	Extra analysis contexts (not datamodel-controlled) on which this text type must be applied.
tokenize (UI: Tokenize)	boolean	True	Splits phrases into individual words or tokens. Required for index-time semantic processing.
urlProcessing (UI: Process as an URL)	boolean	False	Creates 3 prefix handlers for this property, for the 'site', 'url', and 'inurl' features.
indexNormalized (UI: Index as normalized)	boolean	True	Indexes CaFé as cafe.
indexLowercase (UI: Index as lowercase)	boolean	False	Indexes CaFé as café
indexExact (UI: Index as exact)	boolean	False	Indexes CaFé as CaFé
indexSeparators (UI: Index separators)	boolean	True	Indexes the position of separators to enable search within a string. Select this option when using the "split" type prefix handler.
detectLanguage (UI: Detect language)	boolean	True	Determines the language of a document by analyzing its text. Required for extracting spell check ngrams, phonetic forms, named entities and related terms. For performance reasons, only select this option if the documents to be pushed do not already include a 'language' meta. Selecting this option creates a meta called 'language', as well as a Language facet for search results display in the Refinements panel.
extractWords (UI: Extract words)	boolean	True	Extracts the words of each document to the dictionary targeted by this semantic type.
extractNamedEntities (UI: Extract named entities)	boolean	False	Flags famous people, places, organizations or events, and annotates the corresponding index field with the prefix NE:<entity type>. This option adds: A Named Entities Matcher processor to the semantic analysis pipeline. Categories for each named entity annotation in a document. Named entity facets in the search logic, to be displayed in the Refinements panel.
extractRelatedTerms (UI: Extract related terms)	boolean	False	Finds important concepts within the corpus and stores them in the dictionary targeted by this semantic type. To display related terms in the Refinements panel of your search application, you must enable them through the search logics. This option adds a Related Terms Extractor processor to the semantic analysis pipeline.
extractSpellCheckNGrams (UI: Extract spell check ngrams)	boolean	False	Calculates probability of word occurrences or word phrases within the corpus and stores them in the dictionary targeted by this semantic type. This significantly improves the effectiveness of spell-checking. This option adds a NGrams Extractor processor to the semantic analysis pipeline.
extractPhoneticForms (UI: Extract phonetic forms)	boolean	False	Creates a phonetic version for each word and stores them in the dictionary targeted by this semantic type. This significantly improves the effectiveness of spell check and enables phonetic search (for example, soundslike: exaleed). This option adds a Phonetizer processor to the semantic analysis pipeline.
tokenizationConfig (UI: Tokenization config)	string	-	Defines the tokenization config to use for analysis and search. Found in the linguistic configuration.
rankForDedicatedMapping	long	4	Ranking value for the mapping to a dedicated index field.
rankForTextMapping	long	3	Ranking value for the mapping to the "text" index field.
dictionaryName (UI: Dictionary)	string	-	A dictionary is a structure separated from the index, that stores all the words of an indexed document, and their number of occurrences in the corpus. It's used for linguistic expansion mechanisms such as spell-checking or regular expression matching. If value is "_None_", words will not be stored in a dictionary. In admin-ui, select "None" if you want this behavior.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-	Custom parameters.

XML representation

<SemanticType name="" extraContexts="" tokenize="true" urlProcessing="false" indexNormalized="true" indexLowercase="false" indexExact="false" indexSeparators="true" detectLanguage="true" extractWords="true" extractNamedEntities="false" extractRelatedTerms="false" extractSpellCheckNGrams="false" extractPhoneticForms="false" tokenizationConfig="tok0" rankForDedicatedMapping="4" rankForTextMapping="3" dictionaryName="" ></SemanticType>

ConnectorList com.exalead.mercury.mami.connect.v10.ConnectorList

List of connectors defined in the system.

Attributes

version long

Name	Type	Default value	Description
version	long	-	-

Nested elements

Connector*

Type	Wrapper	Description
Connector*	-	List of connectors.

XML representation

<ConnectorList version="" ></ConnectorList>

Connector com.exalead.mercury.mami.connect.v10.Connector

The configuration of a connector

Parent objects

ConnectorList

Attributes

name string
classId string
customClassId string
managed boolean
connectorServer string
buildGroup string
pushAPIServer string
authenticationMode enum(public, basic)
login string
password string
defaultDataModelClass string
documentsType string
generated boolean

Name	Type	Default value	Description
name	string	-	The name of the connector.
classId	string	-	Connector class identifier. This identifies the kind of data source this connectors connects to, as well as the actual implementation. The class identifier can be a reference to an exascript class, a Java class or a .NET Class. This attribute must be null for unmanaged connectors. The value provided should be one of the values returned by @see listConnectorTypes.
customClassId	string	-	Implementation class identifier. Optional. Can be used to specify a custom implementation of the connector.
managed	boolean	False	Indicates that the connector is managed by a framework. A managed connector is launched and operated as a CloudView service, whereas an 'unmanaged connector' is handled by a third-party process.
connectorServer	string	-	Defines the connector server hosting this connector. For managed connectors only. Connectors are deployed in a connector server. There are 3 kinds of connector server: exascript, Java or .NET. The value of this attribute should refer to a connector server defined in the deployment configuration (for example, exa0, java0, dotnet0).
buildGroup	string	-	Defines the build group that will receive the documents. For managed connectors only. For a single connector server, different connectors can push to different build groups.
pushAPIServer	string	-	Defines the Push API server that will receive the documents. For managed connectors only. For a single connector server, different connectors can push to different Push API servers.
authenticationMode	enum(public, basic)	public	The connector configuration parameters. Connector parameters such as the data sources or folders to actually index in the data source are provided as key-values. No configuration parameters should be provided for unmanaged connectors. The set of meta data to be automatically added to indexed documents. These meta-data items are inserted into each document from this connector when analyzing the document. Connector authentication mode values are: `public` and `basic`. If set to `basic`, the indexing-server will require authentication to push content with this connector name.
login	string	-	User login if `authenticationMode` parameter is set to `basic`.
password	string	-	User password if `authenticationMode` parameter is set to `basic`.
defaultDataModelClass	string	-	Specifies the class in which the documents should be indexed if the connector does not provide a dataModelClass indication for its documents. The default value (null) means that the documents will go in the default class of the DataModel.
documentsType	string	-	Type of documents produced by this source. The type of documents must match one of the types declared in your CloudView license file.
generated	boolean	False	Has this connector been generated automatically by another component or it has been manually added by the user. A generated connector must be configured by the component that has generated it and won't be editable in admin-ui

Nested elements

configKeyValue*
forcedMetaKeyValue*
ConnectorScheduledScan*
PostProcessingPipeline
PushConfig
SourceCachingConfig
SourceFetchConfig
SourcePreviewConfig
SourceThumbnailsConfig

Type	Wrapper	Description
KeyValue*	config
KeyValue*	forcedMeta
ConnectorScheduledScan*	-	List of scheduled scans for the connector.
PostProcessingPipeline	-	Pipeline post-processing documents sent by this connector.
PushConfig	-	Specifies how documents are sent to the indexing-server.
SourceCachingConfig	-
SourceFetchConfig	-
SourcePreviewConfig	-
SourceThumbnailsConfig	-

XML representation

<Connector name="" classId="" customClassId="" managed="false" connectorServer="" buildGroup="" pushAPIServer="" authenticationMode="public" login="" password="" defaultDataModelClass="" documentsType="" generated="false" ></Connector>

ConnectorScheduledScan com.exalead.mercury.mami.connect.v10.ConnectorScheduledScan

Defines the connector scheduling

Parent objects

Connector

Attributes

scanMode string

Name	Type	Default value	Description
scanMode	string	-	The scan mode to trigger.

Nested elements

scanModeConfigKeyValue*
ConnectorSchedulingTrigger

Type	Wrapper	Description
KeyValue*	scanModeConfig	The configuration given to the scheduled scan.
ConnectorSchedulingTrigger	-

XML representation

<ConnectorScheduledScan scanMode="" ></ConnectorScheduledScan>

ConnectorSchedulingTrigger com.exalead.mercury.mami.connect.v10.ConnectorSchedulingTrigger

Defines the scheduled interval.

Parent objects

ConnectorScheduledScan

Attributes

startTimestamp long
endTimestamp long

Name	Type	Default value	Description
startTimestamp	long	-
endTimestamp	long	-

XML representation

<ConnectorSchedulingTrigger startTimestamp="" endTimestamp="" ></ConnectorSchedulingTrigger>

SimpleConnectorSchedulingTrigger com.exalead.mercury.mami.connect.v10.SimpleConnectorSchedulingTrigger

Defines the scheduled interval.

Parent objects

ConnectorScheduledScan

Attributes

Name	Type	Default value	Description
startTimestamp	long	-
endTimestamp	long	-
repeatInterval	long	-

XML representation

<SimpleConnectorSchedulingTrigger startTimestamp="" endTimestamp="" repeatInterval="" ></SimpleConnectorSchedulingTrigger>

CronConnectorSchedulingTrigger com.exalead.mercury.mami.connect.v10.CronConnectorSchedulingTrigger

Configures the scheduled scan for a connector, given a start timestamp and an end timestamp.

Parent objects

ConnectorScheduledScan

Attributes

Name	Type	Default value	Description
startTimestamp	long	-
endTimestamp	long	-
cronExpression	string	-	The Quartz Cron expression made of five time and date fields.

XML representation

<CronConnectorSchedulingTrigger startTimestamp="" endTimestamp="" cronExpression="" ></CronConnectorSchedulingTrigger>

CustomPostProcessingPipeline com.exalead.mercury.mami.connect.v10.CustomPostProcessingPipeline

Post-processing pipeline based on a custom Java class. A component must be providing, implementing the com.exalead.dataprocessing.processors.cloudview.papi.connect.ConnectorDataProcessingPipelineBuilder.java interface.
The component must be packaged in a plugin.
The ConnectorDataProcessingPipelineBuilder must create its DataProcessing API pipeline, and all documents will be sent through it.

Parent objects

Connector

Attributes

builderClassId string

Name	Type	Default value	Description
builderClassId	string	-	Java class id of the PipelineBuilder implementation.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-	Configuration of the PipelineBuilder implementation.

XML representation

<CustomPostProcessingPipeline builderClassId="" ></CustomPostProcessingPipeline>

SimplePostProcessingPipeline com.exalead.mercury.mami.connect.v10.SimplePostProcessingPipeline

Creates a configured post-processing pipeline

Nested elements

ScanPipelinePipelineBranch
FetchPipelinePipelineBranch

Type	Wrapper	Description
PipelineBranch	ScanPipeline	A PAPI Source processor will be automatically added at the beginning of the branch on connector's scan.
PipelineBranch	FetchPipeline	A PAPI Source processor will be automatically added at the beginning of the branch on connector's fetch.

XML representation

<SimplePostProcessingPipeline ></SimplePostProcessingPipeline>

BasicPipelineBranch com.exalead.mercury.mami.connect.v10.BasicPipelineBranch

A Simple PipelineBranch

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

input string
output string
autolink boolean
name string

Name	Type	Default value	Description
input	string	-	Name of the BranchAction used as input (with the processors'input if it is a Process action)
output	string	-	Name of the BranchAction used as input (with the processors'input if it is a Process action)
autolink	boolean	False	Links automatically successive BranchActions on their first input and output when they have no explicit link.
name	string	-	Unique name for this pipeline branch

Nested elements

BranchAction*
PipelineLink*

Type	Wrapper	Description
BranchAction*	-	Actions in the branch
PipelineLink*	-	Links between actions

XML representation

<BasicPipelineBranch input="" output="" autolink="false" name="" ></BasicPipelineBranch>

PipelineLink com.exalead.mercury.mami.connect.v10.PipelineLink

Link between two BranchActions

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

source string
target string

Name	Type	Default value	Description
source	string	-	Name of the output which is used as this link source
target	string	-	Name of the input which is used as this link target

XML representation

<PipelineLink source="" target="" ></PipelineLink>

MultithreadedDispatchBranch com.exalead.mercury.mami.connect.v10.MultithreadedDispatchBranch

Replicates a branch multiple times and connects a dispatch to the input and a union to the output

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

input string
output string
autolink boolean
name string
nbThreads int

Name	Type	Default value	Description
input	string	-	Name of the BranchAction used as input (with the processors'input if it is a Process action)
output	string	-	Name of the BranchAction used as input (with the processors'input if it is a Process action)
autolink	boolean	False	Links automatically successive BranchActions on their first input and output when they have no explicit link.
name	string	-	Unique name for this pipeline branch
nbThreads	int	4	Number of replications of this branch

Nested elements

BranchAction*
PipelineLink*

Type	Wrapper	Description
BranchAction*	-	Actions in the branch
PipelineLink*	-	Links between actions

XML representation

<MultithreadedDispatchBranch input="" output="" autolink="false" name="" nbThreads="4" ></MultithreadedDispatchBranch>

CustomProcess com.exalead.mercury.mami.connect.v10.CustomProcess

Processes the records with a custom processor. The action name will be the processor name.

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

name string
classId string

Name	Type	Default value	Description
name	string	-	Unique name for this pipeline branch
classId	string	-	Processor's class

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-	Processor's configuration

XML representation

<CustomProcess name="" classId="" ></CustomProcess>

QueueProcess com.exalead.mercury.mami.connect.v10.QueueProcess

Creates a QueueProcessor

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

name string
capacity int

Name	Type	Default value	Description
name	string	-	Unique name for this pipeline branch
capacity	int	0	Maximum number elements in the queue. 0 means no limit

XML representation

<QueueProcess name="" capacity="0" ></QueueProcess>

ForwardProcess com.exalead.mercury.mami.connect.v10.ForwardProcess

Creates a ForwardProcessor

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

name string

Name	Type	Default value	Description
name	string	-	Unique name for this pipeline branch

XML representation

<ForwardProcess name="" ></ForwardProcess>

CloudViewPushAPITargetProcess com.exalead.mercury.mami.connect.v10.CloudViewPushAPITargetProcess

Creates a CloudViewPushAPITargetProcessor. All instances will share the same PushAPI.

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

name string

Name	Type	Default value	Description
name	string	-	Unique name for this pipeline branch

XML representation

<CloudViewPushAPITargetProcess name="" ></CloudViewPushAPITargetProcess>

UnionProcess com.exalead.mercury.mami.connect.v10.UnionProcess

Creates a UnionProcessor

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

name string
orderingByMarker boolean

Name	Type	Default value	Description
name	string	-	Unique name for this pipeline branch
orderingByMarker	boolean	True	This behavior enables the restoration of sessions when a stream has been split into multiple streams which are joined by this processor. Even if some split streams process faster, the elements from the previous session will all be sent before beginning to send the next one.

XML representation

<UnionProcess name="" orderingByMarker="true" ></UnionProcess>

RoundRobinProcess com.exalead.mercury.mami.connect.v10.RoundRobinProcess

Creates a RoundRobinProcessor

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

name string

Name	Type	Default value	Description
name	string	-	Unique name for this pipeline branch

XML representation

<RoundRobinProcess name="" ></RoundRobinProcess>

SetFieldValue com.exalead.mercury.mami.connect.v10.SetFieldValue

Sets the value of a field

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

name string
field string
value string

Name	Type	Default value	Description
name	string	-	Unique name for this pipeline branch
field	string	-	Field name
value	string	-	Value to set the field to

XML representation

<SetFieldValue name="" field="" value="" ></SetFieldValue>

RenameField com.exalead.mercury.mami.connect.v10.RenameField

Renames a field

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

name string
origName string
newName string

Name	Type	Default value	Description
name	string	-	Unique name for this pipeline branch
origName	string	-	Original name of field.
newName	string	-	New name of field.

XML representation

<RenameField name="" origName="" newName="" ></RenameField>

DeleteFields com.exalead.mercury.mami.connect.v10.DeleteFields

Deletes a set of fields

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

name string
fields string

Name	Type	Default value	Description
name	string	-	Unique name for this pipeline branch
fields	string	-	Comma-separated list of fields to remove.

XML representation

<DeleteFields name="" fields="" ></DeleteFields>

CustomTransform com.exalead.mercury.mami.connect.v10.CustomTransform

Custom transformation

Parent objects

BasicPipelineBranch
MultithreadedDispatchBranch
PipelineBranch

Attributes

name string
classId string

Name	Type	Default value	Description
name	string	-	Unique name for this pipeline branch
classId	string	-

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<CustomTransform name="" classId="" ></CustomTransform>

PushConfig com.exalead.mercury.mami.connect.v10.PushConfig

Specifies how documents are sent to the indexing-server.

Parent objects

Connector

Attributes

Name	Type	Default value	Description
buffer	boolean	True	Enables buffering of documents in the connector to send them as batches to the indexing server.
triggerIndexingAfterScan	boolean	False	Always triggers an indexing job after a completed scan.
profilePushOperations	boolean	False	Enables profiling of push operations. The profiling can be costly on small documents.
automaticRestartOnFailure	boolean	True	When a scan throws a `PushAPIException` does this connector automatically restart? This allows the connector to retry a scan operation when a component in the indexing chain crashes. Set this option to `false` to disable the retry behavior.

XML representation

<PushConfig buffer="true" triggerIndexingAfterScan="false" profilePushOperations="false" automaticRestartOnFailure="true" ></PushConfig>

SourceCachingConfig com.exalead.mercury.mami.connect.v10.SourceCachingConfig

Defines how source documents are put in the document cache

Parent objects

Connector

Attributes

Name	Type	Default value	Description
storeInDocumentCache	boolean	True
minSizeForCachingB	long	-
maxSizeForCachingB	long	-

XML representation

<SourceCachingConfig storeInDocumentCache="true" minSizeForCachingB="" maxSizeForCachingB="" ></SourceCachingConfig>

SourceFetchConfig com.exalead.mercury.mami.connect.v10.SourceFetchConfig

Defines how source documents are "fetched" for download, preview and thumbnails

Parent objects

Connector

Attributes

Name	Type	Default value	Description
allowRawDocumentFetch	boolean	True
customFetcherClass	string	-
customFetcherUrl	string	-	Base URL used for retrieving documents from this connector. This is used for preview, thumbnails and raw fetch from the search results. For unmanaged connectors, this is always used. For managed connectors, if this parameter is given, it completely replaces the retriever within the connectors server.
fetchProtocol	string	-	Protocol implemented by the `customFetcherURL`. One of `v1`, `v2`, `rpv3`, or `networkRetriever`

XML representation

<SourceFetchConfig allowRawDocumentFetch="true" customFetcherClass="" customFetcherUrl="" fetchProtocol="" ></SourceFetchConfig>

SourcePreviewConfig com.exalead.mercury.mami.connect.v10.SourcePreviewConfig

Defines the configuration for image and HTML preview of the documents of a source.

Parent objects

Connector

Attributes

allowHTMLPreview boolean
allowImagePreview boolean

Name	Type	Default value	Description
allowHTMLPreview	boolean	True
allowImagePreview	boolean	True

XML representation

<SourcePreviewConfig allowHTMLPreview="true" allowImagePreview="true" ></SourcePreviewConfig>

SourceThumbnailsConfig com.exalead.mercury.mami.connect.v10.SourceThumbnailsConfig

No documentation for this element.

Parent objects

Connector

Attributes

Name	Type	Default value	Description
allowThumbnails	boolean	True
precomputeThumbnails	boolean	False
precomputedThumbnailsHeight	int	120
precomputedThumbnailsWidth	int	120
homePageOnly	boolean	False	When crawling web sites, only compute and generate thumbnails for the home pages

XML representation

<SourceThumbnailsConfig allowThumbnails="true" precomputeThumbnails="false" precomputedThumbnailsHeight="120" precomputedThumbnailsWidth="120" homePageOnly="false" ></SourceThumbnailsConfig>

CrawlConfig com.exalead.mercury.mami.crawl.v21.CrawlConfig

The crawl configuration.

Attributes

version long
verbose boolean

Name	Type	Default value	Description
version	long	-	-
verbose	boolean	False

Nested elements

ICrawler*

Type	Wrapper	Description
ICrawler*	-

XML representation

<CrawlConfig version="" verbose="false" ></CrawlConfig>

Crawler com.exalead.mercury.mami.crawl.v21.Crawler

A crawler configuration.
A crawler may contain a CrawlSchedulerConfig to overwrite the default fifo priorities.
A crawler may contain a CustomCrawlConfig to enable custom processors.

Parent objects

CrawlConfig

Attributes

Name	Type	Default value	Description
name	string	-	The crawler name. It must be unique across all crawlers.
documentsType	string	-	The type of documents pushed by this connector. The type of documents must match one of the types declared in your CloudView license file.
fetcher	string	-	Which fetcher to use.
crawlerServer	string	-	Crawler server hosting this crawler. See Deployment configuration.
connectorServer	string	-	Connector server hosting the indexing part of this crawler. See Deployment configuration.
buildGroup	string	-	Target build group.
dataModel	string	-	The default data model for documents indexed by this crawler.
storeTextOnly	boolean	True	Whether to store original binary documents, or only converted text.
nthreads	int	1	The number of crawl threads which must be strictly positive.
aggressive	boolean	False	Whether to enable aggressive crawl, that never sleeps between two requests to the same host.
throttleTimeMS	int	2500	In the case of non-aggressive crawl, this defines the sleep interval between requests to the same host.
ignoreRobotsTxt	boolean	False	Whether to ignore robots.txt rules. Not recommended.
enableConvertProcessor	boolean	True	Whether to enable remoteconvert-based processor for links extracting in binary documents.
nearDuplicateDetector	boolean	True	Whether to enable the near-duplicate content detector.
patternsDetector	boolean	True	Whether to enable patterns detection in pages.
crawlSitemaps	boolean	True	Whether to crawl sitemaps.
disableConditionalGet	boolean	False	Whether to always fetch documents, even if the server tells it has not changed.
defaultAccept	boolean	False	Whether to crawl a url by default when it matches no other accept rule.
defaultIndex	boolean	False	Whether to index by default when a url matches no index rule.
defaultFollow	boolean	False	Whether to follow by default when a url matches no follow rule.
defaultFollowRoots	boolean	True	Whether to automatically follow root urls
enableSimpleSiteCollapsing	boolean	True	Whether to generate a site ID suitable for document collapsing.
simpleSiteCollapsingDepth	int	0	How many path segments to use to generate the site collapsing ID.
mimeTypesMode	string	exclude	Mime types white/black list
smartRefresh	boolean	True	Whether to crawl a fraction of refreshed urls.
smartRefreshMinAgeS	int	3600	Age in seconds at which we may refresh old urls.
smartRefreshMaxAgeS	int	604800	Age in seconds at which we force the refresh of old urls.
archiveDocuments	boolean	False	When enabled, deleted documents are not deleted, but kept with their deletion date.
enableConsolidation	boolean	True	Define if we use a standard PAPI or a consolidation PAPI.

Nested elements

mimeTypesStringConstantValue*
sessionIdBlacklistStringConstantValue*
PushAPIFilterKeyValue*
rootsRoot*
rootsetsRootSet*
CrawlSchedulerConfig
CustomCrawlConfig
Rules*
UrlTesterData

Type	Wrapper	Description
StringConstantValue*	mimeTypes
StringConstantValue*	sessionIdBlacklist	SessionId blacklist. These parameters are removed from URLs with a path or query part containing them.
KeyValue*	PushAPIFilter
Root*	roots	A list of root urls to start the crawl from.
RootSet*	rootsets	A list of files to load urls/sites from.
CrawlSchedulerConfig	-
CustomCrawlConfig	-
Rules*	-
UrlTesterData	-

XML representation

<Crawler name="" documentsType="" fetcher="" crawlerServer="" connectorServer="" buildGroup="" dataModel="" storeTextOnly="true" nthreads="1" aggressive="false" throttleTimeMS="2500" ignoreRobotsTxt="false" enableConvertProcessor="true" nearDuplicateDetector="true" patternsDetector="true" crawlSitemaps="true" disableConditionalGet="false" defaultAccept="false" defaultIndex="false" defaultFollow="false" defaultFollowRoots="true" enableSimpleSiteCollapsing="true" simpleSiteCollapsingDepth="0" mimeTypesMode="exclude" smartRefresh="true" smartRefreshMinAgeS="3600" smartRefreshMaxAgeS="604800" archiveDocuments="false" enableConsolidation="true" ></Crawler>

CrawlSchedulerConfig com.exalead.mercury.mami.crawl.v21.CrawlSchedulerConfig

Scheduler configuration.
Use at your own risk.

Parent objects

Crawler
FeedFetcher
ICrawler

Attributes

Name	Type	Default value	Description
priority0	int	100000	Fifo priorities. by default, only user-submitted urls
priority1	int	10000	by default, only redirected urls
priority2	int	1000	by default, only indexed and not followed urls
priority3	int	100	by default, only indexed and followed urls
priority4	int	10	by default, only followed and not indexed urls
refreshPriority	int	1	refresh source

XML representation

<CrawlSchedulerConfig priority0="100000" priority1="10000" priority2="1000" priority3="100" priority4="10" refreshPriority="1" ></CrawlSchedulerConfig>

CustomCrawlConfig com.exalead.mercury.mami.crawl.v21.CustomCrawlConfig

Custom processors specification.

Parent objects

Crawler
FeedFetcher
ICrawler

Attributes

Name	Type	Default value	Description
preProcessorClassId	string	-	Custom PreProcessor. Called at the end of the preprocess pipe.
fetcherClassId	string	-	Custom Fetcher.
processorClassId	string	-	Custom Processor. Called at the end of the process pipe. Catches all mime types.
htmlProcessorClassId	string	-	Custom HTML Processor. Called at the of the html process pipe. Catches only html documents.
linksFilterClassId	string	-	Custom LinksFilter. Called at the end of the links filter list. Can decide whether to crawl an outgoing link.
postProcessorClassId	string	-	Custom PostProcessor. Called at the end of the postprocess pipe.
crawlerTemplate	string	-	Alternatively, specify the url of a xml file describing the whole crawler.

XML representation

<CustomCrawlConfig preProcessorClassId="" fetcherClassId="" processorClassId="" htmlProcessorClassId="" linksFilterClassId="" postProcessorClassId="" crawlerTemplate="" ></CustomCrawlConfig>

Rules com.exalead.mercury.mami.crawl.v21.Rules

A rule set identified by a key.

Parent objects

Crawler
FeedFetcher
ICrawler

Attributes

key string
group string

Name	Type	Default value	Description
key	string	-	How to interpret these rules. auto, adminui: place index/follow/accept rules where relevant, automatically. may break complex rules depending on other key-values pre, post, link, papi: place in preprocessor, postprocessor, linksfilter or papi filter only.
group	string	default	Key used to group rules and root urls.

Nested elements

Rule*

Type	Wrapper	Description
Rule*	-

XML representation

<Rules key="" group="default" ></Rules>

Rule com.exalead.mercury.mami.crawl.v21.Rule

No documentation for this element.

Parent objects

Rules

Attributes

message string

Name	Type	Default value	Description
message	string	-

Nested elements

Action*
Pattern*
PostFilter*

Type	Wrapper	Description
Action*	-
Pattern*	-
PostFilter*	-

XML representation

<Rule message="" ></Rule>

ActionSetKV com.exalead.actionrules.v21.ActionSetKV

No documentation for this element.

Parent objects

Rule

Attributes

key string
value string

Name	Type	Default value	Description
key	string	-
value	string	-

XML representation

<ActionSetKV key="" value="" ></ActionSetKV>

ActionAppendKV com.exalead.actionrules.v21.ActionAppendKV

No documentation for this element.

Parent objects

Rule

Attributes

key string
value string

Name	Type	Default value	Description
key	string	-
value	string	-

XML representation

<ActionAppendKV key="" value="" ></ActionAppendKV>

ActionSetNoIndex com.exalead.actionrules.v21.ActionSetNoIndex

No documentation for this element.

XML representation

<ActionSetNoIndex ></ActionSetNoIndex>

ActionSetNoFollow com.exalead.actionrules.v21.ActionSetNoFollow

No documentation for this element.

XML representation

<ActionSetNoFollow ></ActionSetNoFollow>

ActionSetIgnore com.exalead.actionrules.v21.ActionSetIgnore

No documentation for this element.

XML representation

<ActionSetIgnore ></ActionSetIgnore>

ActionUrlDeleteQueryArg com.exalead.actionrules.v21.ActionUrlDeleteQueryArg

No documentation for this element.

Parent objects

Rule

Attributes

token string
caseSensitive boolean

Name	Type	Default value	Description
token	string	-
caseSensitive	boolean	True

XML representation

<ActionUrlDeleteQueryArg token="" caseSensitive="True" ></ActionUrlDeleteQueryArg>

ActionUrlDeleteQuerySessionId com.exalead.actionrules.v21.ActionUrlDeleteQuerySessionId

No documentation for this element.

Parent objects

Rule

Attributes

token string
caseSensitive boolean
strict boolean

Name	Type	Default value	Description
token	string	-
caseSensitive	boolean	True
strict	boolean	False

XML representation

<ActionUrlDeleteQuerySessionId token="" caseSensitive="True" strict="False" ></ActionUrlDeleteQuerySessionId>

ActionUrlDeletePathToken com.exalead.actionrules.v21.ActionUrlDeletePathToken

No documentation for this element.

Parent objects

Rule

Attributes

token string
caseSensitive boolean

Name	Type	Default value	Description
token	string	-
caseSensitive	boolean	True

XML representation

<ActionUrlDeletePathToken token="" caseSensitive="True" ></ActionUrlDeletePathToken>

ActionUrlDeletePathSessionId com.exalead.actionrules.v21.ActionUrlDeletePathSessionId

No documentation for this element.

Parent objects

Rule

Attributes

token string
caseSensitive boolean

Name	Type	Default value	Description
token	string	-
caseSensitive	boolean	True

XML representation

<ActionUrlDeletePathSessionId token="" caseSensitive="True" ></ActionUrlDeletePathSessionId>

ActionUrlRegexReplace com.exalead.actionrules.v21.ActionUrlRegexReplace

No documentation for this element.

Parent objects

Rule

Attributes

input string
output string
field string

Name	Type	Default value	Description
input	string	-
output	string	-
field	string	-

XML representation

<ActionUrlRegexReplace input="" output="" field="" ></ActionUrlRegexReplace>

ActionUrlAddQueryArg com.exalead.actionrules.v21.ActionUrlAddQueryArg

No documentation for this element.

Parent objects

Rule

Attributes

token string
value string

Name	Type	Default value	Description
token	string	-
value	string	-

XML representation

<ActionUrlAddQueryArg token="" ></ActionUrlAddQueryArg>

ActionSetRepetitiveTokens com.exalead.actionrules.v21.ActionSetRepetitiveTokens

No documentation for this element.

Parent objects

Rule

Attributes

keyName string
numerical boolean

Name	Type	Default value	Description
keyName	string	crawlUrl.repetitive
numerical	boolean	True

XML representation

<ActionSetRepetitiveTokens keyName="crawlUrl.repetitive" numerical="True" ></ActionSetRepetitiveTokens>

ActionUrlCapture com.exalead.actionrules.v21.ActionUrlCapture

No documentation for this element.

Parent objects

Rule

Attributes

token string
field string
keyNamePrefix string

Name	Type	Default value	Description
token	string	-
field	string	-
keyNamePrefix	string	crawlUrl.capture

XML representation

<ActionUrlCapture token="" field="" keyNamePrefix="crawlUrl.capture" ></ActionUrlCapture>

Accept com.exalead.mercury.mami.crawl.v21.Accept

Actions on urls.

XML representation

<Accept ></Accept>

Ignore com.exalead.mercury.mami.crawl.v21.Ignore

No documentation for this element.

XML representation

<Ignore ></Ignore>

Index com.exalead.mercury.mami.crawl.v21.Index

No documentation for this element.

XML representation

<Index ></Index>

NoIndex com.exalead.mercury.mami.crawl.v21.NoIndex

No documentation for this element.

XML representation

<NoIndex ></NoIndex>

Follow com.exalead.mercury.mami.crawl.v21.Follow

No documentation for this element.

XML representation

<Follow ></Follow>

NoFollow com.exalead.mercury.mami.crawl.v21.NoFollow

No documentation for this element.

XML representation

<NoFollow ></NoFollow>

AddMeta com.exalead.mercury.mami.crawl.v21.AddMeta

Add a meta on a url that will be pushed to the PAPI.

Parent objects

Rule

Attributes

name string
value string

Name	Type	Default value	Description
name	string	-
value	string	-

XML representation

<AddMeta name="" value="" ></AddMeta>

Source com.exalead.mercury.mami.crawl.v21.Source

Index matching urls in a different source.

Parent objects

Rule

Attributes

name string

Name	Type	Default value	Description
name	string	-

XML representation

<Source name="" ></Source>

DataModelClass com.exalead.mercury.mami.crawl.v21.DataModelClass

Change the datamodel class of matching urls.

Parent objects

Rule

Attributes

name string

Name	Type	Default value	Description
name	string	-

XML representation

<DataModelClass name="" ></DataModelClass>

Priority com.exalead.mercury.mami.crawl.v21.Priority

Shift the priority of urls.

Parent objects

Rule

Attributes

shift int

Name	Type	Default value	Description
shift	int	0	set a negative number to crawl faster, positive to crawl slower. For example, shift = -1 will make the url go in the next higher priority fifo.

XML representation

<Priority shift="0" ></Priority>

Or com.exalead.actionrules.v21.Or

No documentation for this element.

Nested elements

Pattern*

Type	Wrapper	Description
Pattern*	-

XML representation

And com.exalead.actionrules.v21.And

No documentation for this element.

Nested elements

Pattern*

Type	Wrapper	Description
Pattern*	-

XML representation

<And ></And>

Not com.exalead.actionrules.v21.Not

No documentation for this element.

Nested elements

Pattern

Type	Wrapper	Description
Pattern	-

XML representation

<Not ></Not>

Atom com.exalead.actionrules.v21.Atom

Raw patterns
Applies on the main url unless specified otherwise.

Parent objects

And
Config
Not
Or
Rule

Attributes

field string
kind string
norm string
value string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
field	string	-	the field on which the pattern is applied A field may be the whole url or a part of it (url, scheme, host, port, path, query)
kind	string	-	specify the semantics of the attribute value - "length": specify the length of a field ([:10], [11:12], [30:]) - "exact", "prefix", "suffix", "inside": specify a regexp and its anchoring @warning for readability purpose, regexp escaping policy is reversed. Special characters must be backslash escaped: write "www.\.\.tv" instead of "www\..\.tv" (backslash must be itself escaped in C-style strings, a single backslash is really needed, in xml for example)
norm	string	none	Specify the normalization level (default is case insentive match) values: norm, lower or none
value	string	-	value (regexp)
matchedUrl	string	-	If not empty, this rule applies on the url provided with the name matchedUrl, instead of the main url.
litteral	boolean	True

XML representation

<Atom field="" kind="" norm="none" value="" matchedUrl="" litteral="True" ></Atom>

Url com.exalead.actionrules.v21.Url

shortcut for url-exact

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<Url val="" norm="norm" matchedUrl="" litteral="True" ></Url>

Scheme com.exalead.actionrules.v21.Scheme

shortcut for scheme-exact

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<Scheme val="" norm="norm" matchedUrl="" litteral="True" ></Scheme>

Host com.exalead.actionrules.v21.Host

No documentation for this element.

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<Host val="" norm="norm" matchedUrl="" litteral="True" ></Host>

Domain com.exalead.actionrules.v21.Domain

No documentation for this element.

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<Domain val="" norm="norm" matchedUrl="" litteral="True" ></Domain>

Port com.exalead.actionrules.v21.Port

shortcut for port-exact

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<Port val="" norm="norm" matchedUrl="" litteral="True" ></Port>

Path com.exalead.actionrules.v21.Path

shortcut for path-prefix

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<Path val="" norm="norm" matchedUrl="" litteral="True" ></Path>

Ext com.exalead.actionrules.v21.Ext

shortcut for path-suffix

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<Ext val="" norm="norm" matchedUrl="" litteral="True" ></Ext>

Query com.exalead.actionrules.v21.Query

shortcut for query-exact

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<Query val="" norm="norm" matchedUrl="" litteral="True" ></Query>

QueryArg com.exalead.actionrules.v21.QueryArg

shortcut for query ?tok= or &tok=

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<QueryArg val="" norm="norm" matchedUrl="" litteral="True" ></QueryArg>

InQuery com.exalead.actionrules.v21.InQuery

shortcut for query-inside

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<InQuery val="" norm="norm" matchedUrl="" litteral="True" ></InQuery>

InPath com.exalead.actionrules.v21.InPath

shortcut for path-inside

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<InPath val="" norm="norm" matchedUrl="" litteral="True" ></InPath>

Length com.exalead.actionrules.v21.Length

shortcut for field-length

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean
field string

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True
field	string	-

XML representation

<Length val="" norm="norm" matchedUrl="" litteral="True" field="" ></Length>

ExternKey com.exalead.actionrules.v21.ExternKey

Match on an extern key, not the default url

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean
key string

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True
key	string	-

XML representation

<ExternKey val="" norm="norm" matchedUrl="" litteral="True" key="" ></ExternKey>

Num com.exalead.actionrules.v21.Num

Test the numerical value of an extern key. Supported operators : <, <=, =, >, >=

Parent objects

And
Config
Not
Or
Rule

Attributes

key string
val string
norm string
matchedUrl string
litteral boolean

Name	Type	Default value	Description
key	string	-
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True

XML representation

<Num key="" val="" norm="norm" matchedUrl="" litteral="True" ></Num>

ExternKeyPrefix com.exalead.actionrules.v21.ExternKeyPrefix

Match the prefix of an extern key

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean
key string

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True
key	string	-

XML representation

<ExternKeyPrefix val="" norm="norm" matchedUrl="" litteral="True" key="" ></ExternKeyPrefix>

ExternKeyInside com.exalead.actionrules.v21.ExternKeyInside

Match inside a list extern keys

Parent objects

And
Config
Not
Or
Rule

Attributes

val string
norm string
matchedUrl string
litteral boolean
key string

Name	Type	Default value	Description
val	string	-
norm	string	norm
matchedUrl	string	-
litteral	boolean	True
key	string	-

XML representation

<ExternKeyInside val="" norm="norm" matchedUrl="" litteral="True" key="" ></ExternKeyInside>

PostFilterProba com.exalead.actionrules.v21.PostFilterProba

Randomly return true or false. @param value The probability to return true.

Parent objects

Rule

Attributes

value float

Name	Type	Default value	Description
value	float	-

XML representation

<PostFilterProba value="" ></PostFilterProba>

PostFilterRepetitiveTokens com.exalead.actionrules.v21.PostFilterRepetitiveTokens

Filter tested after the pattern has matched.
A postfilter allows to apply a filtering that cannot be expressed by rules.

Parent objects

Rule

Attributes

crossLevel string
numerical boolean

Name	Type	Default value	Description
crossLevel	string	-
numerical	boolean	True

XML representation

<PostFilterRepetitiveTokens crossLevel="" numerical="True" ></PostFilterRepetitiveTokens>

InferredDatePostFilter com.exalead.actionrules.v21.InferredDatePostFilter

Filter tested after the pattern has matched.
A postfilter allows to apply a filtering that cannot be expressed by rules.

Parent objects

Rule

Attributes

Name	Type	Default value	Description
searchDateInPath	boolean	True
searchDateInQuery	boolean	True
maxYearsBefore	int	-1
maxYearsAfter	int	-1
maxMonthsBefore	int	-1
maxMonthsAfter	int	-1
maxDaysBefore	int	-1
maxDaysAfter	int	-1
searchDateFormats	string	-
matchIfOutsideRange	boolean	False
matchIfNoDate	boolean	False

XML representation

<InferredDatePostFilter searchDateInPath="True" searchDateInQuery="True" maxYearsBefore="-1" maxYearsAfter="-1" maxMonthsBefore="-1" maxMonthsAfter="-1" maxDaysBefore="-1" maxDaysAfter="-1" searchDateFormats="" matchIfOutsideRange="False" matchIfNoDate="False" ></InferredDatePostFilter>

CustomPostFilter com.exalead.actionrules.v21.CustomPostFilter

Filter tested after the pattern has matched.
A postfilter allows to apply a filtering that cannot be expressed by rules.

Parent objects

Rule

Attributes

classId string

Name	Type	Default value	Description
classId	string	-	The specified class must implement the {@code com.exalead.actionrules.CustomPostFilter} Exascript interface.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<CustomPostFilter classId="" ></CustomPostFilter>

UrlTesterData com.exalead.mercury.mami.crawl.v21.UrlTesterData

No documentation for this element.

Nested elements

urlsUrlTestConfig*

Type	Wrapper	Description
UrlTestConfig*	urls

XML representation

<UrlTesterData ></UrlTesterData>

UrlTestConfig com.exalead.mercury.mami.crawl.v21.UrlTestConfig

No documentation for this element.

Parent objects

urls UrlTesterData

Attributes

url string
group string
enableAdvancedMode boolean
accept boolean
index boolean
follow boolean

Name	Type	Default value	Description
url	string	-
group	string	default
enableAdvancedMode	boolean	False
accept	boolean	False
index	boolean	False
follow	boolean	False

XML representation

<UrlTestConfig url="" group="default" enableAdvancedMode="false" accept="false" index="false" follow="false" ></UrlTestConfig>

Root com.exalead.mercury.mami.crawl.v21.Root

A crawl root.
Note: there is a 4KB limit on the whole url + metas storage.

Attributes

url string
site boolean
priority int
group string
kvs string

Name	Type	Default value	Description
url	string	-	The root url.
site	boolean	True	Enable site-mode: only crawl urls that belong to this 'site'.
priority	int	0	Priority shift. Increase or decrease priority. 0 means normal, -1 is higher priority, +1 lower.
group	string	default	Key used to group rules and root urls.
kvs	string	-	A semi-colon separated list of key-values. example: "key1=value1;key2=value2"

XML representation

<Root url="" site="true" priority="0" group="default" kvs="" ></Root>

RootSet com.exalead.mercury.mami.crawl.v21.RootSet

A file from which to load a set of urls or sites to crawl.

Parent objects

rootsets Crawler
rootsets FeedFetcher

Attributes

url string
group string

Name	Type	Default value	Description
url	string	-
group	string	default	Key used to group rules and root urls.

XML representation

<RootSet url="" group="default" ></RootSet>

FeedFetcher com.exalead.mercury.mami.crawl.v21.FeedFetcher

A feedfetcher configuration.

Parent objects

CrawlConfig

Attributes

Name	Type	Default value	Description
name	string	-	The crawler name. It must be unique across all crawlers.
documentsType	string	-	The type of documents pushed by this connector. The type of documents must match one of the types declared in your CloudView license file.
fetcher	string	-	Which fetcher to use.
crawlerServer	string	-	Crawler server hosting this crawler. See Deployment configuration.
connectorServer	string	-	Connector server hosting the indexing part of this crawler. See Deployment configuration.
buildGroup	string	-	Target build group.
dataModel	string	-	The default data model for documents indexed by this crawler.
storeTextOnly	boolean	True	Whether to store original binary documents, or only converted text.
nthreads	int	1	The number of crawl threads which must be strictly positive.
aggressive	boolean	False	Whether to enable aggressive crawl, that never sleeps between two requests to the same host.
throttleTimeMS	int	2500	In the case of non-aggressive crawl, this defines the sleep interval between requests to the same host.
ignoreRobotsTxt	boolean	False	Whether to ignore robots.txt rules. Not recommended.
enableConvertProcessor	boolean	True	Whether to enable remoteconvert-based processor for links extracting in binary documents.
nearDuplicateDetector	boolean	True	Whether to enable the near-duplicate content detector.
patternsDetector	boolean	True	Whether to enable patterns detection in pages.
crawlSitemaps	boolean	True	Whether to crawl sitemaps.
disableConditionalGet	boolean	False	Whether to always fetch documents, even if the server tells it has not changed.
defaultAccept	boolean	False	Whether to crawl a url by default when it matches no other accept rule.
defaultIndex	boolean	False	Whether to index by default when a url matches no index rule.
defaultFollow	boolean	False	Whether to follow by default when a url matches no follow rule.
defaultFollowRoots	boolean	True	Whether to automatically follow root urls
enableSimpleSiteCollapsing	boolean	True	Whether to generate a site ID suitable for document collapsing.
simpleSiteCollapsingDepth	int	0	How many path segments to use to generate the site collapsing ID.
mimeTypesMode	string	exclude	Mime types white/black list
smartRefresh	boolean	True	Whether to crawl a fraction of refreshed urls.
smartRefreshMinAgeS	int	3600	Age in seconds at which we may refresh old urls.
smartRefreshMaxAgeS	int	604800	Age in seconds at which we force the refresh of old urls.
archiveDocuments	boolean	False	When enabled, deleted documents are not deleted, but kept with their deletion date.
enableConsolidation	boolean	True	Define if we use a standard PAPI or a consolidation PAPI.
refreshDelayS	int	60	minimum delay before refreshing any url, default is 1 minute

Nested elements

mimeTypesStringConstantValue*
sessionIdBlacklistStringConstantValue*
PushAPIFilterKeyValue*
feedsFeed*
rootsetsRootSet*
CrawlSchedulerConfig
CustomCrawlConfig
Rules*
UrlTesterData

Type	Wrapper	Description
StringConstantValue*	mimeTypes
StringConstantValue*	sessionIdBlacklist	SessionId blacklist. These parameters are removed from URLs with a path or query part containing them.
KeyValue*	PushAPIFilter
Feed*	feeds	a list of feeds
RootSet*	rootsets	A list of files to load urls/sites from.
CrawlSchedulerConfig	-
CustomCrawlConfig	-
Rules*	-
UrlTesterData	-

XML representation

<FeedFetcher name="" documentsType="" fetcher="" crawlerServer="" connectorServer="" buildGroup="" dataModel="" storeTextOnly="true" nthreads="1" aggressive="false" throttleTimeMS="2500" ignoreRobotsTxt="false" enableConvertProcessor="true" nearDuplicateDetector="true" patternsDetector="true" crawlSitemaps="true" disableConditionalGet="false" defaultAccept="false" defaultIndex="false" defaultFollow="false" defaultFollowRoots="true" enableSimpleSiteCollapsing="true" simpleSiteCollapsingDepth="0" mimeTypesMode="exclude" smartRefresh="true" smartRefreshMinAgeS="3600" smartRefreshMaxAgeS="604800" archiveDocuments="false" enableConsolidation="true" refreshDelayS="60" ></FeedFetcher>

Feed com.exalead.mercury.mami.crawl.v21.Feed

A feed.
Contains KeyValue* that are mapped to metas on all documents crawled from this root.
Beware: there is a 4KB limit on the whole url + metas storage.

Attributes

url string
site boolean
priority int
group string
kvs string
refreshPeriodS int
indexFeedItems boolean
indexItemDocuments boolean
findFeeds boolean
forceFeedMimeType boolean
findMediaLinks boolean

Name	Type	Default value	Description
url	string	-	The root url.
site	boolean	True	Enable site-mode: only crawl urls that belong to this 'site'.
priority	int	0	Priority shift. Increase or decrease priority. 0 means normal, -1 is higher priority, +1 lower.
group	string	default	Key used to group rules and root urls.
kvs	string	-	A semi-colon separated list of key-values. example: "key1=value1;key2=value2"
refreshPeriodS	int	600	how often to refresh this feed, default 10min
indexFeedItems	boolean	True	whether to index all items found in the feed with metas, before crawling them
indexItemDocuments	boolean	True	whether to crawl the items and index the full item pages
findFeeds	boolean	False	whether to crawl feeds found in html headers <link href="" rel="alternate" />
forceFeedMimeType	boolean	True	force processing of url as xml feed (for servers returning buggy content types) can't work with findFeeds enabled.
findMediaLinks	boolean	True	find <img src="" /> and youtube/dailymotion links in item text and push them as metas

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<Feed url="" site="true" priority="0" group="default" kvs="" refreshPeriodS="600" indexFeedItems="true" indexItemDocuments="true" findFeeds="false" forceFeedMimeType="true" findMediaLinks="true" ></Feed>

FetchConfig com.exalead.mercury.mami.fetch.v21.FetchConfig

Data model //

Attributes

Name	Type	Default value	Description
version	long	-	-
defaultFetcher	string	-
dnsServer	string	-
defaultMaxSizeKB	int	-
defaultTruncate	boolean	False
sessionLifetime	int	600	Cookie session lifetime in seconds
fullDocumentMaxSizeKB	int	32768
crawlCacheProxyAddress	string	-	Crawl through multibox (WebExperiencePlatform mode).
crawlCacheProxyUsername	string	-
crawlCacheProxyPassword	string	-
crawlCacheRequestTimeoutMS	long	10000
globalProxyHost	string	-
globalProxyPort	int	-
globalProxyUsername	string	-
globalProxyPassword	string	-
globalProxyDomain	string	-
nonProxyHosts	string	-

Nested elements

mimesMimeConfig*
Fetcher*

Type	Wrapper	Description
MimeConfig*	mimes
Fetcher*	-

XML representation

<FetchConfig version="" defaultFetcher="" dnsServer="" defaultMaxSizeKB="" defaultTruncate="false" sessionLifetime="600" fullDocumentMaxSizeKB="32768" crawlCacheProxyAddress="" crawlCacheProxyUsername="" crawlCacheProxyPassword="" crawlCacheRequestTimeoutMS="10000" globalProxyHost="" globalProxyPort="" globalProxyUsername="" globalProxyPassword="" globalProxyDomain="" nonProxyHosts="" ></FetchConfig>

Fetcher com.exalead.mercury.mami.fetch.v21.Fetcher

No documentation for this element.

Parent objects

FetchConfig

Attributes

name string
type string
classId string
readTimeoutS int
writeTimeoutS int
connectTimeoutS int
maxDownloadTimeS int
userAgent string
from string
cookies boolean
proxyAddr string
proxyUsername string
proxyPassword string
proxyDomain string
useConnectForHttpsOverProxy boolean
useCrawlCache boolean
maxAgeS long

Name	Type	Default value	Description
name	string	-
type	string	-
classId	string	-
readTimeoutS	int	15
writeTimeoutS	int	15
connectTimeoutS	int	30
maxDownloadTimeS	int	600
userAgent	string	-
from	string	-
cookies	boolean	False
proxyAddr	string	-
proxyUsername	string	-
proxyPassword	string	-
proxyDomain	string	-
useConnectForHttpsOverProxy	boolean	True
useCrawlCache	boolean	False	Crawl through crawl cache proxy. (WebExperiencePlatform mode).
maxAgeS	long	2592000	Max allowed age of docs fetched in cache. Older docs are recrawled. Default is 1 month

Nested elements

headersHeader*
parametersKeyValue*
configRulesConfig*

Type	Wrapper	Description
Header*	headers
KeyValue*	parameters
Config*	configRules

XML representation

<Fetcher name="" type="" classId="" readTimeoutS="15" writeTimeoutS="15" connectTimeoutS="30" maxDownloadTimeS="600" userAgent="" from="" cookies="false" proxyAddr="" proxyUsername="" proxyPassword="" proxyDomain="" useConnectForHttpsOverProxy="true" useCrawlCache="false" maxAgeS="2592000" ></Fetcher>

Header com.exalead.mercury.mami.fetch.v21.Header

No documentation for this element.

Parent objects

headers AddHeaders
headers Fetcher

Attributes

name string
value string

Name	Type	Default value	Description
name	string	-
value	string	-

XML representation

<Header name="" value="" ></Header>

Config com.exalead.mercury.mami.fetch.v21.Config

No documentation for this element.

Parent objects

configRules Fetcher

Attributes

name string

Name	Type	Default value	Description
name	string	-

Nested elements

Pattern*
RulesConfig

Type	Wrapper	Description
Pattern*	-
RulesConfig	-

XML representation

<Config name="" ></Config>

Cookies com.exalead.mercury.mami.fetch.v21.Cookies

No documentation for this element.

Parent objects

Config

Attributes

enable boolean

Name	Type	Default value	Description
enable	boolean	False

XML representation

<Cookies enable="false" ></Cookies>

Proxy com.exalead.mercury.mami.fetch.v21.Proxy

No documentation for this element.

Parent objects

Config

Attributes

addr string
username string
password string
domain string

Name	Type	Default value	Description
addr	string	-
username	string	-
password	string	-
domain	string	-

XML representation

<Proxy addr="" username="" password="" domain="" ></Proxy>

AddHeaders com.exalead.mercury.mami.fetch.v21.AddHeaders

No documentation for this element.

Nested elements

headersHeader*

Type	Wrapper	Description
Header*	headers

XML representation

<AddHeaders ></AddHeaders>

Auth com.exalead.mercury.mami.fetch.v21.Auth

No documentation for this element.

Parent objects

Config

Attributes

type string
username string
password string
realm string
domain string
host string

Name	Type	Default value	Description
type	string	-
username	string	-
password	string	-
realm	string	-
domain	string	-
host	string	-

Nested elements

conditionCond
Post

Type	Wrapper	Description
Cond	condition
Post	-

XML representation

<Auth type="" username="" password="" realm="" domain="" host="" ></Auth>

Post com.exalead.mercury.mami.fetch.v21.Post

No documentation for this element.

Parent objects

Auth

Attributes

gatewayUrl string
formId string
formClass string
formName string
method string
action string
autoSubmittedForms int
additionalRequest string

Name	Type	Default value	Description
gatewayUrl	string	-
formId	string	-	if there are more than one form on the gateway Url, a formName formId and/or a formClass can be specified to find the right one.
formClass	string	-
formName	string	-
method	string	-	when method or action are not null, they override the ones found in the form. when gatewayUrl is null, just use them. gatewayUrl, method and action must not all be null.
action	string	-
autoSubmittedForms	int	0	How many times to try to find and submit a form after the login procedure. This is needed to get cookies for other domains in some SSOs.
additionalRequest	string	-	An additional URL to fetch after the HTML form authentication procedure, following all redirections to allow new cookies. Useful for some SSO including google sites.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<Post formId="" formClass="" formName="" method="" action="" autoSubmittedForms="0" additionalRequest="" ></Post>

Status com.exalead.mercury.mami.fetch.v21.Status

No documentation for this element.

Parent objects

And
Not
Or
condition Auth

Attributes

success boolean
code int

Name	Type	Default value	Description
success	boolean	True
code	int	-

XML representation

<Status success="true" code="" ></Status>

Redirect com.exalead.mercury.mami.fetch.v21.Redirect

No documentation for this element.

Parent objects

And
Not
Or
condition Auth

Attributes

success boolean
matches string

Name	Type	Default value	Description
success	boolean	True
matches	string	-	only match redirections to a url with some string inside, else all redirections

XML representation

<Redirect success="true" matches="" ></Redirect>

ResponseHeader com.exalead.mercury.mami.fetch.v21.ResponseHeader

No documentation for this element.

Parent objects

And
Not
Or
condition Auth

Attributes

success boolean
name string
value string

Name	Type	Default value	Description
success	boolean	True
name	string	-
value	string	-

XML representation

<ResponseHeader success="true" name="" value="" ></ResponseHeader>

InBody com.exalead.mercury.mami.fetch.v21.InBody

No documentation for this element.

Parent objects

And
Not
Or
condition Auth

Attributes

success boolean
text string

Name	Type	Default value	Description
success	boolean	True
text	string	-

XML representation

<InBody success="true" text="" ></InBody>

And com.exalead.mercury.mami.fetch.v21.And

No documentation for this element.

Parent objects

And
Not
Or
condition Auth

Attributes

success boolean

Name	Type	Default value	Description
success	boolean	True

Nested elements

Cond*

Type	Wrapper	Description
Cond*	-

XML representation

<And success="true" ></And>

Or com.exalead.mercury.mami.fetch.v21.Or

No documentation for this element.

Parent objects

And
Not
Or
condition Auth

Attributes

success boolean

Name	Type	Default value	Description
success	boolean	True

Nested elements

Cond*

Type	Wrapper	Description
Cond*	-

XML representation

Not com.exalead.mercury.mami.fetch.v21.Not

No documentation for this element.

Parent objects

And
Not
Or
condition Auth

Attributes

success boolean

Name	Type	Default value	Description
success	boolean	True

Nested elements

Cond

Type	Wrapper	Description
Cond	-

XML representation

<Not success="true" ></Not>

AddParameters com.exalead.mercury.mami.fetch.v21.AddParameters

No documentation for this element.

Nested elements

parametersKeyValue*

Type	Wrapper	Description
KeyValue*	parameters

XML representation

<AddParameters ></AddParameters>

MimeConfig com.exalead.mercury.mami.fetch.v21.MimeConfig

No documentation for this element.

Parent objects

mimes FetchConfig

Attributes

mime string
maxSizeKB int
truncate boolean

Name	Type	Default value	Description
mime	string	-
maxSizeKB	int	-
truncate	boolean	True	When setting truncate to false, the file won't be crawled if size exceeds maxSizeKB. This is useful for binary files, such as pdf, because truncated binary files can not be processed

XML representation

<MimeConfig mime="" maxSizeKB="" truncate="true" ></MimeConfig>

ConvertConfig com.exalead.mercury.mami.convert.v10.ConvertConfig

Global configuration for document conversion.
This configuration impacts how binary files (e.g. images, office documents, ...) are handled during: - Indexing, Analysis, when using the ConvertDocumentProcessor - Document Preview.

Attributes

version long

Name	Type	Default value	Description
version	long	-	-

Nested elements

ConvertDocumentInputSettings
ConvertDocumentOutputSettings
ConvertDocumentProcessingSettings
ConvertInternalSettings
ConvertJavaPluginsSettings

Type	Wrapper	Description
ConvertDocumentInputSettings	-
ConvertDocumentOutputSettings	-
ConvertDocumentProcessingSettings	-
ConvertInternalSettings	-
ConvertJavaPluginsSettings	-

XML representation

<ConvertConfig version="" ></ConvertConfig>

ConvertDocumentInputSettings com.exalead.mercury.mami.convert.v10.ConvertDocumentInputSettings

Configuration of convert inputs handling.
Parameters to accept inputs for conversion.

Parent objects

ConvertConfig

Attributes

Name	Type	Default value	Description
minSizeKB	int	-1	Default minimum size in kilobytes for a document to be converted. The default is -1 which means no limit. Note: This setting defines the process default value, which can be overridden for each conversion command.
maxSizeKB	int	-1	Default maximumm size in kilobytes for a document to be converted. The default is -1 which means use program defaults (250MB). Note: This setting defines the process default value, which can be overridden for each conversion command.
maxSizeForTextDocumentsKB	int	-1	Default maximum size in kilobytes for textual (html, xml, text) documents to be converted. The default is -1 which means use program defaults (250MB). Note: This setting defines the process default value, which can be overridden for each conversion command.

XML representation

<ConvertDocumentInputSettings minSizeKB="-1" maxSizeKB="-1" maxSizeForTextDocumentsKB="-1" ></ConvertDocumentInputSettings>

ConvertDocumentOutputSettings com.exalead.mercury.mami.convert.v10.ConvertDocumentOutputSettings

Configuration of convert outputs handling.
Parameters to tune conversion outputs.

Parent objects

ConvertConfig

Attributes

Name	Type	Default value	Description
maxSizeKB	int	-1	Default maximum size in bytes for converted documents. The default is -1 which means no limit. Note: This setting defines the process default value, which can be overridden for each conversion command.
maxConvertedPagesForXmlOutput	int	-1	Default maximum number of document pages to be converted into xml. The default is -1 which means no limit. The definition of a page is tightly linked to document type: (pdf page, doc page, etc) Note: This setting defines the process default value, which can be overridden for each conversion command.
maxConvertedPagesForHtmlOutput	int	-1	Default maximum number of document pages to be converted into html. The default is -1 which means no limit. The definition of a page is tightly linked to document type: (pdf page, doc page, etc) Note: This setting defines the process default value, which can be overridden for each conversion command.

XML representation

<ConvertDocumentOutputSettings maxSizeKB="-1" maxConvertedPagesForXmlOutput="-1" maxConvertedPagesForHtmlOutput="-1" ></ConvertDocumentOutputSettings>

ConvertDocumentProcessingSettings com.exalead.mercury.mami.convert.v10.ConvertDocumentProcessingSettings

Configuration of conversion settings in processing stage.

Parent objects

ConvertConfig

Attributes

conversionTimeoutS int
conversionTimeoutPerMegabyteS int
conversionGraceTimeoutS int
enableImageResizing boolean
netcamMode string
netcamTimeoutS int
netcamAsyncTimeoutS int
netcamJobsPerMinion int
netcamMaxJobsPerMinion int
netcamJobsRetries int
netcamProxy string
ttfDir string
indexSingleContainersAsOneDocument boolean
singleContainersMaxRecursionDepth int
singleContainersMaxRecursionDocuments int
singleContainersMaxRecursionDocumentsTotal int
allowUnicode32 boolean
allowDocumentChars boolean
metaSeparator string
iFilterExtensions string
excelDateFormat int

Name	Type	Default value	Description
conversionTimeoutS	int	-1	Default timeout in seconds for conversion. The default is -1 which means use program defaults (30 seconds). The conversion will be considered as failed if it takes longer than conversionTimeout. Note: This setting defines the process default value, which can be overridden for each conversion command.
conversionTimeoutPerMegabyteS	int	-1	Default timeout in seconds per megabyte for conversion. The conversion fails if it takes more than conversionTimeout * (size of document in Megabytes). The default is -1 which means "undefined". Note: This setting defines the process default value, which can be overridden for each conversion command.
conversionGraceTimeoutS	int	-1	Global conversion grace timeout value in seconds. The convert process will kill a non-responding minion thread after this timeout. The default is -1 which means use program defaults (30 seconds).
enableImageResizing	boolean	True	Enables commands related to images resizing (used in thumbnails computation).
netcamMode	string	optional	Enables Netcam feature: disabled: Disables the feature enabled: Enables the feature optional: Enables the feature if available {@code enum("disabled", "enabled", "optional")}
netcamTimeoutS	int	-1	Default Netcam conversion timeout in seconds. The default is -1 which means use program defaults (30 seconds). Note: This setting defines the process default value, which can be overridden for each conversion command.
netcamAsyncTimeoutS	int	-1	Netcam asynchronous command timeout in seconds. It should be set to a low value, as it is applied to async commands. The default is -1 which means use program defaults (10 seconds).
netcamJobsPerMinion	int	-1	The number of Netcam jobs per minion thread. The default is -1 which means use program defaults (4).
netcamMaxJobsPerMinion	int	-1	Total number of jobs a minion can process before being recycled. The default is -1 which means use program defaults (128).
netcamJobsRetries	int	-1	Number of Netcam jobs retries if a remote exception occurs. The default is -1 which means use program defaults (4).
netcamProxy	string	-	Optional proxy, or null if undefined.
ttfDir	string	-	The font path (necessary on UNIX). Used for thumbnail generation.
indexSingleContainersAsOneDocument	boolean	False	Default indexing mode for containers (ZIP, TAR, PST, ...) to be converted as a single document, instead of indexing the directory only. By default, it indexes only the container directory. Note: This function has no impact on container commands (i.e., opening, listing, ...). Note: This setting defines the process default value, which can be overridden for each conversion command.
singleContainersMaxRecursionDepth	int	1	Default maximum recursion depth (for containers, or containers inside containers). Only taken in account if indexSingleContainersAsOneDocument = true Note: This setting defines the process default value, which can be overridden for each conversion command.
singleContainersMaxRecursionDocuments	int	2147483647	Default maximum number of documents that can be indexed in a container. Only taken in account if indexSingleContainersAsOneDocument = true Note: This setting defines the process default value, which can be overridden for each conversion command.
singleContainersMaxRecursionDocumentsTotal	int	2147483647	Default maximum number of documents that can be indexed in a container and all its children (for containers inside containers). Only taken in account if indexSingleContainersAsOneDocument = true Note: This setting defines the process default value, which can be overridden for each conversion command.
allowUnicode32	boolean	True	Allows the use of 32-bit unicode points when processing documents. This will allow to produce Unicode characters greater than 65536.
allowDocumentChars	boolean	True	Allows the use of Unicode private range characters (E0XX) for separators (keyword, sentence, paragraph separators, ...)
metaSeparator	string	:	Character separator for meta-data namespaces. Note: MUST be a printable ascii character (Unicode codepoint must be higher than 32 and strictly lower than 128)
iFilterExtensions	string	-	Comma-separated list of extensions to be processed through the Windows IFilter interface. Only available on Windows.
excelDateFormat	int	0	Date default format to be used to interpret date cells in excel: 0:MM/DD/YYYY, 1:YYYY/MM/DD, 2:DD/MM/YYYY

XML representation

<ConvertDocumentProcessingSettings conversionTimeoutS="-1" conversionTimeoutPerMegabyteS="-1" conversionGraceTimeoutS="-1" enableImageResizing="true" netcamMode="optional" netcamTimeoutS="-1" netcamAsyncTimeoutS="-1" netcamJobsPerMinion="-1" netcamMaxJobsPerMinion="-1" netcamJobsRetries="-1" netcamProxy="" ttfDir="" indexSingleContainersAsOneDocument="false" singleContainersMaxRecursionDepth="1" singleContainersMaxRecursionDocuments="2147483647" singleContainersMaxRecursionDocumentsTotal="2147483647" allowUnicode32="true" allowDocumentChars="true" metaSeparator=":" iFilterExtensions="" excelDateFormat="0" ></ConvertDocumentProcessingSettings>

ConvertInternalSettings com.exalead.mercury.mami.convert.v10.ConvertInternalSettings

Configuration of convert system settings.

The StringValue list can be used to add new supported options, in raw format (i.e., the leading -- must be present).

The legacy KeyValue list can be used to add new supported options, using short format for the key (without the leading -- ; such as "enable-foo").

The value should be set to "true" if no value is to be set on the commandline side.

Parent objects

ConvertConfig

Attributes

Name	Type	Default value	Description
retryOnMMAPFailed	boolean	False	Default convert retry with regular I/O if mmap fails when the convert is responsible for fetching the bytes of a document. This is useful for FileSystem mounted in direct I/O. Note: This setting defines the process default value, which can be overridden for each conversion command.
disableSharedMemory	boolean	False	Disables shared memory support.
loggingLevel	string	-	Sets logging verbosity of the convert. verbose: Logs converted urls quiet: Logs errors only. {@code enum ("verbose", "quiet")}
tmpDir	string	-	The temporary path to override the system temporary path.
selftestOnStartup	boolean	False	The converter tests itself on startup.
restrictUserId	boolean	False	Restricts connections to the user running the server. Supported only on Windows and Linux platforms.

Nested elements

ConvertInternalCacheSettings
ConvertInternalChildrenSettings
ConvertInternalPOSIXSettings
KeyValue*
StringValue*

Type	Wrapper	Description
ConvertInternalCacheSettings	-
ConvertInternalChildrenSettings	-
ConvertInternalPOSIXSettings	-
KeyValue*	-
StringValue*	-

XML representation

<ConvertInternalSettings retryOnMMAPFailed="false" disableSharedMemory="false" loggingLevel="" tmpDir="" selftestOnStartup="false" restrictUserId="false" ></ConvertInternalSettings>

ConvertInternalCacheSettings com.exalead.mercury.mami.convert.v10.ConvertInternalCacheSettings

Cache settings.
The cache is mainly used to store preview files: css, javascript and images.

Parent objects

ConvertInternalSettings

Attributes

Name	Type	Default value	Description
minAgeS	long	-1	Minimum age (in seconds) for an item in the cache. This is useful for big files that we don't want to frequently update in the cache, for performance reasons. The default is -1 which means no limit.
maxAgeS	long	-1	Maximum age (in seconds) for an item in the cache. The default is -1 which means no limit.
maxSizeMB	int	-1	Maximum size (in megabytes) for the cache. The default is -1 which means no limit.
maxSizePerFileKB	int	-1	Maximum size (in kilobytes) for an item to be cached. The default is -1 which means no limit.

XML representation

<ConvertInternalCacheSettings minAgeS="-1" maxAgeS="-1" maxSizeMB="-1" maxSizePerFileKB="-1" ></ConvertInternalCacheSettings>

ConvertInternalChildrenSettings com.exalead.mercury.mami.convert.v10.ConvertInternalChildrenSettings

Configuration of convert children.

The convert forks children processes to run parallel conversion and make the service more robust to crashes.

Parent objects

ConvertInternalSettings

Attributes

Name	Type	Default value	Description
maxChildren	int	-1	Maximum authorized number of children. The default is -1 which means use program defaults (128).
maxIdleChildren	int	-1	Children that were created can be reused for future conversion. This parameter specifies the max number of idle children for future use. After long inactivity, all children will die and be re-forked if necessary. The default is -1 which means use program defaults (32).
maxIdleChildrenPerGroup	int	-1	Children that were created for a specific group can be reused for future conversion. This parameter specifies the max number of idle children for each group for future use. After long inactivity, all children will die and be re-forked if necessary. The default is -1 which means use program defaults (automatic).
childSpawnTimeoutS	int	-1	Children spawn timeout in seconds. The default is -1 which means use program defaults (20 seconds).
ChildConvertInitTimeoutS	int	-1	Minion convert libraries and plugins initialization timeout in seconds. The default is -1 which means use program defaults (30 seconds).
exec32Mode	string	disabled	Select 32-bit support disabled: Disables the feature enabled: Enables the feature optional: Enables the feature if available {@code enum("disabled", "enabled", "optional")}

XML representation

<ConvertInternalChildrenSettings maxChildren="-1" maxIdleChildren="-1" maxIdleChildrenPerGroup="-1" childSpawnTimeoutS="-1" ChildConvertInitTimeoutS="-1" exec32Mode="disabled" ></ConvertInternalChildrenSettings>

ConvertInternalPOSIXSettings com.exalead.mercury.mami.convert.v10.ConvertInternalPOSIXSettings

Convert process parameters, valid for a main of child process.

Valid only for POSIX systems.

Parent objects

ConvertInternalSettings

Attributes

Name	Type	Default value	Description
maxProcessMemorySizeMB	int	-1	Maximum allowed memory for a convert process (main or child) (posix: maxas). The default is -1 which means no limit.
maxCoreFileSizeMB	int	-1	Maximum allowed size for core files (posix: maxcore). The default is -1 which means no limit.
maxCreatedFileSizeKB	int	-1	Maximum allowed size for created files such as temporary files etc (posix: maxfsize). The default is -1 which means no limit.
maxNumberOfOpenedFiles	int	-1	Maximum allowed number of opened files (posix: maxnofile). The default is -1 which means no limit.
maxResidentMemorySizeMB	int	-1	Maximum allowed size in bytes for resident memory (posix: maxrss). The default is -1 which means no limit.

XML representation

<ConvertInternalPOSIXSettings maxProcessMemorySizeMB="-1" maxCoreFileSizeMB="-1" maxCreatedFileSizeKB="-1" maxNumberOfOpenedFiles="-1" maxResidentMemorySizeMB="-1" ></ConvertInternalPOSIXSettings>

ConvertJavaPluginsSettings com.exalead.mercury.mami.convert.v10.ConvertJavaPluginsSettings

Java global "format plugins" settings.

Nested elements

ConvertJavaPluginSetting*

Type	Wrapper	Description
ConvertJavaPluginSetting*	-

XML representation

<ConvertJavaPluginsSettings ></ConvertJavaPluginsSettings>

ConvertJavaPluginSetting com.exalead.mercury.mami.convert.v10.ConvertJavaPluginSetting

Java global "format plugin" setting.

Parent objects

ConvertJavaPluginsSettings

Attributes

classId string
disabled boolean

Name	Type	Default value	Description
classId	string	-	Java class name.
disabled	boolean	False	Is the plugin disabled?

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<ConvertJavaPluginSetting classId="" disabled="false" ></ConvertJavaPluginSetting>

AnalysisConfig com.exalead.indexing.analysis.v10.AnalysisConfig

AnalysisConfig represents a self-contained module for Document Analysis.

AnalysisConfig is referenced by a BuildGroup.

An analysis module defines a set of pipelines that are applied in this module.

Attributes

name string
linguistic boolean

Name	Type	Default value	Description
name	string	-	Name of the analysis module. Must be unique.
linguistic	boolean	True	Extracts linguistic data for the dictionary, such as word counts. This impacts the ability to compute related terms and use word counts for ranking.

Nested elements

AnalysisPipeline*

Type	Wrapper	Description
AnalysisPipeline*	-

XML representation

<AnalysisConfig name="" linguistic="true" ></AnalysisConfig>

AnalysisPipeline com.exalead.indexing.analysis.v10.AnalysisPipeline

A document analysis pipeline.

Each pipeline has an associated accept condition. This condition is tested for each input document.

If a document matches the condition, it is processed by this pipeline.

If not, the condition is tested for the next pipeline in the list of pipelines defined in a DocumentAnalysis object.

A document refused by all pipelines is neither processed nor indexed.

Pipeline processing is made of several stages:

Document Processing Stage - is performed by a list of DocumentProcessor which process each Document sequentially.
Document Processors manipulate the 'DocumentParts' (binary data pushed through the PAPI) and the 'DocumentChunks' (textual data obtained either from PAPI meta or by processing of Document Part or by processing of pre-existing Document Chunks)

Each DocumentChunk has a textual content, a ContextName, a language, a score, may belong to a DocumentPart.

A DocumentChunk belonging to no DocumentPart is called a root DocumentChunk.
Semantic Processing Stage - involves a list of SemanticProcessor which process each Document Chunk of each Document sequentially (except those for which Semantic Processing is disabled in the mapping).

Semantic Processing segments text into 'tokens' and then processes text as a flow of tokens. SemanticAnnotations are produced on each token.
Mapping - involves mapping DocumentChunk and Semantic Annotations to index fields.

Parent objects

AnalysisConfig

Attributes

Name	Type	Default value	Description
name	string	-
errorAction	string	continue	Specifies the action to launch if there is a document error during processing: "discard": Discards the document from the job. If the document was already in the index, it's not removed if it already existed. "delete": Discards the document from the job and deletes it from the index. "continue": Keeps processing the document. The document will probably be incomplete in the index.
reportDocumentErrors	boolean	True	Reports the document errors in the global reporting store, for further analysis.
globalLogDocumentErrors	boolean	False	Logs errors and exceptions reported by the processors in the global log (without stack trace).
autoBlacklistDocuments	boolean	True	Tries to add to a block list the documents triggering serious failure automatically. This option helps preventing loop failures, that is to say, when documents always trigger the same analysis process failures.
tokenizationConfig	string	-	Reference to the TokenizationConfig object to use for tokenization during Semantic Processing Stage.
autoconfigureFromDataModel	boolean	True
documentProcessorsProfiling	boolean	False	Logs the CPU time spent for each document processor and for the main indexing phase. The total time spent for each processor is dumped in the analyzer log at the end of the job.
semanticPipeTimeout	int	0	CPU-time limit for the processing of a text chunk by the semantic pipe, in seconds.
slowDocumentWarningTimeUS	long	5000000	If the processing of a document is longer than this time, a message will be printed in the analyzer log. A value of 0 disables the warning feature.
semanticProcessorsProfiling	boolean	False	Logs the CPU time spent for each semantic processor. The total time spent for each processor is dumped in the analyzer log at the end of the job. Warning: This feature strongly impacts performance, only enable it if required.

Nested elements

AcceptCondition
DocumentProcessor*
FilteringConfiguration
LanguageConfiguration*
MappingConfiguration
SemanticProcessor*

Type	Wrapper	Description
AcceptCondition	-
DocumentProcessor*	-
FilteringConfiguration	-
LanguageConfiguration*	-
MappingConfiguration	-
SemanticProcessor*	-

XML representation

<AnalysisPipeline name="" errorAction="continue" reportDocumentErrors="true" globalLogDocumentErrors="false" autoBlacklistDocuments="true" tokenizationConfig="tok0" autoconfigureFromDataModel="true" documentProcessorsProfiling="false" semanticPipeTimeout="0" slowDocumentWarningTimeUS="5000000" semanticProcessorsProfiling="false" ></AnalysisPipeline>

AndCondition com.exalead.indexing.analysis.v10.AndCondition

AndCondition matches if all children AcceptCondition match.

Nested elements

AcceptCondition*

Type	Wrapper	Description
AcceptCondition*	-

XML representation

<AndCondition ></AndCondition>

OrCondition com.exalead.indexing.analysis.v10.OrCondition

OrCondition matches if one child matches.

Nested elements

AcceptCondition*

Type	Wrapper	Description
AcceptCondition*	-

XML representation

<OrCondition ></OrCondition>

NotCondition com.exalead.indexing.analysis.v10.NotCondition

Matches if the child condition does not match.

If there is no child condition (null), this condition never matches.

Nested elements

AcceptCondition

Type	Wrapper	Description
AcceptCondition	-

XML representation

<NotCondition ></NotCondition>

SourceCondition com.exalead.indexing.analysis.v10.SourceCondition

SourceCondition matches if the source of the document matches 'source'.

Parent objects

AnalysisPipeline
AndCondition
CGRDocumentProcessor
ConcatValues
ContentCleanup
ConvertTextExtractor
CoordinatesFormatter
CopyContext
CustomDocumentProcessor
DataModelClassResolver
DateFormatter
DebugCrashProcessor
DebugProcessor
DiscardDocument
DocumentProcessor
DocumentProcessorGroup
DoubleToLong
FixedRangeNumericalPartitioning
ForcedRangeNumericalPartitioning
FormatCheckerDate
GeoBBoxProcessor
GeoCategorizer
HTMLCSSExtractor
HTMLCSSSelector
HTMLRelevantContentExtractor
HTMLTableExtractor
InferFileExtension
InsertCurrentDate
JavaDocumentProcessor
JavaProcessor
JavaScriptProcessor
LanguageDetector
LanguageSetter
MIMEDetector
MathDocumentProcessor
MetaFinder
MimeTypeSetter
MultiContextCSVEncoder
MultiContextDocumentProcessor
NativeTextExtractor
NewChunk
NotCondition
NumericalFormatter
OrCondition
PLMExpandDocumentProcessor
PrecomputedThumbnailsDocumentProcessor
PrintfValues
PublicUrlProcessor
RealTimeAlerting
RemoteHTTPTransformer
RemoteMOTAPIDocumentProcessor
RemoveContexts
RenameContext
RenameUnmappedContexts
ReplaceContextNames
ReplaceRegexp
ReplaceValues
SemanticPipeDocumentProcessor
SetDefaultValue
SimilarStringToPart
SingleContextDocumentProcessor
SplitValues
StandardPartsMerger
StorageServiceDocumentProcessor
StringHash
StringHash32
StringHash64
StringTransform
TextToNum
URLCodec
URLTransformer
UTF8Checker
UniformRandomContextGenerator
UnitsOfMeasurementNormalizer
ValueSelector
WildcardIndexing
XpathExtractor
XpathFragmentExtractor
ZipfRandomContextGenerator

Attributes

source string

Name	Type	Default value	Description
source	string	-	Value of the 'source' for the document against which to check.

XML representation

<SourceCondition source="" ></SourceCondition>

BuildGroupCondition com.exalead.indexing.analysis.v10.BuildGroupCondition

BuildGroupCondition matches if the current buildgroup matches 'name'.

Parent objects

AnalysisPipeline
AndCondition
CGRDocumentProcessor
ConcatValues
ContentCleanup
ConvertTextExtractor
CoordinatesFormatter
CopyContext
CustomDocumentProcessor
DataModelClassResolver
DateFormatter
DebugCrashProcessor
DebugProcessor
DiscardDocument
DocumentProcessor
DocumentProcessorGroup
DoubleToLong
FixedRangeNumericalPartitioning
ForcedRangeNumericalPartitioning
FormatCheckerDate
GeoBBoxProcessor
GeoCategorizer
HTMLCSSExtractor
HTMLCSSSelector
HTMLRelevantContentExtractor
HTMLTableExtractor
InferFileExtension
InsertCurrentDate
JavaDocumentProcessor
JavaProcessor
JavaScriptProcessor
LanguageDetector
LanguageSetter
MIMEDetector
MathDocumentProcessor
MetaFinder
MimeTypeSetter
MultiContextCSVEncoder
MultiContextDocumentProcessor
NativeTextExtractor
NewChunk
NotCondition
NumericalFormatter
OrCondition
PLMExpandDocumentProcessor
PrecomputedThumbnailsDocumentProcessor
PrintfValues
PublicUrlProcessor
RealTimeAlerting
RemoteHTTPTransformer
RemoteMOTAPIDocumentProcessor
RemoveContexts
RenameContext
RenameUnmappedContexts
ReplaceContextNames
ReplaceRegexp
ReplaceValues
SemanticPipeDocumentProcessor
SetDefaultValue
SimilarStringToPart
SingleContextDocumentProcessor
SplitValues
StandardPartsMerger
StorageServiceDocumentProcessor
StringHash
StringHash32
StringHash64
StringTransform
TextToNum
URLCodec
URLTransformer
UTF8Checker
UniformRandomContextGenerator
UnitsOfMeasurementNormalizer
ValueSelector
WildcardIndexing
XpathExtractor
XpathFragmentExtractor
ZipfRandomContextGenerator

Attributes

name string

Name	Type	Default value	Description
name	string	-	Value of the "buildgroup" for the document against which to check.

XML representation

<BuildGroupCondition name="" ></BuildGroupCondition>

MetaCondition com.exalead.indexing.analysis.v10.MetaCondition

MetaCondition matches if the Document contains a DocumentChunk whose meta name and value match the specified condition.

Parent objects

AnalysisPipeline
AndCondition
CGRDocumentProcessor
ConcatValues
ContentCleanup
ConvertTextExtractor
CoordinatesFormatter
CopyContext
CustomDocumentProcessor
DataModelClassResolver
DateFormatter
DebugCrashProcessor
DebugProcessor
DiscardDocument
DocumentProcessor
DocumentProcessorGroup
DoubleToLong
FixedRangeNumericalPartitioning
ForcedRangeNumericalPartitioning
FormatCheckerDate
GeoBBoxProcessor
GeoCategorizer
HTMLCSSExtractor
HTMLCSSSelector
HTMLRelevantContentExtractor
HTMLTableExtractor
InferFileExtension
InsertCurrentDate
JavaDocumentProcessor
JavaProcessor
JavaScriptProcessor
LanguageDetector
LanguageSetter
MIMEDetector
MathDocumentProcessor
MetaFinder
MimeTypeSetter
MultiContextCSVEncoder
MultiContextDocumentProcessor
NativeTextExtractor
NewChunk
NotCondition
NumericalFormatter
OrCondition
PLMExpandDocumentProcessor
PrecomputedThumbnailsDocumentProcessor
PrintfValues
PublicUrlProcessor
RealTimeAlerting
RemoteHTTPTransformer
RemoteMOTAPIDocumentProcessor
RemoveContexts
RenameContext
RenameUnmappedContexts
ReplaceContextNames
ReplaceRegexp
ReplaceValues
SemanticPipeDocumentProcessor
SetDefaultValue
SimilarStringToPart
SingleContextDocumentProcessor
SplitValues
StandardPartsMerger
StorageServiceDocumentProcessor
StringHash
StringHash32
StringHash64
StringTransform
TextToNum
URLCodec
URLTransformer
UTF8Checker
UniformRandomContextGenerator
UnitsOfMeasurementNormalizer
ValueSelector
WildcardIndexing
XpathExtractor
XpathFragmentExtractor
ZipfRandomContextGenerator

Attributes

name string
nameMode enum(equals, matches)
valueMode enum(equals, contains, exists, matches)
value string

Name	Type	Default value	Description
name	string	-	Name of the meta against which to check.
nameMode	enum(equals, matches)	equals	Meta name test mode: "equals": Evaluates the DocumentChunk with a name equal to the specified one. "matches": Evaluates the DocumentChunk with a name matching the specified regular expression.The match is case insensitive.
valueMode	enum(equals, contains, exists, matches)	exists	Value test mode: "exists": Matches if a DocumentChunk pass the name condition. "equals": Matches if a DocumentChunk pass the name condition and the textual content is equal to the 'value' attribute. "contains": Matches if a DocumentChunk pass the name condition and the textual content contains 'value' (Pure string matching is performed without tokenization). "matches": Matches if a DocumentChunk pass the name condition and the textual content matches the regular expression specified by the 'value' attribute. The match is case insensitive.
value	string	-	The string to check against the value of DocumentChunks.

XML representation

<MetaCondition name="" nameMode="equals" valueMode="exists" value="" ></MetaCondition>

MimeCondition com.exalead.indexing.analysis.v10.MimeCondition

A condition that matches if the FIRST document part mime type is in the list.

Note: Conditions work on document but mimes are set per document part.

The MimeCondition only tests the mime type of the first part, if present.

Nested elements

mimesStringValue*

Type	Wrapper	Description
StringValue*	mimes

XML representation

<MimeCondition ></MimeCondition>

URLMatchCondition com.exalead.indexing.analysis.v10.URLMatchCondition

A condition that matches if the URI matches the regexp.

Parent objects

AnalysisPipeline
AndCondition
CGRDocumentProcessor
ConcatValues
ContentCleanup
ConvertTextExtractor
CoordinatesFormatter
CopyContext
CustomDocumentProcessor
DataModelClassResolver
DateFormatter
DebugCrashProcessor
DebugProcessor
DiscardDocument
DocumentProcessor
DocumentProcessorGroup
DoubleToLong
FixedRangeNumericalPartitioning
ForcedRangeNumericalPartitioning
FormatCheckerDate
GeoBBoxProcessor
GeoCategorizer
HTMLCSSExtractor
HTMLCSSSelector
HTMLRelevantContentExtractor
HTMLTableExtractor
InferFileExtension
InsertCurrentDate
JavaDocumentProcessor
JavaProcessor
JavaScriptProcessor
LanguageDetector
LanguageSetter
MIMEDetector
MathDocumentProcessor
MetaFinder
MimeTypeSetter
MultiContextCSVEncoder
MultiContextDocumentProcessor
NativeTextExtractor
NewChunk
NotCondition
NumericalFormatter
OrCondition
PLMExpandDocumentProcessor
PrecomputedThumbnailsDocumentProcessor
PrintfValues
PublicUrlProcessor
RealTimeAlerting
RemoteHTTPTransformer
RemoteMOTAPIDocumentProcessor
RemoveContexts
RenameContext
RenameUnmappedContexts
ReplaceContextNames
ReplaceRegexp
ReplaceValues
SemanticPipeDocumentProcessor
SetDefaultValue
SimilarStringToPart
SingleContextDocumentProcessor
SplitValues
StandardPartsMerger
StorageServiceDocumentProcessor
StringHash
StringHash32
StringHash64
StringTransform
TextToNum
URLCodec
URLTransformer
UTF8Checker
UniformRandomContextGenerator
UnitsOfMeasurementNormalizer
ValueSelector
WildcardIndexing
XpathExtractor
XpathFragmentExtractor
ZipfRandomContextGenerator

Attributes

regexp string

Name	Type	Default value	Description
regexp	string	-	The regexp. Note: It is not anchored by default ; i.e., use '.*\.asp to match .asp URIs.

XML representation

<URLMatchCondition regexp="" ></URLMatchCondition>

FilenameMatchCondition com.exalead.indexing.analysis.v10.FilenameMatchCondition

A condition that matches if the FIRST document part Filename type matches the regexp.

Note: Conditions work on document but Filenames are set per document part.

FilenameMatchCondition only tests the Filename type of the first part, if present.

Parent objects

AnalysisPipeline
AndCondition
CGRDocumentProcessor
ConcatValues
ContentCleanup
ConvertTextExtractor
CoordinatesFormatter
CopyContext
CustomDocumentProcessor
DataModelClassResolver
DateFormatter
DebugCrashProcessor
DebugProcessor
DiscardDocument
DocumentProcessor
DocumentProcessorGroup
DoubleToLong
FixedRangeNumericalPartitioning
ForcedRangeNumericalPartitioning
FormatCheckerDate
GeoBBoxProcessor
GeoCategorizer
HTMLCSSExtractor
HTMLCSSSelector
HTMLRelevantContentExtractor
HTMLTableExtractor
InferFileExtension
InsertCurrentDate
JavaDocumentProcessor
JavaProcessor
JavaScriptProcessor
LanguageDetector
LanguageSetter
MIMEDetector
MathDocumentProcessor
MetaFinder
MimeTypeSetter
MultiContextCSVEncoder
MultiContextDocumentProcessor
NativeTextExtractor
NewChunk
NotCondition
NumericalFormatter
OrCondition
PLMExpandDocumentProcessor
PrecomputedThumbnailsDocumentProcessor
PrintfValues
PublicUrlProcessor
RealTimeAlerting
RemoteHTTPTransformer
RemoteMOTAPIDocumentProcessor
RemoveContexts
RenameContext
RenameUnmappedContexts
ReplaceContextNames
ReplaceRegexp
ReplaceValues
SemanticPipeDocumentProcessor
SetDefaultValue
SimilarStringToPart
SingleContextDocumentProcessor
SplitValues
StandardPartsMerger
StorageServiceDocumentProcessor
StringHash
StringHash32
StringHash64
StringTransform
TextToNum
URLCodec
URLTransformer
UTF8Checker
UniformRandomContextGenerator
UnitsOfMeasurementNormalizer
ValueSelector
WildcardIndexing
XpathExtractor
XpathFragmentExtractor
ZipfRandomContextGenerator

Attributes

regexp string

Name	Type	Default value	Description
regexp	string	-	The regexp. Note: It is not anchored by default ; i.e., use '.*\.doc' to match .doc files.

XML representation

<FilenameMatchCondition regexp="" ></FilenameMatchCondition>

BinaryContentCondition com.exalead.indexing.analysis.v10.BinaryContentCondition

A condition that matches if the FIRST document part binary content type matches the binary string.

Note: Conditions work on document but content is set per document part.

BinaryContentCondition only tests the binary content of the first part, if present.

Parent objects

AnalysisPipeline
AndCondition
CGRDocumentProcessor
ConcatValues
ContentCleanup
ConvertTextExtractor
CoordinatesFormatter
CopyContext
CustomDocumentProcessor
DataModelClassResolver
DateFormatter
DebugCrashProcessor
DebugProcessor
DiscardDocument
DocumentProcessor
DocumentProcessorGroup
DoubleToLong
FixedRangeNumericalPartitioning
ForcedRangeNumericalPartitioning
FormatCheckerDate
GeoBBoxProcessor
GeoCategorizer
HTMLCSSExtractor
HTMLCSSSelector
HTMLRelevantContentExtractor
HTMLTableExtractor
InferFileExtension
InsertCurrentDate
JavaDocumentProcessor
JavaProcessor
JavaScriptProcessor
LanguageDetector
LanguageSetter
MIMEDetector
MathDocumentProcessor
MetaFinder
MimeTypeSetter
MultiContextCSVEncoder
MultiContextDocumentProcessor
NativeTextExtractor
NewChunk
NotCondition
NumericalFormatter
OrCondition
PLMExpandDocumentProcessor
PrecomputedThumbnailsDocumentProcessor
PrintfValues
PublicUrlProcessor
RealTimeAlerting
RemoteHTTPTransformer
RemoteMOTAPIDocumentProcessor
RemoveContexts
RenameContext
RenameUnmappedContexts
ReplaceContextNames
ReplaceRegexp
ReplaceValues
SemanticPipeDocumentProcessor
SetDefaultValue
SimilarStringToPart
SingleContextDocumentProcessor
SplitValues
StandardPartsMerger
StorageServiceDocumentProcessor
StringHash
StringHash32
StringHash64
StringTransform
TextToNum
URLCodec
URLTransformer
UTF8Checker
UniformRandomContextGenerator
UnitsOfMeasurementNormalizer
ValueSelector
WildcardIndexing
XpathExtractor
XpathFragmentExtractor
ZipfRandomContextGenerator

Attributes

offset int
match string

Name	Type	Default value	Description
offset	int	-	Offset in bytes for the binary data to be compared, in bytes (0 for the beginning of the file). Negative values are taken as offset from the end of the file (-1 for the last byte).
match	string	-	Binary string to be compared. The string may contain any ASCII (7-bit) character, or the following '\' escape sequences: \xNN An hexadecimal-encoded character (N part of '0'..'9' or 'A'..'F') \NNN An octal-encoded character (N part of '0'..'9') \n Character 10 \r Character 13 \\ Character '\' \" Character '"' \? Any character

XML representation

<BinaryContentCondition offset="" match="" ></BinaryContentCondition>

DataModelClassCondition com.exalead.indexing.analysis.v10.DataModelClassCondition

A condition that matches if the document has the corresponding DataModel.

Parent objects

AnalysisPipeline
AndCondition
CGRDocumentProcessor
ConcatValues
ContentCleanup
ConvertTextExtractor
CoordinatesFormatter
CopyContext
CustomDocumentProcessor
DataModelClassResolver
DateFormatter
DebugCrashProcessor
DebugProcessor
DiscardDocument
DocumentProcessor
DocumentProcessorGroup
DoubleToLong
FixedRangeNumericalPartitioning
ForcedRangeNumericalPartitioning
FormatCheckerDate
GeoBBoxProcessor
GeoCategorizer
HTMLCSSExtractor
HTMLCSSSelector
HTMLRelevantContentExtractor
HTMLTableExtractor
InferFileExtension
InsertCurrentDate
JavaDocumentProcessor
JavaProcessor
JavaScriptProcessor
LanguageDetector
LanguageSetter
MIMEDetector
MathDocumentProcessor
MetaFinder
MimeTypeSetter
MultiContextCSVEncoder
MultiContextDocumentProcessor
NativeTextExtractor
NewChunk
NotCondition
NumericalFormatter
OrCondition
PLMExpandDocumentProcessor
PrecomputedThumbnailsDocumentProcessor
PrintfValues
PublicUrlProcessor
RealTimeAlerting
RemoteHTTPTransformer
RemoteMOTAPIDocumentProcessor
RemoveContexts
RenameContext
RenameUnmappedContexts
ReplaceContextNames
ReplaceRegexp
ReplaceValues
SemanticPipeDocumentProcessor
SetDefaultValue
SimilarStringToPart
SingleContextDocumentProcessor
SplitValues
StandardPartsMerger
StorageServiceDocumentProcessor
StringHash
StringHash32
StringHash64
StringTransform
TextToNum
URLCodec
URLTransformer
UTF8Checker
UniformRandomContextGenerator
UnitsOfMeasurementNormalizer
ValueSelector
WildcardIndexing
XpathExtractor
XpathFragmentExtractor
ZipfRandomContextGenerator

Attributes

className string

Name	Type	Default value	Description
className	string	-	The restricted DataModel class

XML representation

<DataModelClassCondition className="" ></DataModelClassCondition>

CustomDirectiveCondition com.exalead.indexing.analysis.v10.CustomDirectiveCondition

A condition that matches if the document has the specified directive name, with an optional specific value.

Parent objects

AnalysisPipeline
AndCondition
CGRDocumentProcessor
ConcatValues
ContentCleanup
ConvertTextExtractor
CoordinatesFormatter
CopyContext
CustomDocumentProcessor
DataModelClassResolver
DateFormatter
DebugCrashProcessor
DebugProcessor
DiscardDocument
DocumentProcessor
DocumentProcessorGroup
DoubleToLong
FixedRangeNumericalPartitioning
ForcedRangeNumericalPartitioning
FormatCheckerDate
GeoBBoxProcessor
GeoCategorizer
HTMLCSSExtractor
HTMLCSSSelector
HTMLRelevantContentExtractor
HTMLTableExtractor
InferFileExtension
InsertCurrentDate
JavaDocumentProcessor
JavaProcessor
JavaScriptProcessor
LanguageDetector
LanguageSetter
MIMEDetector
MathDocumentProcessor
MetaFinder
MimeTypeSetter
MultiContextCSVEncoder
MultiContextDocumentProcessor
NativeTextExtractor
NewChunk
NotCondition
NumericalFormatter
OrCondition
PLMExpandDocumentProcessor
PrecomputedThumbnailsDocumentProcessor
PrintfValues
PublicUrlProcessor
RealTimeAlerting
RemoteHTTPTransformer
RemoteMOTAPIDocumentProcessor
RemoveContexts
RenameContext
RenameUnmappedContexts
ReplaceContextNames
ReplaceRegexp
ReplaceValues
SemanticPipeDocumentProcessor
SetDefaultValue
SimilarStringToPart
SingleContextDocumentProcessor
SplitValues
StandardPartsMerger
StorageServiceDocumentProcessor
StringHash
StringHash32
StringHash64
StringTransform
TextToNum
URLCodec
URLTransformer
UTF8Checker
UniformRandomContextGenerator
UnitsOfMeasurementNormalizer
ValueSelector
WildcardIndexing
XpathExtractor
XpathFragmentExtractor
ZipfRandomContextGenerator

Attributes

directiveName string
directiveValue string

Name	Type	Default value	Description
directiveName	string	-	The expected directive name
directiveValue	string	-	An optional expected value for the given directive

XML representation

<CustomDirectiveCondition directiveName="" ></CustomDirectiveCondition>

LanguageDetector com.exalead.indexing.analysis.v10.LanguageDetector

Language detection is performed using the text of all the DocumentChunks associated with the specified input ContextNames for which language was not already detected or specified.

The whole text of all these DocumentChunks is taken into account by a statistical algorithm that detects the language.

This language is then set as the language for all specified chunks.
For example, the language attribute of a DocumentChunk is used by semantic processing.

Language is represented by its iso639-1 code: fr, en.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
languageContext	string	-	If this is not null and if there is a DocumentChunk with a ContextName matching 'languageContext': no automatic detection will be performed, the language specified will be used as the language of the DocumentChunks associated with the ContextNames specified as input.
languagesToDetect	string	-	If not null, restrict the language detector to a set of languages. If you only have a small set of languages to detect, you can restrict language detector to this set to improve precision. List is comma-separated, ex: "en,fr"
defaultLanguage	string	-	If not null, 'defaultLanguage' will be used as the default language when automatic detection fails.
exclude	boolean	False	If true, "inputContexts" is an exclude list instead of an include list. Language detection is then performed on all DocumentChunks except those whose ContextName appears in 'inputContexts'.
outputContext	string	-	ContextName of the DocumentChunk to create. It will contain the language detected in the processed DocumentChunks as defined in ISO 639-1.
minLangPercentage	int	33	Minimum ratio ([0-100]) of language to be detected (0 = always keeps a detected language)
languagesToKeep	int	0	Keeps the n most represented languages in the document. A value of 0 lets the minLangPercentage select the languages.

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<LanguageDetector name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" languageContext="" languagesToDetect="" defaultLanguage="" exclude="false" outputContext="" minLangPercentage="33" languagesToKeep="0" ></LanguageDetector>

LanguageSetter com.exalead.indexing.analysis.v10.LanguageSetter

The language is set as the language for all the DocumentChunks associated with the specified input ContextNames.

For example, the language attribute of a DocumentChunk is used by semantic processing.

The language is represented by its iso639-1 code: fr, en

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
language iso code
outputContext string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
language	iso code	-	Language specified by ISO 639-1 code.
outputContext	string	-	ContextName of the DocumentChunk to create. It will contain the language name as defined in ISO 639-1.

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<LanguageSetter name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" language="en" outputContext="" ></LanguageSetter>

ContentCleanup com.exalead.indexing.analysis.v10.ContentCleanup

Analyzes each DocumentChunk and performs whitespace removal, 'Whitespaces' being defined by the Unicode specification.

This includes ' ' '\r' and '\n'.

Input: All DocumentChunks associated with the specified 'inputContext' ContextNames.

Output: Same as input

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
duplicateWhitespaces boolean
leading boolean
trailing boolean
spaces boolean
stripHTML boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
duplicateWhitespaces	boolean	False	Removes duplicate whitespaces. (' ' -> ' ')
leading	boolean	False	Removes the leading whitespaces
trailing	boolean	False	Removes the trailing whitespaces
spaces	boolean	False	Removes all whitespaces.
stripHTML	boolean	False	Strips HTML tags

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<ContentCleanup name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" duplicateWhitespaces="false" leading="false" trailing="false" spaces="false" stripHTML="false" ></ContentCleanup>

ValueSelector com.exalead.indexing.analysis.v10.ValueSelector

Takes the input contexts in the specified order, and as soon as one is found, it copies the content to the output context and stops.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	ContextName to be associated with the DocumentChunk created for each selection.

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<ValueSelector name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></ValueSelector>

UTF8Checker com.exalead.indexing.analysis.v10.UTF8Checker

Checks that the text passing through is valid UTF-8.
Emits a warning with the document URI and the context name if input is malformed.
Optionally deletes invalid chunks.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
deleteInvalidChunks boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
deleteInvalidChunks	boolean	False	Removes invalid chunks from documents.

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<UTF8Checker name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" deleteInvalidChunks="false" ></UTF8Checker>

ConcatValues com.exalead.indexing.analysis.v10.ConcatValues

Concatenates all textual content of DocumentChunks where ContextName matches 'inputContexts', and joins them with the 'join' string.

A single DocumentChunk with ContextName 'outputContext' is created as an output.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
join string
strict boolean
allowDuplicates boolean
cartesianProduct boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	ContextName to be associated with the DocumentChunk created for each concatenated value.
join	string	-	Optional string inserted between concatenated values.
strict	boolean	True	Forces all the input contexts found to generate the concatenation.
allowDuplicates	boolean	True	If true, and if there are multiple DocumentChunks with the same ContextName, it concatenates them all. If false, only the first DocumentChunk among all those with the same ContextName is kept.
cartesianProduct	boolean	False	If there are multiple DocumentChunks with the same ContextName, it generates the cartesian product between all values.

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<ConcatValues name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" join="" strict="true" allowDuplicates="true" cartesianProduct="false" ></ConcatValues>

RemoveContexts com.exalead.indexing.analysis.v10.RemoveContexts

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<RemoveContexts name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></RemoveContexts>

MultiContextCSVEncoder com.exalead.indexing.analysis.v10.MultiContextCSVEncoder

Creates a DocumentChunk containing the ContextName and the textual value of the DocumentChunks matching 'inputContexts'.

This processor can be used, for instance, to store arbitrary (key,value) pairs into one single index field.

Note that this storing method is inefficient and should be used with caution. @csh AC_MULTICONTEXT_ENCODER_ID

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
processUnmappedContexts boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The ContextName used for newly created chunks.
processUnmappedContexts	boolean	False	All DocumentChunks with an unmapped ContextName in the document will be used for input. This can be used to emulate the 'default meta' and 'content' field feature of CloudView 4.6.

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<MultiContextCSVEncoder name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" processUnmappedContexts="false" ></MultiContextCSVEncoder>

StringHash com.exalead.indexing.analysis.v10.StringHash

The StringHash processor computes a signed hash of the textual input value.

For example, this value can be used in a field used for grouping.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
nbBits int
outputContext string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
nbBits	int	64	The size of the hash, in bits, including the sign bit. The hash values will be in [-2^(nbBits-1); 2^(nbBits-1) - 1].
outputContext	string	-	The ContextName used for the newly created chunk.

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<StringHash name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" nbBits="64" outputContext="" ></StringHash>

StringHash64 com.exalead.indexing.analysis.v10.StringHash64

The StringHash processor computes a signed hash of the textual input value on 64 bits.

For example, this value can be used in a field used for grouping.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The ContextName used for the newly created chunk.

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<StringHash64 name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></StringHash64>

StringHash32 com.exalead.indexing.analysis.v10.StringHash32

The StringHash processor computes a signed hash of the textual input value on 32 bits.

For example, this value can be used in a field used for grouping.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The ContextName used for the newly created chunk.

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<StringHash32 name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></StringHash32>

NumericalFormatter com.exalead.indexing.analysis.v10.NumericalFormatter

The Numerical Formatter processor creates valid numerical chunks from various number formats.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
precision int
round int
removeTrailingZeros boolean
groupSeparator string
decimalSeparator string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The ContextName used for the newly created chunk. If null, it uses the same name as the input.
precision	int	0	Number of digits relevant in the decimal part.
round	int	0	Rounds the integer part with this range.
removeTrailingZeros	boolean	True	Removes the trailing zeros in the decimal part.
groupSeparator	string	-	group separator
decimalSeparator	string	.	decimal separator

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<NumericalFormatter name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" precision="0" round="0" removeTrailingZeros="true" ></NumericalFormatter>

CoordinatesFormatter com.exalead.indexing.analysis.v10.CoordinatesFormatter

The Coordinates Formatter processor creates a normalized chunk for the latitude and longitude.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
latitudeContext string
latitudeFormat enum(DMS, Decimal)
longitudeContext string
longitudeFormat enum(DMS, Decimal)

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The ContextName used for the newly created chunk.
latitudeContext	string	-	The ContextName used as input for the latitude
latitudeFormat	enum(DMS, Decimal)		The input format for the latitude Value can be one of DMS Decimal
longitudeContext	string	-	The ContextName used as input for the longitude
longitudeFormat	enum(DMS, Decimal)		The input format for the longitude Value can be one of DMS Decimal

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<CoordinatesFormatter name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" latitudeContext="" latitudeFormat="" longitudeContext="" longitudeFormat="" ></CoordinatesFormatter>

DebugProcessor com.exalead.indexing.analysis.v10.DebugProcessor

Dumps all the DocumentChunks named after 'inputContexts' on Standard Output.

This provides a log of the 'Analysis' process. @descr

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
dump boolean
outputContext string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
dump	boolean	True
outputContext	string	-	The ContextName used for the newly created chunk.

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<DebugProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" dump="true" ></DebugProcessor>

RemoteMOTAPIDocumentProcessor com.exalead.indexing.analysis.v10.RemoteMOTAPIDocumentProcessor

The processing of each input context will be handled by the targeted remote API. @param targetBuildGroups list of build groups that should be used to handle processing. @param remoteMOTAPIConfigName the name of the RemoteMOTAPIConfig object as seen in RemoteMOTAPIConfig.xml high level configuration file.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
remoteMOTAPIConfigName string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
remoteMOTAPIConfigName	string	-

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
targetInstancesStringValue*
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
StringValue*	targetInstances
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<RemoteMOTAPIDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" remoteMOTAPIConfigName="" ></RemoteMOTAPIDocumentProcessor>

StringTransform com.exalead.indexing.analysis.v10.StringTransform

Applies textual transformations on chunks from several contexts:

trims blanks at the beginning and end of chunks
reduces sequences of blanks to just one
changes text to uppercase/lowercase/normalized/capitalized

Outputs replace inputs.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
spaces string
form string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
spaces	string	-	What to do with spaces ("trim" or "normalize-spaces", default set to nothing)
form	string	-	What transformation to apply ("lowercase", "uppercase", "normalized", "capitalized", default set to nothing)

Nested elements

inputContextsStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	inputContexts	The processor will only be applied to DocumentChunks with a ContextName specified in this list.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<StringTransform name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" spaces="" form="" ></StringTransform>

ReplaceValues com.exalead.indexing.analysis.v10.ReplaceValues

The ReplaceValues processor compares all DocumentChunks for a given inputContext with the specified KeyValue map.

When the DocumentChunk value is an exact match, it is replaced by the specified string.

This processor can be used, for instance, to normalize different spelling for document metadata.
@csh AC_REPLACE_VALUES_ID

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor

Nested elements

fromDataModelDocumentProcessor
AcceptCondition
KeyValue*

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.
KeyValue*	-

XML representation

<ReplaceValues inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></ReplaceValues>

PublicUrlProcessor com.exalead.indexing.analysis.v10.PublicUrlProcessor

For each input DocumentChunk associated with the 'inputContext' ContextName, 4 DocumentChunks are created, each associated with a different ContextName:

'treeOutputContext'
'leafOutputContext'
'urlOutputContext'
'urlCategoryOutputContext'

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
treeOutputContext	string	-	The ContextName for the DocumentChunk created from the category path encoding the web site tree.
leafOutputContext	string	-	The ContextName for the DocumentChunks created from the complete, normalized, URL.
urlOutputContext	string	-	The ContextName for the DocumentChunk created from the complete, normalized URL.
urlPathOutputContext	string	-	The ContextName for the DocumentChunk created from the normalized URL.
maxPathDepth	int	4	maximum depth of url path

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<PublicUrlProcessor inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" treeOutputContext="" leafOutputContext="" urlOutputContext="" urlPathOutputContext="" maxPathDepth="4" ></PublicUrlProcessor>

DateFormatter com.exalead.indexing.analysis.v10.DateFormatter

If a document chunk matches either:

a custom input format defined with UNIX date syntax (for example,%Y/%m/%d-%H:%M:%S)
one of the automatically recognized date formats (click icon for more information)

the Date Formatter generates three additional document chunks, each with its own context name, using the following naming convention:

$inputContext$dateTimeOutputContext (Default format: %Y/%m/%d-%H:%M:%S)
$inputContext$dateOutputContext (Default format: %Y/%m/%d)
$inputContext$timeOutputContext (Default format: %H:%M:%S)

@csh AC_DATE_FORMATTER_ID

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
dateTimeOutputContext	string	-	Suffix for the name of the DocumentChunk containing the date as defined by dateTimeOutputFormat (default YYYY/MM/DD-HH:MM:SS). The original ContextName of the input DocumentChunk and this suffix are concatenated ($orig$dateTimeOutputContext) to produce the ContextName actually used. This DocumentChunk is usually used for date display.
dateTimeOutputFormat	string	-	A date and time output format compliant with libc's strftime.
dateOutputContext	string	-	Suffix for the name of the DocumentChunk containing the date as defined by dateOutputFormat (default YYYY/MM/DD). The original name of the input DocumentChunk and this suffix are concatenated ($orig$dateTimeOutputContext) to produce the name actually used. This DocumentChunk is usually remapped to a category for navigation.
dateOutputFormat	string	-	A date output format compliant with libc's strftime.
timeOutputContext	string	-	Suffix for the name of the DocumentChunk containing the date as defined by timeOutputFormat (default HH:MM:SS). The original name of the input DocumentChunk and this suffix are concatenated ($orig$dateTimeOutputContext) to produce the name actually used.
timeOutputFormat	string	-	A time output format compliant with libc's strftime.
inputFormat	string	-	An optional date input format, compliant with libc's `strptime()` format. If such a format is provided, the automatic date format heuristic is disabled, and the provided date format is used exclusively. Documentation of accepted formats: (days and month literals are only recognized in English) Day %a: weekday abbreviated ("Mon", ...) %A: weekday full ("Monday", ...) %d: day of the month, zero filled [01-31] %e: Equivalent to %d [1-31] %j: day year, zero filled [001-366] %u: day of week starting with Monday (1), i.e. mtwtfss [7 (for Sunday)] %w: day of week as a decimal number [0,6], with 0 representing Sunday Week %U: week number of the year (Sunday as first day of the week) as a decimal number [00,53] %W: week number of the year (Monday as the first day of the week) as a decimal number [01,53] %V: week of the year [01-53] Month %m: the month number [01-12] %b: month locale abbreviated ("Aug", ...) %h: equivalent to %b %B: locale's full month, variable length ("August") Year %y: The year within the century with two-digit dates, for example [69,99] is mapped to [1969,1999] and [00,68] is mapped to [2000,2068] %Y: The year, including the century (for example, 2014) %g: last two digits of year of ISO week number (see %G) %G: year of ISO week number (see %V), for example, 2014; normally useful only with %V Century %C: The century number [00,99] Date %D: Equivalent to mm/dd/yy (08/20/14) %x: locale's date representation (mm/dd/yy), 08/20/2014 %F: %Y-%m-%d (2014-08-20) Hours %l: hour (12-hour clock), for example, [1-12] %I: hour (12-hour clock) zero filled, [01-12] %k: hour (24 hour), for example, 17 %H: hour (24 hour) zero padded, 17 %p: locale's upper case AM or PM (blank in many locales), for example, PM %P: locale's lower case am or pm, for example, pm Minutes %M: The minute [00-59] Seconds %s: seconds since 00:00:00 1970-01-01 UTC (Unix epoch), for example, 1345483096 %S: seconds [00-60], (The 60 is necessary to accommodate a leap second) Time %r: hours, minutes, seconds (12-hour clock), for example, 05:18:16 PM %R: hours, minutes (24-hour clock), for example, 17:18 %T: hours, minutes, seconds (24-hour clock), for example, 17:18:16 %X: locale's time representation, for example, 11:07:26 AM %dt: AM or PM Date and Time %c: locale's date and time, for example, Sat Nov 04 12:02:33 EST 1989 Others %n: Any white space %t: Any white space %%: Replaced by %
removeOriginalChunk	boolean	True	Removes the original input chunk.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<DateFormatter inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" dateTimeOutputContext="" dateTimeOutputFormat="" dateOutputContext="" dateOutputFormat="" timeOutputContext="" timeOutputFormat="" inputFormat="" removeOriginalChunk="true" ></DateFormatter>

FormatCheckerDate com.exalead.indexing.analysis.v10.FormatCheckerDate

The FormatCheckDate processor checks the chunk matches either:

a custom input format defined with UNIX date syntax (for example,%Y/%m/%d-%H:%M:%S)
one of the automatically recognized date formats

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
inputFormat string

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
inputFormat	string	-	An optional date input format, compliant with libc's `strptime()` format. If such a format is provided, the automatic date format heuristic is disabled, and the provided date format is used exclusively. Documentation of accepted formats: (days and month literals are only recognized in English) %a: The day of the week ("Monday", ...) %A: Equivalent to %a %b: The month ("January", ...) %B: Equivalent to %b %c: Equivalent to %a %b %e %H:%M:%S %Y %C: The century number [00,99] %d: The day of the month [01,31] %D: Equivalent to %m/%d/%y %e: Equivalent to %d %h: Equivalent to %b %H: The hour (24-hour clock) [00,23] %I: The hour (12-hour clock) [01,12] %j: The day number of the year [001,366] %m: The month number [01,12] %M: The minute [00,59] %n: Any white space %dt: AM or PM %r: Equivalent to %I:%M:%S %p %R: Equivalent to %H:%M %S: The seconds [00,60] %t: Any white space %T: Equivalent to %H:%M:%S %U: The week number of the year (Sunday as the first day of the week) as a decimal number [00,53] %w: The weekday as a decimal number [0,6], with 0 representing Sunday %W: The week number of the year (Monday as the first day of the week) as a decimal number [00,53] %x: Equivalent to %m/%d/%y %X: Equivalent to %H:%M:%S %y: The year within century. (for two-digit dates, [69,99] is mapped to [1969,1999] and [00,68] is mapped to [2000,2068]) %Y: The year, including the century (for example, 1988) %%: Replaced by %

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<FormatCheckerDate inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" inputFormat="" ></FormatCheckerDate>

SplitValues com.exalead.indexing.analysis.v10.SplitValues

Splits the content of all DocumentChunks associated with the ContextName 'inputContext' using 'separator' as a separator regular expression.

A new DocumentChunk is created for each segment, with 'outputContext' as the ContextName.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
separator string

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	ContextName to be associated with the DocumentChunk created for each split segment.
separator	string	-	Separator around which to split. ASTL library is used to perform regular expression matching. The regular expression language supported is Perl 5, WITHOUT support for: assertions like \b, \B, ?=, ?!, ?<=, ?<! backreferences \1, \2, ... UNICODE escaping like \u0020 or \p{name} non-greedy (lazy) repeat operators like ??, *?, +?

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<SplitValues inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" separator="" ></SplitValues>

RenameContext com.exalead.indexing.analysis.v10.RenameContext

Each DocumentChunk with ContextName matching 'inputContext' is renamed with a ContextName 'outputContext'.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The new ContextName for DocumentChunks with ContextName matching 'inputContext'.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<RenameContext inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></RenameContext>

CopyContext com.exalead.indexing.analysis.v10.CopyContext

Copies all DocumentChunks with 'inputContext' as ContextName, and creates new DocumentChunks with the same score, language and part but with 'outputContext' as ContextName.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
requiredAnnotation string
restrictValues string

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The ContextName used for newly created chunks.
requiredAnnotation	string	-	The name of the required annotation the chunk must have to be copied. If null, no special handling is done on annotations.
restrictValues	string	-	A regexp which values of the chunk must match to be copied to the output context. Values that don't match the regexp will not be copied.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<CopyContext inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" requiredAnnotation="" restrictValues="" ></CopyContext>

FixedRangeNumericalPartitioning com.exalead.indexing.analysis.v10.FixedRangeNumericalPartitioning

Matches numerical values in a range. It transforms a numerical value into a matching range, based on a fixed range size.

For example, with rangeSize = 100,

101 -> 100_199
234 -> 200_299

It also works for negative numbers:

-20 -> -100_-1
0 -> 0_99

This helps to create categories (for navigation) from numerical values.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
separator string
rangeSize long

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The ContextName used for newly created chunks.
separator	string	_	The range separator.
rangeSize	long	1	The size of the range to consider.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<FixedRangeNumericalPartitioning inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" separator="_" rangeSize="1" ></FixedRangeNumericalPartitioning>

ForcedRangeNumericalPartitioning com.exalead.indexing.analysis.v10.ForcedRangeNumericalPartitioning

Transforms a numerical value into the text value associated to its matching range from a set of predetermined ranges specified in 'NumericalRange'.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
separator string

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The ContextName used for newly created chunks.
separator	string	_	The separator between the beginning and the end of the range. This parameter is deprecated.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition
NumericalRange*

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.
NumericalRange*	-	The forced ranges.

XML representation

<ForcedRangeNumericalPartitioning inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" separator="_" ></ForcedRangeNumericalPartitioning>

NumericalRange com.exalead.indexing.analysis.v10.NumericalRange

Associates text with a numerical range.

The range includes all values >= beg and <= end (beg <= x <= end).
A range corresponding to a unique value with beg = end is allowed.

Parent objects

ForcedRangeNumericalPartitioning

Attributes

beg long
end long
text string

Name	Type	Default value	Description
beg	long	-	The lower bound.
end	long	-	The upper bound.
text	string	-	The associated text.

XML representation

<NumericalRange beg="" end="" text="" ></NumericalRange>

TextToNum com.exalead.indexing.analysis.v10.TextToNum

Processor to hack an approximate sort on a text field.

Implements a surjection from the set of strings to the set of integers [0..N] with N close but inferior or equal to 18,446,744,073,709,551,615

User defines an ordered alphabet. A first surjection from the set of all strings to the set of finite sequences of symbols taken from this alphabet is applied (strip the string from symbols out of the alphabet).

A partial order relation is inferred on the latter set by the alphabet (lexicographical order).

For obvious cardinal numbers reasons (one set is infinite the other is not), the second surjection cannot be partial-order preserving. The idea is to preserve the relation on the shorter strings, AND preserve the relation between shorter strings and longer strings, such as:

if STRING2ULONG('shortstring1') <= STRING2ULONG('shortstring2') then 'shortstring1' <= 'shortstring2'
STRING2ULONG('longstring1') <= STRING2ULONG('longstring2') does NOT insure 'longstring1' <= 'longstring2'
if STRING2ULONG('shortstring1') <= STRING2ULONG('longstring2') then 'shortstring1' <= 'longstring2'

The size of the prefix obviously depends on the size of the alphabet.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
alphabet string
outputContext string
nbBits int

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
alphabet	string	0123456789abcdefghijklmnopqrstuvwxyz	The ordered alphabet.
outputContext	string	-	The ContextName used for the newly created chunk.
nbBits	int	63	Number of bits of unsigned field used for sorting.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<TextToNum inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" alphabet="0123456789abcdefghijklmnopqrstuvwxyz" outputContext="" nbBits="63" ></TextToNum>

DoubleToLong com.exalead.indexing.analysis.v10.DoubleToLong

Using this processor you can store floating point values into signed fields that can then be queried with the DoublePrefixHandler.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
precision int
outputContext string

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
precision	int	1000	The multiplicator. Each value will be multiplied by this factor.
outputContext	string	-	The ContextName used for the newly created chunk.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<DoubleToLong inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" precision="1000" outputContext="" ></DoubleToLong>

GeoBBoxProcessor com.exalead.indexing.analysis.v10.GeoBBoxProcessor

The Geo BBox processor converts the input geometry from WKT to WKB
and compute its bouding box. Both WKB and bounding box are returned as chunks. @descr

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
precision int
bboxMetaName string

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
precision	int	6	The number of decimals that will be used in geometrical representations and computations.
bboxMetaName	string	-

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<GeoBBoxProcessor inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" precision="6" bboxMetaName="" ></GeoBBoxProcessor>

JavaProcessor (deprecated) com.exalead.indexing.analysis.v10.JavaProcessor

Allows documents to be sent to a java process for analysis.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
id string
target string

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
id	string
target	string

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<JavaProcessor inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" id="" target="" ></JavaProcessor>

ReplaceRegexp com.exalead.indexing.analysis.v10.ReplaceRegexp

Substitutes the content substring of all DocumentChunks having the ContextName 'inputContext', using:

'pattern' as the matching substring regular expression
and 'value' as the replacement value.

This value may have the form of sed output format using references to captures \0 through \9.

A new DocumentChunk is created with the substitutions.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
pattern string
value string
replaceAll boolean

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	ContextName to be associated with the DocumentChunk created for each new context.
pattern	string	-	Pattern used to match the substrings to replace. ASTL library is used to perform regular expression matching. The regular expression language supported is Perl 5, WITHOUT support for: lazy (non-greedy) quantifiers like ?, +?, ??, {n}?, {n,}?, {n,m}? possessive quantifiers like +, ++, ?+, {n}+, {n,}+, {n,m}+ assertions like \b, \B, \A, \z, \Z, \G look-around assertions (?=pattern), (?!pattern), (?<=pattern), (?<!pattern) named captures (?'name'pattern), (?<name>pattern) numeric and named backreferences like \1, \g1, g{-1}, \g{name}, k<name>, k'name' named Unicode character \N{name} all operators related to Perl code inlining like (?{ code }) all operators related to backtracking algorithm control like independent subexpression (?>pattern) \C matching a single C char (octet) of the pattern-match modifiers (?pimsx-imsx) only (?i:pattern) and (?i) are supported (no negative form)
value	string	-	The replacement value (sed-like output format).
replaceAll	boolean	True	Replaces all first occurrences of patterns.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<ReplaceRegexp inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" pattern="" value="" replaceAll="true" ></ReplaceRegexp>

URLCodec com.exalead.indexing.analysis.v10.URLCodec

URL encode/decode with UTF-8 charset only

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
encodeURIComponent boolean
mode enum(encode, decode)

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	Stores URL encoded form in outputContext. If outputContext = inputContext, it removes the original chunk.
encodeURIComponent	boolean	True	If true (default), it encodes the following characters: ',' '/' '?' ':' '@' '&' '=' '+' '$' '#'
mode	enum(encode, decode)	encode	mode = "encode" or "decode"

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<URLCodec inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" encodeURIComponent="true" mode="encode" ></URLCodec>

WildcardIndexing com.exalead.indexing.analysis.v10.WildcardIndexing

Computes the input chunk substring to perform efficient prefix/substring/suffix search

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
exactScore int
prefixSearch boolean
prefixScore int
suffixSearch boolean
suffixScore int
substringSearch boolean
substringScore int
maxStringSize int

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	Stores exact/prefix/substring/suffix in outputContext. If outputContext = inputContext, it removes the original chunk.
exactScore	int	4	Specifies the score for an exact match.
prefixSearch	boolean	True	Enables the prefix search.
prefixScore	int	3	Specifies the score for a prefix match.
suffixSearch	boolean	True	Enables the suffix search.
suffixScore	int	2	Specifies the score for a suffix match.
substringSearch	boolean	True	Enables the substring search.
substringScore	int	1	Specifies the score for a substring match.
maxStringSize	int	100	Specifies the max string size for which this processor will be applied.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<WildcardIndexing inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" exactScore="4" prefixSearch="true" prefixScore="3" suffixSearch="true" suffixScore="2" substringSearch="true" substringScore="1" maxStringSize="100" ></WildcardIndexing>

URLTransformer com.exalead.indexing.analysis.v10.URLTransformer

Parses a context string as a regular URL (RFC 2396, "Uniform Resource Identifier") and transforms it according to the given URL pattern.

A new DocumentChunk is created with the substitution.

Pattern used to transform the URL (in the form <scheme>://<authority><path>?<query>#<fragment>):

Characters other than '$' or '\' are kept as-is
The '$' character and the '\' character must be escaped with a leading \
The ${expression} form allows to compute a string expression based on URL components (see "Expression" below)

Expression used inside the enclosing ${}:

url: Original URL
scheme: Scheme name ("http", "https", "file", ...)
authority: Authority (host:port or host) (may be empty)
host: Hostname part of the authority (may be empty)
port: Port number part of the authority (may be empty)
userInfo: username:password field of the authority (may be empty)
file: File starting with / and query string, if any
pathurl: Normalized absolute path starting with /
path: Normalized absolute path (may start with C:\ on Windows)
query: Normalized query part starting with ? (may be empty)
args: Query part without the leading ? (may be empty)
fragment: Fragment part starting with #(may be empty)
reference: Reference part ; i.e., fragment without the leading # (may be empty)
arg:name: Query part argument identified by its name, unescaped (you must re-escape it using "urlencode:" when necessary)
str:string: The final argument is not a variable name, but a string (only useful for clarity purpose)
tolower:expression: Transform into lowercase (ONLY A-Z)
toupper:expression: Transform into uppercase (ONLY a-z)
urlencode:expression :URL encoding (%NN or +)
urlpathencode:expression: URL encoding outside / fragments
urldecode:expression: URL decoding
pathslash:expression: Convert \ into /
pathantislash:expression: Convert / into \

Notes:

Unreserved characters are unescaped during URL processing (i.e., never '%' or '\')
The lower other similar prefix accept recursion (i.e., the expression "${urlpathencode:pathantislash:toupper:path}" is valid)
Both "file://C:\path" and "file:///C:\path" will produce path="/C:\path"

Examples:

With the input context value "http://www.example.com/bar/foo?bar=42"
- "hello, world" => "hello, world"
- "the scheme is ${scheme}" => "the scheme is http"
- "the scheme is \${scheme}" => "the scheme is \${scheme}
- "http://myserver${path}${query}" => "http://myserver/bar/foo?bar=42"
- "http://myserver/applet?f=${urlpathencode:path}&t=${arg:bar}" => "http://myserver/applet?f=/bar/foo&t=42"
- "http://myserver/applet?f=${urlencode:path}&t=${arg:bar}" => "http://myserver/applet?f=%2Fbar%2Ffoo&t=42"
- "http://myserver/applet?f=${urlpathencode:pathantislash:toupper:path}" => "http://myserver/applet?f=%5CBAR%5CFOO"
With the input context value "file:///C:/My%20Documents/Document.doc"
- "${pathantislash:urldecode:path}" => "C:\My Documents\Document.doc"

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
urlPattern string

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	ContextName to be associated with the DocumentChunk created for each new context.
urlPattern	string	-	Pattern used to transform the URL.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<URLTransformer inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" urlPattern="" ></URLTransformer>

GeoCategorizer com.exalead.indexing.analysis.v10.GeoCategorizer

A processor that categorizes geographic points given their inclusion in a GeoDomain.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

inputContext string
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string

Name	Type	Default value	Description
inputContext	string	-	The processor will only be applied to DocumentChunks with this ContextName.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	ContextName of the chunk to create.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition
GeoDomain*

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.
GeoDomain*	-

XML representation

<GeoCategorizer inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></GeoCategorizer>

DiskDomain com.exalead.search.v30.DiskDomain

No documentation for this element.

Parent objects

ExplicitGeoFacet
GeoCategorizer

Attributes

title string
id int
radius double
x double
y double

Name	Type	Default value	Description
title	string	-
id	int	0	Unique identifier of this domain. If id=0 (its default value) the category path will be the set of vertices. Otherwise, it will be the id value.
radius (UI: Radius)	double	-	Disk radius in meters
x	double	-	First coordinate of the center for the DiskDomain. If the point type is XY, it will be interpreted as the X coordinate (integer units). For geographic points (GPS), it will be interpreted as the latitude coordinate.
y	double	-	Second coordinate of the center for the DiskDomain. If the point type is XY, it will be interpreted as the Y coordinate (integer units). For geographic points (GPS), it will be interpreted as the longitude coordinate.

XML representation

<DiskDomain title="" id="0" radius="" x="" y="" ></DiskDomain>

PolygonDomain com.exalead.search.v30.PolygonDomain

No documentation for this element.

Parent objects

ExplicitGeoFacet
GeoCategorizer

Attributes

title string
id int
vertices string

Name	Type	Default value	Description
title	string	-
id	int	0	Unique identifier of this domain. If id=0 (its default value) the category path will be the set of vertices. Otherwise, it will be the id value.
vertices	string	-	Polygon vertices, as a list of (x,y) coordinates. For example: "0.0,0.0;1.1,0.1;1.1,1.1"

XML representation

<PolygonDomain title="" id="0" vertices="" ></PolygonDomain>

KMLDomain com.exalead.search.v30.KMLDomain

Definition of a geographic domain using a KML or KMZ resource

Parent objects

ExplicitGeoFacet
GeoCategorizer

Attributes

title string
id int
resource string
KMZ boolean

Name	Type	Default value	Description
title	string	-
id	int	0	Unique identifier of this domain. If id=0 (its default value) the category path will be the set of vertices. Otherwise, it will be the id value.
resource	string	-
KMZ	boolean	False	Is this resource a KMZ resource?

XML representation

<KMLDomain title="" id="0" resource="" KMZ="false" ></KMLDomain>

SHPDomain com.exalead.search.v30.SHPDomain

No documentation for this element.

Parent objects

ExplicitGeoFacet
GeoCategorizer

Attributes

title string
id int
shpResource string
shxResource string
dbfResource string

Name	Type	Default value	Description
title	string	-
id	int	0	Unique identifier of this domain. If id=0 (its default value) the category path will be the set of vertices. Otherwise, it will be the id value.
shpResource	string	-
shxResource	string	-
dbfResource	string	-

XML representation

<SHPDomain title="" id="0" shpResource="" shxResource="" dbfResource="" ></SHPDomain>

MimeTypeSetter com.exalead.indexing.analysis.v10.MimeTypeSetter

Manually sets the mime type

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
value string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
value	string	-	New mime type

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<MimeTypeSetter name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" value="" ></MimeTypeSetter>

MetaFinder com.exalead.indexing.analysis.v10.MetaFinder

Keeps track of all document metas

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<MetaFinder name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></MetaFinder>

JavaDocumentProcessor com.exalead.indexing.analysis.v10.JavaDocumentProcessor

Takes Java code either inline or from a file, and executes it on-the-fly.

For production mode, we recommend packaging your custom code as a Java Plugin (CVPlugin) and using the Custom Document Processor to call it.
Plugins allow better packaging and source code maintenance.
Note that this processor does not work when the Java JVM used to run Cloudview is only a JRE and not a JDK

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
path string
priority int
sourceCode string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
path	string	-	User defined path to a Java file containing the processor code
priority	int	-	Defines which path to use (0: user defined path, 1: resource managed path (inlined Java))
sourceCode	string	-	Inline Java code

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<JavaDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" path="" priority="" sourceCode="" ></JavaDocumentProcessor>

JavaScriptProcessor (deprecated) com.exalead.indexing.analysis.v10.JavaScriptProcessor

This document processor is deprecated. Use the Java document processor instead.
The JavaScript Processor takes a JS script and executes it.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
path string
priority int
script string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
path	string	-	User defined path to a JS file containing the processor code
priority	int	-	Defines which path to use (0: user defined path, 1: resource managed path (inlined JS))
script	string	-	Inline script

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<JavaScriptProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" path="" priority="" script="" ></JavaScriptProcessor>

StorageServiceDocumentProcessor com.exalead.indexing.analysis.v10.StorageServiceDocumentProcessor

Queries the storage for any meta to attach to the document.

Multi-valued pairs are pushed as multi-valued metas.

For example:

The storage key "nb_comment" will be attached as "nb_comment" meta on the document.
The storage key "tags[]" will be attached as "tags" multi-valued meta on the document.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
instance string
metaIdentifier string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
instance	string	-	Storage service instance
metaIdentifier	string	-	Defines an optional meta name that will be used as storage Identifier instead of the document Uri.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<StorageServiceDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" instance="" metaIdentifier="" ></StorageServiceDocumentProcessor>

MathDocumentProcessor com.exalead.indexing.analysis.v10.MathDocumentProcessor

Performs mathematical operations on a numerical field. Expressions must be prefaced by a $.

For example, the expression `$ht_price * 1.196` finds the first chunk in the `ht_price` context, and replaces all occurrences of `ht_price` with the mathematical expression.

The result will be a new text chunk, either in the Output context (if specified), or in the original `ht_price` context.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
expression string
outputContext string
floatingPoint boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
expression	string	-	Arithmetic expression to evaluate. For example: "$file_size + 42"
outputContext	string	-	ContextName of the chunk to create.
floatingPoint	boolean	False	Output: A floating point number instead of the default integer one.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<MathDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" expression="" outputContext="" floatingPoint="false" ></MathDocumentProcessor>

PrecomputedThumbnailsDocumentProcessor com.exalead.indexing.analysis.v10.PrecomputedThumbnailsDocumentProcessor

The Precomputed Thumbnails Document Processor precomputes thumbnails of the first DocumentPart.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
convertAddresses string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
convertAddresses	string	-	Semicolon separated list of convert instance names or urls to use.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<PrecomputedThumbnailsDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></PrecomputedThumbnailsDocumentProcessor>

RealTimeAlerting com.exalead.indexing.analysis.v10.RealTimeAlerting

The Real-time alerting document processor matches queries defined by end-users and alerts them as soon as possible a new matching document is indexed. To be used only when not in task queue mode.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor

Nested elements

fromDataModelDocumentProcessor
alertGroupsAlertGroup*
customPublishersCustomPublisher*
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AlertGroup*	alertGroups	List of alert groups handled by this processor, empty means ALL groups
CustomPublisher*	customPublishers
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<RealTimeAlerting name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></RealTimeAlerting>

AlertGroup com.exalead.indexing.analysis.v10.AlertGroup

No documentation for this element.

Parent objects

alertGroups RealTimeAlerting

Attributes

name string

Name	Type	Default value	Description
name	string	-

XML representation

<AlertGroup name="" ></AlertGroup>

CustomPublisher com.exalead.indexing.analysis.v10.CustomPublisher

Custom publisher configuration

Parent objects

customPublishers RealTimeAlerting

Attributes

classId string

Name	Type	Default value	Description
classId	string	-	Custom publisher type

Nested elements

configKeyValue*

Type	Wrapper	Description
KeyValue*	config

XML representation

<CustomPublisher classId="" ></CustomPublisher>

MIMEDetector com.exalead.indexing.analysis.v10.MIMEDetector

The MIME detector operates on each DocumentPart for which a MIME-type is not available.

The MIME-type can be specified for each DocumentPart in the PAPI.

For DocumentPart, the 'bytes' and the 'filename' are used to guess the real MIME-type and charset.

The guessed MIME-type and the charset are then set as attributes of the DocumentPart.

Input: The DocumentPart of the document.

Output: 'mime' and 'encodingToUse' attributes of DocumentParts.

This document processor does not create any document chunks.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
defaultValue string
defaultCharset string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
defaultValue	string	-	Default mime to use if not detected.
defaultCharset	string	-	On text or HTML files, the MIME detector tries to detect charset encoding automatically. If the encoding cannot be detected, this 'defaultCharset' is used.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<MIMEDetector name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" defaultValue="" defaultCharset="" ></MIMEDetector>

NativeTextExtractor com.exalead.indexing.analysis.v10.NativeTextExtractor

Extraction is performed for the following data types:

text/plain for Text files.
text/html for HTML Files.
application/x-exalead-document for CloudView 4.6 document format (com.exalead.document)
application/x-exalead-ndoc for CloudView 5 internal document format, binary.
application/x-exalead-ndoc-v10+xml for CloudView internal document format, XML.

@csh AC_TEXTEXTRACTOR_HTML_ID

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
annotateHTML	boolean	False	Adds some stylish annotations to DocumentChunks (for HTML files only): html:p for DocumentChunks generated from <p> html:row for DocumentChunks generated from <tr> html:column for DocumentChunks generated from <td> or <th> html:table for DocumentChunks generated from <table> html:h1 for DocumentChunks generated from <h1> html:h2 for DocumentChunks generated from <h2> html:h3 for DocumentChunks generated from <h3> html:h4 for DocumentChunks generated from <h4> html:h5 for DocumentChunks generated from <h5> html:h6 for DocumentChunks generated from <h6> html:link for DocumentChunks generated from <a>, <iframe> or <frame> html:link:rel if the link has a "rel" attribute html:link:name if the link has a "name" attribute html:list for DocumentChunks generated from <ul>, <ol> or <dl> html:item for DocumentChunks generated from <li> html:bold for DocumentChunks generated from <b> or <strong> html:italic for DocumentChunks generated from <i> or <em> html:underline for DocumentChunks generated from <u> html:strike for DocumentChunks generated from <s> or <strike> html:pre for DocumentChunks generated from <pre> html:invisible for DocumentChunks containing invisible text (display: none, white on white) html:class for DocumentChunks taken in a CSS class html:id for DocumentChunks taken in a CSS id html:img:src for DocumentChunks created from a <img> It also creates specific HTML DocumentChunks with the following contexts: html:lang when parsing a <html> containing the "lang" attribute html:xml:lang when parsing a <html> containing the "xml:lang" attribute html:title when parsing a <title> html:title:other when parsing a second <title> html:base:href when parsing a <base> html:link when parsing a <link> containing the "src" attribute and annotated by: html:link:rel if the link has a "rel" attribute html:link:type if the link has a "type" attribute html:http-equiv:NAME when parsing a http-equiv meta html:meta:NAME when parsing a meta named "NAME"
skipInvisibleHTMLText	boolean	False	Skips the invisible text. For example, white fonts on white backgrounds (for HTML files only).
extractJs	boolean	False	Tries to parse JavaScript and then extract links.
extractHTMLTables	boolean	False	Adds annotations on table, tr, td, th
extractHTMLStyles	boolean	False	Adds annotations on style attributes.
extractHTMLForms	boolean	False	Add annotations on Forms, select.
maxHTMLAnnotationDepth	int	20	Prevents new annotations from being created after @c maxHTMLAnnotationDepth HTML level.
disableAutomaticHTMLDTDFix	boolean	False	Disables automatic DTD fix on HTML documents.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<NativeTextExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" annotateHTML="false" skipInvisibleHTMLText="false" extractJs="false" extractHTMLTables="false" extractHTMLStyles="false" extractHTMLForms="false" maxHTMLAnnotationDepth="20" disableAutomaticHTMLDTDFix="false" ></NativeTextExtractor>

ConvertTextExtractor com.exalead.indexing.analysis.v10.ConvertTextExtractor

This processor performs text content extraction for all MIME-types (300+ file formats are currently handled).

See the "Supported Formats" technical note for more information.

Text, HTML, and built-in data types must be processed by the 'NativeTextExtractor' rather than this processor.

Make sure to have a 'NativeTextExtractor' before the ConvertTextExtractor in your pipeline. @csh AC_TEXTEXTRACTOR_MIME_ID

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
looseTextDetection boolean
forceContent boolean
minInputSizeKB long
maxInputSizeKB long
maxRecursionDepth int
maxRecursionDocuments int
maxRecursionDocumentsTotal int
strictSizeCheck boolean
retryIO string
filter string
timeoutMs long
priority string
embedded string
attachments string
styles string
forceConversion boolean
startPage long
maxPages long
maxOutputSizeKB long
allowUnicode32 boolean
allowDocumentChars boolean
outsideIn string
outsideInFallback string
outsideInOnly string
outsideInForPreview string
outsideInSimpleXHTMLFallback string
ocr string
ocrFallback string
ocrDetect string
ocrQuality string
ocrLang string
ocrTimeoutMs long
ocrMaxPages int
ocrPriority string
httpProxyUrl string
disablePlugins boolean
overrideAddresses string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
looseTextDetection	boolean	True	Looses text detection to detect more text files, including suspicious ones (not .txt or .html) ("true", "false")
forceContent	boolean	False	Forces to accept the content, even if the MIME type does not seem to be a known or supported MIME type.
minInputSizeKB	long	-1	Minimum document size accepted, in kilobytes.
maxInputSizeKB	long	-1	Maximum document size accepted, in kilobytes.
maxRecursionDepth	int	-1	Maximum recursion depth.
maxRecursionDocuments	int	-1	Maximum number of documents that can be converted in one directory level.
maxRecursionDocumentsTotal	int	-1	Maximum number of documents that can be converted over all levels.
strictSizeCheck	boolean	False	Strict size validation mode (even for partial reads).
retryIO	string	-	Uses regular I/O when mmap fails. ("true", "false")
filter	string	-	Native filter identifier list to be used specifically. The list is a comma-separated (,) list of filter identifiers with optional ending argument(s) separated by semi-colons (;). If the filter identifier is prefixed by '!', the corresponding filter will be explicitly excluded. The special filter identifier '' stands for "all other filters". First match wins: ",!doc" is identical to "". For example: filter="!jpeg," will accept all filters but the jpeg filter.
timeoutMs	long	-1	Conversion timeout value, in milliseconds. If the conversion process takes longer, the remote side attemps to abort the conversion process.
priority	string	-	Worker thread(s) priority to be used for the processing ("normal", "lowest", "very low", "low", "normal", "high", "very high")
embedded	string	-	Includes embedded images ("true", "false", "optional")
attachments	string	-	Includes embedded attachments ("true", "false", "optional")
styles	string	-	Attempts to extract more text styles for HTML conversion ("true", "false", "optional")
forceConversion	boolean	False	Attempts to generate an empty document upon conversion error (may be ignored)
startPage	long	-1	Starts conversion from this page number (page number starts at 1). This parameter is only taken into account for image processing and may be ignored.
maxPages	long	-1	Maximum number of pages to process for xml conversion (may be ignored).
maxOutputSizeKB	long	-1	Maximum output size on the remote side, in kilobytes. If the generated output exceeds this value, the document may be truncated or invalid.
allowUnicode32	boolean	False	Allows the use of 32-bit unicode points.
allowDocumentChars	boolean	False	Allows the use of Unicode private range characters (E0XX) for separators (keyword, sentence, paragraph separators, ...)
outsideIn	string	-	This feature is no longer supported. ("true", "false", "optional")
outsideInFallback	string	-	This feature is no longer supported. ("true", "false", "optional")
outsideInOnly	string	-	This feature is no longer supported. ("true", "false", "optional")
outsideInForPreview	string	-	This feature is no longer supported. ("true", "false", "optional")
outsideInSimpleXHTMLFallback	string	-	This feature is no longer supported. ("true", "false", "optional")
ocr	string	-	Converts using OCR ("true", "false", "optional")
ocrFallback	string	-	Fallback to OCR if heuristics deem it necessary ("true", "false", "optional")
ocrDetect	string	-	Detects documents requiring OCR (and rejects them) ("true", "false")
ocrQuality	string	-	OCR quality ("fast", "normal", "best")
ocrLang	string	-	OCR language(s) ("en" for English, "en;fr" for French and English, etc.)
ocrTimeoutMs	long	-1	OCR conversion timeout value, in milliseconds. If the OCR process takes longer, the remote side attemps to abort the conversion process. This value overrides the timeout value if the processing involves an OCR operation.
ocrMaxPages	int	-1	Maximum number of pages to process for OCR.
ocrPriority	string	-	Worker thread(s) priority to be used for the OCR processing ("normal", "lowest", "very low", "low", "normal", "high", "very high")
httpProxyUrl	string	-	Optional HTTP proxy URL. The URL can embed credentials if required.
disablePlugins	boolean	False	Disables external plugins.
overrideAddresses	string	-

Nested elements

fromDataModelDocumentProcessor
AcceptCondition
KeyValue*

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.
KeyValue*	-

XML representation

<ConvertTextExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" looseTextDetection="true" forceContent="false" minInputSizeKB="-1" maxInputSizeKB="-1" maxRecursionDepth="-1" maxRecursionDocuments="-1" maxRecursionDocumentsTotal="-1" strictSizeCheck="false" retryIO="" filter="" timeoutMs="-1" priority="" embedded="" attachments="" styles="" forceConversion="false" startPage="-1" maxPages="-1" maxOutputSizeKB="-1" allowUnicode32="false" allowDocumentChars="false" outsideIn="" outsideInFallback="" outsideInOnly="" outsideInForPreview="" outsideInSimpleXHTMLFallback="" ocr="" ocrFallback="" ocrDetect="" ocrQuality="" ocrLang="" ocrTimeoutMs="-1" ocrMaxPages="-1" ocrPriority="" httpProxyUrl="" disablePlugins="false" overrideAddresses="" ></ConvertTextExtractor>

RemoteHTTPTransformer com.exalead.indexing.analysis.v10.RemoteHTTPTransformer

The processor posts part bytes to the remote HTTP service, and gets the typed resource as a result.

The remote service may return a Document.MIME_V10 document, or any other document that can later be processed in the pipeline.

If the remote service returns a non "OK" HTTP status (!= 200 error code), the corresponding error is passed as a regular error.

The service may also advertise a filename, using the standard Content-Disposition's 'filename' attribute.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
remoteUrl string
timeoutMs int
httpIdleTimeoutMs int

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
remoteUrl	string	-	Remote URL
timeoutMs	int	-	Remote processor timeout, in milliseconds. This is the timeout.
httpIdleTimeoutMs	int	-	Cached HTTP connection idle timeout. This is an advanced setting. For efficiency, the RemoteHTTPTransformer maintains a pool of opened connections to the remote HTTP service. This defines the timeout for connections which are no longer used. Default is 10.000.

Nested elements

fromDataModelDocumentProcessor
argMappingRemoteHTTPTransformerRemoteArgMapping*
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
RemoteHTTPTransformerRemoteArgMapping*	argMapping	Argument(s) mapping, if any. @see RemoteHTTPTransformerRemoteArgMapping
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<RemoteHTTPTransformer name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" remoteUrl="" timeoutMs="" httpIdleTimeoutMs="" ></RemoteHTTPTransformer>

RemoteHTTPTransformerRemoteArgMapping com.exalead.indexing.analysis.v10.RemoteHTTPTransformerRemoteArgMapping

Transformation
RemoteHTTPTransformer argument mapping.

Parent objects

argMapping RemoteHTTPTransformer

Attributes

key string
value string
defaultValue string

Name	Type	Default value	Description
key	string	-	URL key to map. This key name will be used as remote HTTP argument name.
value	string	-	Value to use. If @c null, the @c defaultValue value will be used. The following values names are reserved: $docname: the document name or URI $msg.uri: see @c com.exalead.mercury.papi.PAPIMessage $msg.source: see @c com.exalead.mercury.papi.PAPIMessage $part.name: see @c com.exalead.indexing.DocPart $part.filename: see @c com.exalead.indexing.DocPart $part.encoding: see @c com.exalead.indexing.DocPart $part.forcedMime: see @c com.exalead.indexing.DocPart $part.mimeHint: see @c com.exalead.indexing.DocPart $part.mime: see @c com.exalead.indexing.DocPart $part.encodingToUse: see @c com.exalead.indexing.DocPart $part.bytes.length: see @c com.exalead.indexing.DocPart $part.customDirectives.*: see @c com.exalead.indexing.DocPart $$$foo: escaping for $foo
defaultValue	string	-	Value to use if the @c value is @c null. If this value is @c null, the empty string will be used.

XML representation

<RemoteHTTPTransformerRemoteArgMapping key="" value="" defaultValue="" ></RemoteHTTPTransformerRemoteArgMapping>

StandardPartsMerger com.exalead.indexing.analysis.v10.StandardPartsMerger

This processor does nothing if there are no DocumentParts (only root DocumentChunks).

This processor needs one DocumentPart called the 'Master Part'.

If there is only one part, this part is the 'Master Part'.

If there are multiple parts, the part named after the 'masterPart' attribute is the 'Master Part'. @csh AC_STANDARDPARTS_MERGER_ID

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
masterPart string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
masterPart	string	-	Name of the master part. This name should be "master" to follow the convention used by connectors that send documents composed of multiple parts (e.g. mails with attachments).

Nested elements

fromDataModelDocumentProcessor
partSpecificContextsStringValue*
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
StringValue*	partSpecificContexts	The ContextNames of the DocumentChunk from the non-master part that should be copied to the root document.
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<StandardPartsMerger name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" masterPart="" ></StandardPartsMerger>

SemanticPipeDocumentProcessor com.exalead.indexing.analysis.v10.SemanticPipeDocumentProcessor

Instantiates a semantic pipe and creates chunks out of resulting annotations.

It can be used to instantiate classification processors, and perform document level operations from their output.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
pipeline string
annotations string
topLevelAnnotationsOnly boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
pipeline	string	-	Analysis pipeline on which semantic processors will be used.
annotations	string	-	A chunk will be created for each annotation which name is in the list. Comma-separated list of annotations.
topLevelAnnotationsOnly	boolean	False	Considers top level annotations only. For example, results from the QueryMatcher or Fast Rules.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition
SemanticProcessor*

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.
SemanticProcessor*	-	List of semantic processors to use

XML representation

<SemanticPipeDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" annotations="" topLevelAnnotationsOnly="false" ></SemanticPipeDocumentProcessor>

Anchorer com.exalead.indexing.analysis.v10.Anchorer

Adds an annotation on the first and last tokens of either a processed sequence (first/last) or a range defined by an annotation a (first_a/last_a)

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
tagsToAnchor	string	-	List of comma-separated tags on which to work
finalAnnotationOnNextToken	boolean	False	If true, sets final annotation on the token after the last token of annotation a
finalCannotBeSepSpace	boolean	False	If final can't be a space, the annotation last may be set on the next non-blank token

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<Anchorer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" tagsToAnchor="" finalAnnotationOnNextToken="false" finalCannotBeSepSpace="false" ></Anchorer>

CompoundWordSplitter com.exalead.indexing.analysis.v10.CompoundWordSplitter

Annotates compound words that use CamelCase (like SearchServer) or underscores (like my_variable) to separate the root words. This allows users to search for the root words individually.
Annotations generated:

"compound": for example, compound="search server"

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
tokenizeAnnotations boolean
doCamelCase boolean
doUnderscore boolean

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
tokenizeAnnotations	boolean	True	Subtokenizes "SearchServer" into "Search" "Server" automatically, and keep original annotations.
doCamelCase	boolean	True	Separates compound words before each capital letter. For example, the annotation for "CamelCase" is compound="camel case".
doUnderscore	boolean	True	Separates multi-word strings wherever there is an underscore. For example, the annotation for "under_score" is compound="under score".

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<CompoundWordSplitter name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" tokenizeAnnotations="true" doCamelCase="true" doUnderscore="true" ></CompoundWordSplitter>

OntologyMatcher com.exalead.indexing.analysis.v10.OntologyMatcher

An OntologyMatcher detects concepts defined in an ontology in the textual content of the Document Chunks.

Typically, an ontology contains a list of business terms to be detected. Resulting Annotations are mapped to enable navigation by business concepts.
Annotations generated:

Depends on the resource (See Pkg).

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
enableApproxMatching	boolean	False	Enables approximative matching in ontology. Approximative matching uses the Damerau-Levenshtein edit distance.
minWordSizeForDist1	int	3	Minimum number of chars in token to enable the Damerau-Levenshtein distance of 1.
minWordSizeForDist2	int	8	Minimum number of chars in token to enable the Damerau-Levenshtein distance of 2.
resourceDir	string	-	URL for the directory containing the ontology (data://, file;// or resource://).
restrictLanguage	boolean	True	Keeps only the expression added with language == Language.XX or with the document language. For example, if the Ontology contains an expression added with language=En, it will be extracted only for an English document if restrictLanguage is set to true.
keepLongestMatch	boolean	True	Keeps only the longest match. For example, if you have 5 tokens ('a', 'b', 'c', 'd', 'e') and 4 annotations 'a', 'a-c', 'b-c-d' and 'd-e', this option will only keep 'b-c-d' and remove all other annotations.
keepLongestMatchInterTag	boolean	False	Keeps only the longest match (tag independant). For example, if you have 5 tokens ('a', 'b', 'c', 'd', 'e') and 4 annotations 'a', 'a-c', 'b-c-d' and 'd-e', this option will only keep 'b-c-d' and remove all other annotations.
tokenizeAnnotations	boolean	False	If you have some multi-tokens annotations (like "super market" annotation on token "supermarket", this option will automatically subtokenize "supermarket" in "super" "market" and keep original annotations. If you enable this option, keepLongestMatch and keepLongestMatcherInterTag will be set to true.
annotationsToIgnore	string	-	Sets the list of annotations to be ignored (comma-separated). This feature allows you to define a list of words/expressions to ignore in the recognition of this ontology. For example, if you add: the expressions "of" and "the" with the tag "toIgnore" in ontology A, and the expression "website embassy" in ontology B with tagsToIgnore="toIgnore", ... you will be able to match "website of the embassy", "website of embassy" and "website embassy".
ignoreSpaces	boolean	False	If your ontology was compiled with matchOnSeparators=false, this allows 'lemonde' to retrieve 'le monde' or 'le monde' to retrieve 'lemonde'. If your ontology was compiled with matchOnSeparators=true, this allows 'le monde' to retrieve 'le monde'.
annotationPrefix	string		A prefix to add to each annotation tag. For example, if the package of the entry matched in the ontology is "exalead.location.country" and the annotationPrefix is "myOntology_", an annotation will be added with the tag "myOntology_exalead.location.country".
trustLevelBasedDedup	boolean	False	Keeps only the annotation with the highest trust level when several entries from a package match the same text chunk.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<OntologyMatcher name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" enableApproxMatching="false" minWordSizeForDist1="3" minWordSizeForDist2="8" resourceDir="" restrictLanguage="true" keepLongestMatch="true" keepLongestMatchInterTag="false" tokenizeAnnotations="false" annotationsToIgnore="" ignoreSpaces="false" annotationPrefix="" trustLevelBasedDedup="false" ></OntologyMatcher>

NamedEntitiesMatcher com.exalead.indexing.analysis.v10.NamedEntitiesMatcher

The Named Entities Matcher detects named entities such as people, organizations, or places, in the textual content of the document.

It generates annotations like NE.person or NE.organization, using ontology-based matching and/or rule-based matching.

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceDir string
rules string
prefix string
language string
partOfSpeechFiltering boolean
useKnownWordsForDisambiguisation boolean

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceDir	string	-	URL for the resource (data://, file;// or resource://).
rules	string	ne	Defines which entities will be extracted: The default value, `ne` triggers the extraction of people, organizations, locations and events. The value `ne-all` triggers the extraction of all types of entities.
prefix	string	NE	Prefix to add in front of each annotation generated by the named entity matcher.
language	string	-	Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.
partOfSpeechFiltering	boolean	True	It discards annotations for parts of text made of a name followed by a verb or an adverb with the first letter in uppercase. This filter is useful if your documents contain a lot of titles with several capitalized words (what is called 'Title Case'). It applies to `NE.person`, `NE.place` and `NE.organization`.
useKnownWordsForDisambiguisation	boolean	True	Uses a resource of known words to disambiguate named entities candidates. It works only for English and French.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<NamedEntitiesMatcher name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" prefix="NE" language="en" partOfSpeechFiltering="true" useKnownWordsForDisambiguisation="true" ></NamedEntitiesMatcher>

Classifier com.exalead.indexing.analysis.v10.Classifier

A Classifier classifies a whole document according to the existing annotations on selected Document Chunks.

The annotations are matched against a learning resource.

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceDir	string	-	URL for the vocabulary resource (data://, file;// or resource://)
annotationName	string	-	Name of the annotation to add.
language	iso code	-	Language for which the vocabulary classifier is activated.
excludedLanguages	string		Language for which the vocabulary classifier is deactivated (works only if language=xx, comma-separated).
addAnnotationsOnKeywords	boolean	False	If true, it adds annotations to all matching tokens.
maxAnnotations	int	-1	Maximum number of annotations per document.
minTrustLevel	int	0	The minimum trust level of categories to keep.
maxKeywords	int	-1	The maximum number of keywords to keep.
minKeywords	int	1	The minimum number of keywords per class.
collapseToken	boolean	False	If true, all identical tokens are collapsed.
extraPrefixAnnotations	string		The optional list of prefix annotations to keep (comma-separated).
extraAnnotationsMinTrustLevel	int	100	The minimum trust level to keep an extra annotation.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<Classifier name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" annotationName="" language="en" excludedLanguages="" addAnnotationsOnKeywords="false" maxAnnotations="-1" minTrustLevel="0" maxKeywords="-1" minKeywords="1" collapseToken="false" extraPrefixAnnotations="" extraAnnotationsMinTrustLevel="100" ></Classifier>

HierarchicalClassifier com.exalead.indexing.analysis.v10.HierarchicalClassifier

A Classifier classifies a whole document according to the existing annotations on selected Document Chunks.

The annotations are matched against a learning resource.

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

Name	Type	Default value	Description
resourceDir	string	-	URL for the vocabulary resource (data://, file;// or resource://)
annotationName	string	-	Name of the annotation to add.
language	iso code	-	Language for which the vocabulary classifier is activated.
excludedLanguages	string		Language for which the vocabulary classifier is deactivated (works only if language=xx, comma-separated).
addAnnotationsOnKeywords	boolean	False	If true, it adds annotations to all matching tokens.
maxAnnotations	int	-1	Maximum number of annotations per document.
minTrustLevel	int	0	The minimum trust level of categories to keep.
maxKeywords	int	-1	The maximum number of keywords to keep.
minKeywords	int	1	The minimum number of keywords per class.
collapseToken	boolean	False	If true, all identical tokens are collapsed.
extraPrefixAnnotations	string		The optional list of prefix annotations to keep (comma-separated).
extraAnnotationsMinTrustLevel	int	100	The minimum trust level to keep an extra annotation.
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<HierarchicalClassifier resourceDir="" annotationName="" language="en" excludedLanguages="" addAnnotationsOnKeywords="false" maxAnnotations="-1" minTrustLevel="0" maxKeywords="-1" minKeywords="1" collapseToken="false" extraPrefixAnnotations="" extraAnnotationsMinTrustLevel="100" name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></HierarchicalClassifier>

RulesMatcher com.exalead.indexing.analysis.v10.RulesMatcher

A RuleMatcher applies a rule engine on the textual content of the DocumentChunks.

The rules are defined in a separate XML 'resourceFile' and are a combination of regular expression, word matching and boolean operators over content.
Annotations generated:

The matching rule defined in the XML specifies the annotation to generate

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceFile string
language iso code

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceFile	string	-	URL for the resource (data://, file;// or resource://).
language	iso code	-	Language for which this processor is activated.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<RulesMatcher name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceFile="" language="en" ></RulesMatcher>

RelatedTerms com.exalead.indexing.analysis.v10.RelatedTerms

Extracts all possible related terms.

Only one instance of this processor may exist per input context.
Annotations generated:

"relatedTerm": RelatedTerm identifier (stored in the dictionary and in the index)
"relatedTermDisplay": display form of the RelatedTerm (stored in the dictionary)

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
relatedTermsMinSpan	int	3	Minimum number of words (excluding stop words) in an automatically extracted term (not applicable to allow list).
relatedTermsMaxSpan	int	6	Maximum number of words (excluding stop words) in an automatically extracted term (not applicable to allow list).
maxRelatedTermsPerDoc	int	64	The maximum number of related terms per document.
keepLongestMatch	boolean	True	Keeps only the longest term when several overlap. For example, if you have 5 tokens ('a', 'b', 'c', 'd', 'e') and 4 related terms 'a', 'a-c', 'b-c-d' and 'd-e', this option will only keep 'b-c-d' and remove all other related terms.
dictionaryName	string	-	Name of the dictionary populated by terms extracted by this processor. If null, use the default dictionary.
preprocResourceDir	string	-	URL for the resource of the related terms preprocessor (data://, file;// or resource://). If null, we use the standard preprocessor of the product.
whitelistResource	string	-	Path to a related terms allow list resource.
blacklistResource	string	-	Path to a related terms blocklist resource.
withPartOfSpeech	boolean	True	Adds a PartOfSpeechTagger to the list of processors automatically. Improves quality of automatically extracted terms.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<RelatedTerms name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" relatedTermsMinSpan="3" relatedTermsMaxSpan="6" maxRelatedTermsPerDoc="64" keepLongestMatch="true" preprocResourceDir="" withPartOfSpeech="true" ></RelatedTerms>

PartOfSpeechTagger com.exalead.indexing.analysis.v10.PartOfSpeechTagger

A PartOfSpeechTagger detects the part of speech for each word in the text of Document Chunks.

It improves the quality of other processors, such as the named entity detector or the sentiment analyzer.
Annotations generated:

"tagger"

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceDir string
language string

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceDir	string	-	URL for the resource (data://, file;// or resource://).
language	string	-	Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<PartOfSpeechTagger name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" language="en" ></PartOfSpeechTagger>

Phonetizer com.exalead.indexing.analysis.v10.Phonetizer

Creates a phonetic form for each word.

This processor is used:

as a helper for other processors (like Ontology Matcher, or Semantic Extractor), which need to perform phonetic matches.
to perform search-time phonetic analysis using the Phonetic expansion module (this creates the dictionary of phonetic forms that will be used by the expansion module at search-time).
to greatly improve the quality of spell checking.

Annotations generated:

"phonetic"

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceFile string
language string

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceFile	string	-	URL for the resource (data://, file;// or resource://).
language	string	-	Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<Phonetizer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceFile="" language="en" ></Phonetizer>

NGramsExtractor com.exalead.indexing.analysis.v10.NGramsExtractor

Extracts normalized word-grams. N-grams are useful for spell checking and statistical processings.
Annotations generated:

"ngram"

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
min int
max int

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
min	int	2	Minimum ngram size
max	int	3	Maximum ngram size

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<NGramsExtractor name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" min="2" max="3" ></NGramsExtractor>

Lemmatizer com.exalead.indexing.analysis.v10.Lemmatizer

Creates a lemmatized form for each word (nouns and adjectives only).

This processor is mostly used as a helper for other processors (like Ontology Matcher, or Semantic Extractor), which need to perform lemmatized matches.

Annotations generated:

"lemma": normalized lemmatized form of the word (singular/masculine)
"lemma_lowercase": lemmatized form of the word (singular/masculine)
"fsingular": normalized singular form of the word
"fsingular_lowercase": singular form of the word
"masculine": if the token is a masculine word
"feminine": if the token is a feminine word
"neuter": if the token is neuter
"singular": if the word is singular
"plural": if the word is plural
"unnumbered": if the word is unnumbered
"pos": the static Part of Speech

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceDir string
language string
lemmatizeNormalizedAnnotations boolean

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceDir	string	-	URL for the resource (data://, file;// or resource://).
language	string	-	Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.
lemmatizeNormalizedAnnotations	boolean	False

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<Lemmatizer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" language="en" lemmatizeNormalizedAnnotations="false" ></Lemmatizer>

AcronymDetector com.exalead.indexing.analysis.v10.AcronymDetector

Detects acronyms like 'o.n.u' and extracts 'onu'.
'.', '-' and ' ' are the standard acronym separators.

Custom alphanumeric separators can be added with the "separators" attribute.
Annotations generated:

"acronym"

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
addNormalizerAnnotation boolean
separators string
language string
strict boolean

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
addNormalizerAnnotation	boolean	False
separators	string	-	List of allowed separators chars separated by ',' (can only be alphanumerical, for example, 'and' to handle '1 and 1')
language	string	-	Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.
strict	boolean	True	In strict mode, the only separator is dot.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<AcronymDetector name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" addNormalizerAnnotation="false" separators="" language="en" strict="true" ></AcronymDetector>

Normalizer com.exalead.indexing.analysis.v10.Normalizer

Normalizes all tags given in input tags field.
Annotations generated:

"NORMALIZE"
"LOWERCASE"

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
inputTags string
trustLevel int
transliteration boolean

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
inputTags	string	-	Normalize all tags of "inputTags" (comma-separated list of tags).
trustLevel	int	100
transliteration	boolean	True	When normalizing, convert some characters to their latin equivalent

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<Normalizer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" inputTags="" trustLevel="100" transliteration="true" ></Normalizer>

FarTextAnnotator com.exalead.indexing.analysis.v10.FarTextAnnotator

A FarTextAnnotator annotates alphanumeric tokens with 'annotation' if they are farther than 'startOffset'

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
startOffset int
annotation string

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
startOffset	int	8192
annotation	string	fartext

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<FarTextAnnotator name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" startOffset="8192" annotation="fartext" ></FarTextAnnotator>

Chunker com.exalead.indexing.analysis.v10.Chunker

A chunker detects noun groups.
Annotations generated:

"gadv": adverbal group
"gadj": adjectival group
"gnoun": noun group
"gverb": verbal group
"gprep": prepositional group

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceDir string
language string

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceDir	string	-	URL for the resource (data://, file;// or resource://).
language	string	-	Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<Chunker name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" language="en" ></Chunker>

SentimentAnalyzer com.exalead.indexing.analysis.v10.SentimentAnalyzer

Analyzes the nouns and adjectives present in the text. It detects topics and annotates the document with:

a global rating of good, bad or neutral
a rating per topic
the adjective(s) used in the document

@require Tokenizer, Lemmatizer, PartOfSpeechTagger, RelatedTermsPreprocessor, RelatedTermsExtractor, NamedEntitiesMatcher, Chunker @annotations "sentiment" annotation on nouns with a modulated ("really", "quite", "not") appreciation @document-annotations "document_sentiment" annotation on the document with either "good", "bad" or "neutral" and a confidence ratio @attribute resourceDir (defaults to resource://sentiment/sentiment.bin): @attribute language (defaults to all supported languages): @attribute summarize (defaults to false): @attribute annotateGlobally (defaults to false): @attribute showPackage (defaults to false): @attribute packageCount (defaults to false): @attribute nounPackage DEPRECATED (defaults to true): @attribute ignorePartOfSpeech (defaults to false):

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceDir string
language iso code
annotateGlobally boolean
annotatePronouns boolean
ignorePartOfSpeech boolean
ignoreRelatedTerms boolean
legacyAnnotations boolean
notApplicableAnnotations boolean
normalizeTrustLevels boolean
nounPackage boolean
packageCount boolean
showPackage boolean
suggest boolean
summarize boolean
suggestOutput string

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceDir	string	-	URL for the resource (data://, file;// or resource://).
language	iso code	-
annotateGlobally	boolean	False
annotatePronouns	boolean	False
ignorePartOfSpeech	boolean	False
ignoreRelatedTerms	boolean	False
legacyAnnotations	boolean	False
notApplicableAnnotations	boolean	True
normalizeTrustLevels	boolean	True
nounPackage	boolean	True
packageCount	boolean	False
showPackage	boolean	False
suggest	boolean	False
summarize	boolean	False
suggestOutput	string	-

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<SentimentAnalyzer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" language="en" annotateGlobally="false" annotatePronouns="false" ignorePartOfSpeech="false" ignoreRelatedTerms="false" legacyAnnotations="false" notApplicableAnnotations="true" normalizeTrustLevels="true" nounPackage="true" packageCount="false" showPackage="false" suggest="false" summarize="false" suggestOutput="" ></SentimentAnalyzer>

FastRulesMatcher com.exalead.indexing.analysis.v10.FastRulesMatcher

Annotates a document using a set of XML rules, compiled for efficiency.

The rules are described with the query language using the AND, OR and NOT operators, as well as 'context' matching operators.

The rules can also match whole chunks (and not just words) per regular expressions.
Annotations generated:

Depending on the resources (See FastRulesDefinition)

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceDir string
allowsExprStartingBySeparators boolean

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceDir	string	-	Directory containing the matcher resources. Must not be empty.
allowsExprStartingBySeparators	boolean	False	If you have expressions starting with a separator (",", ";", "&", ...), then you must set this option to true.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<FastRulesMatcher name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" allowsExprStartingBySeparators="false" ></FastRulesMatcher>

SnowballStemmer com.exalead.indexing.analysis.v10.SnowballStemmer

Creates the stemmed form of each word. This uses the Snowball stemming algorithms.

This processor is mostly used as a helper for other processors (like Ontology Matcher, or Semantic Extractor), which need to perform stemmed matches.

Annotations generated:

"stem"

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<SnowballStemmer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></SnowballStemmer>

DebugSemanticProcessor com.exalead.indexing.analysis.v10.DebugSemanticProcessor

Dumps all annotated tokens in the specified format on Standard Output, or in @c outputFile. (Log of the 'Analysis' process)

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputFile string
format enum(html, xml)

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
outputFile	string	-
format	enum(html, xml)	html	Output format.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<DebugSemanticProcessor name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputFile="" format="html" ></DebugSemanticProcessor>

SQI (deprecated) com.exalead.indexing.analysis.v10.SQI

A SemanticProcessor applies semantic processing on the textual content of the DocumentChunks.

A Semantic Processor creates SemanticAnnotations on tokens.

These SemanticAnnotations can then be used in the Mapping.

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceDir string
breakOnSentence boolean
breakOnParagraph boolean
breakOnLine boolean
matchAllRules boolean

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceDir	string	-	URL for the resource (data://, file:// or resource://)
breakOnSentence	boolean	False	If true, there will be maximum one match per sentence, and no match for inter-sentence. This option will add the SentenceFinder automatically.
breakOnParagraph	boolean	True	If true, there will be maximum one match per paragraph, and no match for inter-paragraph.
breakOnLine	boolean	False	If true, there will be maximum one match per line, and no match for inter-line.
matchAllRules	boolean	True	If true, it returns the full list of matched rules. If false, it returns the first matched rule only.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<SQI name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" breakOnSentence="false" breakOnParagraph="true" breakOnLine="false" matchAllRules="true" ></SQI>

SemanticExtractor com.exalead.indexing.analysis.v10.SemanticExtractor

The resource describes the features to extract, with their term, type and range for numerical values according to a set of rules.
Annotations generated:

Depending on the resource (See SemanticExtractorConfig)

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceDir string
prefix string
breakOnSentence boolean
breakOnParagraph boolean
breakOnLine boolean
matchAllRules boolean
language iso code
annotateUnusedTokensWith string
overlappingMatches boolean

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceDir	string	-	URL of the compiled semantic extractor file. Use the format data://, file:// or resource://.
prefix	string	-	Output annotations prefix
breakOnSentence	boolean	False	If true, there will be maximum one match per sentence, and no match for inter-sentence. This option will add the SentenceFinder automatically.
breakOnParagraph	boolean	True	If true, there will be maximum one match per paragraph, and no match for inter-paragraph.
breakOnLine	boolean	False	If true, there will be maximum one match per line, and no match for inter-line.
matchAllRules	boolean	True	If true, it returns the full list of matched rules. If false, it returns only the first matched rule.
language	iso code	-	Language for which the extractor is activated. If null, all languages are activated.
annotateUnusedTokensWith	string	-	Used in the context of query rewriting by the Semantic Query Analyzer.
overlappingMatches	boolean	True	If true, reports all matches even if their locations overlap. Only makes sense when matchAllRules is true.

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<SemanticExtractor name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" prefix="" breakOnSentence="false" breakOnParagraph="true" breakOnLine="false" matchAllRules="true" language="en" overlappingMatches="true" ></SemanticExtractor>

ProximityProcessor com.exalead.indexing.analysis.v10.ProximityProcessor

A proximity processor detects and annotates pieces of text where several annotations occur given distance constraints.
Possible constraints (non mutually exclusive):

token window size
distance between annotations
sentence/paragraph scope

Annotations generated:

Depending on the resource (See Proximity)

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceFile string

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceFile	string	-	URL for the resource (data://, file:// or resource://)

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<ProximityProcessor name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceFile="" ></ProximityProcessor>

AnnotationManager com.exalead.indexing.analysis.v10.AnnotationManager

An annotation manager implements basic operations on annotations: copy/removal/selection according to a number of conditions like:

Removal of overlaping annotations
Selection of the most frequent annotations
Copy of an annotation unless block listed

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
resourceFile string

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
resourceFile	string	-	URL for the resource (data://, file:// or resource://)

Nested elements

fromDataModelSemanticProcessor

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor

XML representation

<AnnotationManager name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceFile="" ></AnnotationManager>

CustomSemanticProcessor com.exalead.indexing.analysis.v10.CustomSemanticProcessor

A custom semantic processor allows you to plug in custom code in the semantic pipeline.

Parent objects

AnalysisPipeline
SemanticPipeDocumentProcessor
SemanticProcessorModule
SemanticQueryAnalysisConfig

Attributes

name string
contexts string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
classId string

Name	Type	Default value	Description
name	string	-	Name of the Semantic Processor. This name is only used for tracing and debugging purposes.
contexts	string	-	Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied. If this list is empty, all DocumentChunks are processed.
dataModelState	string	-	Is this semantic processor managed by a data model? @enum{null,auto,customized, error}. If null, this semantic processor is not related to the data model. If "auto", this semantic processor is auto-generated by the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disables the DocumentProcessor
classId	string	-	The specified class must implement the {@code com.exalead.indexing.analysis.semantic.CustomSemanticProcessorInterface} Exascript interface.

Nested elements

fromDataModelSemanticProcessor
KeyValue*

Type	Wrapper	Description
SemanticProcessor	fromDataModel	If dataModelState is "customized", you will find here the original semantic processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
KeyValue*	-

XML representation

<CustomSemanticProcessor name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" classId="" ></CustomSemanticProcessor>

PrintfValues com.exalead.indexing.analysis.v10.PrintfValues

Prints textual content of DocumentChunks according to a formatting string.

This string contains variables in one of the 3 following formats:
1. $(name), the name of a context: output is the textual content of this context.
2. $/name:regexp/, the name of a context whose chunks must match the regexp: output is the piece of text that has matched.
3. $/name:regexp:format/, the name of a context whose chunks must match the regexp: output is defined by a sed-like format referencing the regexp subexpressions.

Warning: In the regexp and format parts, colons and slashes must be escaped with a backslash.

For example : "$(firstname) $(lastname) : $/age:[0-9]+/ $/date:([0-9]{2})([0-9]{2})([0-9]{4}):day=\\1 month=\\2 year=\\3"

Warning: The context used in this method cannot be produced by another processor. It should come from the connector.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
formattingString string
outputContext string
strict boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
formattingString	string	-	This string contains variables in one of the 3 following formats: 1. $(name), the name of a context: output is the textual content of this context. 2. $/name:regexp/, the name of a context whose chunks must match the regexp: output is the piece of text that has matched. 3. $/name:regexp:format/, the name of a context whose chunks must match the regexp: output is defined by a sed-like format referencing the regexp subexpressions. Warning: Colons and slashes must be escaped with a backslash. For example : "$(firstname) $(lastname) : $/age:[0-9]+/ $/date:([0-9]{2})([0-9]{2})([0-9]{4}):day=\\1 month=\\2 year=\\3"
outputContext	string	-	ContextName to be associated with the DocumentChunk created for each generated value.
strict	boolean	True	Forces all the manipulated contexts found to process.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<PrintfValues name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" formattingString="" outputContext="" strict="true" ></PrintfValues>

RenameUnmappedContexts com.exalead.indexing.analysis.v10.RenameUnmappedContexts

This Document Processor changes the ContextName for all DocumentChunks associated with a ContextName that does not have a Mapping Configuration.

This avoids extensive renaming using RenameContext.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The new ContextName for DocumentChunks with an unmapped ContextName.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<RenameUnmappedContexts name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></RenameUnmappedContexts>

NewChunk com.exalead.indexing.analysis.v10.NewChunk

Creates a new DocumentChunk with 'outputContext' as ContextName, and textual content specified in 'value'.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
value string
partName string
language iso code

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The ContextName used for newly created chunks.
value	string	-	The value used for newly created chunks.
partName	string	-	The part to which the chunk should belong. If nothing is specified here, the chunk will be handled as a global chunk.
language	iso code	-	Language of the chunk, as an ISO639 code.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<NewChunk name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" value="" partName="" language="en" ></NewChunk>

UniformRandomContextGenerator com.exalead.indexing.analysis.v10.UniformRandomContextGenerator

Adds a new DocumentChunk for one document out of 'modulo' documents processed.

The textual content of the DocumentChunk is picked out of the list specified in 'values', with a uniform distribution. @descr

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
modulo int

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The ContextName used for newly created chunks.
modulo	int	-	Inverse probability of adding the new chunk. Must be a strictly positive integer.

Nested elements

fromDataModelDocumentProcessor
valuesStringValue*
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
StringValue*	values	List of possible values.
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<UniformRandomContextGenerator name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" modulo="" ></UniformRandomContextGenerator>

ZipfRandomContextGenerator com.exalead.indexing.analysis.v10.ZipfRandomContextGenerator

Adds a new document chunk for one document out of 'modulo'.

The textual content of the document chunk is picked out of the list specified in 'values', with a non-uniform discrete Zipf distribution. @descr

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

outputContext string
modulo int
name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
parameter double

Name	Type	Default value	Description
outputContext	string	-	The ContextName used for newly created chunks.
modulo	int	-	Inverse probability of adding the new chunk. Must be a strictly positive integer.
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
parameter	double	-	The exponent characterizing the distribution.

Nested elements

valuesStringValue*
fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
StringValue*	values	List of possible values.
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<ZipfRandomContextGenerator outputContext="" modulo="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" parameter="" ></ZipfRandomContextGenerator>

HTMLRelevantContentExtractor com.exalead.indexing.analysis.v10.HTMLRelevantContentExtractor

The HTMLRelevantContentExtractor extracts the most relevant parts of an HTML document.

Generally, the relevant part of an HTML document is the article on the middle of the page. The header, the footer and the menus are often the same on all pages and should not be indexed.

The extraction can be tuned using different attributes. @csh AC_HTMLRELEVANT_CONTENT_ID

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
relevantChunkContext	string	relevantcontent	Relevant text chunks will be copied in this context.
newContextName	string	relevantcontent	Deprecated, use 'relevantChunkContext'.
irrelevantChunkContext	string	excludedcontent	Irrelevant text chunks will be copied in this context.
retrieveFieldContext	string	htmlcontent	Original text chunks will be moved in this context.
irrelevantChunkAnnotation	string	-	If set, the HTMLRelevantContentExtractor will annotate each irrelevant chunk with an annotation.
minScore	int	15	Internally, the HTMLRelevantContentExtractor assigns a score to each chunk of its input. Use 'minScore' to keep only chunks having a score greater than a value.
minParagraphWords	int	7	The minimum number of words a <p> chunk must have to be considered as a paragraph and be boosted.
minTitleWords	int	3	The minimum number of words a title must have to be boosted.
linkAllowedInTitle	boolean	True	By default, the links contained in a page title produce a malus, this can be disabled.
paragraphBoost	int	10	Each time a paragraph will be detected, the score will be increased by this value.
maxWordInLinkRatio	int	2	The maximum allowed ratio of words contained in links in a chunk of text.
titleBoost	int	5	Each time a title will be detected, the score will be increased by this value.
classBoost	int	10	Each time a CSS class included in 'idsAndClassesToKeep' will be detected, the score will be increased by this value.
keepOnlyBestChunk	boolean	False	If true, the 'relevantcontent' will only be composed by the main article of the page.
skipBlockquotes	boolean	False	Ability to skip HTML blockquote tags.
skipPre	boolean	False	Ability to skip HTML pre tags.
keepImages	boolean	False	If true, the HTML image annotations will be kept in the new context.

Nested elements

fromDataModelDocumentProcessor
idsAndClassesToIgnoreStringValue*
idsAndClassesToKeepStringValue*
annotationsToCopyStringValue*
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
StringValue*	idsAndClassesToIgnore	The list of CSS classes and HTML ids to ignore.
StringValue*	idsAndClassesToKeep	The list of CSS classes and HTML ids to boost.
StringValue*	annotationsToCopy	The list of annotations to keep in the new context.
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<HTMLRelevantContentExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" relevantChunkContext="relevantcontent" newContextName="relevantcontent" irrelevantChunkContext="excludedcontent" retrieveFieldContext="htmlcontent" irrelevantChunkAnnotation="" minScore="15" minParagraphWords="7" minTitleWords="3" linkAllowedInTitle="true" paragraphBoost="10" maxWordInLinkRatio="2" titleBoost="5" classBoost="10" keepOnlyBestChunk="false" skipBlockquotes="false" skipPre="false" keepImages="false" ></HTMLRelevantContentExtractor>

HTMLTableExtractor com.exalead.indexing.analysis.v10.HTMLTableExtractor

Extracts all HTML tables having minColumnsRequired < nb cols < maxColumnsRequired and duplicates them in context {@link newContextName}

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
newContextName	string	webtable	The ContextName used for newly created chunks.
minColumnsRequired	int	2	The minimum number of columns required to extract.
maxColumnsRequired	int	2147483647	The maximum number of columns required to extract.
concatenateRows	boolean	False	Concatenates all rows.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<HTMLTableExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" newContextName="webtable" minColumnsRequired="2" maxColumnsRequired="2147483647" concatenateRows="false" ></HTMLTableExtractor>

DiscardDocument com.exalead.indexing.analysis.v10.DiscardDocument

DEPRECATED. It does not stop the processing of the document. To do so, add a custom document processor with the following code:

document.setProcessingFlag(Operation.DISCARD_AND_DELETE);
((AnalysisDocumentProcessingContext) context).stopProcessingAfterCurrentProcessor();

@desc Discards documents from the pipeline.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
logDiscardedDocuments boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
logDiscardedDocuments	boolean	False	If true, the URI of each discarded documents is logged on each analysis process log file.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<DiscardDocument name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" logDiscardedDocuments="false" ></DiscardDocument>

ReplaceContextNames com.exalead.indexing.analysis.v10.ReplaceContextNames

Replaces the first matching substring of context names with the given replacement.

For example, inputSubstring="abc" and outputReplacement="bar" will rename context abcdef to bardef and somethingabcstuff to somethingbarstuff

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
inputSubstring string
outputReplacement string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
inputSubstring	string	-	The piece of string to be replaced.
outputReplacement	string	-	The replacement string.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<ReplaceContextNames name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" inputSubstring="" outputReplacement="" ></ReplaceContextNames>

HTMLCSSSelector com.exalead.indexing.analysis.v10.HTMLCSSSelector

Deletes all text chunks that are not annotated with a class or an id specified in {@link classes} or {@link ids}

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor

Nested elements

fromDataModelDocumentProcessor
classesStringValue*
idsStringValue*
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
StringValue*	classes
StringValue*	ids
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<HTMLCSSSelector name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></HTMLCSSSelector>

HTMLCSSExtractor com.exalead.indexing.analysis.v10.HTMLCSSExtractor

Extracts all text chunks annotated with a class or an id specified in {@link classes} or {@link ids}, and duplicates them in context {@link outputContext}

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	ContextName to be associated with the DocumentChunk created for each new context.

Nested elements

fromDataModelDocumentProcessor
classesStringValue*
idsStringValue*
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
StringValue*	classes	List of classes used to determine whether a chunk must be duplicated.
StringValue*	ids	List of ids used to determine whether a chunk must be duplicated.
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<HTMLCSSExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></HTMLCSSExtractor>

DataModelClassResolver com.exalead.indexing.analysis.v10.DataModelClassResolver

This processor takes the value of the "datamodel_class" papi directive to determine the DataModelClass of the document.

If this directive is not found, we assume this is the default class.

If this is not the default class, all metas corresponding to an existing DataModelProperty are prefixed with the type of the class declaring the property (it may be a superclass of the class).

For the processors following this processor in the pipeline, you must refer to the Data Model property by prefixing it with its class name. For processors preceding this processor in the pipeline, use the meta name only (without prefix).

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<DataModelClassResolver name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></DataModelClassResolver>

SetDefaultValue com.exalead.indexing.analysis.v10.SetDefaultValue

This processor looks for specified contexts. If they are not present in document, they are created with a configured value.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor

Nested elements

fromDataModelDocumentProcessor
AcceptCondition
KeyValue*

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.
KeyValue*	-

XML representation

<SetDefaultValue name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></SetDefaultValue>

CustomDocumentProcessor com.exalead.indexing.analysis.v10.CustomDocumentProcessor

A Custom document processor allows you to plug in custom code packaged as a CVPlugin into the document processing pipeline.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
classId string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
classId	string	-	Class identifier. The specified class must implement the com.exalead.pdoc.analysis.CustomDocumentProcessor Java Interface.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition
KeyValue*

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.
KeyValue*	-

XML representation

<CustomDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" classId="" ></CustomDocumentProcessor>

InferFileExtension com.exalead.indexing.analysis.v10.InferFileExtension

When the file_extension meta is not present, finds the file extension based on the file name or the mime meta (if one of these two is present).

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<InferFileExtension name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></InferFileExtension>

InsertCurrentDate com.exalead.indexing.analysis.v10.InsertCurrentDate

Adds the current date in an output context

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
outputContext string
format string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
outputContext	string	-	The ContextName used for newly created chunks.
format	string	-	Either "unixts" or a SimpleDateFormat specification

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<InsertCurrentDate name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" format="" ></InsertCurrentDate>

XpathExtractor com.exalead.indexing.analysis.v10.XpathExtractor

Extraction is performed for the following data types:

text/html. HTML Files.
application/xml. XML Files.

Warning: To put before the NativeTextExtractor because the 'bytes' of each Document Binary Part are deleted by the NativeTextExtractor.

Limitations: This extractor handles node set and string functions. Not number and boolean. You can use number or boolean functions inside your xpath //img[starts-with(@src, "http://")] because this xpath return a set of nodes (<img>) but xpath count(//img) doesn't work because it returns a number. @csh AC_XPATH_EXTRACTOR_ID

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
htmlParserToUse enum(htmlCleaner, tagSoup)

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
htmlParserToUse	enum(htmlCleaner, tagSoup)	htmlCleaner	HTML parser to use in priority.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition
XpathRule*

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.
XpathRule*	-

XML representation

<XpathExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" htmlParserToUse="htmlCleaner" ></XpathExtractor>

XpathRule com.exalead.indexing.analysis.v10.XpathRule

No documentation for this element.

Parent objects

XpathExtractor

Attributes

metaName string
xpath string
concatMutiMatch boolean

Name	Type	Default value	Description
metaName	string	-
xpath	string	-
concatMutiMatch	boolean	True	Concatenates all results in a value when the xpath expression returns several results. Otherwise, it adds each match in a multiValued meta. It should be unselected if you want each node returned by xpath expression in different value (like list of item).

XML representation

<XpathRule metaName="" xpath="" concatMutiMatch="true" ></XpathRule>

XpathFragmentExtractor com.exalead.indexing.analysis.v10.XpathFragmentExtractor

Input: All DocumentChunks associated with the specified 'inputContext' ContextNames. Input can be XML or HTML fragment.

Output: DocumentChunks are created for each Xpath Fragment Rule. Each DocumentChunk is associated with the specified 'Meta name' ContextName.

Warning: To put before the NativeTextExtractor because the 'bytes' of each Document Binary Part are deleted by the NativeTextExtractor.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
inputFragmentMeta string
parserToUse enum(htmlCleaner, tagSoup, xmlParser)

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
inputFragmentMeta	string	-
parserToUse	enum(htmlCleaner, tagSoup, xmlParser)	xmlParser	Parser to use in priority.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition
XpathFragmentRule*

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.
XpathFragmentRule*	-

XML representation

<XpathFragmentExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" inputFragmentMeta="" parserToUse="xmlParser" ></XpathFragmentExtractor>

XpathFragmentRule com.exalead.indexing.analysis.v10.XpathFragmentRule

No documentation for this element.

Parent objects

XpathFragmentExtractor

Attributes

metaName string
xpath string

Name	Type	Default value	Description
metaName	string	-
xpath	string	-

XML representation

<XpathFragmentRule metaName="" xpath="" ></XpathFragmentRule>

SimilarStringToPart com.exalead.indexing.analysis.v10.SimilarStringToPart

Converts the signatures in a string format from a meta to a binary part

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
version int

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
version	int	1	Specifies the version.

Nested elements

fromDataModelDocumentProcessor
valuesStringValue*
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
StringValue*	values	List of the names of the metas to parse and to transform to part.
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<SimilarStringToPart name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" version="1" ></SimilarStringToPart>

DocumentProcessorGroup com.exalead.indexing.analysis.v10.DocumentProcessorGroup

Contains a list of document processors, which are executed only if this group document processor condition matches.
It avoids condition duplication or distinct pipelines creation when several processors share the same condition.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor

Nested elements

fromDataModelDocumentProcessor
AcceptCondition
DocumentProcessor*

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.
DocumentProcessor*	-

XML representation

<DocumentProcessorGroup name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></DocumentProcessorGroup>

UnitsOfMeasurementNormalizer com.exalead.indexing.analysis.v10.UnitsOfMeasurementNormalizer

Unit of measurement detector and convertor

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
indexField string
indexFieldUnitSymbol string
suffixName string
removeContext boolean

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
indexField	string	-	The index field in which the value will be stored.
indexFieldUnitSymbol	string	-	The output unit symbol
suffixName	string	_um	Output suffix to create a new meta as output
removeContext	boolean	False	Remove contexts after processing

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<UnitsOfMeasurementNormalizer name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" indexField="" indexFieldUnitSymbol="" suffixName="_um" removeContext="false" ></UnitsOfMeasurementNormalizer>

DebugCrashProcessor com.exalead.indexing.analysis.v10.DebugCrashProcessor

Causes crashes for debugging purpose

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
type string
delay int
count int

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
type	string	exception	The crash type {@code enum(noop,exception,oom,infiniteloop,nullptr,abort,assert,segv,intdiv)}
delay	int	0	Trigger delay in seconds.
count	int	3	Trigger document count.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<DebugCrashProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" type="exception" delay="0" count="3" ></DebugCrashProcessor>

PLMExpandDocumentProcessor com.exalead.indexing.analysis.v10.PLMExpandDocumentProcessor

Treat plm metas to generate octrees and matrices for PLMExpand.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
metaMatrix string
fieldMatrix string
fieldInvMatrix string
metaCGR string
fieldOctree string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
metaMatrix	string	matrix	Name of the meta containing the matrix data.
fieldMatrix	string	matrix	Name of the target matrix field.
fieldInvMatrix	string	invmatrix	Name of the target matrix field.
metaCGR	string	cgr	Name of the meta containing the CGRs.
fieldOctree	string	octree	Name of target octree field.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<PLMExpandDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" metaMatrix="matrix" fieldMatrix="matrix" fieldInvMatrix="invmatrix" metaCGR="cgr" fieldOctree="octree" ></PLMExpandDocumentProcessor>

CGRDocumentProcessor com.exalead.indexing.analysis.v10.CGRDocumentProcessor

Calls convert to generate octrees.

Parent objects

AnalysisPipeline
DocumentProcessorGroup

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string
disabled boolean
partCGR string
partOctree string
docIdentifyer string

Name	Type	Default value	Description
name	string	-	Name of this processor. The name of a processor is used only for tracing and debugging purposes.
dataModelState	string	-	Is this document processor managed by a data model? @enum{null,auto,customized, error}. If null, this document processor is not related to a data model. If "auto", this document processor is auto-generated by a data model. If "customized", this document processor was auto-generated by a data model and then customized. If "error", there is a conflict between this document processor and the data model.
dataModelClass	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
dataModelProperty	string	-	If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
disabled	boolean	False	Disable the DocumentProcessor
partCGR	string	CGR	Name of the part containing the CGR data (tesselation).
partOctree	string	octree	Name of the part used to store the resulting octree.
docIdentifyer	string	majorid	Name of the meta identifying the document.

Nested elements

fromDataModelDocumentProcessor
AcceptCondition

Type	Wrapper	Description
DocumentProcessor	fromDataModel	If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition	-	Expresses the enablement condition of this DocumentProcessor.

XML representation

<CGRDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" partCGR="CGR" partOctree="octree" docIdentifyer="majorid" ></CGRDocumentProcessor>

FilteringConfiguration com.exalead.indexing.analysis.v10.FilteringConfiguration

Filters to apply to the words extracted from the semantic processors.

Words that do not satisfy these conditions will not be indexed.

The filtered values are expressed by the number of unicode characters.

Parent objects

AnalysisPipeline

Attributes

Name	Type	Default value	Description
wordMaxLength	int	100	Maximal length of a word. 100 is the default value.
hexCharMax	int	0	Maximal number of hexadecimal characters that can appear in a word. This filter applies only for words bigger than 'hexLengthMin'. 0 = no filter (default value)
hexLengthMin	int	0	Minimal number of characters in a word for the hexadecimal filter to apply. 0 = no filter (default value)
maxNumChars	int	0	Maximal number of characters in a word. 0 = no filter (default value)

XML representation

<FilteringConfiguration wordMaxLength="100" hexCharMax="0" hexLengthMin="0" maxNumChars="0" ></FilteringConfiguration>

LanguageConfiguration com.exalead.indexing.analysis.v10.LanguageConfiguration

Configuration of the linguistic extraction for a given language.

Parent objects

AnalysisPipeline

Attributes

language iso code
generateWordDict boolean
wordDictModulo int
maxWordDictWordsPerDocument long
maxExtractedWordLength int
spellCheckNGramMaxSize int
spellCheckNGramsDictModulo int
maxSpellCheckNGramsPerDocument long
maxExtractedSpellCheckNGramLength int
relatedTermsDictModulo int
maxRelatedTermsDictContextsPerDocument long

Name	Type	Default value	Description
language	iso code	-	The language iso code
generateWordDict	boolean	False	Extracts words for the global dictionary.
wordDictModulo	int	1	Word extraction modulo, by default extract all words.
maxWordDictWordsPerDocument	long	-1	Maximum number of words extracted per document.
maxExtractedWordLength	int	64	Maximum length a word must have to be extracted.
spellCheckNGramMaxSize	int	3	Maximum number of consecutive words for spellchecking. If the value is set to '-1', spellcheck data is not generated for this language. 0 and 1 values are illegal, default is 3.
spellCheckNGramsDictModulo	int	5	NGrams extraction modulo. It extracts 1 ngram out of 5 by default.
maxSpellCheckNGramsPerDocument	long	-1	Maximum number of ngrams extracted per document.
maxExtractedSpellCheckNGramLength	int	256	Maximum length an ngram must have to be extracted.
relatedTermsDictModulo	int	1	Submits 1 out of X documents for related terms generation. If the value is set to 0, related terms are not generated for this language.
maxRelatedTermsDictContextsPerDocument	long	-1	Maximum number of related terms extracted per document.

XML representation

<LanguageConfiguration language="en" generateWordDict="false" wordDictModulo="1" maxWordDictWordsPerDocument="-1" maxExtractedWordLength="64" spellCheckNGramMaxSize="3" spellCheckNGramsDictModulo="5" maxSpellCheckNGramsPerDocument="-1" maxExtractedSpellCheckNGramLength="256" relatedTermsDictModulo="1" maxRelatedTermsDictContextsPerDocument="-1" ></LanguageConfiguration>

MappingConfiguration com.exalead.indexing.analysis.v10.MappingConfiguration

Specifies how DocumentChunks and their SemanticAnnotations populate the index and the dictionary.

Nested elements

AnnotationMapping*
ContextMapping*
FieldIndexingLimit*
FieldRetrievalLimit*
GenerateAnnotationsForContext*
PartMapping*
WordCountMapping*

Type	Wrapper	Description
AnnotationMapping*	-	List of mappings from annotations to index targets, with associated parameters.
ContextMapping*	-	List mappings from contexts to index targets, with associated parameters.
FieldIndexingLimit*	-	Word count limits to apply to texts mapped to index fields for search.
FieldRetrievalLimit*	-	Size limits (in bytes) to apply to texts mapped to the index for retrieval.
GenerateAnnotationsForContext*	-	List of contexts to process with a semantic pipeline before mapping.
PartMapping*	-	List mappings from parts to index targets, with associated parameters.
WordCountMapping*	-	Specify where to map Word count.

XML representation

<MappingConfiguration ></MappingConfiguration>

AnnotationMapping com.exalead.indexing.analysis.v10.AnnotationMapping

Defines how SemanticAnnotations are used to populate index fields.

Parent objects

MappingConfiguration

Attributes

name string
context string
patternMatch boolean
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
name	string	-	Name of the SemanticAnnotation to map.
context	string	-	Optional input context restricting the mapping from the annotations coming from a specific context. Incompatible with the patternMatch feature.
patternMatch	boolean	False	Matches all annotations matching this pattern (must be a valid regular expression).
dataModelState	string	-	Is this annotation target managed by a data model? @enum{null,auto,customized}. If null, this annotation mapping is not related to a data model. If "auto", this annotation mapping is auto-generated by a data model If "customized", this annotation mapping was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this annotation mapping.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this annotation mapping.

Nested elements

fromDataModelAnnotationMapping
AnnotationTarget*

Type	Wrapper	Description
AnnotationMapping	fromDataModel	If dataModelState is "customized", you will find here the original annotation mapping generated by the data model. Use this to easily show what reverting to "auto" from "customized" would imply
AnnotationTarget*	-

XML representation

<AnnotationMapping name="" context="" patternMatch="false" dataModelState="" dataModelClass="" dataModelProperty="" ></AnnotationMapping>

CategoryAnnotationTarget com.exalead.indexing.analysis.v10.CategoryAnnotationTarget

CategoryAnnotationTarget is used to create a new category path inside an index category field, out of a SemanticAnnotation.

The category path is built by the concatenation of the 'categoryRoot' and the selected 'form' of the annotation.

Parent objects

AnnotationMapping

Attributes

indexField string
forcedRank long
rankBoost long
form string
dataModelState string
dataModelClass string
dataModelProperty string
categoryRoot string
categoryAppend boolean
appendAnnotationNameToRoot boolean
retrievable boolean
cleanupContent boolean
detectTitle boolean

Name	Type	Default value	Description
indexField	string	-
forcedRank	long	-
rankBoost	long	0
form	string	normalized	Which form of SemanticAnnotation value should we index? {@code enum(exact,normalized)}
dataModelState	string	-	Is this annotation target managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this AnnotationTarget.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this AnnotationTarget.
categoryRoot	string	-	Prefix used to build the CategoryPath.
categoryAppend	boolean	True	Builds the category path by concatenating the categoryRoot and the selected 'form' of the annotation. If false, only the category root will be used.
appendAnnotationNameToRoot	boolean	False	Appends the annotation name between the root and the value.
retrievable	boolean	False	If true, the category path is retrievable and can be used to create facets. If false, the category path is only searchable. (Advanced usage. langdate hacks)
cleanupContent	boolean	True	Removes trailing and leading spaces. Removes category path without AlphaNum character.
detectTitle	boolean	False	Detect words set after # in path and use them as title

Nested elements

fromDataModelAnnotationTarget

Type	Wrapper	Description
AnnotationTarget	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily see what reverting to "auto" from "customized" would imply.

XML representation

<CategoryAnnotationTarget indexField="" forcedRank="" rankBoost="0" form="normalized" dataModelState="" dataModelClass="" dataModelProperty="" categoryRoot="" categoryAppend="true" appendAnnotationNameToRoot="false" retrievable="false" cleanupContent="true" detectTitle="false" ></CategoryAnnotationTarget>

StandardAnnotationTarget com.exalead.indexing.analysis.v10.StandardAnnotationTarget

StandardAnnotationTarget is used to index the textual content of a SemanticAnnotation.

The selected 'form' of the SemanticAnnotation is used to populate an index field.

Parent objects

AnnotationMapping

Attributes

indexField string
forcedRank long
rankBoost long
form string
dataModelState string
dataModelClass string
dataModelProperty string
searchable boolean
indexLevel string
customIndexKind int
retrievable boolean
retrieveField string

Name	Type	Default value	Description
indexField	string	-
forcedRank	long	-
rankBoost	long	0
form	string	normalized	Which form of SemanticAnnotation value should we index? {@code enum(exact,normalized)}
dataModelState	string	-	Is this annotation target managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this AnnotationTarget.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this AnnotationTarget.
searchable	boolean	False	If true, the SemanticAnnotation can be searched for.
indexLevel	string	-	If searchable, index kind where data will be indexed. Can be "exact", "lowercase", "normalized" or "custom".
customIndexKind	int	-	If indexLevel = "custom", this index kind will be used.
retrievable	boolean	False	If true, the SemanticAnnotation can be retrieved.
retrieveField	string	-	The field where the SemanticAnnotation is stored for retrieval, if 'retrievable' is set to true. If null, 'indexField' will be used to store the SemanticAnnotation for retrieval.

Nested elements

fromDataModelAnnotationTarget

Type	Wrapper	Description
AnnotationTarget	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily see what reverting to "auto" from "customized" would imply.

XML representation

<StandardAnnotationTarget indexField="" forcedRank="" rankBoost="0" form="normalized" dataModelState="" dataModelClass="" dataModelProperty="" searchable="false" indexLevel="" customIndexKind="" retrievable="false" retrieveField="" ></StandardAnnotationTarget>

EnumFacetAnnotationTarget com.exalead.indexing.analysis.v10.EnumFacetAnnotationTarget

EnumFacetAnnotationTarget maps the annotations according to the specified EnumFacet.

Parent objects

AnnotationMapping

Attributes

indexField string
forcedRank long
rankBoost long
form string
dataModelState string
dataModelClass string
dataModelProperty string
enumFacetId string

Name	Type	Default value	Description
indexField	string	-
forcedRank	long	-
rankBoost	long	0
form	string	normalized	Which form of SemanticAnnotation value should we index? {@code enum(exact,normalized)}
dataModelState	string	-	Is this annotation target managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this AnnotationTarget.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this AnnotationTarget.
enumFacetId	string	-	The id of the EnumFacetAnnotationTarget this target refers to.

Nested elements

fromDataModelAnnotationTarget

Type	Wrapper	Description
AnnotationTarget	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily see what reverting to "auto" from "customized" would imply.

XML representation

<EnumFacetAnnotationTarget indexField="" forcedRank="" rankBoost="0" form="normalized" dataModelState="" dataModelClass="" dataModelProperty="" enumFacetId="" ></EnumFacetAnnotationTarget>

ContextMapping com.exalead.indexing.analysis.v10.ContextMapping

ContextMapping specifies how DocumentChunks with a given ContextName are remapped to index fields and whether they are used to populate the dictionary.

Parent objects

MappingConfiguration

Attributes

name string
prefixMatch boolean
unprefix boolean
patternMatch boolean
semantic boolean
resourceFreq int
tokenizationConfig string
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
name	string	-	ContextName of the DocumentChunks to map.
prefixMatch	boolean	False	Matches all context that starts with this prefix.
unprefix	boolean	False	Remove the prefix that was used to match.
patternMatch	boolean	False	Matches all context matching this pattern (must be a valid regular expression).
semantic	boolean	True	Performs semantic processing on the DocumentChunks processed by this mapping. If false, the textual content of the DocumentChunks will not be tokenized before indexing. This can be used to index 'exact raw values'.
resourceFreq	int	1	To extract a resource, select the frequency to add. For example, if you have a 'firstname lastname' entry, you may want to simulate a frequency of 1000 to avoid spellcheck on this entry.
tokenizationConfig	string	-
dataModelState	string	-	Is this content target managed by a data model? @enum{null,auto,customized}. If null, this context mapping is not related to a data model. If "auto", this context mapping is auto-generated by a data model If "customized", this context mapping was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this context mapping.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this ContextMapping

Nested elements

fromDataModelContextMapping
Target*

Type	Wrapper	Description
ContextMapping	fromDataModel	If dataModelState is "customized", you will find here the original context mapping generated by the data model. Use this to easily show what reverting to "auto" from "customized" would imply.
Target*	-

XML representation

<ContextMapping name="" prefixMatch="false" unprefix="false" patternMatch="false" semantic="true" resourceFreq="1" tokenizationConfig="tok0" dataModelState="" dataModelClass="" dataModelProperty="" ></ContextMapping>

CategoryContentTarget com.exalead.indexing.analysis.v10.CategoryContentTarget

CategoryContentTarget is used to map a DocumentChunk to a category.

A Category Path is created for each DocumentChunk processed.

The textual content of the DocumentChunk is used to build a Category Path.
'indexField' should be a category field (usually called 'categories' or 'security').

Parent objects

ContextMapping

Attributes

indexField string
forcedRank long
rankBoost long
categoryRoot string
categoryAppend boolean
appendContextNameToRoot boolean
form string
retrievable boolean
cleanupContent boolean
detectTitle boolean

Name	Type	Default value	Description
indexField	string	-	The indexField to populate with this content. If null, the contextName of the DocumentChunk will be used for the index field.
forcedRank	long	-	Sets the ranking value for chunks in this mapping. -1 means that the chunk internal ranking value is kept.
rankBoost	long	0	Offsets the chunk internal ranking value. Use it only when forcedRank = -1 For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
categoryRoot	string	-	Builds the category path.
categoryAppend	boolean	True	Appends the textual content of the DocumentChunk to the category root. If false, only the category root will be used.
appendContextNameToRoot	boolean	False	Appends the context name between the root and the value.
form	string	normalized	The form of the word to be used to build the category path. {@code enum(exact,normalized)}
retrievable	boolean	False	Stores the category path, which enables display and navigation by category path. If false, we only index the SemanticAnnotation (Advanced usage - langdate hacks).
cleanupContent	boolean	True	If true: Removes trailing and leading unicode-spaces. Replaces all sequences of unicode-space characters by a single 'space' character. Does not map to the category in append mode if the DocumentChunk does not contain at least one unicode alpha-numerical character.
detectTitle	boolean	False	Detect words set after # in path and use them as title

XML representation

<CategoryContentTarget indexField="" forcedRank="" rankBoost="0" categoryRoot="" categoryAppend="true" appendContextNameToRoot="false" form="normalized" retrievable="false" cleanupContent="true" detectTitle="false" ></CategoryContentTarget>

DateCategoryContentTarget com.exalead.indexing.analysis.v10.DateCategoryContentTarget

CategoryContentTarget specific to date.

Parent objects

ContextMapping

Attributes

categoryRoot string
categoryAppend boolean
appendContextNameToRoot boolean
form string
retrievable boolean
cleanupContent boolean
detectTitle boolean
indexField string
forcedRank long
rankBoost long
inputFormat string

Name	Type	Default value	Description
categoryRoot	string	-	Builds the category path.
categoryAppend	boolean	True	Appends the textual content of the DocumentChunk to the category root. If false, only the category root will be used.
appendContextNameToRoot	boolean	False	Appends the context name between the root and the value.
form	string	normalized	The form of the word to be used to build the category path. {@code enum(exact,normalized)}
retrievable	boolean	False	Stores the category path, which enables display and navigation by category path. If false, we only index the SemanticAnnotation (Advanced usage - langdate hacks).
cleanupContent	boolean	True	If true: Removes trailing and leading unicode-spaces. Replaces all sequences of unicode-space characters by a single 'space' character. Does not map to the category in append mode if the DocumentChunk does not contain at least one unicode alpha-numerical character.
detectTitle	boolean	False	Detect words set after # in path and use them as title
indexField	string	-	The indexField to populate with this content. If null, the contextName of the DocumentChunk will be used for the index field.
forcedRank	long	-	Sets the ranking value for chunks in this mapping. -1 means that the chunk internal ranking value is kept.
rankBoost	long	0	Offsets the chunk internal ranking value. Use it only when forcedRank = -1 For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
inputFormat	string	-	Specifies the input format of the date, in UNIX date format. Set null value for automatic detection of standard formats.

XML representation

<DateCategoryContentTarget categoryRoot="" categoryAppend="true" appendContextNameToRoot="false" form="normalized" retrievable="false" cleanupContent="true" detectTitle="false" indexField="" forcedRank="" rankBoost="0" inputFormat="" ></DateCategoryContentTarget>

StandardContentTarget com.exalead.indexing.analysis.v10.StandardContentTarget

A StandardContentTarget is used to populate a textual, numerical or date index field, with the content of a DocumentChunk.

Parent objects

ContextMapping

Attributes

Name	Type	Default value	Description
indexField	string	-	The indexField to populate with this content. If null, the contextName of the DocumentChunk will be used for the index field.
forcedRank	long	-	Sets the ranking value for chunks in this mapping. -1 means that the chunk internal ranking value is kept.
rankBoost	long	0	Offsets the chunk internal ranking value. Use it only when forcedRank = -1 For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
prefixWithContext	boolean	False	Enables prefixing of all words in inverted lists by 'contextName#'.
addStartEnd	boolean	False	Enables the introduction of a word __start__ before chunk content and a word __end__ after chunk content. Only valid if Chunk is mapped with semantic=true. This option is compatible with prefixContextName: produce contextName#__start__ and contextName#__end__)
indexPrefixes	boolean	False	Enables the indexing of all prefixes for each word with a score = prefixScore. The prefix can be mapped to a specific type if you add 'prefix' in formIndexingConfig.
prefixesScore	int	1	Score given to words' prefixes. The document relevance is determined by its score. The text matching score basically represents the "distance" between a search query and a document.
maxPrefixLength	int	0	Maximum length of the extracted prefixes.
indexSuffixes	boolean	False	Enables the indexing of all suffixes for each word with a score = suffixScore. The suffix can be mapped to a specific kind if you add 'suffix' in formIndexingConfig.
suffixesScore	int	1	Score given to words' prefixes. The document relevance is determined by its score. The text matching score basically represents the "distance" between a search query and a document.
maxSuffixLength	int	0	Maximum length of the extracted suffixes.
indexSubstrings	boolean	False	Enables the indexing of all substrings for each word with a score = substringScore. The suffix can be mapped to a specific kind if you add 'substring' in formIndexingConfig.
substringsScore	int	1	Score given to extracted substrings. Document relevance is determined by its score. The text matching score basically represents the "distance" between a search query and a document.
searchable	boolean	True	Marks the content of the DocumentChunk as indexed and searchable.
retrievable	boolean	True	Enables the content of the DocumentChunk to be directly stored in the index, so that it can be retrieved. For numerical values, retrievability allows you to sort results by field.
retrieveField	string	-	The index field in which the content will be stored. If null, the content will be put in 'indexField'.
indexNormalized	boolean	True	Enables the indexing of the normalized form of the word.
indexLowercase	boolean	False	Enables the indexing of the lowercase (non-normalized) form of each token.
indexExact	boolean	False	Enables the indexing of the exact (non-normalized) form of each token.
indexSeparators	boolean	False	Enables the indexing of the index standard separators. Indexed standard separators are: paragraph, sentence and page. Standard separators indexing is required for the SPLIT operator to work with these separators.
addBreakBetweenChunks	boolean	True	Enables the introduction of a break between document chunks by the indexer. This forbids phrase matching across these chunks and has an impact on search when using double-quotes expressions or the 'NEXT' operator. For example, if a document has a "title" chunk containing "foo" and a "text" chunk containing "bar", and they are both remapped to the text field. If addBreakBetweenChunks is false, then the document will match on the query `"foo bar"`, `foo NEXT bar` If addBreakBetweenChunks is true, then the document will not match the query `"foo bar"` nor `foo NEXT bar` but will match the query `foo AND bar`

Nested elements

DecreaseRankOnAnnotation*
IncreaseRankOnAnnotation*
RankOnAnnotation*

Type	Wrapper	Description
DecreaseRankOnAnnotation*	-	List of DecreaseRankOnAnnotation
IncreaseRankOnAnnotation*	-	List of IncreaseRankOnAnnotation
RankOnAnnotation*	-	List of RankOnAnnotation

XML representation

<StandardContentTarget indexField="" forcedRank="" rankBoost="0" prefixWithContext="false" addStartEnd="false" indexPrefixes="false" prefixesScore="1" maxPrefixLength="0" indexSuffixes="false" suffixesScore="1" maxSuffixLength="0" indexSubstrings="false" substringsScore="1" searchable="true" retrievable="true" retrieveField="" indexNormalized="true" indexLowercase="false" indexExact="false" indexSeparators="false" addBreakBetweenChunks="true" ></StandardContentTarget>

DateContentTarget com.exalead.indexing.analysis.v10.DateContentTarget

DateContentTarget defines indexing a date.

Parent objects

ContextMapping

Attributes

Name	Type	Default value	Description
prefixWithContext	boolean	False	Enables prefixing of all words in inverted lists by 'contextName#'.
addStartEnd	boolean	False	Enables the introduction of a word __start__ before chunk content and a word __end__ after chunk content. Only valid if Chunk is mapped with semantic=true. This option is compatible with prefixContextName: produce contextName#__start__ and contextName#__end__)
indexPrefixes	boolean	False	Enables the indexing of all prefixes for each word with a score = prefixScore. The prefix can be mapped to a specific type if you add 'prefix' in formIndexingConfig.
prefixesScore	int	1	Score given to words' prefixes. The document relevance is determined by its score. The text matching score basically represents the "distance" between a search query and a document.
maxPrefixLength	int	0	Maximum length of the extracted prefixes.
indexSuffixes	boolean	False	Enables the indexing of all suffixes for each word with a score = suffixScore. The suffix can be mapped to a specific kind if you add 'suffix' in formIndexingConfig.
suffixesScore	int	1	Score given to words' prefixes. The document relevance is determined by its score. The text matching score basically represents the "distance" between a search query and a document.
maxSuffixLength	int	0	Maximum length of the extracted suffixes.
indexSubstrings	boolean	False	Enables the indexing of all substrings for each word with a score = substringScore. The suffix can be mapped to a specific kind if you add 'substring' in formIndexingConfig.
substringsScore	int	1	Score given to extracted substrings. Document relevance is determined by its score. The text matching score basically represents the "distance" between a search query and a document.
searchable	boolean	True	Marks the content of the DocumentChunk as indexed and searchable.
retrievable	boolean	True	Enables the content of the DocumentChunk to be directly stored in the index, so that it can be retrieved. For numerical values, retrievability allows you to sort results by field.
retrieveField	string	-	The index field in which the content will be stored. If null, the content will be put in 'indexField'.
indexNormalized	boolean	True	Enables the indexing of the normalized form of the word.
indexLowercase	boolean	False	Enables the indexing of the lowercase (non-normalized) form of each token.
indexExact	boolean	False	Enables the indexing of the exact (non-normalized) form of each token.
indexSeparators	boolean	False	Enables the indexing of the index standard separators. Indexed standard separators are: paragraph, sentence and page. Standard separators indexing is required for the SPLIT operator to work with these separators.
addBreakBetweenChunks	boolean	True	Enables the introduction of a break between document chunks by the indexer. This forbids phrase matching across these chunks and has an impact on search when using double-quotes expressions or the 'NEXT' operator. For example, if a document has a "title" chunk containing "foo" and a "text" chunk containing "bar", and they are both remapped to the text field. If addBreakBetweenChunks is false, then the document will match on the query `"foo bar"`, `foo NEXT bar` If addBreakBetweenChunks is true, then the document will not match the query `"foo bar"` nor `foo NEXT bar` but will match the query `foo AND bar`
indexField	string	-	The indexField to populate with this content. If null, the contextName of the DocumentChunk will be used for the index field.
forcedRank	long	-	Sets the ranking value for chunks in this mapping. -1 means that the chunk internal ranking value is kept.
rankBoost	long	0	Offsets the chunk internal ranking value. Use it only when forcedRank = -1 For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
inputFormat	string	-	Specifies the input format of the date, in UNIX date format. Set null value for automatic detection of standard formats.

Nested elements

DecreaseRankOnAnnotation*
IncreaseRankOnAnnotation*
RankOnAnnotation*

Type	Wrapper	Description
DecreaseRankOnAnnotation*	-	List of DecreaseRankOnAnnotation
IncreaseRankOnAnnotation*	-	List of IncreaseRankOnAnnotation
RankOnAnnotation*	-	List of RankOnAnnotation

XML representation

<DateContentTarget prefixWithContext="false" addStartEnd="false" indexPrefixes="false" prefixesScore="1" maxPrefixLength="0" indexSuffixes="false" suffixesScore="1" maxSuffixLength="0" indexSubstrings="false" substringsScore="1" searchable="true" retrievable="true" retrieveField="" indexNormalized="true" indexLowercase="false" indexExact="false" indexSeparators="false" addBreakBetweenChunks="true" indexField="" forcedRank="" rankBoost="0" inputFormat="" ></DateContentTarget>

DecreaseRankOnAnnotation com.exalead.indexing.analysis.v10.DecreaseRankOnAnnotation

Allows you to decrease the ranking when some words are flagged by an annotation (part of speech, ontology, ...).

Parent objects

DateContentTarget
StandardContentTarget

Attributes

Name	Type	Default value	Description
annotationName	string	-	Name of the targeted annotation.
annotationValue	string	-	Value of the annotation that will trigger the decrease in ranking.
value	int	-	Number to decrease from the ranking when triggered.

XML representation

<DecreaseRankOnAnnotation annotationName="" annotationValue="" value="" ></DecreaseRankOnAnnotation>

IncreaseRankOnAnnotation com.exalead.indexing.analysis.v10.IncreaseRankOnAnnotation

Allows you to increase the ranking when some words are flagged by an annotation (part of speech, ontology, ...).

Parent objects

DateContentTarget
StandardContentTarget

Attributes

Name	Type	Default value	Description
annotationName	string	-	Name of the targeted annotation.
annotationValue	string	-	Value of the annotation that will trigger the increase in ranking.
value	int	-	Number to increase in the ranking when triggered.

XML representation

<IncreaseRankOnAnnotation annotationName="" annotationValue="" value="" ></IncreaseRankOnAnnotation>

RankOnAnnotation com.exalead.indexing.analysis.v10.RankOnAnnotation

Modifies ranking when some words are flagged by a given annotation.

Parent objects

DateContentTarget
StandardContentTarget

Attributes

Name	Type	Default value	Description
annotationName	string	-	The annotation that triggers the ranking modification.
annotationValue	string	-	The annotation value required to trigger the ranking modification.
forcedRank	int	-	The new ranking.

XML representation

<RankOnAnnotation annotationName="" annotationValue="" forcedRank="" ></RankOnAnnotation>

CustomContentTarget com.exalead.indexing.analysis.v10.CustomContentTarget

CustomerContentTarget defines indexing by a custom 'Index Kind'.

Parent objects

ContextMapping

Attributes

indexField string
forcedRank long
rankBoost long
searchable boolean
retrieveField string
retrievable boolean
indexKind int
addBreakBetweenChunks boolean

Name	Type	Default value	Description
indexField	string	-	The indexField to populate with this content. If null, the contextName of the DocumentChunk will be used for the index field.
forcedRank	long	-	Sets the ranking value for chunks in this mapping. -1 means that the chunk internal ranking value is kept.
rankBoost	long	0	Offsets the chunk internal ranking value. Use it only when forcedRank = -1 For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
searchable	boolean	True	If true, the content of the DocumentChunk will be indexed and searchable.
retrieveField	string	-	The index field in which the content will be stored. If null, the content will be put in 'indexField'.
retrievable	boolean	True	Stores the content of the DocumentChunk directly in the index, so that it can be retrieved. For numerical values, retrievability enables to sort results by field.
indexKind	int	-	Index 'Kind' to use for indexing content.
addBreakBetweenChunks	boolean	True	If true, the indexer introduces a break between document chunks. This forbids phrase matching across these chunks and has an impact on search when using double-quotes expressions or the 'NEXT' operator. For example, if a document has a "title" chunk containing "foo" and a "text" chunk containing "bar", and they are both remapped to the text field: If addBreakBetweenChunks is false, then the document will match on the query `"foo bar"`, `foo NEXT bar` If addBreakBetweenChunks is true, then the document will not match the query `"foo bar"` nor `foo NEXT bar` but will match the query `foo AND bar`

XML representation

<CustomContentTarget indexField="" forcedRank="" rankBoost="0" searchable="true" retrieveField="" retrievable="true" indexKind="" addBreakBetweenChunks="true" ></CustomContentTarget>

EnumFacetContentTarget com.exalead.indexing.analysis.v10.EnumFacetContentTarget

EnumFacetContentTarget maps the content according to the specified EnumFacet.

Parent objects

ContextMapping

Attributes

indexField string
forcedRank long
rankBoost long
enumFacetId string
form string

Name	Type	Default value	Description
indexField	string	-	The indexField to populate with this content. If null, the contextName of the DocumentChunk will be used for the index field.
forcedRank	long	-	Sets the ranking value for chunks in this mapping. -1 means that the chunk internal ranking value is kept.
rankBoost	long	0	Offsets the chunk internal ranking value. Use it only when forcedRank = -1 For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
enumFacetId	string	-	The id of the EnumFacet this target refers to.
form	string	normalized	The form of the values for the facet stringValues {@code enum(exact,normalized)}

XML representation

<EnumFacetContentTarget indexField="" forcedRank="" rankBoost="0" enumFacetId="" form="normalized" ></EnumFacetContentTarget>

DictionaryTarget com.exalead.indexing.analysis.v10.DictionaryTarget

A DictionaryTarget specifies how a DocumentChunk or semantic annotation is processed to the dictionary.

Parent objects

ContextMapping

Attributes

dictionaryName string
words boolean
ngrams boolean
rt boolean
phonemes boolean

Name	Type	Default value	Description
dictionaryName	string	-
words	boolean	True
ngrams	boolean	False
rt	boolean	False
phonemes	boolean	False

XML representation

<DictionaryTarget dictionaryName="" words="true" ngrams="false" rt="false" phonemes="false" ></DictionaryTarget>

PartTarget com.exalead.indexing.analysis.v10.PartTarget

A PartTarget specifies how a Part is processed to populate the index.

Parent objects

ContextMapping

Attributes

indexField string

Name	Type	Default value	Description
indexField	string	-	The index field in which the content will be stored.

XML representation

<PartTarget indexField="" ></PartTarget>

FieldIndexingLimit com.exalead.indexing.analysis.v10.FieldIndexingLimit

Limits the number of words that can be retrieved from a given field.

Parent objects

MappingConfiguration

Attributes

fieldName string
maxNbWords int

Name	Type	Default value	Description
fieldName	string	-	Field to limit.
maxNbWords	int	-	Maximum number of words for this field.

XML representation

<FieldIndexingLimit fieldName="" maxNbWords="" ></FieldIndexingLimit>

FieldRetrievalLimit com.exalead.indexing.analysis.v10.FieldRetrievalLimit

Limits the size of text that can be retrieved from a given field.

In some standard configuration, a FieldRetrievalLimit on the 'text' field is set to "maxLength=4096".

This limits the size of the index on disk.

Parent objects

MappingConfiguration

Attributes

retrieveField string
maxLength int

Name	Type	Default value	Description
retrieveField	string	-	Field to limit.
maxLength	int	-	Max text size in bytes. The text will be clipped to the nearest word. Text is stored in UTF-8.

XML representation

<FieldRetrievalLimit retrieveField="" maxLength="" ></FieldRetrievalLimit>

GenerateAnnotationsForContext com.exalead.indexing.analysis.v10.GenerateAnnotationsForContext

Forces a context to be processed by the SemanticProcessor pipeline and to process semantic annotations.

Parent objects

MappingConfiguration

Attributes

name string
prefixMatch boolean
patternMatch boolean
tokenizationConfig string

Name	Type	Default value	Description
name	string	-	ContextName of the DocumentChunks to map.
prefixMatch	boolean	False	Matches any context starting with this prefix.
patternMatch	boolean	False	Matches any context matching this regular expression.
tokenizationConfig	string	-	If set, it forces the tokenization configuration to use.

XML representation

<GenerateAnnotationsForContext name="" prefixMatch="false" patternMatch="false" tokenizationConfig="tok0" ></GenerateAnnotationsForContext>

PartMapping com.exalead.indexing.analysis.v10.PartMapping

PartMapping specifies how parts are remapped to index fields.

Parent objects

MappingConfiguration

Attributes

name string
prefixMatch boolean
patternMatch boolean

Name	Type	Default value	Description
name	string	-	Name of the Part to map.
prefixMatch	boolean	False	Matches all parts that starts with this prefix.
patternMatch	boolean	False	Matches all parts matching this pattern (must be a valid regular expression).

Nested elements

PartTarget*

Type	Wrapper	Description
PartTarget*	-

XML representation

<PartMapping name="" prefixMatch="false" patternMatch="false" ></PartMapping>

WordCountMapping com.exalead.indexing.analysis.v10.WordCountMapping

Specify where to map Word count.

Parent objects

MappingConfiguration

Attributes

fromName string
toName string

Name	Type	Default value	Description
fromName	string	-	Compute the word count of this field.
toName	string	-	Store the word count to this field.

XML representation

<WordCountMapping fromName="" toName="" ></WordCountMapping>

IndexSchema com.exalead.mercury.mami.indexing.v10.IndexSchema

Configuration for an index schema.
This defines the fields actually stored in an index.

Most commonly, only one index schema is defined, and used by all build groups (for all slices).

This configuration is referenced in the BuildGroup element in 'Deployment'.

Attributes

name string
allowIntensiveDiskAccess boolean

Name	Type	Default value	Description
name	string	-
allowIntensiveDiskAccess	boolean	False	Allows intensive operations like sorting or faceting to be performed on disk (SSD should be preferred).

Nested elements

AttributeGroupStore*
FieldConfig*

Type	Wrapper	Description
AttributeGroupStore*	-
FieldConfig*	-

XML representation

<IndexSchema name="" allowIntensiveDiskAccess="false" ></IndexSchema>

AttributeGroupStore com.exalead.mercury.mami.indexing.v10.AttributeGroupStore

Configuration of an attribute group. An attribute group define how attributes should be persisted on disk.

Parent objects

IndexSchema

Attributes

id int
label string
format enum(SimpleRowOrientedStore, ItemOrientedStore)
retrievableRoles string
leafSize int

Name	Type	Default value	Description
id	int	-	A unique identifier for this attribute group.
label	string	-	A human readable name for this attribute group.
format	enum(SimpleRowOrientedStore, ItemOrientedStore)	ItemOrientedStore	Specifies how to persist the data on disk for this attribute group.
retrievableRoles	string	-	Specifies a comma-separated list of annotations to be handled in this attribute group store. Ex: @Facetable,@Sortable,@Display
leafSize	int	30720	If the format is SimpleRowOrientedStore, configures the leaf size (i.e., maximum IO size read per DID).

XML representation

<AttributeGroupStore id="" label="" format="ItemOrientedStore" retrievableRoles="" leafSize="30720" ></AttributeGroupStore>

AlphanumFieldConfig com.exalead.mercury.mami.indexing.v10.AlphanumFieldConfig

This field stores alphanumeric values (i.e., 'text', 'title').

Parent objects

IndexSchema

Attributes

ramBased boolean
multiContext boolean
fieldName string
searchable boolean
retrievable boolean
dataModelState string
dataModelClass string
dataModelProperty string
multivalued boolean
version int
maxStoredWordPosition int
maxInlineWordPositions int
useVariablePositionsEncoding boolean
storeTf boolean
bloomFilter boolean
gzip boolean
implementation enum(strbtree, trie, fsm)
nbWordsPerLeaf int
optimizePatternSearch boolean
patternSearchOptimizationType enum(ngramv1, ngramv2)

Name	Type	Default value	Description
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0
maxStoredWordPosition	int	-	Number of words, starting from the beginning of the document, for which word positions will be stored in the index. This enables proximity ranking and position searching (NEAR, NEXT, ...) up to this number of words in the document. '0' should be used to disable position storing.
maxInlineWordPositions	int	2	Advanced setting controlling how many positions are inlined in the main data file for each word of each document.
useVariablePositionsEncoding	boolean	False	Advanced setting to choose which positions encoding algorithm should be used. Variable position encoding should be used to reduce index size when indexing big documents.
storeTf	boolean	False	Stores the number of terms of each document. This information may be used by the ranking algorithm to normalize term frequencies (as "nbTerms"). This costs a few bytes of RAM per document.
bloomFilter	boolean	False	Activates a Bloom filter per slot. This speeds up requests containing words that are not present in the field on a given slot. Disable this option if all words of the request for this field are always matching, and if you compact into big slots regularly. Enable this option if there is either a lot of misses (e.g. on the "text" field) or if you have small updates (e.g. with real-time indexing).
gzip	boolean	True	Activates content compression
implementation	enum(strbtree, trie, fsm)	fsm	Advanced configuration. Internal structure used to store the field dictionary.
nbWordsPerLeaf	int	1000	Advanced configuration. If using the strbtree structure, it configures the number of words per leaf.
optimizePatternSearch	boolean	True	Adds extra informations to the index for pattern search optimization. If false, optimizes data structures for size.
patternSearchOptimizationType	enum(ngramv1, ngramv2)	ngramv1	Select algorithm based on dictionary ngrams or indexed ngrams (untokenized text only).

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<AlphanumFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" maxStoredWordPosition="" maxInlineWordPositions="2" useVariablePositionsEncoding="false" storeTf="false" bloomFilter="false" gzip="true" implementation="fsm" nbWordsPerLeaf="1000" optimizePatternSearch="true" patternSearchOptimizationType="ngramv1" ></AlphanumFieldConfig>

RiceEncoderConfig com.exalead.mercury.mami.indexing.v10.RiceEncoderConfig

No documentation for this element.

Parent objects

AlphanumFieldConfig
BinaryFieldConfig
CategoryFieldConfig
DateFieldConfig
DoubleFieldConfig
FieldConfig
GeoFieldConfig
HierarchyFieldConfig
LegacySignedFieldConfig
LegacyUnsignedFieldConfig
NumericalFieldConfig
PointFieldConfig
ReferenceFieldConfig
SignedFieldConfig
SortableFieldConfig
StandardFieldConfig
TextFieldConfig
TimeFieldConfig
UidFieldConfig
UnsignedFieldConfig
ValueFieldConfig

Attributes

Name	Type	Default value	Description
bytesPerBlock	int	1024
positionsRiceCodingParam	int	1024
dataFilesPrefetchPages	int	2
extFilesPrefetchPages	int	2

XML representation

<RiceEncoderConfig bytesPerBlock="1024" positionsRiceCodingParam="1024" dataFilesPrefetchPages="2" extFilesPrefetchPages="2" ></RiceEncoderConfig>

VarIntEncoderConfig com.exalead.mercury.mami.indexing.v10.VarIntEncoderConfig

Stores each integer in varint encoding

XML representation

<VarIntEncoderConfig ></VarIntEncoderConfig>

Apollo11EncoderConfig com.exalead.mercury.mami.indexing.v10.Apollo11EncoderConfig

Stores each integer in Apollo11 encoding

XML representation

<Apollo11EncoderConfig ></Apollo11EncoderConfig>

NoOpEncoderConfig com.exalead.mercury.mami.indexing.v10.NoOpEncoderConfig

Trivial encoder. For debugging purposes only

XML representation

<NoOpEncoderConfig ></NoOpEncoderConfig>

FastNoPosEncoderConfig com.exalead.mercury.mami.indexing.v10.FastNoPosEncoderConfig

An encoder that only stores docids, not ranks nor positions.

Parent objects

AlphanumFieldConfig
BinaryFieldConfig
CategoryFieldConfig
DateFieldConfig
DoubleFieldConfig
FieldConfig
GeoFieldConfig
HierarchyFieldConfig
LegacySignedFieldConfig
LegacyUnsignedFieldConfig
NumericalFieldConfig
PointFieldConfig
ReferenceFieldConfig
SignedFieldConfig
SortableFieldConfig
StandardFieldConfig
TextFieldConfig
TimeFieldConfig
UidFieldConfig
UnsignedFieldConfig
ValueFieldConfig

Attributes

didsPerBlock int

Name	Type	Default value	Description
didsPerBlock	int	256

XML representation

<FastNoPosEncoderConfig didsPerBlock="256" ></FastNoPosEncoderConfig>

LegacyUnsignedFieldConfig com.exalead.mercury.mami.indexing.v10.LegacyUnsignedFieldConfig

No documentation for this element.

Parent objects

IndexSchema

Attributes

ramBased boolean
multiContext boolean
fieldName string
searchable boolean
retrievable boolean
dataModelState string
dataModelClass string
dataModelProperty string
multivalued boolean
version int
bitsForValue int

Name	Type	Default value	Description
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0
bitsForValue	int	32	Number of bits used to store numerical values.

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<LegacyUnsignedFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" bitsForValue="32" ></LegacyUnsignedFieldConfig>

LegacySignedFieldConfig com.exalead.mercury.mami.indexing.v10.LegacySignedFieldConfig

No documentation for this element.

Parent objects

IndexSchema

Attributes

ramBased boolean
multiContext boolean
fieldName string
searchable boolean
retrievable boolean
dataModelState string
dataModelClass string
dataModelProperty string
multivalued boolean
version int

Name	Type	Default value	Description
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<LegacySignedFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></LegacySignedFieldConfig>

PointFieldConfig com.exalead.mercury.mami.indexing.v10.PointFieldConfig

This type of field is used to store geographical points using either GPS coordinates (WGS84) or planar X,Y coordinates (Meter).

Parent objects

IndexSchema

Attributes

ramBased boolean
multiContext boolean
fieldName string
searchable boolean
retrievable boolean
dataModelState string
dataModelClass string
dataModelProperty string
multivalued boolean
version int
geoType enum(WGS84, Meter)
blockSize int
exact boolean

Name	Type	Default value	Description
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0
geoType	enum(WGS84, Meter)	WGS84	Value can be one of WGS84 Meter
blockSize	int	8192
exact	boolean	True

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<PointFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" geoType="WGS84" blockSize="8192" exact="true" ></PointFieldConfig>

GeoFieldConfig com.exalead.mercury.mami.indexing.v10.GeoFieldConfig

This type of field is used to store 2D geometries using either planar X,Y coordinates (Meter).

Parent objects

IndexSchema

Attributes

ramBased boolean
multiContext boolean
fieldName string
searchable boolean
retrievable boolean
dataModelState string
dataModelClass string
dataModelProperty string
multivalued boolean
version int
geoType enum(Meter)
maxBlockSize int
precision int
bboxFieldName string

Name	Type	Default value	Description
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0
geoType	enum(Meter)	Meter	Value can be one of Meter
maxBlockSize	int	24
precision	int	6
bboxFieldName	string	-

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<GeoFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" geoType="Meter" maxBlockSize="24" precision="6" bboxFieldName="" ></GeoFieldConfig>

UidFieldConfig com.exalead.mercury.mami.indexing.v10.UidFieldConfig

This field stores a unique value in order to facilitate search.

Parent objects

IndexSchema

Attributes

ramBased boolean
multiContext boolean
fieldName string
searchable boolean
retrievable boolean
dataModelState string
dataModelClass string
dataModelProperty string
multivalued boolean
version int
dictStorage enum(strbtree, trie, fsm)
bitsetThreshold int

Name	Type	Default value	Description
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0
dictStorage	enum(strbtree, trie, fsm)	fsm	Associative array implementation.
bitsetThreshold	int	10000	Number of requested documents before switching from a dynamic array to a bitset representation.

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<UidFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" dictStorage="fsm" bitsetThreshold="10000" ></UidFieldConfig>

ValueFieldConfig com.exalead.mercury.mami.indexing.v10.ValueFieldConfig

Stores alphanumerical content with an internal ordinal mapping, which makes it suitable for efficient facetting. Each term is limited to 1024 bytes.

Parent objects

IndexSchema

Attributes

Name	Type	Default value	Description
deltaRefEncodeMultivaluedValues	boolean	True	Delta ref encode multivalued values.
sortMultivaluedValues	boolean	True	Storing multivalued RAM-based values in an increasing order consumes less RAM. This must be disabled to use some advanced multivalued virtual functions.
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0
ignorePresentBit	boolean	False	Uses and loads the present bit.
minMemberNbBits	int	5	Min number of bits for attr part for value field.
bloomFilter	boolean	False	Activates a Bloom filter per slot. This speeds up requests containing words that are not present in the field on a given slot. Disable this option if all words of the request for this field are always matching, and if you compact into big slots regularly. Enable this option if there is either a lot of misses (e.g. on the "text" field) or if you have small updates (e.g. with real-time indexing).
hashThreshold	int	128	Stores a hash value in field dictionary instead of the original data if value length is greater than this threshold.
implementation	enum(strbtree, fsm)	fsm	Advanced configuration. Internal structure used to store the field dictionary.
optimizeListsForPatternSearch	boolean	False	speed up pattern search by reducing the number of opened inverted lists at the expense of indexing time and disk space.

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<ValueFieldConfig deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ignorePresentBit="false" minMemberNbBits="5" bloomFilter="false" hashThreshold="128" implementation="fsm" optimizeListsForPatternSearch="false" ></ValueFieldConfig>

TextFieldConfig com.exalead.mercury.mami.indexing.v10.TextFieldConfig

Stores alphanumerical content with an internal ordinal mapping, which makes it suitable for efficient facetting. Each term is limited to 1024 bytes.

Parent objects

IndexSchema

Attributes

Name	Type	Default value	Description
ramBased	boolean	True	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
retrievable	boolean	True
ignorePresentBit	boolean	False	Uses and loads the present bit.
minMemberNbBits	int	5	Min number of bits for attr part for value field.
bloomFilter	boolean	False	Activates a Bloom filter per slot. This speeds up requests containing words that are not present in the field on a given slot. Disable this option if all words of the request for this field are always matching, and if you compact into big slots regularly. Enable this option if there is either a lot of misses (e.g. on the "text" field) or if you have small updates (e.g. with real-time indexing).
hashThreshold	int	128	Stores a hash value in field dictionary instead of the original data if value length is greater than this threshold.
implementation	enum(strbtree, fsm)	fsm	Advanced configuration. Internal structure used to store the field dictionary.
optimizeListsForPatternSearch	boolean	False	speed up pattern search by reducing the number of opened inverted lists at the expense of indexing time and disk space.
deltaRefEncodeMultivaluedValues	boolean	True	Delta ref encode multivalued values.
sortMultivaluedValues	boolean	True	Storing multivalued RAM-based values in an increasing order consumes less RAM. This must be disabled to use some advanced multivalued virtual functions.
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0
storePositions	boolean	True	Store positions for seq nodes and proximity scoring.

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<TextFieldConfig ramBased="true" multiContext="false" retrievable="true" ignorePresentBit="false" minMemberNbBits="5" bloomFilter="false" hashThreshold="128" implementation="fsm" optimizeListsForPatternSearch="false" deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" fieldName="" searchable="false" dataModelState="" multivalued="false" version="0" storePositions="true" ></TextFieldConfig>

ReferenceFieldConfig com.exalead.mercury.mami.indexing.v10.ReferenceFieldConfig

Stores alphanumerical content with an internal ordinal mapping, which makes it suitable for efficient facetting. Each term is limited to 1024 bytes.

Parent objects

IndexSchema

Attributes

Name	Type	Default value	Description
ramBased	boolean	True	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
retrievable	boolean	True
ignorePresentBit	boolean	False	Uses and loads the present bit.
minMemberNbBits	int	5	Min number of bits for attr part for value field.
bloomFilter	boolean	False	Activates a Bloom filter per slot. This speeds up requests containing words that are not present in the field on a given slot. Disable this option if all words of the request for this field are always matching, and if you compact into big slots regularly. Enable this option if there is either a lot of misses (e.g. on the "text" field) or if you have small updates (e.g. with real-time indexing).
hashThreshold	int	128	Stores a hash value in field dictionary instead of the original data if value length is greater than this threshold.
implementation	enum(strbtree, fsm)	fsm	Advanced configuration. Internal structure used to store the field dictionary.
optimizeListsForPatternSearch	boolean	False	speed up pattern search by reducing the number of opened inverted lists at the expense of indexing time and disk space.
deltaRefEncodeMultivaluedValues	boolean	True	Delta ref encode multivalued values.
sortMultivaluedValues	boolean	True	Storing multivalued RAM-based values in an increasing order consumes less RAM. This must be disabled to use some advanced multivalued virtual functions.
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<ReferenceFieldConfig ramBased="true" multiContext="false" retrievable="true" ignorePresentBit="false" minMemberNbBits="5" bloomFilter="false" hashThreshold="128" implementation="fsm" optimizeListsForPatternSearch="false" deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" fieldName="" searchable="false" dataModelState="" multivalued="false" version="0" ></ReferenceFieldConfig>

UnsignedFieldConfig com.exalead.mercury.mami.indexing.v10.UnsignedFieldConfig

No documentation for this element.

Parent objects

IndexSchema

Attributes

Name	Type	Default value	Description
bitsForValue	int	63	Number of bits used to store numerical values. For unsigned numerical fields, the possible values are [0; 2^N - 1], and the field values are stored on N bits. For signed fields (signed integer and double), the possible values are [-2^N, 2^N - 1], and the field values are stored on (N+1) bits.
blockSize	int	8192
deltaRefEncodeMultivaluedValues	boolean	True	Delta ref encode multivalued values.
sortMultivaluedValues	boolean	True	Storing multivalued RAM-based values in an increasing order consumes less RAM. This must be disabled to use some advanced multivalued virtual functions.
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<UnsignedFieldConfig bitsForValue="63" blockSize="8192" deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></UnsignedFieldConfig>

SignedFieldConfig com.exalead.mercury.mami.indexing.v10.SignedFieldConfig

No documentation for this element.

Parent objects

IndexSchema

Attributes

Name	Type	Default value	Description
bitsForValue	int	63	Number of bits used to store numerical values. For unsigned numerical fields, the possible values are [0; 2^N - 1], and the field values are stored on N bits. For signed fields (signed integer and double), the possible values are [-2^N, 2^N - 1], and the field values are stored on (N+1) bits.
blockSize	int	8192
deltaRefEncodeMultivaluedValues	boolean	True	Delta ref encode multivalued values.
sortMultivaluedValues	boolean	True	Storing multivalued RAM-based values in an increasing order consumes less RAM. This must be disabled to use some advanced multivalued virtual functions.
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<SignedFieldConfig bitsForValue="63" blockSize="8192" deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></SignedFieldConfig>

DoubleFieldConfig com.exalead.mercury.mami.indexing.v10.DoubleFieldConfig

Configuration of a double precision floating point number field.

Parent objects

IndexSchema

Attributes

Name	Type	Default value	Description
bitsForValue	int	63	Number of bits used to store numerical values. For unsigned numerical fields, the possible values are [0; 2^N - 1], and the field values are stored on N bits. For signed fields (signed integer and double), the possible values are [-2^N, 2^N - 1], and the field values are stored on (N+1) bits.
blockSize	int	8192
deltaRefEncodeMultivaluedValues	boolean	True	Delta ref encode multivalued values.
sortMultivaluedValues	boolean	True	Storing multivalued RAM-based values in an increasing order consumes less RAM. This must be disabled to use some advanced multivalued virtual functions.
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0
precision	int	4	Number of relevant digits in the decimal part.

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<DoubleFieldConfig bitsForValue="63" blockSize="8192" deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" precision="4" ></DoubleFieldConfig>

TimeFieldConfig com.exalead.mercury.mami.indexing.v10.TimeFieldConfig

No documentation for this element.

Parent objects

IndexSchema

Attributes

deltaRefEncodeMultivaluedValues boolean
sortMultivaluedValues boolean
ramBased boolean
multiContext boolean
fieldName string
searchable boolean
retrievable boolean
dataModelState string
dataModelClass string
dataModelProperty string
multivalued boolean
version int

Name	Type	Default value	Description
deltaRefEncodeMultivaluedValues	boolean	True	Delta ref encode multivalued values.
sortMultivaluedValues	boolean	True	Storing multivalued RAM-based values in an increasing order consumes less RAM. This must be disabled to use some advanced multivalued virtual functions.
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<TimeFieldConfig deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></TimeFieldConfig>

DateFieldConfig com.exalead.mercury.mami.indexing.v10.DateFieldConfig

No documentation for this element.

Parent objects

IndexSchema

Attributes

deltaRefEncodeMultivaluedValues boolean
sortMultivaluedValues boolean
ramBased boolean
multiContext boolean
fieldName string
searchable boolean
retrievable boolean
dataModelState string
dataModelClass string
dataModelProperty string
multivalued boolean
version int

Name	Type	Default value	Description
deltaRefEncodeMultivaluedValues	boolean	True	Delta ref encode multivalued values.
sortMultivaluedValues	boolean	True	Storing multivalued RAM-based values in an increasing order consumes less RAM. This must be disabled to use some advanced multivalued virtual functions.
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<DateFieldConfig deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></DateFieldConfig>

BinaryFieldConfig com.exalead.mercury.mami.indexing.v10.BinaryFieldConfig

No documentation for this element.

Parent objects

IndexSchema

Attributes

ramBased boolean
multiContext boolean
fieldName string
searchable boolean
retrievable boolean
dataModelState string
dataModelClass string
dataModelProperty string
multivalued boolean
version int
gzip boolean

Name	Type	Default value	Description
ramBased	boolean	False	A value field must be RAM-based to perform synthesis efficiently.
multiContext	boolean	False
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0
gzip	boolean	False	Activates content compression

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<BinaryFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" gzip="false" ></BinaryFieldConfig>

CategoryFieldConfig com.exalead.mercury.mami.indexing.v10.CategoryFieldConfig

Stores hierarchy content. Each term is limited to 1024 bytes.

Parent objects

IndexSchema

Attributes

fieldName string
searchable boolean
retrievable boolean
dataModelState string
dataModelClass string
dataModelProperty string
multivalued boolean
version int
ramBased boolean
implementation enum(strbtree, fsm)

Name	Type	Default value	Description
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0
ramBased	boolean	True	A value field must be RAM-based to perform synthesis efficiently.
implementation	enum(strbtree, fsm)	strbtree	Advanced configuration. Internal structure used to store the field dictionary.

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<CategoryFieldConfig fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ramBased="true" implementation="strbtree" ></CategoryFieldConfig>

HierarchyFieldConfig com.exalead.mercury.mami.indexing.v10.HierarchyFieldConfig

Stores hierarchy content. Each term is limited to 1024 bytes.

Parent objects

IndexSchema

Attributes

ramBased boolean
implementation enum(strbtree, fsm)
fieldName string
searchable boolean
retrievable boolean
dataModelState string
dataModelClass string
dataModelProperty string
multivalued boolean
version int

Name	Type	Default value	Description
ramBased	boolean	True	A value field must be RAM-based to perform synthesis efficiently.
implementation	enum(strbtree, fsm)	strbtree	Advanced configuration. Internal structure used to store the field dictionary.
fieldName	string	-	The name of the field. The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
searchable	boolean	False	Allows users to query on this field (using a prefix handler).
retrievable	boolean	False	Allows the content of this field to be retrieved at query time and displayed in the search results.
dataModelState	string	-	Is this index field config managed by a data model? @enum{null,auto,customized}. If null, this is not related to a data model. If "auto", this is auto-generated by a data model. If "customized", this was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
dataModelProperty	string	-	If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
multivalued	boolean	False
version	int	0

Nested elements

fromDataModelFieldConfig
ListsEncoderConfig

Type	Wrapper	Description
FieldConfig	fromDataModel	If dataModelState is "customized", you will find here the original object generated by the data model. Use this to easily revert to "auto" state from "customized".
ListsEncoderConfig	-	Configuration of the inverted lists encoder. If no configuration is specified, a Rice encoder is used.

XML representation

<HierarchyFieldConfig ramBased="true" implementation="strbtree" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></HierarchyFieldConfig>

IndexingConfig com.exalead.mercury.mami.indexing.v10.IndexingConfig

No documentation for this element.

Attributes

name string

Name	Type	Default value	Description
name	string	-

Nested elements

AnalysisPolicy
CommitTriggerCondition*
ImportPolicy
IndexManagementPolicy
WriteAttributeSlotConfig*
WriteSlotConfig

Type	Wrapper	Description
AnalysisPolicy	-
CommitTriggerCondition*	-
ImportPolicy	-
IndexManagementPolicy	-
WriteAttributeSlotConfig*	-
WriteSlotConfig	-

XML representation

<IndexingConfig name="" ></IndexingConfig>

FixedThreadsAnalysisPolicy com.exalead.mercury.mami.indexing.v10.FixedThreadsAnalysisPolicy

Instantiates a fixed number of analysis threads.

Dispatches documents according to their DIDs (Document IDs) and slice.

Parent objects

IndexingConfig

Attributes

maxRAMConsumptionThreshold enum(disabled, enabled, auto)
maxRAMConsumptionMB int
nbThreads int

Name	Type	Default value	Description
maxRAMConsumptionThreshold	enum(disabled, enabled, auto)	enabled	When reaching the RAM value specified, analysis is stopped and analyzed documents are imported to the index. Then analysis starts again. Enabled: Commits when the RAM size reaches the Threshold value specified (by default, 2048 MB). Auto: Commits when the RAM size reaches 2048 MB.'
maxRAMConsumptionMB	int	2048	The maximum of non-java RAM the analyzer can allocate. Reaching this limit triggers a commit.
nbThreads	int	4	Number of threads to allocate.

XML representation

<FixedThreadsAnalysisPolicy maxRAMConsumptionThreshold="enabled" maxRAMConsumptionMB="2048" nbThreads="4" ></FixedThreadsAnalysisPolicy>

PerSliceAnalysisPolicy com.exalead.mercury.mami.indexing.v10.PerSliceAnalysisPolicy

Instantiates an analysis thread for each slice.

Dispatches documents according to their slice.

Consumes less RAM than the 'FixedThreadsAnalysisPolicy'.

Parent objects

IndexingConfig

Attributes

maxRAMConsumptionThreshold enum(disabled, enabled, auto)
maxRAMConsumptionMB int
nbThreads int

Name	Type	Default value	Description
maxRAMConsumptionThreshold	enum(disabled, enabled, auto)	enabled	When reaching the RAM value specified, analysis is stopped and analyzed documents are imported to the index. Then analysis starts again. Enabled: Commits when the RAM size reaches the Threshold value specified (by default, 2048 MB). Auto: Commits when the RAM size reaches 2048 MB.'
maxRAMConsumptionMB	int	2048	The maximum of non-java RAM the analyzer can allocate. Reaching this limit triggers a commit.
nbThreads	int	1	Uses N threads per slice.

XML representation

<PerSliceAnalysisPolicy maxRAMConsumptionThreshold="enabled" maxRAMConsumptionMB="2048" nbThreads="1" ></PerSliceAnalysisPolicy>

SameThreadAnalysisPolicy com.exalead.mercury.mami.indexing.v10.SameThreadAnalysisPolicy

Instantiates an analysis thread for each incoming PAPI thread.

Each PAPI thread analyzes its tasks synchronously.

Parent objects

IndexingConfig

Attributes

maxRAMConsumptionThreshold enum(disabled, enabled, auto)
maxRAMConsumptionMB int

Name	Type	Default value	Description
maxRAMConsumptionThreshold	enum(disabled, enabled, auto)	enabled	When reaching the RAM value specified, analysis is stopped and analyzed documents are imported to the index. Then analysis starts again. Enabled: Commits when the RAM size reaches the Threshold value specified (by default, 2048 MB). Auto: Commits when the RAM size reaches 2048 MB.'
maxRAMConsumptionMB	int	2048	The maximum of non-java RAM the analyzer can allocate. Reaching this limit triggers a commit.

XML representation

<SameThreadAnalysisPolicy maxRAMConsumptionThreshold="enabled" maxRAMConsumptionMB="2048" ></SameThreadAnalysisPolicy>

AutomaticAnalysisPolicy com.exalead.mercury.mami.indexing.v10.AutomaticAnalysisPolicy

Depending on the number of threads specified, CloudView automatically chooses the most efficient analysis policy.

Changes made in Analyze require a restart of CloudView, or at least of the indexing server process, to be taken into account.

Parent objects

IndexingConfig

Attributes

maxRAMConsumptionThreshold enum(disabled, enabled, auto)
maxRAMConsumptionMB int
nbThreads int

Name	Type	Default value	Description
maxRAMConsumptionThreshold	enum(disabled, enabled, auto)	enabled	When reaching the RAM value specified, analysis is stopped and analyzed documents are imported to the index. Then analysis starts again. Enabled: Commits when the RAM size reaches the Threshold value specified (by default, 2048 MB). Auto: Commits when the RAM size reaches 2048 MB.'
maxRAMConsumptionMB	int	2048	The maximum of non-java RAM the analyzer can allocate. Reaching this limit triggers a commit.
nbThreads	int	0	If not set or set with a multiple of 'nbSlices', it uses the 'PerSliceAnalysisPolicy'. Otherwise, it uses 'FixedThreadsAnalysisPolicy'.

XML representation

<AutomaticAnalysisPolicy maxRAMConsumptionThreshold="enabled" maxRAMConsumptionMB="2048" nbThreads="0" ></AutomaticAnalysisPolicy>

NumberOfTasksBasedCommitTriggerCondition com.exalead.mercury.mami.indexing.v10.NumberOfTasksBasedCommitTriggerCondition

Triggers a commit after the specified No. tasks has been processed.
The No. of tasks calculation is executed each time a batch of documents received, to avoid performance penalties. You might therefore have a bit more than the specified No. of tasks analyzed.

Parent objects

IndexingConfig
AggregationTriggerConditions ConsolidationConfig
CommitTriggerConditions ConsolidationConfig

Attributes

nbTasks int

Name	Type	Default value	Description
nbTasks	int	-	The number of tasks

XML representation

<NumberOfTasksBasedCommitTriggerCondition nbTasks="" ></NumberOfTasksBasedCommitTriggerCondition>

SizeBasedCommitTriggerCondition com.exalead.mercury.mami.indexing.v10.SizeBasedCommitTriggerCondition

Triggers a commit when the Max size (MB) is reached.

Parent objects

IndexingConfig
AggregationTriggerConditions ConsolidationConfig
CommitTriggerConditions ConsolidationConfig

Attributes

maxSizeMB int

Name	Type	Default value	Description
maxSizeMB	int	-	Max size threshold in MB

XML representation

<SizeBasedCommitTriggerCondition maxSizeMB="" ></SizeBasedCommitTriggerCondition>

RAMUsageCommitTriggerCondition com.exalead.mercury.mami.indexing.v10.RAMUsageCommitTriggerCondition

Triggers a commit when RAM usage reaches the limit.

Parent objects

IndexingConfig
AggregationTriggerConditions ConsolidationConfig
CommitTriggerConditions ConsolidationConfig

Attributes

maxRAMUsageInMB int

Name	Type	Default value	Description
maxRAMUsageInMB	int	-	Max RAM usage in MB

XML representation

<RAMUsageCommitTriggerCondition maxRAMUsageInMB="" ></RAMUsageCommitTriggerCondition>

PeriodicCommitTriggerCondition com.exalead.mercury.mami.indexing.v10.PeriodicCommitTriggerCondition

Commits every N seconds after the first push order done after the last commit.

Parent objects

IndexingConfig
AggregationTriggerConditions ConsolidationConfig
CommitTriggerConditions ConsolidationConfig

Attributes

delayS long

Name	Type	Default value	Description
delayS	long	-	Time in seconds between two commits.

XML representation

<PeriodicCommitTriggerCondition delayS="" ></PeriodicCommitTriggerCondition>

InactivityCommitTriggerCondition com.exalead.mercury.mami.indexing.v10.InactivityCommitTriggerCondition

Inactivity-based condition.
This condition is triggered when:

there is no new data for the specified time period
AND at least the specified No. tasks has been analyzed.

Parent objects

IndexingConfig
AggregationTriggerConditions ConsolidationConfig
CommitTriggerConditions ConsolidationConfig

Attributes

Name	Type	Default value	Description
numberOfTasks	int	-	Minimum number of tasks to trigger a commit.
inactivityTimeS	long	-	After N seconds of no indexing activity, it is defined as inactive.
inactivityTimeMs	long	-	After N milliseconds of no indexing activity, it is defined as inactive. If both timeS and timeMs are set, wait 1000*timeS + timeMs milliseconds.

XML representation

<InactivityCommitTriggerCondition numberOfTasks="" inactivityTimeS="" inactivityTimeMs="" ></InactivityCommitTriggerCondition>

ParallelImportPolicy com.exalead.mercury.mami.indexing.v10.ParallelImportPolicy

For each analysis buffers one generation is created.

Analysis buffers are imported in parallel.

Parent objects

IndexingConfig

Attributes

nbThreads int

Name	Type	Default value	Description
nbThreads	int	8	The number of parallel import.

XML representation

<ParallelImportPolicy nbThreads="8" ></ParallelImportPolicy>

MergedImportPolicy com.exalead.mercury.mami.indexing.v10.MergedImportPolicy

All analysis buffers are merged into a single one to be imported in an unique generation.

XML representation

<MergedImportPolicy ></MergedImportPolicy>

StandardIndexManagementPolicy com.exalead.mercury.mami.indexing.v10.StandardIndexManagementPolicy

Default index (service + build) runtime configuration

Parent objects

IndexingConfig

Attributes

gcEveryS int

Name	Type	Default value	Description
gcEveryS	int	120	Trigger a GC every N seconds.

Nested elements

CommitPolicy
CompactPolicies
UploadPolicy

Type	Wrapper	Description
CommitPolicy	-	The commit policy used to configured how the index persists its file to disk.
CompactPolicies	-	The compact policies used to trigger slots compaction.
UploadPolicy	-	The upload policy used to replicate new slots to replicas.

XML representation

<StandardIndexManagementPolicy gcEveryS="120" ></StandardIndexManagementPolicy>

StandardCommitPolicy com.exalead.mercury.mami.indexing.v10.StandardCommitPolicy

Default commit policy

XML representation

<StandardCommitPolicy ></StandardCommitPolicy>

CompactPolicies com.exalead.mercury.mami.indexing.v10.CompactPolicies

No documentation for this element.

Parent objects

AdvancedConfig
StandardIndexManagementPolicy

Attributes

Name	Type	Default value	Description
synchronous	boolean	False	By default, compaction jobs are asynchronous. If set, compacts will be done synchronously just after imports.
maxParallelFullCompacts	int	0	Limit the number of full compacts in parallel, can be useful when you don't have too much disk space available. 0 means no limit.
type	enum(mmap, pagecache)	mmap	Specifies which I/O mode is used while compacting. ( Value can be null or one of mmap pagecache )
maxPageCacheSizeMB	int	32	If the policy uses the PageCache mode, it specifies the max cache size.
pageCachePageSizeKB	int	8	If the policy uses the PageCache mode, it specifies the page size.
priorityCompactThreshold	int	64	When compacting a slot gen0-gen1, consider as a priority compact if gen1-gen0 < priorityCompactThreshold. Default is 48. (0: disabled)
lowPriorityCompactNbThreads	int	2	Number of threads to use for a compact having low priority (0: all available threads).
highPriorityCompactNbThreads	int	0	Number of threads to use for a compact having high priority (0: all available threads).

Nested elements

AutoCompactPolicy*

Type	Wrapper	Description
AutoCompactPolicy*	-	Specifies the auto-compact policies.

XML representation

<CompactPolicies synchronous="false" maxParallelFullCompacts="0" type="mmap" maxPageCacheSizeMB="32" pageCachePageSizeKB="8" priorityCompactThreshold="64" lowPriorityCompactNbThreads="2" highPriorityCompactNbThreads="0" ></CompactPolicies>

NumberOfSlotsBasedCompactPolicy com.exalead.mercury.mami.indexing.v10.NumberOfSlotsBasedCompactPolicy

Compaction policy based on a fixed number of slots for a given number of generations.

Parent objects

CompactPolicies

Attributes

component string
arity int
maxSlotSizeMb long

Name	Type	Default value	Description
component	string	-
arity	int	4	Specifies the number of slots of the same length required to compact.
maxSlotSizeMb	long	5000	If a slot reaches this size; it will never be used by the next automatic compaction processes.

Nested elements

FullCompactPolicy

Type	Wrapper	Description
FullCompactPolicy	-

XML representation

<NumberOfSlotsBasedCompactPolicy arity="4" maxSlotSizeMb="5000" ></NumberOfSlotsBasedCompactPolicy>

MaxSizeFullCompactPolicy com.exalead.mercury.mami.indexing.v10.MaxSizeFullCompactPolicy

A FullCompactPolicy that compacts all slots into one whenever the "tail" of small slots exceeds a certain ratio of the large first slot.

This policy is appropriate when auto-compacts are restricted to slots under a certain size for performance reasons.

In this case, a full optimization can occasionally be triggered to purge the deletes.

If not, the deletes occurring in later slots would never be deleted, incurring performance costs at query-time and extra disk space consumption.

Parent objects

AutoCompactPolicy
LowLatencyCompactPolicy
NoCompactPolicy
NumberOfSlotsBasedCompactPolicy
SlotsLogSizeBasedCompactPolicy
SlotsSizeBasedCompactPolicy

Attributes

percentage int
minSlots int

Name	Type	Default value	Description
percentage	int	100	Minimum percentage to launch a full compaction. Compacts all slots into one whenever the "tail" of small slots exceeds a certain percentage of the large first slot. Eg: with percentage=100, when cumulated size of all slots except biggest is higher than size of the biggest slot, a full compact is triggered.
minSlots	int	2	Minimum number of slots before triggering a full compact.

XML representation

<MaxSizeFullCompactPolicy percentage="100" minSlots="2" ></MaxSizeFullCompactPolicy>

ArityBasedFullCompactPolicy com.exalead.mercury.mami.indexing.v10.ArityBasedFullCompactPolicy

A FullCompactPolicy that compacts all slots into one whenever the "tail" of slots with smaller arities exceeds together a certain arity.

The idea is that the arity-based policy guarantees occasional full-compaction, but the time interval between full-compaction increases exponentially.

This add-on policy caps the increase at a certain arity, and schedules full-compacts at regular intervals afterwards.

This policy is appropriate when auto-compacts are managed per generation-arity.

In this case, a full optimization can occasionally be triggered to purge the deletes.

If not, the deletes occurring in later slots would never be deleted, incurring performance costs at query-time and extra disk space consumption.

Parent objects

AutoCompactPolicy
LowLatencyCompactPolicy
NoCompactPolicy
NumberOfSlotsBasedCompactPolicy
SlotsLogSizeBasedCompactPolicy
SlotsSizeBasedCompactPolicy

Attributes

maxArity int
minSize long

Name	Type	Default value	Description
maxArity	int	256	Whenever the long tail total arity reaches maxArity, a full compact is scheduled. The "long tail" are the slots whose span has an arity inferior to this parameter. This is generally a multiple of the auto-compact Arity policy arity parameter.
minSize	long	0	Slots below this size are considered neglectable.

XML representation

<ArityBasedFullCompactPolicy maxArity="256" minSize="0" ></ArityBasedFullCompactPolicy>

SlotsSizeBasedCompactPolicy com.exalead.mercury.mami.indexing.v10.SlotsSizeBasedCompactPolicy

Compaction policy based on size that produces slots with similar size.

When N consecutive slots have a size below targetSizeForCompactionMB, it performs a compaction if:

N is at least minArity AND

The N+1 slot makes the size above targetSizeForCompactionMB OR
The size is above minSizeForCompactionMB

Parent objects

CompactPolicies

Attributes

Name	Type	Default value	Description
component	string	-
targetSizeForCompactionMB	int	200	Targeted size for a compacted slot.
minSizeForCompactionMB	int	50	Minimum size required to compact.
minArity	int	2	Minimum number of slots required to compact.

Nested elements

FullCompactPolicy

Type	Wrapper	Description
FullCompactPolicy	-

XML representation

<SlotsSizeBasedCompactPolicy targetSizeForCompactionMB="200" minSizeForCompactionMB="50" minArity="2" ></SlotsSizeBasedCompactPolicy>

SlotsLogSizeBasedCompactPolicy com.exalead.mercury.mami.indexing.v10.SlotsLogSizeBasedCompactPolicy

A CompactPolicy that tries to compact slots into levels of exponentially increasing size, where each level has fewer slots than the value of the compact factor.

Whenever extra slots (beyond the compact factor upper bound) are encountered, all slots within the level are compacted.

Parent objects

CompactPolicies

Attributes

component string
compactFactor int
minSize long
maxSize long

Name	Type	Default value	Description
component	string	-
compactFactor	int	10	Determines how often slots are compacted. With smaller values, less RAM is used while indexing, and searches on unoptimized indices are faster, but indexing speed is slower. With larger values, more RAM is used during indexing, and while searches on unoptimized indices are slower, indexing is faster. Thus larger values (greater than 10) are best for batch index creation, and smaller values (lower than 10) for indices that are interactively maintained.
minSize	long	1048576	A size setting type which sets the minimum size for the lowest level slots. Slots below this size are considered to be on the same level (even if they vary drastically in size) and will be merged whenever there are mergeFactor for them. This effectively truncates the "long tail" of small slots that would otherwise be created into a single level. If you set this too large, it can greatly increase the merging cost during indexing (if you flush many small slots).
maxSize	long	9223372036854775807	A size setting type which sets the largest slot that may be merged with other segments.

Nested elements

FullCompactPolicy

Type	Wrapper	Description
FullCompactPolicy	-

XML representation

<SlotsLogSizeBasedCompactPolicy compactFactor="10" minSize="1048576" maxSize="9223372036854775807" ></SlotsLogSizeBasedCompactPolicy>

LowLatencyCompactPolicy com.exalead.mercury.mami.indexing.v10.LowLatencyCompactPolicy

Compacts when the size of all small slots is above the average large slot size, or when the number of slots is above nbLargeSlots + maxNbSmallSlots.

Parent objects

CompactPolicies

Attributes

Name	Type	Default value	Description
component	string	-
nbLargeSlots	int	8	The number of large slots to keep.
maxNbSmallSlots	int	8	Maximum number of small slots allowed. As soon as this limit is reached, small slots are compacted together.
gatherSmallsAtTheEnd	boolean	True
contiguousCompact	boolean	False

Nested elements

FullCompactPolicy

Type	Wrapper	Description
FullCompactPolicy	-

XML representation

<LowLatencyCompactPolicy nbLargeSlots="8" maxNbSmallSlots="8" gatherSmallsAtTheEnd="true" contiguousCompact="false" ></LowLatencyCompactPolicy>

NoCompactPolicy com.exalead.mercury.mami.indexing.v10.NoCompactPolicy

Compact policy that does not perform any compact.

Parent objects

CompactPolicies

Attributes

component string

Name	Type	Default value	Description
component	string	-

Nested elements

FullCompactPolicy

Type	Wrapper	Description
FullCompactPolicy	-

XML representation

<NoCompactPolicy ></NoCompactPolicy>

StandardUploadPolicy com.exalead.mercury.mami.indexing.v10.StandardUploadPolicy

Default upload policy

Parent objects

StandardIndexManagementPolicy

Attributes

waitBetweenSwitchesS int

Name	Type	Default value	Description
waitBetweenSwitchesS	int	-	If strictly positive, all slices switch to a generation sequentially, and we wait this time in seconds between two slices. This spreads the temporary memory consumption to avoid large memory spikes and swap out.

XML representation

<StandardUploadPolicy waitBetweenSwitchesS="" ></StandardUploadPolicy>

WriteAttributeSlotConfig com.exalead.mercury.mami.indexing.v10.WriteAttributeSlotConfig

Write attribute slot configuration

Parent objects

IndexingConfig

Attributes

type enum(directio, sequential)
groupId int

Name	Type	Default value	Description
type	enum(directio, sequential)	directio	Access type for writing the new slots. Value can be null or one of directio sequential
groupId	int	-	Specifies which attribute group store this access configuration applies to.

XML representation

<WriteAttributeSlotConfig type="directio" groupId="" ></WriteAttributeSlotConfig>

WriteSlotConfig com.exalead.mercury.mami.indexing.v10.WriteSlotConfig

Write slot configuration

Parent objects

IndexingConfig

Attributes

type enum(directio, sequential)

Name	Type	Default value	Description
type	enum(directio, sequential)	sequential	Access type for writing the new slots. Value can be null or one of directio sequential

XML representation

<WriteSlotConfig type="sequential" ></WriteSlotConfig>

IndexRuntimeConfigList com.exalead.mercury.mami.indexing.v10.IndexRuntimeConfigList

Lists all index runtime config list.

Attributes

version long

Name	Type	Default value	Description
version	long	-	-

Nested elements

CacheConfig*
IndexRuntimeConfig*

Type	Wrapper	Description
CacheConfig*	-	Lists PageCache configurations
IndexRuntimeConfig*	-	Lists runtime configurations

XML representation

<IndexRuntimeConfigList version="" ></IndexRuntimeConfigList>

CacheConfig com.exalead.mercury.mami.indexing.v10.CacheConfig

PageCache configuration. Warning: The index page cache is limited to 32000 files in the index directory. If you get an error like "FileRAM: too many cached files (c_max_files=32767)", it means that the limit has been crossed and you should set a more aggressive compact policy.

Parent objects

IndexRuntimeConfigList

Attributes

Name	Type	Default value	Description
name	string	-	The cache ID.
cacheSizeMB	int	256	Maximum cache size in MB.
pageSizeKB	int	8	Page size in KB.
maxSimultaneousIOOperations	int	32	Specifies the max number of simultaneous I/O.

XML representation

<CacheConfig name="" cacheSizeMB="256" pageSizeKB="8" maxSimultaneousIOOperations="32" ></CacheConfig>

IndexRuntimeConfig com.exalead.mercury.mami.indexing.v10.IndexRuntimeConfig

Index runtime configuration for an instance of an index slice.
Use key values arguments to provide custom configuration keys.

Parent objects

IndexRuntimeConfigList

Attributes

name string
newGenerationBandwidthLimitKB int
compactBandwidthLimitKB int
ramBasedAttrGroupLoadPolicy enum(rebuild, copyAndPatch)

Name	Type	Default value	Description
name	string	-
newGenerationBandwidthLimitKB	int	0
compactBandwidthLimitKB	int	0
ramBasedAttrGroupLoadPolicy	enum(rebuild, copyAndPatch)	copyAndPatch	Value can be one of rebuild copyAndPatch

Nested elements

AttributeGroupAccess*
FieldRuntimeConfig*
QueryAutocacheConfig
ReplicationConfig
WarmupConfig

Type	Wrapper	Description
AttributeGroupAccess*	-
FieldRuntimeConfig*	-
QueryAutocacheConfig	-
ReplicationConfig	-
WarmupConfig	-

XML representation

<IndexRuntimeConfig name="" newGenerationBandwidthLimitKB="0" compactBandwidthLimitKB="0" ramBasedAttrGroupLoadPolicy="copyAndPatch" ></IndexRuntimeConfig>

AttributeGroupAccess com.exalead.mercury.mami.indexing.v10.AttributeGroupAccess

Configuration specifying how to access the attribute group at runtime.

Parent objects

IndexRuntimeConfig

Attributes

groupId string
runType enum(mmap, pagecache, direct, RAMRow, RAMColumnDense)
preload boolean
mlock boolean
cacheId string

Name	Type	Default value	Description
groupId	string	-	Specifies which attribute group store this access configuration applies to.
runType	enum(mmap, pagecache, direct, RAMRow, RAMColumnDense)	mmap	Specifies how the attribute group should be accessed at runtime.
preload	boolean	False	For RAM-based access type, specifies if the attribute group should be loaded in RAM at startup instead of at access time.
mlock	boolean	False	For RAM-based access type, specifies if the attribute group should be locked in RAM. Preventing it being moved to the swap area.
cacheId	string	-	For pagecache I/O type, specifies the cache ID.

XML representation

<AttributeGroupAccess groupId="" runType="mmap" preload="false" mlock="false" cacheId="" ></AttributeGroupAccess>

FieldRuntimeConfig com.exalead.mercury.mami.indexing.v10.FieldRuntimeConfig

Configuration specifying the index field at runtime.

Parent objects

IndexRuntimeConfig

Attributes

name string
dictType enum(mmap, pagecache)
type enum(mmap, pagecache)
preload boolean
mlock boolean
cacheId string

Name	Type	Default value	Description
name	string	-	The index field name.
dictType	enum(mmap, pagecache)	mmap	Specifies the I/O mode used to load the dictionary part of an index field. ( Value can be one of mmap pagecache )
type	enum(mmap, pagecache)	mmap	Specifies the I/O mode used to load the component. ( Value can be one of mmap pagecache )
preload	boolean	False	Should the field be preloaded? This will force the field to be loaded in RAM at startup.
mlock	boolean	False	Should the field be locked in RAM.
cacheId	string	-	If PageCache is used, it specifies the cache ID.

XML representation

<FieldRuntimeConfig name="" dictType="mmap" type="mmap" preload="false" mlock="false" cacheId="" ></FieldRuntimeConfig>

QueryAutocacheConfig com.exalead.mercury.mami.indexing.v10.QueryAutocacheConfig

Query #autocache configuration.

Parent objects

IndexRuntimeConfig

Attributes

Name	Type	Default value	Description
totalCacheSizeMB	int	20	Maximum cache size in MB (cross queries).
queryCacheSizeMB	int	5	Maximum cached query size.
maxCachedQueries	int	20	Number of queries cached.

XML representation

<QueryAutocacheConfig totalCacheSizeMB="20" queryCacheSizeMB="5" maxCachedQueries="20" ></QueryAutocacheConfig>

ReplicationConfig com.exalead.mercury.mami.indexing.v10.ReplicationConfig

Slice replication configuration

Nested elements

AttributeReplicationConfig*
FieldReplicationConfig*

Type	Wrapper	Description
AttributeReplicationConfig*	-	Configures the direction usage in attribute replication.
FieldReplicationConfig*	-	Configures the direction usage in field replication.

XML representation

<ReplicationConfig ></ReplicationConfig>

AttributeReplicationConfig com.exalead.mercury.mami.indexing.v10.AttributeReplicationConfig

Attribute's replication configuration

Parent objects

ReplicationConfig

Attributes

groupId string
type enum(directio, sequential)

Name	Type	Default value	Description
groupId	string	-	Group id of the attribute to configure
type	enum(directio, sequential)	directio	Access type Value can be null or one of directio sequential

XML representation

<AttributeReplicationConfig type="directio" ></AttributeReplicationConfig>

FieldReplicationConfig com.exalead.mercury.mami.indexing.v10.FieldReplicationConfig

Index field replication configuration

Parent objects

ReplicationConfig

Attributes

name string
type enum(directio, sequential)
dictType enum(directio, sequential)

Name	Type	Default value	Description
name	string	-	Name of the field to configure.
type	enum(directio, sequential)	directio	Access type Value can be null or one of directio sequential
dictType	enum(directio, sequential)	directio	Access type for the dictionary Value can be null or one of directio sequential

XML representation

<FieldReplicationConfig type="directio" dictType="directio" ></FieldReplicationConfig>

WarmupConfig com.exalead.mercury.mami.indexing.v10.WarmupConfig

Index warmup configuration

Parent objects

IndexRuntimeConfig

Attributes

warmupQueryFile string
maxWarmupDurationS int

Name	Type	Default value	Description
warmupQueryFile	string	-	Warmup list of single queries
maxWarmupDurationS	int	5	Maximum time for warmup. Open the index after and prints a warning indicating which line number has been reached

XML representation

<WarmupConfig warmupQueryFile="" maxWarmupDurationS="5" ></WarmupConfig>

BuildGroupConfig com.exalead.mercury.mami.deploy.v10.BuildGroupConfig

Configuration of a build group.

A "Build Group" is defined by references to sub-configurations defined in other MAMI:

Analysis (how documents are processed).
Index Builder (how indexing jobs are scheduled and managed).
Index Schema (schema of the index slices being built).
Task Queue (how input document processing tasks are queued before jobs).
Similar Document (optional)

Several build groups may share some or all their sub-configuration.
In most configuration, all build groups would share the same index schema configuration.

When built with the same schema, index slices built by different build groups can be queried together (see the Search MAMI).

Attributes

buildGroup string
dataModel string
indexingConfig string

Name	Type	Default value	Description
buildGroup	string	-	Name of the build group. This name should be unique.
dataModel	string	-	Name of the data model.
indexingConfig	string	-	Name of an indexing configuration (IndexingConfig element in Indexing MAMI).

Nested elements

DIHConfig
DidAllocationPolicy
DocumentCacheConfig
PrecomputedThumbnailsConfig
ScratchHook*
SlicePartioningPolicy

Type	Wrapper	Description
DIHConfig	-
DidAllocationPolicy	-
DocumentCacheConfig	-
PrecomputedThumbnailsConfig	-
ScratchHook*	-
SlicePartioningPolicy	-

XML representation

<BuildGroupConfig buildGroup="" dataModel="" indexingConfig="" ></BuildGroupConfig>

DIHConfig com.exalead.mercury.mami.deploy.v10.DIHConfig

A DIHConfig is a set of parameters for a DIH.

Parent objects

BuildGroupConfig

Attributes

compactArity int
nbBloomBitsPerElement int
nbElementsInLeaf int
readMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)
enumMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)
compactMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)

Name	Type	Default value	Description
compactArity	int	4	Number of consecutive slots to trigger a compact.
nbBloomBitsPerElement	int	20	Number of bits per elements in the DIH's StrBTree's bloom filter.
nbElementsInLeaf	int	100	Number of entries in each of the DIH's StrBTree's leaves.
readMode	enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)	mmap	Read mode of the DIH's StrBTree, except for enumeration. Value can be null or one of auto direct mmap mmap_mlock mmap_mload pagecache random sequential
enumMode	enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)	mmap	Read mode of the DIH's StrBTree, for enumeration. Value can be null or one of auto direct mmap mmap_mlock mmap_mload pagecache random sequential
compactMode	enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)	mmap	Read mode of the DIH's StrBTree, for compact. Value can be null or one of auto direct mmap mmap_mlock mmap_mload pagecache random sequential

XML representation

<DIHConfig compactArity="4" nbBloomBitsPerElement="20" nbElementsInLeaf="100" readMode="mmap" enumMode="mmap" compactMode="mmap" ></DIHConfig>

ContiguousDidAllocationPolicy com.exalead.mercury.mami.deploy.v10.ContiguousDidAllocationPolicy

Base-class specifying how DIDs (Document IDs) are assigned to the documents.

Parent objects

BuildGroupConfig

Attributes

increasing boolean
startingPoint int
endingPoint nullableint

Name	Type	Default value	Description
increasing	boolean	True	Assign DIDs in an increasing order.
startingPoint	int	0	Start point of the allocation. By default, the first DID will have value '1'.
endingPoint	nullableint	-	End point of the allocation. By default, it will be Integer.MAX_VALUE if increasing or 1 if decreasing.

XML representation

<ContiguousDidAllocationPolicy increasing="true" startingPoint="0" endingPoint="" ></ContiguousDidAllocationPolicy>

DocumentCacheConfig com.exalead.mercury.mami.deploy.v10.DocumentCacheConfig

Configuration for the document cache.

Parent objects

BuildGroupConfig

Attributes

path string
compactArity int
nbBloomBitsPerElement int
nbElementsInLeaf int
readMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)
enumMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)
compactMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)
diskCompressionAlgorithm enum(none, fastlz, gzip, lcs, lz4)
temporaryFilesCompressionAlgorithm enum(none, fastlz, gzip, lz4)

Name	Type	Default value	Description
path	string	-	Location of the document cache on the filesystem. Unless otherwise specified, the document cache is located in the "cache" subdirectory of the build group.
compactArity	int	4	Number of consecutive slots to trigger a compact.
nbBloomBitsPerElement	int	10	Number of bits per element in the document cache StrBTree bloom filter.
nbElementsInLeaf	int	20	Number of entries in each of the document cache StrBTree leaves.
readMode	enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)	auto	Read mode of the document cache StrBTree, except for enumeration. Value can be null or one of auto direct mmap mmap_mlock mmap_mload pagecache random sequential
enumMode	enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)	auto	Read mode of the document cache StrBTree, for enumeration. Value can be null or one of auto direct mmap mmap_mlock mmap_mload pagecache random sequential
compactMode	enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)	auto	Read mode of the document cache StrBTree, for compact. Value can be null or one of auto direct mmap mmap_mlock mmap_mload pagecache random sequential
diskCompressionAlgorithm	enum(none, fastlz, gzip, lcs, lz4)	fastlz	Algorithm to compress the document cache on disk. Value can be null or one of none fastlz gzip lcs lz4
temporaryFilesCompressionAlgorithm	enum(none, fastlz, gzip, lz4)	fastlz	Algorithm to compress the temporary files on disk. Value can be null or one of none fastlz gzip lz4

XML representation

<DocumentCacheConfig path="" compactArity="4" nbBloomBitsPerElement="10" nbElementsInLeaf="20" readMode="auto" enumMode="auto" compactMode="auto" diskCompressionAlgorithm="fastlz" temporaryFilesCompressionAlgorithm="fastlz" ></DocumentCacheConfig>

PrecomputedThumbnailsConfig com.exalead.mercury.mami.deploy.v10.PrecomputedThumbnailsConfig

No documentation for this element.

Parent objects

BuildGroupConfig

Attributes

computeThreads int

Name	Type	Default value	Description
computeThreads	int	4

XML representation

<PrecomputedThumbnailsConfig computeThreads="4" ></PrecomputedThumbnailsConfig>

FSPrecomputedThumbnailsConfig (deprecated) com.exalead.mercury.mami.deploy.v10.FSPrecomputedThumbnailsConfig

No documentation for this element.

Parent objects

BuildGroupConfig

Attributes

computeThreads int

Name	Type	Default value	Description
computeThreads	int	4

XML representation

<FSPrecomputedThumbnailsConfig computeThreads="4" ></FSPrecomputedThumbnailsConfig>

GDSPrecomputedThumbnailsConfig (deprecated) com.exalead.mercury.mami.deploy.v10.GDSPrecomputedThumbnailsConfig

No documentation for this element.

Parent objects

BuildGroupConfig

Attributes

computeThreads int
ramBufferSizeMB long
readMode enum(normal, direct)

Name	Type	Default value	Description
computeThreads	int	4
ramBufferSizeMB	long	16
readMode	enum(normal, direct)	direct	Value can be null or one of normal direct

XML representation

<GDSPrecomputedThumbnailsConfig computeThreads="4" ramBufferSizeMB="16" readMode="direct" ></GDSPrecomputedThumbnailsConfig>

ScratchHook com.exalead.mercury.mami.deploy.v10.ScratchHook

A Hook to plug custom exa code on BuildGroup scratch

Parent objects

BuildGroupConfig

Attributes

classId string

Name	Type	Default value	Description
classId	string	-	The specified class must implement the {@code com.exalead.mercury.indexing.CustomScratchHook} Exascript interface.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<ScratchHook classId="" ></ScratchHook>

BasicSlicePartioningPolicy com.exalead.mercury.mami.deploy.v10.BasicSlicePartioningPolicy

Basic partionning function based on a URL hash and a '%' (modulo).

XML representation

<BasicSlicePartioningPolicy ></BasicSlicePartioningPolicy>

SearchLogic com.exalead.mercury.mami.search.v20.SearchLogic

Controls how searches are processed.

Attributes

name string
tokenizationConfig string
dataModel string
customQueryProcessingFactoryClassId string
fetchMasks boolean
sliceDownAction enum(error, ignore)
sliceDownReportMode enum(quiet, local, global)
searchTimeoutAction enum(error, ignore)
searchTimeoutReportMode enum(quiet, local, global)
searchLimitAction enum(error, ignore)
searchLimitReportMode enum(quiet, local, global)
retrieveSourceField boolean
handleNegativeSecurityTokens boolean

Name	Type	Default value	Description
name	string	-	Name of the search logic. This name is referred to with the 'logic' parameter in the Search API.
tokenizationConfig	string	-	Name of the tokenization configuration to use when using this search logic. For consistency, this should be the same tokenization configuration as in the dictionary and analysis configurations.
dataModel	string	-	Name of the data model this search logic targets. A Search Logic can only target one data model.
customQueryProcessingFactoryClassId	string	-	Java class implementing the QueryProcessingRunnerFactory interface to build the thread-specific QueryProcessingRunners (which expand the actual query).
fetchMasks	boolean	False	Fetches masks in searcher. Required to get the "mask" field in the hits.
sliceDownAction	enum(error, ignore)	ignore	What to do if one of the slice is down when the search is performed. A slice is considered as down if all replicas are down (and if the master is also down in the default search target configuration). One of "error" or "ignore". Default is "ignore".
sliceDownReportMode	enum(quiet, local, global)	local	Specifies how to report a down slice error in 'Troubleshooting > Logs': quiet: No log. local: Error is reported in the Process log. global: Error is reported in both Process and Global logs.
searchTimeoutAction	enum(error, ignore)	ignore	What to do when slices time out. One of "error" or "ignore". Default is "ignore".
searchTimeoutReportMode	enum(quiet, local, global)	local	Specifies how to report a timeout error in 'Troubleshooting > Logs': quiet: No log. local: Error is reported in the Process log. global: Error is reported in both Process and Global logs.
searchLimitAction	enum(error, ignore)	ignore	What to do if a search limit is reached. One of "error" or "ignore". Default is "ignore".
searchLimitReportMode	enum(quiet, local, global)	local	Specifies how to report a search limit error in 'Troubleshooting > Logs': quiet: No log. local: Error is reported in the Process log. global: Error is reported in both Process and Global logs.
retrieveSourceField	boolean	True	Should the "source" field be automatically added to the retrieved field?
handleNegativeSecurityTokens	boolean	True	When enabled the security restriction will handle document security token starting with ~ as negative token. The negative security tokens are prioritary above standard tokens. The query restriction generated for a query with the tokens tok1, tok2, tok3 is ((tok1 OR tok2 OR tok3) AND NOT ~tok1 AND NOT ~tok2 AND NOT ~tok3). If the security sources return a lot of security tokens and the connectors do not push negative security tokens it can be interesting to disable negative security tokens handling to speed up the queries (the security query restriction will be (tok1 OR tok2 OR tok3) in this case.

Nested elements

ContentRestriction
FacetsConfig
HitConfig
LimitsConfig
PostFilterConfig
QueryRestrictionConfig
RelatedTermsSynthesisConfig
RelevanceConfig
Sort
SpellCheckerConfig
TimezoneConfig
UQLConfig
VirtualFieldDefinition*

Type	Wrapper	Description
ContentRestriction	-	Content restriction configuration for this search logic.
FacetsConfig	-	Facets configuration for this search logic.
HitConfig	-	Hit content configuration for this search logic.
LimitsConfig	-	Limits configuration for this search logic.
PostFilterConfig	-	Post filtering configuration for this search logic.
QueryRestrictionConfig	-	Query restriction configuration for this search logic.
RelatedTermsSynthesisConfig	-	Related terms configuration for this search logic.
RelevanceConfig	-	Relevance configuration for this search logic.
Sort	-	Sort configuration for this search logic.
SpellCheckerConfig	-	Spell checker configuration for this search logic.
TimezoneConfig	-
UQLConfig	-	UQL configuration for this search logic.
VirtualFieldDefinition*	-	Virtual fields configuration for this search logic.

XML representation

<SearchLogic name="" tokenizationConfig="tok0" dataModel="" customQueryProcessingFactoryClassId="" fetchMasks="false" sliceDownAction="ignore" sliceDownReportMode="local" searchTimeoutAction="ignore" searchTimeoutReportMode="local" searchLimitAction="ignore" searchLimitReportMode="local" retrieveSourceField="true" handleNegativeSecurityTokens="true" ></SearchLogic>

ContentRestriction com.exalead.mercury.mami.search.v20.ContentRestriction

No documentation for this element.

Parent objects

SearchLogic

Attributes

queryString string

Name	Type	Default value	Description
queryString	string	-	Query string to restrict content. Only the documents matching this query string will be taken into consideration. The query string is not subject to any kind of linguistic expansion whatsoever. This query string is AND' ed with the user actual query.

XML representation

<ContentRestriction queryString="" ></ContentRestriction>

FacetsConfig com.exalead.search.v30.FacetsConfig

Configuration for all facets calculated at search time.

Parent objects

SearchLogic

Attributes

maxHits int
enabled boolean
optimizedForExhaustiveRankedSynthesis boolean
categoryArrayBufferSize int

Name	Type	Default value	Description
maxHits	int	-	Number of hits on which category navigation will be computed. 0 means exhaustive navigation, within the limits of the matched hits. Note: Heapsort executor is used if this setting is defined in ranked mode. If not in ranked mode, this setting is applied per slice.
enabled	boolean	True	Enables faceting.
optimizedForExhaustiveRankedSynthesis	boolean	True	When search is running in "heap" mode, it locally sorts the DIDs before calculating facets. This decreases significantly LLC miss.
categoryArrayBufferSize	int	1000	Size of the result blocks sent to the searchserver. This is an optimization parameter. Lower values slightly decrease memory consumption when there is a large number of slices, but increase global CPU consumption.

Nested elements

Facet*

Type	Wrapper	Description
Facet*	-	Definition of a facet.

XML representation

<FacetsConfig maxHits="" enabled="true" optimizedForExhaustiveRankedSynthesis="true" categoryArrayBufferSize="1000" ></FacetsConfig>

CategoryFacet com.exalead.search.v30.CategoryFacet

Configuration for a facet.

Parent objects

FacetsConfig

Attributes

id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
root string
maxCategoryTreeDepth int
maxCategoryTreeDepthFromRoot int
categoryField string
form enum(exact, lowercase, normalized)
hitsMaxCategories int
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
root (UI: Root)	string	-	In the categories field, category values are stored in a tree. The root of the tree is called "Top".
maxCategoryTreeDepth (UI: Max. tree depth from current refinement)	int	0	Maximum depth of the returned category tree. Whatever the level of refinement, there will never be more levels retrieved (this ensures we are not overtaking the maxCategoryTreeDepthFromRoot).
maxCategoryTreeDepthFromRoot (UI: Max. tree depth from root)	int	0	Maximum depth of the returned category tree, starting from the global root of the category. Setting this attribute to N means that no categories will be kept below this level.
categoryField (UI: Index field)	string	-	Specifies from which field the categories should be retrieved.
form (UI: Form)	enum(exact, lowercase, normalized)	normalized	Normalization form for the category path. Value can be one of exact lowercase normalized This must match the form with which the category has been mapped at indexing time.
hitsMaxCategories (UI: Max. categories in hits)	int	100	Maximum number of facet values displayed in hits.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<CategoryFacet id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" root="" maxCategoryTreeDepth="0" maxCategoryTreeDepthFromRoot="0" categoryField="" form="normalized" hitsMaxCategories="100" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></CategoryFacet>

AdditionalFacetView com.exalead.search.v30.AdditionalFacetView

No documentation for this element.

Parent objects

AutoTileGeoFacet
AutoTileGeoV2Facet
CategoryFacet
ConcatValueFacet
DateFacet
DynamicDateFacet
DynamicRangesNumericalFacet
ExplicitGeoFacet
ExplicitRangesNumericalFacet
Facet
FixedRangesNumericalFacet
Hierarchical2DFacet
LuaFacet
MultiDimensionFacet
NumericalFacet
OptimizedEnumFacet
RelatedTermFacet
ValueFacet
VirtualFacet

Attributes

sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
id string
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean

Name	Type	Default value	Description
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
id	string	-	Unique sub-identifier of this view. Will be concatenated to the facet's id. Used in Mashup-Builder and client applications to identify the facet.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.

XML representation

<AdditionalFacetView sortFunction="count" aggregateScores="false" id="" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" ></AdditionalFacetView>

AdditionalCategoryFacetView com.exalead.search.v30.AdditionalCategoryFacetView

No documentation for this element.

Parent objects

AutoTileGeoFacet
AutoTileGeoV2Facet
CategoryFacet
ConcatValueFacet
DateFacet
DynamicDateFacet
DynamicRangesNumericalFacet
ExplicitGeoFacet
ExplicitRangesNumericalFacet
Facet
FixedRangesNumericalFacet
Hierarchical2DFacet
LuaFacet
MultiDimensionFacet
NumericalFacet
OptimizedEnumFacet
RelatedTermFacet
ValueFacet
VirtualFacet

Attributes

id string
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
maxCategoryTreeDepth int
maxCategoryTreeDepthFromRoot int
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean

Name	Type	Default value	Description
id	string	-	Unique sub-identifier of this view. Will be concatenated to the facet's id. Used in Mashup-Builder and client applications to identify the facet.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
maxCategoryTreeDepth (UI: Max. tree depth from current refinement)	int	0	Maximum number of categories displayed in the facet from the current refinement. Setting this attribute to N means that no categories will be kept below this level. 0 means no limit.
maxCategoryTreeDepthFromRoot (UI: Max. tree depth from root)	int	0	Maximum depth of the category tree displayed in the facet from the Top/ level. You can't go further than this limit whatever the refinement level. 0 means no limit.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.

XML representation

<AdditionalCategoryFacetView id="" sortFunction="count" aggregateScores="false" maxCategoryTreeDepth="0" maxCategoryTreeDepthFromRoot="0" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" ></AdditionalCategoryFacetView>

AdditionalDateFacetView com.exalead.search.v30.AdditionalDateFacetView

No documentation for this element.

Parent objects

AutoTileGeoFacet
AutoTileGeoV2Facet
CategoryFacet
ConcatValueFacet
DateFacet
DynamicDateFacet
DynamicRangesNumericalFacet
ExplicitGeoFacet
ExplicitRangesNumericalFacet
Facet
FixedRangesNumericalFacet
Hierarchical2DFacet
LuaFacet
MultiDimensionFacet
NumericalFacet
OptimizedEnumFacet
RelatedTermFacet
ValueFacet
VirtualFacet

Attributes

id string
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
withYear boolean
yearDesc boolean
withMonth boolean
monthDesc boolean
withWeek boolean
weekDesc boolean
withDay boolean
dayDesc boolean
withHour boolean
hourDesc boolean
withMinute boolean
minuteDesc boolean
withSecond boolean
secondDesc boolean
maxCategoryTreeDepth int
maxCategoryTreeDepthFromRoot int
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean

Name	Type	Default value	Description
id	string	-	Unique sub-identifier of this view. Will be concatenated to the facet's id. Used in Mashup-Builder and client applications to identify the facet.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
withYear	boolean	True	Creates a virtual category for each year.
yearDesc	boolean	True
withMonth	boolean	True	Creates a virtual category for each month.
monthDesc	boolean	False
withWeek	boolean	False	Creates a virtual category for each week.
weekDesc	boolean	False
withDay	boolean	True	Creates a virtual category for each day.
dayDesc	boolean	False
withHour	boolean	False	Creates a virtual category for each hour.
hourDesc	boolean	False
withMinute	boolean	False	Creates a virtual category for each minute.
minuteDesc	boolean	False
withSecond	boolean	False	Creates a virtual category for each second.
secondDesc	boolean	False
maxCategoryTreeDepth (UI: Max. tree depth from current refinement)	int	0	Maximum depth of the returned category tree. Whatever the level of refinement, there will never be more levels retrieved (this ensures we are not overtaking the maxCategoryTreeDepthFromRoot).
maxCategoryTreeDepthFromRoot (UI: Max. tree depth from root)	int	0	Maximum depth of the returned category tree, starting from the global root of the category. Setting this attribute to N means that no categories will be kept below this level.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.

XML representation

<AdditionalDateFacetView id="" sortFunction="count" aggregateScores="false" withYear="true" yearDesc="true" withMonth="true" monthDesc="false" withWeek="false" weekDesc="false" withDay="true" dayDesc="false" withHour="false" hourDesc="false" withMinute="false" minuteDesc="false" withSecond="false" secondDesc="false" maxCategoryTreeDepth="0" maxCategoryTreeDepthFromRoot="0" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" ></AdditionalDateFacetView>

AggregationFunction com.exalead.search.v30.AggregationFunction

No documentation for this element.

Parent objects

AutoTileGeoFacet
AutoTileGeoV2Facet
CategoryFacet
ConcatValueFacet
DateFacet
DynamicDateFacet
DynamicRangesNumericalFacet
ExplicitGeoFacet
ExplicitRangesNumericalFacet
Facet
FixedRangesNumericalFacet
Hierarchical2DFacet
LuaFacet
MultiDimensionFacet
NumericalFacet
OptimizedEnumFacet
RelatedTermFacet
ValueFacet
VirtualFacet

Attributes

id string
expr string
functionName string

Name	Type	Default value	Description
id	string	-	Unique identifier of this function.
expr	string	-	The expression on which to perform the synthesis. It can reference fields or virtual fields.
functionName	string	-	Aggregation function name. We support the following aggregation functions: AVG: average field value MAX: maximum field value MIN: minimum field value SUM: sum of field values COUNT: number of values STDDEV: standard deviation of field values CENTILE(X): X-centile of field values. X is a double between 0.0 and 100.0. MAXDATE(format): maximum field value. The expression must be an index time. The output is parsed according to the specified <i>format</i> MINDATE(format): minimum field value. The expression must be an index time. The output is parsed according to the specified <i>format</i>

XML representation

<AggregationFunction id="" expr="" functionName="" ></AggregationFunction>

ExplicitRangesNumericalFacet com.exalead.search.v30.ExplicitRangesNumericalFacet

Facet on a numerical expression

Parent objects

FacetsConfig

Attributes

expr string
min nullabledouble
max nullabledouble
lsb int
msb int
maxValues int
defaultPrecision int
virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
expr (UI: Expression)	string	-	Virtual field expression for this facet.
min (UI: Exclude values below)	nullabledouble	-	Excludes values below this threshold from the facet.
max (UI: Exclude values above)	nullabledouble	-	Excludes values above this threshold from the facet.
lsb (UI: Least significant bit)	int	0	Least significant bit of the numerical value to use. Only used for integral types.
msb (UI: Most significant bit)	int	63	Most significant bit of the numerical value to use. Only used for integral types.
maxValues (UI: Max. categories)	int	100	Maximum number of displayed values.
defaultPrecision (UI: Default precision)	int	0	Default precision, in number of trailing digits, for float-typed expressions with double precision (e.g. "#random()"). For example, if defaultPrecision is 2, 0.011 and 0.012 will be considered similar, and categories will be displayed with two significant digits (e.g. [0.00;0.99])
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*
Range*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.
Range*	-

XML representation

<ExplicitRangesNumericalFacet expr="" min="" max="" lsb="0" msb="63" maxValues="100" defaultPrecision="0" virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></ExplicitRangesNumericalFacet>

Range com.exalead.search.v30.Range

No documentation for this element.

Parent objects

ExplicitRangesNumericalFacet

Attributes

min nullabledouble
max nullabledouble
lsb int
msb int
title string

Name	Type	Default value	Description
min (UI: Min)	nullabledouble	-	The lower bound of the range (null means minus infinity).
max (UI: Max)	nullabledouble	-	The upper bound of the range (null means plus infinity).
lsb (UI: Least significant bit)	int	0	Least significant bit of the numerical value to use. Only used for integral types.
msb (UI: Most significant bit)	int	63	Most significant bit of the numerical value to use. Only used for integral types.
title	string	-

XML representation

<Range min="" max="" lsb="0" msb="63" title="" ></Range>

FixedRangesNumericalFacet com.exalead.search.v30.FixedRangesNumericalFacet

Facet on a numerical expression

Parent objects

FacetsConfig

Attributes

expr string
min nullabledouble
max nullabledouble
lsb int
msb int
maxValues int
defaultPrecision int
virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
rangeSize double
aboveMax boolean
belowMin boolean
rangeTitleFormat string
aboveTitleFormat string
belowTitleFormat string
singletonTitleFormat string
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
expr (UI: Expression)	string	-	Virtual field expression for this facet.
min (UI: Exclude values below)	nullabledouble	-	Excludes values below this threshold from the facet.
max (UI: Exclude values above)	nullabledouble	-	Excludes values above this threshold from the facet.
lsb (UI: Least significant bit)	int	0	Least significant bit of the numerical value to use. Only used for integral types.
msb (UI: Most significant bit)	int	63	Most significant bit of the numerical value to use. Only used for integral types.
maxValues (UI: Max. categories)	int	100	Maximum number of displayed values.
defaultPrecision (UI: Default precision)	int	0	Default precision, in number of trailing digits, for float-typed expressions with double precision (e.g. "#random()"). For example, if defaultPrecision is 2, 0.011 and 0.012 will be considered similar, and categories will be displayed with two significant digits (e.g. [0.00;0.99])
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
rangeSize (UI: Range size)	double	0.0	Size of each range. 0 means that the resulting facet will contain one range for each unique value.
aboveMax (UI: Above max range)	boolean	False	Creates a range [max;] that includes all values above the max.
belowMin (UI: Below min range)	boolean	False	Creates a range [;min] that includes all values below the min.
rangeTitleFormat (UI: Range title format)	string	[$;$]	Format for the generated range titles. The first dollar sign will be replaced with the range's lower bound, and the second dollar sign will be replaced with the range's upper bound. To write a dollar sign, use \$.
aboveTitleFormat (UI: Above max range title format)	string	[$;]	Format for the generated "above max" range. The dollar sign will be replaced with the range's lower bound. To write a dollar sign, use \$.
belowTitleFormat (UI: Below min range title format)	string	[;$]	Format for the generated "below min" range. The dollar sign will be replaced with the range's upper bound. To write a dollar sign, use \$.
singletonTitleFormat (UI: Singleton range title format)	string	$	Format for the generated single value range titles (like [3;3]). The dollar sign will be replaced with the value. To write a dollar sign, use \$.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<FixedRangesNumericalFacet expr="" min="" max="" lsb="0" msb="63" maxValues="100" defaultPrecision="0" virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" rangeSize="0.0" aboveMax="false" belowMin="false" rangeTitleFormat="[$;$]" aboveTitleFormat="[$;]" belowTitleFormat="[;$]" singletonTitleFormat="$" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></FixedRangesNumericalFacet>

DynamicRangesNumericalFacet com.exalead.search.v30.DynamicRangesNumericalFacet

Facet on a numerical expression

Parent objects

FacetsConfig

Attributes

expr string
min nullabledouble
max nullabledouble
lsb int
msb int
maxValues int
defaultPrecision int
virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
minimumRangeSize double
nbRanges int
policy enum(linear, geometrical, auto)
adjustRanges boolean
moreAccurate boolean
rangeTitleFormat string
exclusiveRightBracket boolean
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
expr (UI: Expression)	string	-	Virtual field expression for this facet.
min (UI: Exclude values below)	nullabledouble	-	Excludes values below this threshold from the facet.
max (UI: Exclude values above)	nullabledouble	-	Excludes values above this threshold from the facet.
lsb (UI: Least significant bit)	int	0	Least significant bit of the numerical value to use. Only used for integral types.
msb (UI: Most significant bit)	int	63	Most significant bit of the numerical value to use. Only used for integral types.
maxValues (UI: Max. categories)	int	100	Maximum number of displayed values.
defaultPrecision (UI: Default precision)	int	0	Default precision, in number of trailing digits, for float-typed expressions with double precision (e.g. "#random()"). For example, if defaultPrecision is 2, 0.011 and 0.012 will be considered similar, and categories will be displayed with two significant digits (e.g. [0.00;0.99])
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
minimumRangeSize	double	1.0	Size of each bin used to compute cardinality.
nbRanges	int	5	The maximal number of ranges to output.
policy	enum(linear, geometrical, auto)	linear	The policy to generate the ranges Value can be null or one of linear geometrical auto
adjustRanges	boolean	True	Tries to adjust the ranges on multiples of 10.
moreAccurate	boolean	False	Better linear/geometrical ranges, but slower.
rangeTitleFormat	string	[$;$]	Format for the generated range titles. The first dollar sign will be replaced with the range's lower bound, and the second dollar sign will be replaced with the range's upper bound. To write a dollar sign, use \$.
exclusiveRightBracket (UI: Exclusive right bracket [a;b[)	boolean	False	If exclusiveRightBracket is set to true then the facet will be represented by an open range [a,b[, that means it will not contain hits with value equal to b. otherwise the facet will be in inclusive mode [a,b].
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<DynamicRangesNumericalFacet expr="" min="" max="" lsb="0" msb="63" maxValues="100" defaultPrecision="0" virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" minimumRangeSize="1.0" nbRanges="5" policy="linear" adjustRanges="true" moreAccurate="false" rangeTitleFormat="[$;$]" exclusiveRightBracket="false" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></DynamicRangesNumericalFacet>

ExplicitGeoFacet com.exalead.search.v30.ExplicitGeoFacet

Facet on geographical points based on disk and/or polygons

Parent objects

FacetsConfig

Attributes

virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
field string
hitsMaxCategories int
useGeometryAsDomainId boolean
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
field	string	-	The point field on which to perform the synthesis.
hitsMaxCategories	int	100	Maximum number of categories displayed in hits.
useGeometryAsDomainId	boolean	True	The domain id will be computed with the geometric parameters. Otherwise, an unique id will be used. (recommanded value: false, except for backward compatibility)
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*
GeoDomain*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.
GeoDomain*	-	List of GeoDomains

XML representation

<ExplicitGeoFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" field="" hitsMaxCategories="100" useGeometryAsDomainId="true" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></ExplicitGeoFacet>

AutoTileGeoFacet com.exalead.search.v30.AutoTileGeoFacet

Configuration for a facet.

Parent objects

FacetsConfig

Attributes

virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
field string
xMin double
yMin double
xMax double
yMax double
xBinSize double
yBinSize double
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
field	string	-	The point field on which to perform the synthesis.
xMin	double	-90	Minimum bounding box X value.
yMin	double	-180	Minimum bounding box Y value.
xMax	double	90	Maximum bounding box X value.
yMax	double	180	Maximum bounding box Y value.
xBinSize	double	0.1	Size of a tile in the X direction.
yBinSize	double	0.1	Size of a tile in the Y direction.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<AutoTileGeoFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" field="" xMin="-90" yMin="-180" xMax="90" yMax="180" xBinSize="0.1" yBinSize="0.1" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></AutoTileGeoFacet>

AutoTileGeoV2Facet com.exalead.search.v30.AutoTileGeoV2Facet

Configuration for a facet.

Parent objects

FacetsConfig

Attributes

virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
field string
xMin double
yMin double
xMax double
yMax double
xNbCells int
yNbCells int
hitsMaxCategories int
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
field	string	-	The geov2 field on which to perform the synthesis.
xMin	double	-90	Minimum bounding box X value.
yMin	double	-180	Minimum bounding box Y value.
xMax	double	90	Maximum bounding box X value.
yMax	double	180	Maximum bounding box Y value.
xNbCells	int	10	Number of tiles in the X direction.
yNbCells	int	10	Number of tiles in the Y direction.
hitsMaxCategories	int	100	Maximum number of categories displayed in hits.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<AutoTileGeoV2Facet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" field="" xMin="-90" yMin="-180" xMax="90" yMax="180" xNbCells="10" yNbCells="10" hitsMaxCategories="100" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></AutoTileGeoV2Facet>

DynamicDateFacet com.exalead.search.v30.DynamicDateFacet

A DynamicDateFacet will generate max 'maxCategories' categories, scaling the resolution to a day, week, month or year level.

Parent objects

FacetsConfig

Attributes

virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
expr string
yearOutputFormat string
quarterOutputFormat string
monthOutputFormat string
weekOutputFormat string
dayOutputFormat string
hourOutputFormat string
minuteOutputFormat string
secondOutputFormat string
enableYearLevel boolean
enableQuarterLevel boolean
enableMonthLevel boolean
enableWeekLevel boolean
enableDayLevel boolean
enableHourLevel boolean
enableMinuteLevel boolean
enableSecondLevel boolean
enableISO8601Compliance boolean
generateMissingIntervals boolean
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
expr	string	-	The date field or virtual expr to use.
yearOutputFormat	string	%Y	The output format used to generate year-based categories.
quarterOutputFormat	string	%Y %Q	The output format used to generate quarter-based categories (use %Q to specify the quarter number).
monthOutputFormat	string	%b %Y	The output format used to generate month-based categories.
weekOutputFormat	string	week %V %Y	The output format used to generate week-based categories.
dayOutputFormat	string	%Y/%m/%d	The output format used to generate day-based categories.
hourOutputFormat	string	%Y/%m/%d %H	The output format used to generate hour-based categories.
minuteOutputFormat	string	%Y/%m/%d %H:%M	The output format used to generate minute-based categories.
secondOutputFormat	string	%Y/%m/%d %H:%M:%S	The output format used to generate second-based categories.
enableYearLevel	boolean	True	Enables the year level
enableQuarterLevel	boolean	False	Enables the quarter level
enableMonthLevel	boolean	True	Enables the month level
enableWeekLevel	boolean	False	Enables the week level
enableDayLevel	boolean	True	Enables the day level
enableHourLevel	boolean	False	Enables the hour level
enableMinuteLevel	boolean	False	Enables the minute level
enableSecondLevel	boolean	False	Enables the second level
enableISO8601Compliance	boolean	True	Enables the ISO 8601 standard instead of the American standard. In ISO 8601, weeks start on Monday and a few rules determine the first and last weeks of each year. Use compatible Output formats. Typically, use `%V` for ISO 8601 instead of `%U` for American, otherwise week numbering will not be consistent with grouping.
generateMissingIntervals	boolean	True	Generates missing dates setting their count to 0 and aggregation values to NO_VALUE
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<DynamicDateFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" expr="" yearOutputFormat="%Y" quarterOutputFormat="%Y %Q" monthOutputFormat="%b %Y" weekOutputFormat="week %V %Y" dayOutputFormat="%Y/%m/%d" hourOutputFormat="%Y/%m/%d %H" minuteOutputFormat="%Y/%m/%d %H:%M" secondOutputFormat="%Y/%m/%d %H:%M:%S" enableYearLevel="true" enableQuarterLevel="false" enableMonthLevel="true" enableWeekLevel="false" enableDayLevel="true" enableHourLevel="false" enableMinuteLevel="false" enableSecondLevel="false" enableISO8601Compliance="true" generateMissingIntervals="true" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></DynamicDateFacet>

DateFacet com.exalead.search.v30.DateFacet

Configuration for a facet.

Parent objects

FacetsConfig

Attributes

virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
expr string
startExpr string
endExpr string
beforeStart boolean
afterEnd boolean
withYear boolean
yearDesc boolean
withMonth boolean
monthDesc boolean
withWeek boolean
weekDesc boolean
withDay boolean
dayDesc boolean
withHour boolean
hourDesc boolean
withMinute boolean
minuteDesc boolean
withSecond boolean
secondDesc boolean
useAsFirstLevel int
maxCategoryTreeDepth int
maxCategoryTreeDepthFromRoot int
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
expr	string	-	The date field or virtual expr on which the synthesis must be performed.
startExpr	string	#datetime(0)	A CONSTANT virtual expression that evaluates the time when the synthesis starts (e.g. #now(), #fromunixts(1304865423) or #datetime(1983,2,28)).
endExpr	string	#datetime(10000)	A CONSTANT virtual expression that evaluates the time when the synthesis ends.
beforeStart	boolean	False	Creates a virtual category for all dates before the start.
afterEnd	boolean	False	Creates a virtual category for all dates after the end.
withYear	boolean	True	Creates a virtual category for each year.
yearDesc	boolean	True
withMonth	boolean	True	Creates a virtual category for each month.
monthDesc	boolean	False
withWeek	boolean	False	Creates a virtual category for each week.
weekDesc	boolean	False
withDay	boolean	True	Creates a virtual category for each day.
dayDesc	boolean	False
withHour	boolean	False	Creates a virtual category for each hour.
hourDesc	boolean	False
withMinute	boolean	False	Creates a virtual category for each minute.
minuteDesc	boolean	False
withSecond	boolean	False	Creates a virtual category for each second.
secondDesc	boolean	False
useAsFirstLevel	int	0	Used internally by multidim refinements to tell which level is the first level below root.
maxCategoryTreeDepth (UI: Max. tree depth from current refinement)	int	0	Maximum depth of the returned category tree. Whatever the level of refinement, there will never be more levels retrieved (this ensures we are not overtaking the maxCategoryTreeDepthFromRoot).
maxCategoryTreeDepthFromRoot (UI: Max. tree depth from root)	int	0	Maximum depth of the returned category tree, starting from the global root of the category. Setting this attribute to N means that no categories will be kept below this level.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<DateFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" expr="" startExpr="#datetime(0)" endExpr="#datetime(10000)" beforeStart="false" afterEnd="false" withYear="true" yearDesc="true" withMonth="true" monthDesc="false" withWeek="false" weekDesc="false" withDay="true" dayDesc="false" withHour="false" hourDesc="false" withMinute="false" minuteDesc="false" withSecond="false" secondDesc="false" useAsFirstLevel="0" maxCategoryTreeDepth="0" maxCategoryTreeDepthFromRoot="0" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></DateFacet>

ValueFacet com.exalead.search.v30.ValueFacet

Configuration for a facet.

Parent objects

FacetsConfig

Attributes

virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
field string
hitsMaxCategories int
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
field	string	-	The Value field on which to perform the synthesis.
hitsMaxCategories	int	100	Maximum number of categories displayed in hits.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<ValueFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" field="" hitsMaxCategories="100" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></ValueFacet>

RelatedTermFacet com.exalead.search.v30.RelatedTermFacet

A facet implementing a related-term specific synthesis.

Parent objects

FacetsConfig

Attributes

field string
hitsMaxCategories int
virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
dictionary string
minCorpusFrequency int
corpusLowPassFilter double
localLowPassFilter double
blacklist string
maxRelatedTermsHitsPerSlice int
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
field	string	-	The Value field on which to perform the synthesis.
hitsMaxCategories	int	100	Maximum number of categories displayed in hits.
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
dictionary	string	-	The dictionary containing keywords.
minCorpusFrequency	int	2	Minimum number of occurrences in the whole index for a term to be possibly selected for synthesis.
corpusLowPassFilter	double	0.5	Filters out terms occurring more than this threshold in the whole index.
localLowPassFilter	double	0.9	Filters out terms occurring more than this threshold in the result set.
blacklist	string	-	The related-terms blocklist.
maxRelatedTermsHitsPerSlice (UI: Max hits per slice)	int	5000	Maximum number of hits per slice used to compute related terms synthesis. Set to 0 for exhaustive. Note that the number of hits processed per slice to compute related terms synthesis will never exceed FacetsConfig's maxHits.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<RelatedTermFacet field="" hitsMaxCategories="100" virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" dictionary="" minCorpusFrequency="2" corpusLowPassFilter="0.5" localLowPassFilter="0.9" maxRelatedTermsHitsPerSlice="5000" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></RelatedTermFacet>

ConcatValueFacet com.exalead.search.v30.ConcatValueFacet

Configuration for a facet.

Parent objects

FacetsConfig

Attributes

virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
fieldA string
fieldB string
hitsMaxCategories int
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
fieldA	string	-	The Value field on which to perform the synthesis.
fieldB	string	-	The Value field on which to perform the synthesis.
hitsMaxCategories	int	100	Maximum number of categories displayed in hits.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<ConcatValueFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" fieldA="" fieldB="" hitsMaxCategories="100" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></ConcatValueFacet>

LuaFacet com.exalead.search.v30.LuaFacet

A virtual facet implemented using the Lua scripting language.

Parent objects

FacetsConfig

Attributes

virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
luaCode string
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
luaCode	string	-	The lua code for the facet.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<LuaFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" luaCode="" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></LuaFacet>

Hierarchical2DFacet com.exalead.search.v30.Hierarchical2DFacet

Configuration for a facet.

Parent objects

FacetsConfig

Attributes

virtualRoot string
id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
id1 string
id2 string
withDimSwitch boolean
secondarySortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
secondaryReverse boolean
secondarySortAggregationFunction string
secondaryExplicitSortOrderValues string
hierarchical2DFacetRepresentation boolean
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
virtualRoot (UI: Virtual root)	string	-	Path for the resulting root 'virtual' category. When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path. For example: Top/VirtualRoot/[0;5]
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
id1	string	-	id of the first facet on which to perform the synthesis
id2	string	-	id of the second facet on which to perform the synthesis
withDimSwitch	boolean	True	Adds a '__dimension_switch__' virtual category between categories of the first and second dimensions.
secondarySortFunction	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	The function to use to sort on the second dimension (see 'sortFunction' in Facet).
secondaryReverse	boolean	False	Reverses the secondary sort (see 'reverse' in Facet).
secondarySortAggregationFunction	string	-	The aggregation function used to sort by aggregation values.
secondaryExplicitSortOrderValues	string	-	The ordered list of values used to sort, comma-separated (use the backslash to escape a comma)
hierarchical2DFacetRepresentation	boolean	True	Enables the use of asHierarchical2DFacet() in search client.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<Hierarchical2DFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" id1="" id2="" withDimSwitch="true" secondarySortFunction="count" secondaryReverse="false" hierarchical2DFacetRepresentation="true" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></Hierarchical2DFacet>

OptimizedEnumFacet com.exalead.search.v30.OptimizedEnumFacet

Configuration for a facet.

Parent objects

FacetsConfig

Attributes

id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
virtualRoot string
enumFacetId string
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
virtualRoot	string	-	The root of the resulting 'virtual' category.
enumFacetId	string	-	Id of the referred EnumFacet.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.

XML representation

<OptimizedEnumFacet id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" virtualRoot="" enumFacetId="" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></OptimizedEnumFacet>

MultiDimensionFacet com.exalead.search.v30.MultiDimensionFacet

Configuration for a facet.

Parent objects

FacetsConfig

Attributes

id string
type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
refinementPolicy enum(exclusive, disjunctive, norefine)
implementation enum(autodetect, favor_cpu, favor_memory)
inNavigation boolean
inHits boolean
maxSliceCategories int
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
aggregateScores boolean
virtualRoot string
additionalTreeRepresentation boolean
withAdditionalTreeRepresentationDimSwitch boolean
minDocsPerCategory int
maxCategories int
maxCategoriesPerLevel int
sortAggregationFunction string
explicitSortOrderValues string
reverse boolean
index int
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
id	string	-	Unique identifier of this facet. Used in Mashup-Builder and client applications to identify the facet.
type	enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)		Used for convenience in various introspection code
refinementPolicy (UI: Refinement policy)	enum(exclusive, disjunctive, norefine)	exclusive	Defines the Refinements panel behavior. Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected. Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected. No refine: Allows you to display all categories without enabling refinements.
implementation (UI: Implementation)	enum(autodetect, favor_cpu, favor_memory)	autodetect	Internal algorithm for implementation of this facet. Value can be null or one of autodetect favor_cpu favor_memory . "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index). "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set). "autodetect" automatically selects a suitable policy based on the result set and facet data.
inNavigation (UI: Use for navigation)	boolean	True	Computes navigation on this facet.
inHits (UI: Use for hit content)	boolean	True	Computes the value of this facet for each hit.
maxSliceCategories (UI: Max categories per slices)	int	0	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
sortFunction (UI: Sort by)	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	Specifies a sorting function: Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending. Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending. Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet. Date: Sorts the categories by: Default with decreasing year, increasing month, increasing day Reverse with increasing year, decreasing month, decreasing day Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values. Latitude: Sorts the categories by latitude, using the average of points. Longitude: Sorts the categories by longitude, using the average of points. Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending. Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending. Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
aggregateScores (UI: Aggregate scores)	boolean	False	If selected, the category score will be the sum of the scores of all its hits.
virtualRoot	string	-	The virtual root used for the tree representation.
additionalTreeRepresentation	boolean	False
withAdditionalTreeRepresentationDimSwitch	boolean	False	Adds a '__dimension_switch__' virtual category between each dimension of the tree reprentation. This is required if you're working with hierarhical dimensions.
minDocsPerCategory (UI: Min. docs per category)	int	1	Minimum number of docs in a category for it to be included in the Refinements panel.
maxCategories (UI: Max. categories)	int	0	Maximum number of categories to display in the facet. 0 means all categories will be displayed.
maxCategoriesPerLevel (UI: Max. categories per level)	int	100	Maximum number of facet values in each level of the categories hierarchy. 0 means that all values are selected. The sortFunction is used to select the "best" facet values for each level.
sortAggregationFunction (UI: Aggregation function)	string	-	Aggregation function used to sort by aggregation values.
explicitSortOrderValues (UI: Explicit order)	string	-	Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
reverse (UI: Reverse)	boolean	False	Reverse sorting order.
index	int	-	Internal.
dataModelState	string	-
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

Nested elements

fromDataModelFacet
AdditionalFacetView*
AggregationFunction*
MultiFacetDimension*

Type	Wrapper	Description
Facet	fromDataModel
AdditionalFacetView*	-
AggregationFunction*	-	A numerical computation to perform on hits matching each value of the facet.
MultiFacetDimension*	-

XML representation

<MultiDimensionFacet id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" virtualRoot="" additionalTreeRepresentation="false" withAdditionalTreeRepresentationDimSwitch="false" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></MultiDimensionFacet>

MultiFacetDimension com.exalead.search.v30.MultiFacetDimension

Determines the features of one dimension for a MultiDimensionFacet.

Parent objects

MultiDimensionFacet

Attributes

id string
sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
reverse boolean
sortAggregationFunction string
explicitSortOrderValues string
maxCategories int

Name	Type	Default value	Description
id	string	-	The existing 1d facet id to use for this dimension.
sortFunction	enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)	count	The sort function on this dimension (does not apply to other dimensions).
reverse	boolean	False	Reverses the sort (see 'reverse' in Facet).
sortAggregationFunction	string	-	The aggregation function used to sort by aggregation values.
explicitSortOrderValues	string	-	The ordered list of values used to sort, comma-separated (use the backslash to escape a comma).
maxCategories	int	0	The maximum number of categories after sorting in this dimension.

XML representation

<MultiFacetDimension id="" sortFunction="count" reverse="false" maxCategories="0" ></MultiFacetDimension>

HitConfig com.exalead.search.v30.HitConfig

No documentation for this element.

Parent objects

SearchLogic

Attributes

fullHits int

Name	Type	Default value	Description
fullHits	int	10	Number of full hits to retrieve per page. -1 means unlimited.

Nested elements

AdvancedPipeline
DataModelClassMetaInformation
Meta*
Metas*

Type	Wrapper	Description
AdvancedPipeline	-
DataModelClassMetaInformation	-
Meta*	-
Metas*	-

XML representation

<HitConfig fullHits="10" ></HitConfig>

AdvancedPipeline com.exalead.search.v30.AdvancedPipeline

No documentation for this element.

Nested elements

CSVMultiField*
HitOperation*
MultiContextField*
RankingField*
RetrievedField*

Type	Wrapper	Description
CSVMultiField*	-
HitOperation*	-
MultiContextField*	-
RankingField*	-
RetrievedField*	-

XML representation

<AdvancedPipeline ></AdvancedPipeline>

CSVMultiField com.exalead.search.v30.CSVMultiField

No documentation for this element.

Parent objects

AdvancedPipeline

Attributes

indexField string

Name	Type	Default value	Description
indexField	string	-	CSV index field to retrieve.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<CSVMultiField indexField="" ></CSVMultiField>

StandardDecodingOperation com.exalead.search.v30.StandardDecodingOperation

Decodes url or idna encoded strings

Parent objects

Meta

Attributes

name string
encoding enum(url, idna)

Name	Type	Default value	Description
name	string	-	Name of the meta subject of the operation. Not required when this is inside a 'Meta' tag.
encoding	enum(url, idna)		Value can be one of url idna

XML representation

<StandardDecodingOperation name="" encoding="" ></StandardDecodingOperation>

ValueDisplaySelectionOperation com.exalead.search.v30.ValueDisplaySelectionOperation

No documentation for this element.

Parent objects

Meta

Attributes

name string
minValues int
maxValues int
maxBytesToProcess int
relaxAndNodesPolicy enum(never, toplevel, always)
facetIds string
extraPrefixHandlers string
analysisConfig string
analysisPipeline string

Name	Type	Default value	Description
name	string	-	Name of the meta subject of the operation. Not required when this is inside a 'Meta' tag.
minValues	int	0	Minimum number of values to select.
maxValues	int	50	Maximum number of values to select.
maxBytesToProcess	int	131072	Maximum number of bytes to process in the input text. The remaining bytes are not taken into account for the value selection operation. (0=deactivated)
relaxAndNodesPolicy	enum(never, toplevel, always)	never	Relax query AND nodes (handle them as if they were OR nodes). Value can be null or one of never toplevel always
facetIds	string
extraPrefixHandlers	string
analysisConfig	string	-	Analysis config in which analysisPipeline is searched for tokenization config and extra semantic processors.
analysisPipeline	string	-	Pipeline in analysisConfig searched for tokenization config and extra semantic processors.

XML representation

<ValueDisplaySelectionOperation name="" minValues="0" maxValues="50" maxBytesToProcess="131072" relaxAndNodesPolicy="never" facetIds="" extraPrefixHandlers="" analysisConfig="" analysisPipeline="" ></ValueDisplaySelectionOperation>

SnippetOperation com.exalead.search.v30.SnippetOperation

No documentation for this element.

Parent objects

Meta

Attributes

name string
minLength int
maxLength int
maxSentenceSegmentLength int
minNbWordsInRelevantSentence int
maxSentenceSegments int
maxBytesToProcess int
removeDuplicateSegments boolean
maxConsecutiveSeparators int
splitOnSentences boolean
highlight boolean
minimalPatternHighlight boolean
relaxAndNodesPolicy enum(never, toplevel, always)
ifMeta string
ifMetasMatch string
highlightFacetIds string
highlightExtraPrefixHandlers string
analysisConfig string
analysisPipeline string

Name	Type	Default value	Description
name	string	-	Name of the meta subject of the operation. Not required when this is inside a 'Meta' tag.
minLength	int	100	Minimum length of the snippet (in characters).
maxLength	int	150	Maximum length of the snippet (in characters).
maxSentenceSegmentLength	int	150	Maximum length of sentence segments in the snippet (in characters).
minNbWordsInRelevantSentence	int	10	Minimum number of words for a sentence to be considered for snippetization
maxSentenceSegments	int	3	Maximum number of sentence segments in the snippet.
maxBytesToProcess	int	131072	Maximum number of bytes to process in the input text. The remaining bytes are not taken into account for the snippet operation. (0=deactivated)
removeDuplicateSegments	boolean	False	Remove duplicate segments from the snippet. Using this option with a large number of segments may have a negative impact on performance.
maxConsecutiveSeparators	int	0	Maximum number of consecutive separators. When there are more than this number of separators, they are replaced with a single space character.
splitOnSentences	boolean	True	Don't span over sentence boundaries.
highlight	boolean	True	Highlight the snippet's tokens matching the query.
minimalPatternHighlight	boolean	False	If true, only highlights the alphanumerical part of the pattern. For example, foo will only highlight foo in the word football.
relaxAndNodesPolicy	enum(never, toplevel, always)	toplevel	Relax query AND nodes (handle them as if they were OR nodes). Value can be null or one of never toplevel always
ifMeta	string	-	Fallback meta if one of the ifMetasMatch meta matches the query.
ifMetasMatch	string		Comma-separated list of metas to test against the user query. If a match is found, the summary is filled with the ifMeta value.
highlightFacetIds	string		Comma-separated list of facets id for which you want to highlight the values of the refines in the current meta.
highlightExtraPrefixHandlers	string		Comma-separated list of prefix handlers for which you want to highlight the query in the current meta. eg: if highlightExtraPrefixHandlers="spellslike" spellslike:foo will highlight foo and its alternate forms in the current meta
analysisConfig	string	-	Analysis config in which analysisPipeline is searched for tokenization config and extra semantic processors.
analysisPipeline	string	-	Pipeline in analysisConfig searched for tokenization config and extra semantic processors.

XML representation

<SnippetOperation name="" minLength="100" maxLength="150" maxSentenceSegmentLength="150" minNbWordsInRelevantSentence="10" maxSentenceSegments="3" maxBytesToProcess="131072" removeDuplicateSegments="false" maxConsecutiveSeparators="0" splitOnSentences="true" highlight="true" minimalPatternHighlight="false" relaxAndNodesPolicy="toplevel" ifMetasMatch="" highlightFacetIds="" highlightExtraPrefixHandlers="" analysisConfig="" analysisPipeline="" ></SnippetOperation>

OctreeJSONOperation com.exalead.search.v30.OctreeJSONOperation

No documentation for this element.

Parent objects

Meta

Attributes

name string

Name	Type	Default value	Description
name	string	-	Name of the meta subject of the operation. Not required when this is inside a 'Meta' tag.

XML representation

<OctreeJSONOperation name="" ></OctreeJSONOperation>

MatrixOperation com.exalead.search.v30.MatrixOperation

No documentation for this element.

Parent objects

Meta

Attributes

name string

Name	Type	Default value	Description
name	string	-	Name of the meta subject of the operation. Not required when this is inside a 'Meta' tag.

XML representation

<MatrixOperation name="" ></MatrixOperation>

DateFormatOperation com.exalead.search.v30.DateFormatOperation

No documentation for this element.

Parent objects

Meta

Attributes

name string
outputFormat string

Name	Type	Default value	Description
name	string	-	Name of the meta subject of the operation. Not required when this is inside a 'Meta' tag.
outputFormat	string	%Y/%m/%d	Date format

XML representation

<DateFormatOperation name="" outputFormat="%Y/%m/%d" ></DateFormatOperation>

TimeFormatOperation com.exalead.search.v30.TimeFormatOperation

No documentation for this element.

Parent objects

Meta

Attributes

name string
outputFormat string

Name	Type	Default value	Description
name	string	-	Name of the meta subject of the operation. Not required when this is inside a 'Meta' tag.
outputFormat	string	%Y/%m/%d %H:%M:%S	Datetime format

XML representation

<TimeFormatOperation name="" outputFormat="%Y/%m/%d %H:%M:%S" ></TimeFormatOperation>

PrintfOperation com.exalead.search.v30.PrintfOperation

No documentation for this element.

Parent objects

Meta

Attributes

name string
outputFormat string

Name	Type	Default value	Description
name	string	-	Name of the meta subject of the operation. Not required when this is inside a 'Meta' tag.
outputFormat	string	-	Prints Meta according to this format, controls the output as in C printf.

XML representation

<PrintfOperation name="" outputFormat="" ></PrintfOperation>

HighlightOperation com.exalead.search.v30.HighlightOperation

No documentation for this element.

Parent objects

Meta

Attributes

name string
highlightFacetIds string
highlightExtraPrefixHandlers string
minNbWordsInRelevantSentence int
maxBytesToProcess int
maxConsecutiveSeparators int
relaxAndNodesPolicy enum(never, toplevel, always)
minimalPatternHighlight boolean
analysisConfig string
analysisPipeline string

Name	Type	Default value	Description
name	string	-	Name of the meta subject of the operation. Not required when this is inside a 'Meta' tag.
highlightFacetIds	string
highlightExtraPrefixHandlers	string
minNbWordsInRelevantSentence	int	10	Minimum number of words for a sentence to be considered for snippetization
maxBytesToProcess	int	131072	Maximum number of bytes to process in the input text. The remaining bytes are not taken into account for the operation. (0=deactivated)
maxConsecutiveSeparators	int	0	Maximum number of consecutive separators. When there are more than this number of separators, they are replaced with a single space character.
relaxAndNodesPolicy	enum(never, toplevel, always)	toplevel	Relax query AND nodes (handle them as if they were OR nodes). Value can be null or one of never toplevel always
minimalPatternHighlight	boolean	False	If true, only highlights the alphanumerical part of the pattern. For example, foo will only highlight foo in the word football.
analysisConfig	string	-	Analysis config in which analysisPipeline is searched for tokenization config and extra semantic processors.
analysisPipeline	string	-	Pipeline in analysisConfig searched for tokenization config and extra semantic processors.

XML representation

<HighlightOperation name="" highlightFacetIds="" highlightExtraPrefixHandlers="" minNbWordsInRelevantSentence="10" maxBytesToProcess="131072" maxConsecutiveSeparators="0" relaxAndNodesPolicy="toplevel" minimalPatternHighlight="false" analysisConfig="" analysisPipeline="" ></HighlightOperation>

TruncateOperation com.exalead.search.v30.TruncateOperation

Truncates a meta value if it exceeds a configured threshold value.

Parent objects

Meta

Attributes

name string
maxLength int
strict boolean

Name	Type	Default value	Description
name	string	-	Name of the meta subject of the operation. Not required when this is inside a 'Meta' tag.
maxLength	int	150	Max character length after which the content will be truncated.
strict	boolean	False	Truncates even if a blank character is not met.

XML representation

<TruncateOperation name="" maxLength="150" strict="false" ></TruncateOperation>

CustomMetaOperation com.exalead.search.v30.CustomMetaOperation

No documentation for this element.

Parent objects

Meta

Attributes

name string
classId string

Name	Type	Default value	Description
name	string	-	Name of the meta subject of the operation. Not required when this is inside a 'Meta' tag.
classId	string	-	The fully-qualified name of the class performing the operation.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<CustomMetaOperation name="" classId="" ></CustomMetaOperation>

InsertHitInfoFromFacets com.exalead.search.v30.InsertHitInfoFromFacets

Insert hit information retrieved from facets.

Nested elements

replaceKeyValue*
addKeyValue*

Type	Wrapper	Description
KeyValue*	replace	Key = hit info key to create. Value = facet id to fetch from then remove
KeyValue*	add	Key = hit info key to create. Value = facet id to fetch from

XML representation

<InsertHitInfoFromFacets ></InsertHitInfoFromFacets>

SourceFetchabilityInfosInserter com.exalead.search.v30.SourceFetchabilityInfosInserter

Inserts in the hit information whether the source supports fetch, thumbnails and preview

XML representation

<SourceFetchabilityInfosInserter ></SourceFetchabilityInfosInserter>

CustomHitOperation com.exalead.search.v30.CustomHitOperation

No documentation for this element.

Parent objects

AdvancedPipeline

Attributes

classId string

Name	Type	Default value	Description
classId	string	-	The fully-qualified name of the class performing the operation.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<CustomHitOperation classId="" ></CustomHitOperation>

SimilarDocumentOperation com.exalead.search.v30.SimilarDocumentOperation

No documentation for this element.

Parent objects

AdvancedPipeline

Attributes

Name	Type	Default value	Description
prefixHandlerName	string	similar	The prefix handler name used in the generated query.
metaName	string	similardoc_query	Name of the generated meta containing the resulting query.
minNumberOfKeywordPerDoc	long	4	Documents with fewer keywords than this constant are ignored. Value must be higher or equal to 0.
similarityVirtualField	string	similarity	Name of the virtual field computing the final similarity score.
similarityThreshold	double	-	The mininum similarity score for two documents to be considered similar. Value must be between 0 and 1.
languageConstraint	boolean	True	Forces similar documents to be in the same language.
keywordsMetaName	string	keyword	Name of the retrieved meta containing the keywords.

XML representation

<SimilarDocumentOperation prefixHandlerName="similar" metaName="similardoc_query" minNumberOfKeywordPerDoc="4" similarityVirtualField="similarity" similarityThreshold="" languageConstraint="true" keywordsMetaName="keyword" ></SimilarDocumentOperation>

MultiContextField com.exalead.search.v30.MultiContextField

No documentation for this element.

Parent objects

AdvancedPipeline

Attributes

indexField string
name string
type string
contextName string
keepMetaName boolean

Name	Type	Default value	Description
indexField	string	-	Index field or Virtual Expression to retrieve.
name	string	-	Name of the meta to create from this field. If not set, the index field name is used.
type	string	-	The type of the field to retrieve FIXME
contextName	string	-
keepMetaName	boolean	True	When set to true, a key:value stored in the field will create a meta "key" which value is "value". Otherwise, this will generate a meta "name" which value is "value".

Nested elements

MatchRule*

Type	Wrapper	Description
MatchRule*	-	List of rules to define which document metas should be mapped to this dynamic property.

XML representation

<MultiContextField indexField="" name="" type="" contextName="" keepMetaName="true" ></MultiContextField>

MatchRule com.exalead.search.v30.MatchRule

No documentation for this element.

Parent objects

DynamicFieldElementSource
MultiContextField

Attributes

pattern string
mode enum(prefix, suffix, substring, exact)

Name	Type	Default value	Description
pattern	string	-	If null, property name is used
mode	enum(prefix, suffix, substring, exact)

XML representation

<MatchRule pattern="" mode="" ></MatchRule>

RankingField com.exalead.search.v30.RankingField

No documentation for this element.

Parent objects

AdvancedPipeline

Attributes

expr string

Name	Type	Default value	Description
expr	string	-

XML representation

<RankingField expr="" ></RankingField>

RetrievedField com.exalead.search.v30.RetrievedField

No documentation for this element.

Parent objects

AdvancedPipeline

Attributes

indexField string
name string
type string
multivalued boolean
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
indexField	string	-	Index field or Virtual Expression to retrieve.
name	string	-	Name of the meta to create from this field. If not set, the index field name is used.
type	string	-	The type of the field to retrieve FIXME
multivalued	boolean	False	Is the field to retrieve multivalued? FIXME
dataModelState	string	-	Is this retrieved field managed by a data model? @enum{null,auto,customized, error}. If null, this retrieved field is not related to a data model. If "auto", this retrieved field is auto-generated by a data model If "customized", this retrieved field was auto-generated by a data model and then customized. If "error", there is a conflict between this retrieved field and the data model.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this RetrievedField.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this RetrievedField.

Nested elements

fromDataModelRetrievedField

Type	Wrapper	Description
RetrievedField	fromDataModel	If dataModelState is "customized", you will find here the original retrieved field generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<RetrievedField indexField="" name="" type="" multivalued="false" dataModelState="" dataModelClass="" dataModelProperty="" ></RetrievedField>

DataModelClassMetaInformation com.exalead.search.v30.DataModelClassMetaInformation

No documentation for this element.

Parent objects

HitConfig

Attributes

categoryFieldName string
categoryRoot string
categoryForm enum(exact, lowercase, normalized)

Name	Type	Default value	Description
categoryFieldName	string	categories
categoryRoot	string	Top/dataModelClass
categoryForm	enum(exact, lowercase, normalized)	normalized

XML representation

<DataModelClassMetaInformation categoryFieldName="categories" categoryRoot="Top/dataModelClass" categoryForm="normalized" ></DataModelClassMetaInformation>

Meta com.exalead.search.v30.Meta

No documentation for this element.

Parent objects

HitConfig

Attributes

name string
dataModelState string
dataModelClass string
dataModelProperty string

Name	Type	Default value	Description
name	string	-
dataModelState	string	-	Is this meta managed by a data model? @enum{null,auto,customized, error}. If null, this hit category group is not related to a data model. If "auto", this hit category group is auto-generated by a data model If "customized", this hit category group was auto-generated by a data model and then customized. If "error", there is a conflict between this hit category group and the data model
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this meta.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this meta.

Nested elements

fromDataModelMeta
MetaSource*
MetaSpecificOperation*

Type	Wrapper	Description
Meta	fromDataModel
MetaSource*	-
MetaSpecificOperation*	-

XML representation

<Meta name="" dataModelState="" dataModelClass="" dataModelProperty="" ></Meta>

FieldSource com.exalead.search.v30.FieldSource

Specify from which index field the meta will be retrieved.

Parent objects

Meta

Attributes

indexField string

Name	Type	Default value	Description
indexField	string	-	Index field, Virtual field or Virtual Expression for this meta source.

XML representation

<FieldSource indexField="" ></FieldSource>

DynamicFieldElementSource com.exalead.search.v30.DynamicFieldElementSource

Specifies from which index dynamic field the meta will be retrieved.

Parent objects

Meta

Attributes

indexField string
name string
keepMetaName boolean

Name	Type	Default value	Description
indexField	string	-	Dynamic index field for this meta source.
name	string	-	Specifies the meta name used to retrieve the value in the Dynamic Index Field. If null, it retrieves all meta/value pairs.
keepMetaName	boolean	True	When set to true, a key:value stored in the field will create a meta "key" which value is "value". Otherwise, this will generate a meta "name" which value is "value".

Nested elements

MatchRule*

Type	Wrapper	Description
MatchRule*	-	List of rules to define which metas contained in this dynamic field will be processed and displayed.

XML representation

<DynamicFieldElementSource indexField="" name="" keepMetaName="true" ></DynamicFieldElementSource>

MultiFieldElementSource com.exalead.search.v30.MultiFieldElementSource

Specifies from which CSV-encoded index field the meta will be retrieved.

Parent objects

Meta

Attributes

indexField string
metaNameInMultiField string

Name	Type	Default value	Description
indexField	string	-	CSV-encoded index field for this meta source.
metaNameInMultiField	string	-	CSV column name.

XML representation

<MultiFieldElementSource indexField="" metaNameInMultiField="" ></MultiFieldElementSource>

Metas com.exalead.search.v30.Metas

No documentation for this element.

Nested elements

MetasSource*

Type	Wrapper	Description
MetasSource*	-

XML representation

<Metas ></Metas>

MultiFieldSource com.exalead.search.v30.MultiFieldSource

Specifies from which CSV-encoded index field the metas will be retrieved.
Retrieves all columns once.

Parent objects

Metas

Attributes

indexField string

Name	Type	Default value	Description
indexField	string	-	CSV-encoded index field for this meta source.

XML representation

<MultiFieldSource indexField="" ></MultiFieldSource>

LimitsConfig com.exalead.search.v30.LimitsConfig

No documentation for this element.

Parent objects

SearchLogic

Attributes

Name	Type	Default value	Description
maxKeptHits	int	-	Number of "best" hits that will be kept for navigation and full hits. It limits the number of hits that can be used for navigation and full hits fetch. Reducing this parameter can slightly increase performance in some situations (many slices, IO-bound). The "nhits" value in answer is at most equal to this limit. 0 means no limit. Note: Heapsort executor is used if this setting is defined in ranked mode. If not in ranked mode, this setting is ignored.
maxHitsPerSlice	int	-	Maximum number of hits that a single index slice can fetch.
mainHeapFlushInterval	int	-	Unused setting
sliceHeapFlushInterval	int	-	Each time the slice heap receives this number of hits, it will be flushed to the main heap. This parameter only impacts performance. If value is 0, the system will pick up an appropriate value. Note: Only used in unranked and heapsort. Not used in localsort.
maxQueryTimeMS	int	-	Maximum time spent in the "hits matching" part of the search internal "query" part of the search. After this delay, hits matching is interrupted, partial results will be returned. 0 means no limit.
maxFetchedHits	int	-	When this total number of hits matching the query have been found (on any slice), the query is interrupted. These hits are ranked and the "best" ones are selected for navigation and full hits. Reducing this parameter improves performance but it means that a potentially "better" hit located further in the index will not be found. Above this limit, the "nmatches" value in answer will be estimated. 0 means no limit.
enoughSlicesPercentage	int	100	The query is interrupted if at least this percentage of active slices finished fetching hits.
interruptGraceDelayMS	int	100	After the query is interrupted, the slices have a small time period to flush their heaps. Hits that arrive to the main heap after this time period are dropped.
maxTotalTimeMS	int	-	Maximum time for the entire search (hits matching, navigation and hits computation). After this delay, the search query is aborted. 0 means no limit.
fullGraceDelayMS	int	500	Additional time after maxTotalTime before forcing the full hits processor to stop. Note that this elapsed time is only checked between two full hits.
hitsSampling	double	100.0	Only fetch a sample of the matches. This is useful to improve search performance while still getting a good hits distribution.
usesConsecutiveQueries	boolean	False	Set this optio to True when you plan to use consecutive queries, then the dedup will occur.
maxPathsToConsider	long	20000000	If this value is not zero, when doing expand queries without using the "expand=true" mode, the slices will only consider the first maxExploredPaths paths they find for deciding which DIDs are contained in the graph.
needSecuBlockSize	long	0	If this value is not zero, when doing attrgraph queries, each document that need to be postfiltered will have its Fullhit rewritten once said number of documents to postfilter has been reached.

Nested elements

ContentGroupLimitsConfig*

Type	Wrapper	Description
ContentGroupLimitsConfig*	-	Optional specific per content group limits.

XML representation

<LimitsConfig maxKeptHits="" maxHitsPerSlice="" mainHeapFlushInterval="" sliceHeapFlushInterval="" maxQueryTimeMS="" maxFetchedHits="" enoughSlicesPercentage="100" interruptGraceDelayMS="100" maxTotalTimeMS="" fullGraceDelayMS="500" hitsSampling="100.0" usesConsecutiveQueries="false" maxPathsToConsider="20000000" needSecuBlockSize="0" ></LimitsConfig>

ContentGroupLimitsConfig com.exalead.search.v30.ContentGroupLimitsConfig

Specific search limits for a content group. May override global limits.

Parent objects

LimitsConfig

Attributes

contentGroup string
maxHitsPerSlice int

Name	Type	Default value	Description
contentGroup	string	-	Name of the content group.
maxHitsPerSlice	int	-	On each slice, when this number of hits matching the query have been found, the query is interrupted. These hits are returned to the searcher, ranked, and the "best" ones are selected for navigation and full hits. Reducing this parameter improves performance but it means that a potentially "better" hit located further in the slice would not be found. If this limit triggers on any slice, the "nmatches" value in answer will be estimated. 0 means no limit.

XML representation

<ContentGroupLimitsConfig contentGroup="" maxHitsPerSlice="" ></ContentGroupLimitsConfig>

PostFilterConfig com.exalead.mercury.mami.search.v20.PostFilterConfig

No documentation for this element.

Parent objects

SearchLogic

Attributes

Name	Type	Default value	Description
additionalHits	int	-	The number of additional hits to retrieve
blacklistUrl	string	-
maxQueryStart	int	100

XML representation

<PostFilterConfig additionalHits="" blacklistUrl="" maxQueryStart="100" ></PostFilterConfig>

QueryRestrictionConfig com.exalead.mercury.mami.search.v20.QueryRestrictionConfig

No documentation for this element.

Parent objects

SearchLogic

Attributes

blackListURL string

Name	Type	Default value	Description
blackListURL	string	-	URL to a query-blocklist resource

Nested elements

NodeRestrictionConfig
OperatorRestrictionConfig
QueryTermCountRestrictionConfig

Type	Wrapper	Description
NodeRestrictionConfig	-
OperatorRestrictionConfig	-
QueryTermCountRestrictionConfig	-

XML representation

<QueryRestrictionConfig blackListURL="" ></QueryRestrictionConfig>

NodeRestrictionConfig com.exalead.mercury.mami.search.v20.NodeRestrictionConfig

Configure node restriction policy for search queries

Nested elements

RestrictedNode*

Type	Wrapper	Description
RestrictedNode*	-

XML representation

<NodeRestrictionConfig ></NodeRestrictionConfig>

RestrictedNode com.exalead.mercury.mami.search.v20.RestrictedNode

Restrict occurences of a given node within a search query

Parent objects

NodeRestrictionConfig

Attributes

node enum(AlphanumValue, AlphanumBWPattern, NumericalValue, CategoryValue, TrueNode, FalseNode, NoOp)
maximumCount int
action enum(discard, cleanup)

Name	Type	Default value	Description
node	enum(AlphanumValue, AlphanumBWPattern, NumericalValue, CategoryValue, TrueNode, FalseNode, NoOp)
maximumCount	int	-1	Maximum number of occurences allowed for a given node.
action	enum(discard, cleanup)	discard

XML representation

<RestrictedNode node="" maximumCount="-1" action="discard" ></RestrictedNode>

OperatorRestrictionConfig com.exalead.mercury.mami.search.v20.OperatorRestrictionConfig

Configure operator restriction policy for search queries

Nested elements

OperatorCountRestrictionConfig
RestrictedOperator*

Type	Wrapper	Description
OperatorCountRestrictionConfig	-
RestrictedOperator*	-

XML representation

<OperatorRestrictionConfig ></OperatorRestrictionConfig>

OperatorCountRestrictionConfig com.exalead.mercury.mami.search.v20.OperatorCountRestrictionConfig

Configure operator count restriction policy for search queries

Parent objects

OperatorRestrictionConfig

Attributes

Name	Type	Default value	Description
maximumOperatorCount	int	-1	Maximum number of operators allowed in a query.
maximumProximityOperatorCount	int	-1	Maximum number of proximity operators allowed in a query. The proximity operators are: NEAR NEXT AFTER BEFORE
action	enum(discard, cleanup)	discard	Action when maximumOperatorCount is reached: drop query or remove extra operators. The 'cleanup' mode does not guarantee the order in which extra operators are discarded.

XML representation

<OperatorCountRestrictionConfig maximumOperatorCount="-1" maximumProximityOperatorCount="-1" action="discard" ></OperatorCountRestrictionConfig>

RestrictedOperator com.exalead.mercury.mami.search.v20.RestrictedOperator

Restrict occurrences of a given operator within a search query

Parent objects

OperatorRestrictionConfig

Attributes

operator enum(AND, AT, FUZZYAND, OR, XOR, BOR, NEAR, NOT, NEXT, SEQUENCE, SPLIT, BUTNOT, INNERJOIN, ALL, OPT)
maximumArity int
action enum(discard, cleanup, textual)

Name	Type	Default value	Description
operator	enum(AND, AT, FUZZYAND, OR, XOR, BOR, NEAR, NOT, NEXT, SEQUENCE, SPLIT, BUTNOT, INNERJOIN, ALL, OPT)
maximumArity	int	-1	Maximum number of children allowed for a given operator.
action	enum(discard, cleanup, textual)	discard

XML representation

<RestrictedOperator operator="" maximumArity="-1" action="discard" ></RestrictedOperator>

QueryTermCountRestrictionConfig com.exalead.mercury.mami.search.v20.QueryTermCountRestrictionConfig

Configure term count restriction policy for search queries

Parent objects

QueryRestrictionConfig

Attributes

maximumTermCount int
action enum(discard, cleanup)

Name	Type	Default value	Description
maximumTermCount	int	-1	Maximum number of alphanumeric tokens allowed in a query.
action	enum(discard, cleanup)	discard	Action if maximumTermCount is reached: drop query or cleanup extra values.

XML representation

<QueryTermCountRestrictionConfig maximumTermCount="-1" action="discard" ></QueryTermCountRestrictionConfig>

RelatedTermsSynthesisConfig com.exalead.search.v30.RelatedTermsSynthesisConfig

No documentation for this element.

Parent objects

SearchLogic

Attributes

dictionaryName string
enabled boolean
number int
minCorpusFrequencyFilter int
corpusLowPassFilter double
optimizedForExhaustiveRankedSynthesis boolean
sortFunction enum(count, alphanum)
localLowPassFilter double
blacklist string
indexField string
maxSliceCategories int
maxRelatedTermsHitsPerSlice int

Name	Type	Default value	Description
dictionaryName	string	-	The dictionary containing keywords.
enabled	boolean	False	Enables related terms synthesis.
number	int	10	Maximum number of related terms computed for a query.
minCorpusFrequencyFilter	int	2	Minimum number of occurrences in the whole index for a term to be possibly selected for synthesis.
corpusLowPassFilter	double	0.5	Filters out terms occurring more than this threshold in the whole index.
optimizedForExhaustiveRankedSynthesis	boolean	False	If true, each slice will sort the DIDs before computing the synthesis to improve attributes fetching locality.
sortFunction	enum(count, alphanum)	count	Sort function Value can be one of count alphanum
localLowPassFilter	double	0.9	Filters out terms occurring more than this threshold in the result set.
blacklist	string	-	References the related terms blocklist resource.
indexField	string	keywordv2	The index field where related-terms have been indexed.
maxSliceCategories (UI: Max categories per slices)	int	200	Maximum number of categories returned by each slice for one facet. 0 means no limit. Warning: Results merged by the search server may be incorrect.
maxRelatedTermsHitsPerSlice (UI: Max hits per slice)	int	0	Maximum number of hits per slice used to compute related terms synthesis. Set to 0 for exhaustive. Note that the number of hits processed per slice to compute related terms synthesis will never exceed FacetsConfig's maxHits.

XML representation

<RelatedTermsSynthesisConfig dictionaryName="" enabled="false" number="10" minCorpusFrequencyFilter="2" corpusLowPassFilter="0.5" optimizedForExhaustiveRankedSynthesis="false" sortFunction="count" localLowPassFilter="0.9" blacklist="" maxSliceCategories="200" maxRelatedTermsHitsPerSlice="0" ></RelatedTermsSynthesisConfig>

RelevanceConfig com.exalead.search.v30.RelevanceConfig

How to score and rank documents.
A document rank, like a query result, can depend on many parameters:

the static document rank
the query term relative frequency
the form, position and rank of the query term in the matching document
proximity of query terms in the matching document

...

Parent objects

SearchLogic

Attributes

enabled boolean
exactPhraseScoreMergePolicy enum(SUM, MAX)

Name	Type	Default value	Description
enabled	boolean	True	Enables all relevance calculation. Globally disabling relevance calculation disables the following features: term scoring proximity scoring sorting node properties using ranking elements for faceting using ranking elements for facet aggregations retrieving ranking elements <note>If your query contains grouping, either by a Search API parameter or in the product configuration set in the Administration Console > Search Logic, the relevance feature will be forced to `true`. </note> <note>If your query contains a refinement on a disjunctive facet, the relevance feature will be forced to `true`.</note> In some cases, it provides significant performance boosts, notably for exhaustive counting and faceting.
exactPhraseScoreMergePolicy	enum(SUM, MAX)	SUM	Specifies how to merge the underlying word scores of an exact phrase. Setting the policy to 'MAX' will score the phrase as a single term.

Nested elements

GroupConfig*
ProximityBoostConfig
SortConfig
TermScoreConfig

Type	Wrapper	Description
GroupConfig*	-	Configures collapsing TODO.
ProximityBoostConfig	-	How to boost a document score depending on the proximity of the query terms.
SortConfig	-	Configures the fields to use to sort documents in the result list.
TermScoreConfig	-	How to compute the score of a term in a matching document.

XML representation

<RelevanceConfig enabled="true" exactPhraseScoreMergePolicy="SUM" ></RelevanceConfig>

GroupConfig com.exalead.search.v30.GroupConfig

Collaspes hits in search results, by grouping hits with the same values, then sorting them within a group to keep only the top N hits. The other hits for the group do not display in the search results.

The GroupBy child elemement specifies the grouping criteria. The order of GroupBy child elements does not affect sort order. The SortHitsInGroupBy child elements defines lexicographical order to sort hits within the group. By default, it uses SortConfiguration order.

The Aggregation child elements specify the summary operation (SUM, MIN, MAX, and so on) to perform on all hits (discarded + top/bottom N) within a group.

Parent objects

RelevanceConfig

Attributes

topN int

Name	Type	Default value	Description
topN	int	1	Maximum number of hits to display for a group. The remaining hits do not display in search results. Must be strictly positive.

Nested elements

Aggregation*
GroupBy*
SortHitsInGroupBy*

Type	Wrapper	Description
Aggregation*	-	Aggregation(s) to be evaluated sequentially on all the hits within a group, and stored as a virtual expression meta. To display the aggregation value in the grouped hit(s), add this meta name as a Hit content meta.
GroupBy*	-	Virtual expressions used to group, or collapse, hits in search results. Hits with the same value for an expression form one group. Hits with undefined values for the expression are never grouped. If empty, displays all hits in the search results. As with any virtual expression, it requires RAM-based index fields.
SortHitsInGroupBy*	-	Ordered list of expressions for sorting hits. The sort direction identifies the top N hits of a group. Hits with defined values are always listed higher than hits with undefined values. If empty, it uses the `SortConfig` by default. As with any virtual expression, it requires RAM-based index fields.

XML representation

<GroupConfig topN="1" ></GroupConfig>

Aggregation com.exalead.search.v30.Aggregation

Combines the values of the specified expr for all the hits of the group and output the value to the specified meta.
For example, if you have hits:

H1 with values: sort_field=100, group_field=12, value_field=57
H2 with values: sort_field=50, group_field=12, value_field=23

With a sort by sort_field and a group by group_field, the GroupBy will only keep the hit H1 because its sort_field has the best value.

with a function on value_field of type "MIN", the value of value_field in the grouped hit will be 23 (minimum value over grouped hits)
with a function on value_field of type "MAX", the value of value_field in the grouped hit will be 57 (maximum value over grouped hits)
with a function on value_field of type "SUM", the value of value_field in the grouped hit will be 80 (sum of values for grouped hits)
with a function on value_field of type "AVG", the value of value_field in the grouped hit will be 40 (average of values for grouped hits)

Parent objects

GroupConfig

Attributes

expr string
aggregationField string
function enum(AVG, MAX, MIN, SUM, STDDEV, CONCAT)
separator string

Name	Type	Default value	Description
expr	string	-	Field to compute on the grouped hits.
aggregationField	string	-
function	enum(AVG, MAX, MIN, SUM, STDDEV, CONCAT)	MAX	Aggregation function to apply on the specified expression. MIN/MAX use the lowest/highest value of all hits in the group SUM adds the values of all hits in the group AVG calculates the mean of values of all hits in the group STDDEV calculates the standard deviation of values of all hits in the group CONCAT joins all the values from all the hits in the group. Only works with alphanumerical values.
separator	string	-	Character to separate concatenated strings.

XML representation

<Aggregation expr="" function="MAX" separator="" ></Aggregation>

GroupBy com.exalead.search.v30.GroupBy

A comparative element used to identify group of documents.

Parent objects

GroupConfig

Attributes

expr string
LSB int
MSB int
limit int

Name	Type	Default value	Description
expr	string	-	Virtual expression to compare.
LSB	int	0	Position of the least significant bit to be considered. Applies to integer values only, ignored otherwize.
MSB	int	63	Position of the most significant bit to be considered. Applies to integer values only, ignored otherwize.
limit	int	0	Maximum number of characters to consider when comparing string values. 0 means the string is compared till the end. Applies to alphanumerical values only, ignored otherwize.

XML representation

<GroupBy expr="" LSB="0" MSB="63" limit="0" ></GroupBy>

SortHitsInGroupBy com.exalead.search.v30.SortHitsInGroupBy

A comparative element used to sort documents of a group.

Parent objects

SortConfig

Attributes

name string
expr string
LSB int
MSB int
limit int
min long
order enum(asc, desc)
active boolean

Name	Type	Default value	Description
name	string	-	Unique sort name.
expr	string	-	Virtual expression to compare.
LSB	int	0	Position of the least significant bit to be considered. Applies to integer values only.
MSB	int	63	Position of the most significant bit to be considered. Applies to integer values only.
limit	int	0	Maximum number of characters to use when matching string values. 0 means the entire string is used for matching. Applies to alphanumerical values only.
min	long	0	Minimum value needed for a the document to be returned by the query. Applies to integer and double values only. 0 means keep all documents. Discards documents with a value below the minimum. Keeps documents with undefined value for this field.
order	enum(asc, desc)	desc	Sort order Value can be null or one of asc desc
active	boolean	True	Defines if the sort is applied by default.

XML representation

<SortHitsInGroupBy name="" expr="" LSB="0" MSB="63" limit="0" min="0" order="desc" active="true" ></SortHitsInGroupBy>

ProximityBoostConfig com.exalead.search.v30.ProximityBoostConfig

Controls the behavior of the @proximity element

Parent objects

RelevanceConfig

Attributes

enabled boolean
maxDistance int

Name	Type	Default value	Description
enabled	boolean	True	Enables proximity calculation globally. When disabled, @proximity is always 1.0
maxDistance	int	100	Do not apply proximity boost if words are separated by more than 'maxDistance' words.

XML representation

<ProximityBoostConfig enabled="true" maxDistance="100" ></ProximityBoostConfig>

SortConfig com.exalead.search.v30.SortConfig

Define the order of returned documents.

Parent objects

RelevanceConfig

Attributes

discardInvalidSort boolean

Name	Type	Default value	Description
discardInvalidSort	boolean	False	Auto discard sortBy not allowed rather than sending an error.

Nested elements

SortBy*

Type	Wrapper	Description
SortBy*	-	Ordered list of expresion that defines a lexicographical order (documents having the same value for the first criteria are then ordered using the second critera, and so on).

XML representation

<SortConfig discardInvalidSort="false" ></SortConfig>

SortBy com.exalead.search.v30.SortBy

Configures a sort criterion. An expression is calculated for every document matching the query and the results are then sorted according to calculated values.

Parent objects

SortConfig

Attributes

name string
expr string
LSB int
MSB int
limit int
min long
order enum(asc, desc)
active boolean

Name	Type	Default value	Description
name	string	-	Unique sort name.
expr	string	-	Virtual expression to compare.
LSB	int	0	Position of the least significant bit to be considered. Applies to integer values only.
MSB	int	63	Position of the most significant bit to be considered. Applies to integer values only.
limit	int	0	Maximum number of characters to use when matching string values. 0 means the entire string is used for matching. Applies to alphanumerical values only.
min	long	0	Minimum value needed for a the document to be returned by the query. Applies to integer and double values only. 0 means keep all documents. Discards documents with a value below the minimum. Keeps documents with undefined value for this field.
order	enum(asc, desc)	desc	Sort order Value can be null or one of asc desc
active	boolean	True	Defines if the sort is applied by default.

XML representation

<SortBy name="" expr="" LSB="0" MSB="63" limit="0" min="0" order="desc" active="true" ></SortBy>

TermScoreConfig com.exalead.search.v30.TermScoreConfig

No documentation for this element.

Parent objects

RelevanceConfig

Attributes

type enum(RANK_TFIDF, TFIDF, RANK, BM25, BM25F, NO_RANKING, IDF, RANK_IDF, CUSTOM, CUSTOM_LUA)
scoreMap string
customExpr string
customLuaScript string

Name	Type	Default value	Description
type	enum(RANK_TFIDF, TFIDF, RANK, BM25, BM25F, NO_RANKING, IDF, RANK_IDF, CUSTOM, CUSTOM_LUA)	NO_RANKING	Select the ranking policy ( Value can be null or one of RANK_TFIDF TFIDF RANK BM25 BM25F NO_RANKING IDF RANK_IDF CUSTOM CUSTOM_LUA ) for the terms
scoreMap	string	-	Comma-separated list of target weights for the ranking scores. XXX: Should be cleanly documented elsewhere.
customExpr	string	-
customLuaScript	string	-

XML representation

<TermScoreConfig type="NO_RANKING" scoreMap="" customExpr="" customLuaScript="" ></TermScoreConfig>

Sort com.exalead.search.v30.Sort

No documentation for this element.

Parent objects

SearchLogic

Attributes

secondarySortKey boolean
indexField string
ascending boolean
LSB int
MSB int
hasInvalidValue boolean
invalidValue long

Name	Type	Default value	Description
secondarySortKey	boolean	False	Allows disambiguating documents having the same score, on a configurable key. The rest of the comparator configuration is similar to the primary sort. If this option is enabled, the documents will be compared by score, and in case of equality, only by the key extracted from the configurable index field.
indexField	string	-	Index field on which we want to sort. You can only sort on an unsigned index field.
ascending	boolean	False	Inverse the total order relation.
LSB	int	0	Least Significant Bit of the key inside the field.
MSB	int	63	Most Significant Bit of the key inside the field.
hasInvalidValue	boolean	False	Should one of the keys be considered as excluded from the set and always put at the end
invalidValue	long	-	If hasInvalidValue is true, what is the invalid value?

XML representation

<Sort secondarySortKey="false" indexField="" ascending="false" LSB="0" MSB="63" hasInvalidValue="false" invalidValue="" ></Sort>

SpellCheckerConfig com.exalead.search.v30.SpellCheckerConfig

No documentation for this element.

Parent objects

SearchLogic

Attributes

Name	Type	Default value	Description
enabled (UI: Enable)	boolean	False	Enables spellcheck on queries.
blacklistedPrefixHandlers	string		A list of comma-separated prefix handler names defining which parts of the query will NOT be spell checked .
enabledRefine (UI: Use with facets)	boolean	False	Suggests alternate expressions even if facets have been used to filter hits.
computeCorrectedNHits (UI: Calculate no. hits)	boolean	False	Calculates and displays the number of hits that will be returned for each suggested expression. This can impact performance as CloudView must perform the query for each suggestion. Warning: This option is not compatible with disjunctive facets. The count will not report any values.
hitThreshold (UI: Disable at N hits)	int	0	Only performs spell check if the query returns less than N hits. However, if N=0, spell check is always performed. For example, if N=4, spell check is only activated for queries that return 3 or less hits.
cancelBadCorrections (UI: Remove weak suggestions)	boolean	False	Leaves out alternate expressions that would return less hits than the original query. This requires computeCorrectedNHits to be true.
autoCorrect (UI: Automatically correct)	boolean	False	Automatically performs (uncorrected query) OR (corrected query 1) OR (corrected query 2) ... If cancelBadCorrections and autoCorrect are both true, the query is performed three times: The uncorrected query is performed to calculate the number of hits. The corrected queries are performed to calculate the number of hits. The corrected queries that yield more hits than the uncorrected query are added to the uncorrected query, and the resulting query is performed and returned to the user.
graceDelayMS (UI: Timeout(ms))	int	50	Maximum time to wait for spell checking after query processing has finished. If the spell check exceeds this time, it is cancelled and a warning displays. After this time expires, spellcheck is cancelled. If computeCorrectedNHits is enabled, the additional time this takes will leave less time for spell check to finish. If 'Automatically correct' is selected, this timeout is ignored.
querySizeLimit	int	16	Queries with more than this count of chunks are considered computer-generated queries for which the spellchecking is disabled.
dictionaryName (UI: Dictionary)	string	-	Dictionary used for spell checking and allow/block lists

Nested elements

SpellCheckParameters

Type	Wrapper	Description
SpellCheckParameters	-	Low-level parameters.

XML representation

<SpellCheckerConfig enabled="false" blacklistedPrefixHandlers="" enabledRefine="false" computeCorrectedNHits="false" hitThreshold="0" cancelBadCorrections="false" autoCorrect="false" graceDelayMS="50" querySizeLimit="16" dictionaryName="" ></SpellCheckerConfig>

SpellCheckParameters com.exalead.linguistic.v10.SpellCheckParameters

No documentation for this element.

Parent objects

SpellCheckerConfig

Attributes

maxApprox int
insertion double
deletion double
substitution double
transposition double
first double
blank double
phoneticDist double
precision double
minLengthCandidate int
minLengthCandidate2 int
minLengthMultiWordCandidate int
minLengthMultiWordCandidate2 int
minLengthInsideMultiWordCandidate int
minLengthInsideMultiWordCandidate2 int
minNGramLengthForDist2 int
filterWhiteList boolean
filterPhonetic boolean
tokenizeResults boolean
maxSuggestions int
allSuggestionsMustAlwaysBeRetrieved boolean
whitelist string
blacklist string
timeoutMS int

Name	Type	Default value	Description
maxApprox (UI: Max spellings per word)	int	40	Maximum number of alternate spellings to retrieve for each word.
insertion (UI: Insertion)	double	0.8	Damerau-Levenshtein distance incurred for inserting a letter.
deletion (UI: Deletion)	double	0.9	Damerau-Levenshtein distance incurred for deleting a letter.
substitution (UI: Substitution)	double	0.9	Damerau-Levenshtein distance incurred for substituting a letter.
transposition (UI: Transposing)	double	0.9	Damerau-Levenshtein distance incurred for transposing letters.
first (UI: First letter)	double	0.1	Additional Damerau-Levenshtein distance incurred for any operation on the first letter of a word.
blank (UI: Blank space)	double	0.5	Damerau-Levenshtein distance incurred for inserting or deleting a space.
phoneticDist (UI: Phonetic matches)	double	0.1	Damerau-Levenshtein distance incurred for phonetic matches.
precision (UI: Precision)	double	1.0	The higher the precision, the less often alternatives are proposed; the lower, the more recall.
minLengthCandidate (UI: Min word length for distance 1)	int	4	Minimum length for a word to trigger a distance 1 spell check.
minLengthCandidate2 (UI: Min word length for distance 2)	int	8	Minimum length for a word to trigger a distance 2 spell check.
minLengthMultiWordCandidate (UI: Min word length anywhere for distance 1)	int	2	Minimum length for a word anywhere in a multi-word expression to trigger a distance 1 spell check.
minLengthMultiWordCandidate2 (UI: Min word length anywhere for distance 2)	int	8	Minimum length for a word anywhere in a multi-word expression to trigger a distance 2 spell check.
minLengthInsideMultiWordCandidate (UI: Min word length mid-expression for distance 1)	int	1	Minimum length for a word in the middle of a multi-word expression to trigger a distance 1 spell check.
minLengthInsideMultiWordCandidate2 (UI: Min word length mid-expression for distance 2)	int	6	Minimum length for a word in the middle of a multi-word expression to trigger a distance 2 spell check.
minNGramLengthForDist2 (UI: Min expression length for distance 2)	int	12	Minimum length for a multi-word expression to trigger a distance 2 spell check.
filterWhiteList (UI: Apply min lengths to allow list)	boolean	False	Applies minimum length criteria on allow list entries.
filterPhonetic (UI: Apply min lengths to phonetic)	boolean	True	Applies the minimum length criteria on phonetic matches.
tokenizeResults (UI: Tokenize results)	boolean	False	Tokenizes the suggested expression, and annotates each token with the corresponding word from the original query. This allows for additional processing after spellchecking, such as lemmatization, and normalization. For example, for a suggested expression of 3 tokens: - If true: each token has an annotation representing the associated word from the original user query. - If false: The first token is annotated with 3 tokens and a length of 3.
maxSuggestions (UI: Max. suggestions)	int	1	Suggests no more than N alternate expressions
allSuggestionsMustAlwaysBeRetrieved (UI: Keep poor suggestions)	boolean	False	Suggests alternate expressions even when they have a lower score than the corresponding expression from the original query. The score is determined by an algorithm that considers both the Damerau-Levenshtein distance and frequency in the corpus.
whitelist	string	-	Reference a dictionary's spellcheck allow list resource. Used to always suggest expressions for the listed words.
blacklist	string	-	Reference a dictionary's spellcheck block list resource. Used to never suggest expressions for the listed words.
timeoutMS (UI: timeout)	int	0	Time allotted to the computing of spell suggestions.

Nested elements

SpellCheckBlackList
SpellCheckWhiteList

Type	Wrapper	Description
SpellCheckBlackList	-	Never suggest expressions for these words.
SpellCheckWhiteList	-	Always suggest expressions for these words.

XML representation

<SpellCheckParameters maxApprox="40" insertion="0.8" deletion="0.9" substitution="0.9" transposition="0.9" first="0.1" blank="0.5" phoneticDist="0.1" precision="1.0" minLengthCandidate="4" minLengthCandidate2="8" minLengthMultiWordCandidate="2" minLengthMultiWordCandidate2="8" minLengthInsideMultiWordCandidate="1" minLengthInsideMultiWordCandidate2="6" minNGramLengthForDist2="12" filterWhiteList="false" filterPhonetic="true" tokenizeResults="false" maxSuggestions="1" allSuggestionsMustAlwaysBeRetrieved="false" whitelist="" blacklist="" timeoutMS="0" ></SpellCheckParameters>

SpellCheckBlackList com.exalead.linguistic.v10.SpellCheckBlackList

No documentation for this element.

Nested elements

Entry*

Type	Wrapper	Description
Entry*	-

XML representation

<SpellCheckBlackList ></SpellCheckBlackList>

Entry com.exalead.linguistic.v10.Entry

No documentation for this element.

Parent objects

SpellCheckBlackList

Attributes

lang iso code
expression string
level enum(exact, lowercase, normalized)

Name	Type	Default value	Description
lang	iso code	-
expression	string	-
level	enum(exact, lowercase, normalized)	normalized

XML representation

<Entry lang="" expression="" level="normalized" ></Entry>

SpellCheckWhiteList com.exalead.linguistic.v10.SpellCheckWhiteList

No documentation for this element.

Nested elements

SpellCheckEntry*

Type	Wrapper	Description
SpellCheckEntry*	-

XML representation

<SpellCheckWhiteList ></SpellCheckWhiteList>

SpellCheckEntry com.exalead.linguistic.v10.SpellCheckEntry

No documentation for this element.

Parent objects

SpellCheckWhiteList

Attributes

lang iso code
expression string
level enum(exact, lowercase, normalized)
suggestion string

Name	Type	Default value	Description
lang	iso code	-
expression	string	-
level	enum(exact, lowercase, normalized)	normalized
suggestion	string	-

XML representation

<SpellCheckEntry lang="" expression="" level="normalized" suggestion="" ></SpellCheckEntry>

TimezoneConfig com.exalead.mercury.mami.search.v20.TimezoneConfig

No documentation for this element.

Parent objects

SearchLogic

Attributes

add boolean
hours int
minutes int

Name	Type	Default value	Description
add	boolean	True	If true, the hours and minutes will be added. Otherwise, they will be substracted.
hours	int	0	Difference of hours to apply.
minutes	int	0	Difference of minutes to apply.

XML representation

<TimezoneConfig add="true" hours="0" minutes="0" ></TimezoneConfig>

UQLConfig com.exalead.mercury.mami.search.v20.UQLConfig

No documentation for this element.

Parent objects

SearchLogic

Attributes

defaultPrefixHandler string
missingPrefixBehaviour enum(error, remove, ignore, and, replace_by_false)
phraseQueryTriggerChars string
queryTemplate string
defaultLanguage string

Name	Type	Default value	Description
defaultPrefixHandler	string	text	Prefix handler to use for content that is not contained within a specific prefix handler.
missingPrefixBehaviour	enum(error, remove, ignore, and, replace_by_false)	error	Action to perform when a prefix:value query does not reference a valid prefix: error: fails the query remove: removes the whole prefix node ignore: does as if the query was a "value" without prefix and: performs a query "prefix AND value" replace_by_false: replaces the whole node by a false node
phraseQueryTriggerChars	string	.-&	List of chars inside a word that trigger a sequence match, for example, a.b will be rewritten as "a b".
queryTemplate	string	#and(#query(_default_) #query(refine) #query(security) #query(restriction) #query(date_restriction) #query(geo_restriction))	Default query template to use (conjunction of query, refine, security and content restriction by default)
defaultLanguage	string	xx	Default language of the query

Nested elements

LogicRunnerCustomization
ParsingConfig
PatternsExpansionConfig
QueryExpansionConfig
QueryExpansionModule*
QueryPrefixHandler*
SWConfig
SemanticQueryAnalysisConfig

Type	Wrapper	Description
LogicRunnerCustomization	-
ParsingConfig	-
PatternsExpansionConfig	-
QueryExpansionConfig	-
QueryExpansionModule*	-
QueryPrefixHandler*	-
SWConfig	-
SemanticQueryAnalysisConfig	-	Semantic analysis and rewriting of queries

XML representation

<UQLConfig defaultPrefixHandler="text" missingPrefixBehaviour="error" phraseQueryTriggerChars=".-&" queryTemplate="#and(#query(_default_) #query(refine) #query(security) #query(restriction) #query(date_restriction) #query(geo_restriction))" defaultLanguage="xx" ></UQLConfig>

LogicRunnerCustomization com.exalead.mercury.mami.search.v20.LogicRunnerCustomization

No documentation for this element.

Nested elements

globalPreParseProcessorsCustomProcessor*
preParseProcessorsCustomProcessor*
preLinguisticProcessorsCustomProcessor*
preTransformProcessorsCustomProcessor*
preMapProcessorsCustomProcessor*
postMapProcessorsCustomProcessor*
globalPostParseProcessorsCustomProcessor*
globalFinalProcessorsCustomProcessor*

Type	Wrapper	Description
CustomProcessor*	globalPreParseProcessors
CustomProcessor*	preParseProcessors
CustomProcessor*	preLinguisticProcessors
CustomProcessor*	preTransformProcessors
CustomProcessor*	preMapProcessors
CustomProcessor*	postMapProcessors
CustomProcessor*	globalPostParseProcessors
CustomProcessor*	globalFinalProcessors

XML representation

<LogicRunnerCustomization ></LogicRunnerCustomization>

CustomProcessor com.exalead.mercury.mami.search.v20.CustomProcessor

No documentation for this element.

Parent objects

globalFinalProcessors LogicRunnerCustomization
globalPostParseProcessors LogicRunnerCustomization
globalPreParseProcessors LogicRunnerCustomization
postMapProcessors LogicRunnerCustomization
preLinguisticProcessors LogicRunnerCustomization
preMapProcessors LogicRunnerCustomization
preParseProcessors LogicRunnerCustomization
preTransformProcessors LogicRunnerCustomization

Attributes

classId string

Name	Type	Default value	Description
classId	string	-	Must implement com.exalead.search.query.processors.CustomQueryProcessor

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<CustomProcessor classId="" ></CustomProcessor>

ParsingConfig com.exalead.search.v30.ParsingConfig

No documentation for this element.

Parent objects

UQLConfig

Attributes

allowOptions boolean
nearDistance int
fallbackOnError boolean
fallbackReportMode enum(quiet, local, global)
maxQuerySize int
oversizedQueryReportMode enum(quiet, local, global)
implicitSequenceOperator enum(AND, OR)

Name	Type	Default value	Description
allowOptions	boolean	False	Allows word options in user query.
nearDistance	int	16	Default distance for NEAR queries
fallbackOnError	boolean	False	If the standard parsing fails, parses the query with a relaxed grammar, meaning it will split words on white spaces. Note that in some cases, operators and prefix handlers will not be taken into account for the query. For example, a query that ends with an AND will normally return an error. If this option is selected, CloudView will search for "AND" instead of considering it as an operator. See also missingPrefixBehaviour in UQLConfig
fallbackReportMode	enum(quiet, local, global)	local	Specifies how to report a search error in 'Troubleshooting > Logs': quiet: No log. local: Error is reported in the Process log. global: Error is reported in both Process and Global logs.
maxQuerySize	int	-	Queries larger than N are rejected. 0 means no limit. @unimplemented.
oversizedQueryReportMode	enum(quiet, local, global)	local	Specifies how to report an oversized query error in 'Troubleshooting > Logs': quiet: No log. local: Error is reported in the Process log. global: Error is reported in both Process and Global logs.
implicitSequenceOperator	enum(AND, OR)	AND	The default query operator. If a user enters this query: exalead cloudview (without quotes), by default, it will be parsed as "exalead AND cloudview".

XML representation

<ParsingConfig allowOptions="false" nearDistance="16" fallbackOnError="false" fallbackReportMode="local" maxQuerySize="" oversizedQueryReportMode="local" implicitSequenceOperator="AND" ></ParsingConfig>

PatternsExpansionConfig com.exalead.search.v30.PatternsExpansionConfig

No documentation for this element.

Parent objects

UQLConfig

Attributes

performRegexpExpansion boolean
minPrefixSize int
minSuffixSize int
expansionTimeout long
allLanguages boolean
rejectedPatternAction enum(normalize, remove, error, keep)
rejectedPatternReportMode enum(quiet, local, global)
errorAction enum(normalize, remove, error)
errorReportMode enum(quiet, local, global)

Name	Type	Default value	Description
performRegexpExpansion	boolean	False	Allows query expansion on regular expressions.
minPrefixSize	int	3	Minimum number of characters required before wilcard (*) for pattern expansion to be carried out.
minSuffixSize	int	3	Minimum number of characters required after wilcard (*) for pattern expansion to be carried out.
expansionTimeout	long	-	Query expansion timeout for each pattern.
allLanguages	boolean	False	Performs query expansion on all languages, whatever the language of the orignal query.
rejectedPatternAction	enum(normalize, remove, error, keep)	error	Action to perform when a pattern is rejected (because the prefix or suffix is too short, or the regular expression is disabled): normalize: Transforms the pattern into a simple string match. remove: Removes the pattern from the query. error: Returns an error.
rejectedPatternReportMode	enum(quiet, local, global)	local	Specifies how to report a rejected pattern error in 'Troubleshooting > Logs': quiet: No log. local: Error is reported in the Process log. global: Error is reported in both Process and Global logs.
errorAction	enum(normalize, remove, error)	error	Action to take when expansion fails: normalize: Transforms the pattern into a simple string match. remove: Removes the pattern from the query. error: Returns an error.
errorReportMode	enum(quiet, local, global)	local	Specifies how to report an expansion error in 'Troubleshooting > Logs': quiet: No log. local: Error is reported in the Process log. global: Error is reported in both Process and Global logs.

XML representation

<PatternsExpansionConfig performRegexpExpansion="false" minPrefixSize="3" minSuffixSize="3" expansionTimeout="" allLanguages="false" rejectedPatternAction="error" rejectedPatternReportMode="local" errorAction="error" errorReportMode="local" ></PatternsExpansionConfig>

QueryExpansionConfig com.exalead.search.v30.QueryExpansionConfig

No documentation for this element.

Parent objects

UQLConfig

Attributes

timeoutMS int
unavailableDictionaryAction enum(error, ignore)
unavailableDictionaryReportMode enum(quiet, global, local)
detectCJ boolean
langDetect boolean
langDetectThreshold double
longFormsPruningThreshold int
performExactPhraseExpansion boolean
exactPhraseExpansionModulesBlackList string

Name	Type	Default value	Description
timeoutMS	int	500
unavailableDictionaryAction	enum(error, ignore)	error	Action to execute if the dictionary cannot be contacted for expansion. error: fails the query ignore: does not perform expansion, then continues
unavailableDictionaryReportMode	enum(quiet, global, local)	global	Value can be null or one of quiet global local
detectCJ	boolean	False	Tries to detect Chinese and Japanese in the query and override the language of the request accordingly. If false, it uses the user-specified language for expansion.
langDetect	boolean	True	Tries to detect the text language from its unicode range. If false, it uses the user-specified language for expansion (xx is the default one).
langDetectThreshold	double	0.6	If langDetect is True, it tries to detect the text language. That language will be used only if its proportion is superior to langDetectThreshold of the string length
longFormsPruningThreshold	int	20	Prunes forms spanning several shorter forms from query nodes larger than N tokens, to avoid combination.
performExactPhraseExpansion	boolean	False	Activates query expansion on expressions contained within quotation marks. In this case, a number of query expansion modules can be disabled by exactPhraseExpansionModulesBlackList.
exactPhraseExpansionModulesBlackList	string		If performExactPhraseExpansion is true, this list of comma-separated query expansion module names defines which modules are disabled when expanding exact phrases.

Nested elements

LangDetectMapping

Type	Wrapper	Description
LangDetectMapping	-

XML representation

<QueryExpansionConfig timeoutMS="500" unavailableDictionaryAction="error" unavailableDictionaryReportMode="global" detectCJ="false" langDetect="true" langDetectThreshold="0.6" longFormsPruningThreshold="20" performExactPhraseExpansion="false" exactPhraseExpansionModulesBlackList="" ></QueryExpansionConfig>

LangDetectMapping com.exalead.linguistic.v10.LangDetectMapping

Maps unicode range to a default language for its automatic atribution

Parent objects

QueryExpansionConfig

Attributes

defaultDefault iso code
defaultArabic iso code
defaultCyrillic iso code

Name	Type	Default value	Description
defaultDefault	iso code	xx
defaultArabic	iso code	ar
defaultCyrillic	iso code	ru

XML representation

<LangDetectMapping defaultDefault="xx" defaultArabic="ar" defaultCyrillic="ru" ></LangDetectMapping>

CustomQueryExpansionModule com.exalead.mercury.mami.search.v20.CustomQueryExpansionModule

Serve as a function for query expansion config in query prefix handlers. @see QueryPrefixHandler#queryExpansionConfig

Parent objects

UQLConfig

Attributes

name string
classId string

Name	Type	Default value	Description
name	string	-	Name of the module. Referenced to target this module in query expansion config.
classId	string	-	Full name of the Java class implementing the custom module.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<CustomQueryExpansionModule name="" classId="" ></CustomQueryExpansionModule>

PhoneticModule com.exalead.mercury.mami.search.v20.PhoneticModule

Expands the query using a phonetic resource.

Parent objects

UQLConfig

Attributes

name string
language string
resourceFile string

Name	Type	Default value	Description
name	string	-	Name of the module. Referenced to target this module in query expansion config.
language	string	-	Optional language code. If not specified, all are supported.
resourceFile	string	-	Optional phonetic rules file. If not specified, the default rules file is used.

XML representation

<PhoneticModule name="" language="en" resourceFile="" ></PhoneticModule>

ApproximateModule com.exalead.mercury.mami.search.v20.ApproximateModule

Serve as a function for query expansion config in query prefix handlers. @see QueryPrefixHandler#queryExpansionConfig

Parent objects

UQLConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-	Name of the module. Referenced to target this module in query expansion config.

XML representation

<ApproximateModule name="" ></ApproximateModule>

StemmingModule com.exalead.mercury.mami.search.v20.StemmingModule

Extracts the root of a word.

Parent objects

UQLConfig

Attributes

name string
resourceDir string

Name	Type	Default value	Description
name	string	-	Name of the module. Referenced to target this module in query expansion config.
resourceDir	string	-	Optional resource directory. @unimplemented

XML representation

<StemmingModule name="" resourceDir="" ></StemmingModule>

RelatedTermModule com.exalead.mercury.mami.search.v20.RelatedTermModule

Expands the query using related terms.
Related terms are popular ngrams detected at indexing time. They allow to match popular concepts spanning several words.

Parent objects

UQLConfig

Attributes

name string
minSpan int
maxSpan int
keepLongestMatch boolean
withPartOfSpeech boolean

Name	Type	Default value	Description
name	string	-	Name of the module. Referenced to target this module in query expansion config.
minSpan	int	3	Minimum number of words (excluding stop words) in a generated related term.
maxSpan	int	6	Maximal number of words (excluding stop words) in a generated related term.
keepLongestMatch	boolean	True	Keeps only the longest match. For example, if you have 5 tokens ('a', 'b', 'c', 'd', 'e') and 4 related terms 'a', 'a-c', 'b-c-d' and 'd-e'. This option will only keep 'b-c-d' and will remove all other related terms.
withPartOfSpeech	boolean	True	Automatically adds a PartOfSpeechTagger to the list of processors.

XML representation

<RelatedTermModule name="" minSpan="3" maxSpan="6" keepLongestMatch="true" withPartOfSpeech="true" ></RelatedTermModule>

NGramModule com.exalead.mercury.mami.search.v20.NGramModule

Serve as a function for query expansion config in query prefix handlers. @see QueryPrefixHandler#queryExpansionConfig

Parent objects

UQLConfig

Attributes

name string
minSpan int
maxSpan int
keepLongestMatch boolean
withLemmatizer boolean
withStopWordDict boolean
addCanonicalForm boolean

Name	Type	Default value	Description
name	string	-	Name of the module. Referenced to target this module in query expansion config.
minSpan	int	2	Minimum number of words (excluding stop words) in a generated related term.
maxSpan	int	3	Maximal number of words (excluding stop words) in a generated related term.
keepLongestMatch	boolean	True	Keeps only the longest match. For example, if you have 5 tokens ('a', 'b', 'c', 'd', 'e') and 4 related terms 'a', 'a-c', 'b-c-d' and 'd-e'. This option will only keep 'b-c-d' and will remove all other related terms.
withLemmatizer	boolean	True	Adds a Lemmatizer to the list of processors automatically.
withStopWordDict	boolean	True	Adds a StopWord dictionary to the list of processors automatically.
addCanonicalForm	boolean	False

XML representation

<NGramModule name="" minSpan="2" maxSpan="3" keepLongestMatch="true" withLemmatizer="true" withStopWordDict="true" addCanonicalForm="false" ></NGramModule>

LemmatizationModule com.exalead.mercury.mami.search.v20.LemmatizationModule

Semantic module expanding the query using the lemmatized forms of terms.
The lemmatized form is the semantic root of a word.

Parent objects

UQLConfig

Attributes

name string
language string
resourceDir string

Name	Type	Default value	Description
name	string	-	Name of the module. Referenced to target this module in query expansion config.
language	string	-	Language in which the lemmatization should be enabled. If null, all languages are enabled.
resourceDir	string	-	Optional resource dir, default value is resource:///lemmatizer/LANGUAGE

XML representation

<LemmatizationModule name="" language="en" resourceDir="" ></LemmatizationModule>

SemanticProcessorModule com.exalead.mercury.mami.search.v20.SemanticProcessorModule

Query expansion module embedding a semantic processing pipeline.

The resulting annotation can be used to expand the query.

Parent objects

UQLConfig

Attributes

name string
annotation string
indexKind int

Name	Type	Default value	Description
name	string	-	Name of the module. Referenced to target this module in query expansion config.
annotation	string	-	The annotation name to use for expansion.
indexKind	int	2	The targeted index kind (0: exact search, 1: lowercase search, 2: normalized search).

Nested elements

SemanticProcessor*

Type	Wrapper	Description
SemanticProcessor*	-	The underlying semantic processors plugged in the pipeline.

XML representation

<SemanticProcessorModule name="" annotation="" indexKind="2" ></SemanticProcessorModule>

SynonymsModule com.exalead.mercury.mami.search.v20.SynonymsModule

Synonyms module broadening the scope of a user query.

Parent objects

UQLConfig

Attributes

name string
resourceFile string
scoreMultiWordsExpressionsAsSingleWords boolean

Name	Type	Default value	Description
name	string	-	Name of the module. Referenced to target this module in query expansion config.
resourceFile	string	-	The compiled synonyms configuration file, cannot be null.
scoreMultiWordsExpressionsAsSingleWords	boolean	False	If true, the multi-words synonym will have the maximum score of the underlying words multiplied by the number of original words spanned. By default, the sum is calculated.

XML representation

<SynonymsModule name="" resourceFile="" scoreMultiWordsExpressionsAsSingleWords="false" ></SynonymsModule>

NormalizationExceptionsModule com.exalead.mercury.mami.search.v20.NormalizationExceptionsModule

Defines a list of words that should not be normalized.

Parent objects

UQLConfig

Attributes

name string
resourceFile string

Name	Type	Default value	Description
name	string	-	Name of the module. Referenced to target this module in query expansion config.
resourceFile	string	resource://normalization-exceptions/synonyms.bin	The compiled normalization exceptions, cannot be null.

XML representation

<NormalizationExceptionsModule name="" resourceFile="resource://normalization-exceptions/synonyms.bin" ></NormalizationExceptionsModule>

URLMatcherModule com.exalead.mercury.mami.search.v20.URLMatcherModule

Serve as a function for query expansion config in query prefix handlers. @see QueryPrefixHandler#queryExpansionConfig

Parent objects

UQLConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-	Name of the module. Referenced to target this module in query expansion config.

XML representation

<URLMatcherModule name="" ></URLMatcherModule>

TemplatePrefixHandler com.exalead.mercury.mami.search.v20.TemplatePrefixHandler

Replaces its content with the expansion of its template attribute, in which the __QUERY__ placeholder has itself been replaced by the content.
This prefix handler can be used as default prefix handler with some limitations (not compatible with linguistic prefix handlers like spellslike:).

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
template string

Name	Type	Default value	Description
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
template	string	-	Template to use to replace the content of this prefix handler. Occurrences of __QUERY__ inside of this template are replaced by the content itself.

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<TemplatePrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" template="" ></TemplatePrefixHandler>

CategoryPrefixHandler com.exalead.mercury.mami.search.v20.CategoryPrefixHandler

Searches for paths in a category field

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
root string
form enum(exact, lowercase, normalized)
tokenizationConfig string

Name	Type	Default value	Description
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
root	string	-	Root category for this prefix
form	enum(exact, lowercase, normalized)	normalized	Form of the category path. Value can be null or one of exact lowercase normalized
tokenizationConfig	string	-

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<CategoryPrefixHandler indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" root="" form="normalized" tokenizationConfig="tok0" ></CategoryPrefixHandler>

RefinePrefixHandler com.exalead.mercury.mami.search.v20.RefinePrefixHandler

Base class to configure a prefix handler targeting an index field.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
excluded boolean

Name	Type	Default value	Description
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
excluded	boolean	False	Specifies whether the category must be included or excluded.

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<RefinePrefixHandler indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" excluded="false" ></RefinePrefixHandler>

DynamicNumericalPrefixHandler com.exalead.mercury.mami.search.v20.DynamicNumericalPrefixHandler

Searches for numerical values in a dynamic field.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

meta string
invalidValueAction enum(remove, ignore, error)
indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
defaultOperator enum(eq, le, lt, ge, gt, ne, range)

Name	Type	Default value	Description
meta	string	-	Meta name to search within. If null, you need to use the syntax prefix:meta:VALUE
invalidValueAction	enum(remove, ignore, error)	error	What to do when an invalid numerical value is passed: remove: removes the whole prefix XXX ignore: ? XXX error: fails the query
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
defaultOperator	enum(eq, le, lt, ge, gt, ne, range)	eq	Default numerical operator to use if the field is given with ':'

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<DynamicNumericalPrefixHandler invalidValueAction="error" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" defaultOperator="eq" ></DynamicNumericalPrefixHandler>

DynamicDatePrefixHandler com.exalead.mercury.mami.search.v20.DynamicDatePrefixHandler

Searches for dates and/or times in a dynamic field.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

meta string
invalidValueAction enum(remove, ignore, error)
indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
time boolean
defaultOperator enum(eq, le, lt, ge, gt, ne, range)

Name	Type	Default value	Description
meta	string	-	Meta name to search within. If null, you need to use the syntax prefix:meta:VALUE
invalidValueAction	enum(remove, ignore, error)	error	What to do when an invalid numerical value is passed: remove: removes the whole prefix XXX ignore: ? XXX error: fails the query
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
time	boolean	False	Allows a "date+time" input.
defaultOperator	enum(eq, le, lt, ge, gt, ne, range)	eq	Default numerical operator to use if the field is given with ':'

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<DynamicDatePrefixHandler invalidValueAction="error" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" time="false" defaultOperator="eq" ></DynamicDatePrefixHandler>

DynamicFullTextPrefixHandler (deprecated) com.exalead.mercury.mami.search.v20.DynamicFullTextPrefixHandler

Root class for prefix handlers searching for alphanumerical values in a dynamic field.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

semanticContent boolean
indexFields string
performMAX boolean
patternMaxMatches int
tokenizationConfig string
meta string
invalidValueAction enum(remove, ignore, error)
indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
matchingMode enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized)

Name	Type	Default value	Description
semanticContent	boolean	true	Is the content of this prefix "semantic"? If false, the content will not be tokenized nor passed through the linguistic expansion phase.
indexFields	string	-	Comma-separated list of index fields to be targeted by this prefix handler.
performMAX	boolean	False	Should we perform a MAX-like query rather than simple OR when targeting multiple fields?
patternMaxMatches	int	50	Max number of matches for the patterns under this prefix handler.
tokenizationConfig	string	-	Tokenization Config to use.
meta	string	-	Meta name to search within. If null, you need to use the syntax prefix:meta:VALUE
invalidValueAction	enum(remove, ignore, error)	error	What to do when an invalid numerical value is passed: remove: removes the whole prefix XXX ignore: ? XXX error: fails the query
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
matchingMode	enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized)	normalized	The matching mode defines how the query words are looked up in the index. Value can be null or one of exact lowercase normalized exact_lowercase exact_lowercase_normalized lowercase_normalized

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<DynamicFullTextPrefixHandler semanticContent="true" indexFields="" performMAX="false" patternMaxMatches="50" tokenizationConfig="tok0" invalidValueAction="error" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" matchingMode="normalized" ></DynamicFullTextPrefixHandler>

DynamicRawTextPrefixHandler com.exalead.mercury.mami.search.v20.DynamicRawTextPrefixHandler

Root class for prefix handlers searching for alphanumerical values in a dynamic field.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

semanticContent boolean
indexFields string
performMAX boolean
patternMaxMatches int
tokenizationConfig string
meta string
invalidValueAction enum(remove, ignore, error)
indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
kind int

Name	Type	Default value	Description
semanticContent	boolean	true	Is the content of this prefix "semantic"? If false, the content will not be tokenized nor passed through the linguistic expansion phase.
indexFields	string	-	Comma-separated list of index fields to be targeted by this prefix handler.
performMAX	boolean	False	Should we perform a MAX-like query rather than simple OR when targeting multiple fields?
patternMaxMatches	int	50	Max number of matches for the patterns under this prefix handler.
tokenizationConfig	string	-	Tokenization Config to use.
meta	string	-	Meta name to search within. If null, you need to use the syntax prefix:meta:VALUE
invalidValueAction	enum(remove, ignore, error)	error	What to do when an invalid numerical value is passed: remove: removes the whole prefix XXX ignore: ? XXX error: fails the query
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
kind	int	-

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<DynamicRawTextPrefixHandler semanticContent="true" indexFields="" performMAX="false" patternMaxMatches="50" tokenizationConfig="tok0" invalidValueAction="error" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" kind="" ></DynamicRawTextPrefixHandler>

NumericalPrefixHandler com.exalead.mercury.mami.search.v20.NumericalPrefixHandler

Searches for numerical values in a numerical field (integer or double)

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

defaultOperator enum(eq, le, lt, ge, gt, ne, range)
invalidValueAction enum(remove, ignore, error)
useRetrieved boolean
lsb int
msb int
indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean

Name	Type	Default value	Description
defaultOperator	enum(eq, le, lt, ge, gt, ne, range)	eq	Default numerical operator to use if the field is given with ':'
invalidValueAction	enum(remove, ignore, error)	error	What to do when an invalid numerical value is passed: remove: removes the whole prefix XXX ignore: ? XXX error: fails the query
useRetrieved	boolean	False	Search using the "retrieved" part of the index. Searching in the retrieved part of a numerical field is faster when this condition is not very restrictive (i.e. when it does not divide the result set by more than 5). It also provides the ability to restrict search to a range of bits in the numerical field. However, it can only be used on retrievable fields. Note that using this prefix handler is the only way to search in virtual fields.
lsb	int	0	Least significant bit to search in (only if using retrieved)
msb	int	63	Most significant bit to search in (only if using retrieved)
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<NumericalPrefixHandler defaultOperator="eq" invalidValueAction="error" useRetrieved="false" lsb="0" msb="63" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" ></NumericalPrefixHandler>

DatePrefixHandler com.exalead.mercury.mami.search.v20.DatePrefixHandler

Searches for dates and/or times in a date field.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

defaultOperator enum(eq, le, lt, ge, gt, ne, range)
invalidValueAction enum(remove, ignore, error)
useRetrieved boolean
lsb int
msb int
indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
time boolean
format string

Name	Type	Default value	Description
defaultOperator	enum(eq, le, lt, ge, gt, ne, range)	eq	Default numerical operator to use if the field is given with ':'
invalidValueAction	enum(remove, ignore, error)	error	What to do when an invalid numerical value is passed: remove: removes the whole prefix XXX ignore: ? XXX error: fails the query
useRetrieved	boolean	False	Search using the "retrieved" part of the index. Searching in the retrieved part of a numerical field is faster when this condition is not very restrictive (i.e. when it does not divide the result set by more than 5). It also provides the ability to restrict search to a range of bits in the numerical field. However, it can only be used on retrievable fields. Note that using this prefix handler is the only way to search in virtual fields.
lsb	int	0	Least significant bit to search in (only if using retrieved)
msb	int	63	Most significant bit to search in (only if using retrieved)
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
time	boolean	False	Allows a "date+time" input.
format	string	-

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<DatePrefixHandler defaultOperator="eq" invalidValueAction="error" useRetrieved="false" lsb="0" msb="63" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" time="false" format="" ></DatePrefixHandler>

UnixTimestampPrefixHandler com.exalead.mercury.mami.search.v20.UnixTimestampPrefixHandler

Searches for dates and/or times in a date field, in Unix Timestamp format.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

defaultOperator enum(eq, le, lt, ge, gt, ne, range)
invalidValueAction enum(remove, ignore, error)
useRetrieved boolean
lsb int
msb int
indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
time boolean

Name	Type	Default value	Description
defaultOperator	enum(eq, le, lt, ge, gt, ne, range)	eq	Default numerical operator to use if the field is given with ':'
invalidValueAction	enum(remove, ignore, error)	error	What to do when an invalid numerical value is passed: remove: removes the whole prefix XXX ignore: ? XXX error: fails the query
useRetrieved	boolean	False	Search using the "retrieved" part of the index. Searching in the retrieved part of a numerical field is faster when this condition is not very restrictive (i.e. when it does not divide the result set by more than 5). It also provides the ability to restrict search to a range of bits in the numerical field. However, it can only be used on retrievable fields. Note that using this prefix handler is the only way to search in virtual fields.
lsb	int	0	Least significant bit to search in (only if using retrieved)
msb	int	63	Most significant bit to search in (only if using retrieved)
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
time	boolean	True	Generates a DateTime. Otherwise, it generates a Date.

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<UnixTimestampPrefixHandler defaultOperator="eq" invalidValueAction="error" useRetrieved="false" lsb="0" msb="63" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" time="true" ></UnixTimestampPrefixHandler>

PositionPrefixHandler com.exalead.mercury.mami.search.v20.PositionPrefixHandler

Searches in an alphanumerical field using the anchoring position (i.e. restricting the match to be at a specific position).

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
matchingMode enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized)
positionMatchMode enum(exact, startsWith, endsWith)
tokenizationConfig string

Name	Type	Default value	Description
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
matchingMode	enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized)	normalized	The matching mode defines how the query words are looked up in the index. Value can be null or one of exact lowercase normalized exact_lowercase exact_lowercase_normalized lowercase_normalized
positionMatchMode	enum(exact, startsWith, endsWith)	exact	Choose between 'exact', 'startsWith' and 'endsWith'
tokenizationConfig	string	-

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<PositionPrefixHandler indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" matchingMode="normalized" positionMatchMode="exact" tokenizationConfig="tok0" ></PositionPrefixHandler>

SitePrefixHandler com.exalead.mercury.mami.search.v20.SitePrefixHandler

Searches for parts of URLs.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
additionalIndexField string
groupsByToRemove string

Name	Type	Default value	Description
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
additionalIndexField	string	-
groupsByToRemove	string		Names of the GroupBy nodes to remove (comma-separated).

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<SitePrefixHandler indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" additionalIndexField="" groupsByToRemove="" ></SitePrefixHandler>

GeoPrefixHandler com.exalead.mercury.mami.search.v20.GeoPrefixHandler

Sets the geographic field for WITHIN and DISTANCE searches

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

indexField string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean

Name	Type	Default value	Description
indexField	string	-	Index field to be targeted by this prefix.
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<GeoPrefixHandler indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" ></GeoPrefixHandler>

FullTextPrefixHandler com.exalead.mercury.mami.search.v20.FullTextPrefixHandler

Searches in alphanumerical fields (both static and dynamic ones).

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

semanticContent boolean
indexFields string
indexFieldWeights string
performMAX boolean
patternMaxMatches int
usePrefixedMappings boolean
patternSearch boolean
tokenizationConfig string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
matchingMode enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized, custom)
customKind int
useNestedPrefixAsDynamicMetaName boolean
forcedDynamicMetaName string
dictionaryName string

Name	Type	Default value	Description
semanticContent	boolean	true	Is the content of this prefix handler "semantic"? If false, the content will neither be tokenized nor passed through the linguistic expansion phase.
indexFields (UI: Index fields)	string	-	Comma-separated list of index fields targeted by this prefix handler.
indexFieldWeights (UI: Index fields weights)	string	-	Optional comma-separated list of weights applied to each index field targeted by this prefix handler.
performMAX (UI: Perform MAX)	boolean	False	Should we perform a MAX-like query rather than a simple OR when targeting multiple fields?
patternMaxMatches (UI: Max pattern matches)	int	50	Max number of matches for the patterns under this prefix handler.
usePrefixedMappings	boolean	False	Prefixes the value with the prefix and #. This must be used with the "prefixWithContext" option of analysis mappings. For example: <FullTextPrefixHandler name='stuff' prefixMatch='true' usePrefixedMappings='true' indexField='everything'/> will transform: stuff_number1:test into: everything:stuff_number1#test
patternSearch (UI: Use field dictionaries for pattern search)	boolean	False	Uses index field dictionaries for pattern expansion rather than linguistic dictionaries.
tokenizationConfig	string	-
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
matchingMode (UI: Matching mode)	enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized, custom)	normalized	Normalization level for lookup of the words associated with this prefix handler. Using the 'custom' matching mode requires the custom kind to be set. Value can be null or one of exact lowercase normalized exact_lowercase exact_lowercase_normalized lowercase_normalized custom
customKind (UI: Custom kind)	int	-	Custom indexing kind for lookup of the words associated with this prefix handler. This requires the matching mode to be set to 'custom'. Custom kinds must be used together with custom content targets at indexing time.
useNestedPrefixAsDynamicMetaName (UI: Use nested prefix as dyn. meta)	boolean	False	Uses a nested prefix handler as dynamic meta name to search within a dynamic alphanumerical field. This requires the Store meta names option to be enabled on the dynamic field. This is incompatible with the "forcedDynamicMetaName" option. For example, on a prefix handler named "myprefix" configured to target a "mydynfield" index field, a "myprefix:mymeta:myvalue" query will search for "myvalue" into the "mymeta" meta of "mydynfield". Note that a "myprefix:myvalue" query will not lookup into any meta, and will not find anything.
forcedDynamicMetaName (UI: Dyn. meta name)	string	-	Dynamic meta to search when targeting a dynamic alphanumerical field. Use this to restrict the search in a dynamic field to text that appears within this meta. This requires the Store meta names option to be enabled on the dynamic field. This is incompatible with the Use nested prefix as dyn. meta option. For example, with this option set to "mymeta", on a prefix handler named "myprefix" configured to target index field "mydynfield", a query "myprefix:myvalue" will search for "myvalue" into the "mymeta" meta of "mydynfield". It is possible to set a list of meta separated by ',' to mimic the behavior of a list of index fields (OR on the different metas)
dictionaryName (UI: Dictionary)	string	-	Dictionary to target

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<FullTextPrefixHandler semanticContent="true" indexFields="" indexFieldWeights="" performMAX="false" patternMaxMatches="50" usePrefixedMappings="false" patternSearch="false" tokenizationConfig="tok0" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" matchingMode="normalized" customKind="" useNestedPrefixAsDynamicMetaName="false" forcedDynamicMetaName="" dictionaryName="" ></FullTextPrefixHandler>

RawTextPrefixHandler com.exalead.mercury.mami.search.v20.RawTextPrefixHandler

No documentation for this element.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

semanticContent boolean
indexFields string
indexFieldWeights string
performMAX boolean
patternMaxMatches int
usePrefixedMappings boolean
patternSearch boolean
tokenizationConfig string
name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
kind int

Name	Type	Default value	Description
semanticContent	boolean	true	Is the content of this prefix handler "semantic"? If false, the content will neither be tokenized nor passed through the linguistic expansion phase.
indexFields (UI: Index fields)	string	-	Comma-separated list of index fields targeted by this prefix handler.
indexFieldWeights (UI: Index fields weights)	string	-	Optional comma-separated list of weights applied to each index field targeted by this prefix handler.
performMAX (UI: Perform MAX)	boolean	False	Should we perform a MAX-like query rather than a simple OR when targeting multiple fields?
patternMaxMatches (UI: Max pattern matches)	int	50	Max number of matches for the patterns under this prefix handler.
usePrefixedMappings	boolean	False	Prefixes the value with the prefix and #. This must be used with the "prefixWithContext" option of analysis mappings. For example: <FullTextPrefixHandler name='stuff' prefixMatch='true' usePrefixedMappings='true' indexField='everything'/> will transform: stuff_number1:test into: everything:stuff_number1#test
patternSearch (UI: Use field dictionaries for pattern search)	boolean	False	Uses index field dictionaries for pattern expansion rather than linguistic dictionaries.
tokenizationConfig	string	-
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
kind	int	-

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<RawTextPrefixHandler semanticContent="true" indexFields="" indexFieldWeights="" performMAX="false" patternMaxMatches="50" usePrefixedMappings="false" patternSearch="false" tokenizationConfig="tok0" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" kind="" ></RawTextPrefixHandler>

LinguisticOptionsPrefixHandler com.exalead.mercury.mami.search.v20.LinguisticOptionsPrefixHandler

Sets a query expansion config on the prefix handler content, but does not actually perform search.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
dictionaryName string

Name	Type	Default value	Description
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
dictionaryName (UI: Dictionary)	string	-	Dictionary to target

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<LinguisticOptionsPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" dictionaryName="" ></LinguisticOptionsPrefixHandler>

SimilarDocumentPrefixHandler com.exalead.mercury.mami.search.v20.SimilarDocumentPrefixHandler

Manually looks up similar values in documents.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

Name	Type	Default value	Description
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
similarityThreshold	double	0.1	The mininum similarity score for two documents to be considered similar. Value must be between 0 and 1.
minNumberOfKeywordPerDoc	long	5	Documents with fewer keywords than this constant are ignored. Value must be higher or equal to 0.
minNumberOfMatchedKeywords	long	4	Documents that do not share at least this number of keywords with the reference document are ignored. Value must be higher or equal to 0.
similarityVirtualField	string	similarity	Name of the virtual field computing the final similarity score.
dictionary	string	dict0	Related-terms dictionary to use.
keywordField	string	keywordv2	Searchable value field indexing the related-terms.

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<SimilarDocumentPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" similarityThreshold="0.1" minNumberOfKeywordPerDoc="5" minNumberOfMatchedKeywords="4" similarityVirtualField="similarity" dictionary="dict0" keywordField="keywordv2" ></SimilarDocumentPrefixHandler>

SplitPrefixHandler com.exalead.mercury.mami.search.v20.SplitPrefixHandler

Searches for expressions, within the bounds of separators.

For example, with a Split prefix handler named "mypage" using separator INPAGE, mypage:(a AND b) will only match if "a" and "b" appear on the same page in the document.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
sep enum(INPARAGRAPH, INSENTENCE, INPAGE)

Name	Type	Default value	Description
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
sep	enum(INPARAGRAPH, INSENTENCE, INPAGE)	INPAGE	One of the separators: Value can be null or one of INPARAGRAPH INSENTENCE INPAGE .

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<SplitPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" sep="INPAGE" ></SplitPrefixHandler>

KeywordPrefixHandler com.exalead.mercury.mami.search.v20.KeywordPrefixHandler

Searches for a specific Related Term resolving its indexed identifier (internal) by querying the dictionary.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
keywordField string

Name	Type	Default value	Description
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
keywordField	string	keywordv2	Name of the value-field where keywords are indexed.

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<KeywordPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" keywordField="keywordv2" ></KeywordPrefixHandler>

EnumFacetPrefixHandler com.exalead.mercury.mami.search.v20.EnumFacetPrefixHandler

No documentation for this element.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
enumFacetId string
mode enum(exact, tolower, toupper)
invalidValueAction enum(remove, ignore, error)

Name	Type	Default value	Description
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
enumFacetId	string	-	The EnumFacet id this prefix handler refers to. Value must be any non-empty string
mode	enum(exact, tolower, toupper)	exact	What to do with the prefix handler value: "exact": Does nothing "tolower": Converts the value to lower case "toupper": Converts the value to upper case { @code enum ("exact", "tolower", "toupper")}
invalidValueAction	enum(remove, ignore, error)	error	What to do when an invalid value is passed: remove: removes the whole prefix XXX ignore: ? XXX error: fails the query

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<EnumFacetPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" enumFacetId="" mode="exact" invalidValueAction="error" ></EnumFacetPrefixHandler>

MultiQueryPrefixHandler com.exalead.mercury.mami.search.v20.MultiQueryPrefixHandler

Prefix handler dispatching a query to several other prefix handlers.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
uqlQueries string

Name	Type	Default value	Description
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
uqlQueries	string	-	Comma-separated list of additional UQL queries to build. Each UQL query is mapped to a specific prefix handler.

Nested elements

fromDataModelQueryPrefixHandler
QueryPrefixHandler*

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".
QueryPrefixHandler*	-	List of prefix handlers to query, identified by their names.

XML representation

<MultiQueryPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" uqlQueries="" ></MultiQueryPrefixHandler>

CustomPrefixHandler com.exalead.mercury.mami.search.v20.CustomPrefixHandler

Prefix handler implemented using custom Java code.

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

name string
aliases string
queryExpansionConfig string
dataModelState string
dataModelClass string
dataModelProperty string
prefixMatch boolean
classId string

Name	Type	Default value	Description
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
classId	string	-	Reference of the Java class that implements the query prefix handler. This class must implement the interface com.exalead.search.query.prefix.CustomPrefixHandler

Nested elements

fromDataModelQueryPrefixHandler
KeyValue*

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".
KeyValue*	-

XML representation

<CustomPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" classId="" ></CustomPrefixHandler>

UnitsOfMeasurementPrefixHandler com.exalead.mercury.mami.search.v20.UnitsOfMeasurementPrefixHandler

Searches for documents by resolving and converting values in the required unit of measurement

Parent objects

MultiQueryPrefixHandler
UQLConfig

Attributes

Name	Type	Default value	Description
name	string	-	Name of the prefix handler that should be handled. This is the prefix name that can be used in user queries. For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
aliases	string	-	Comma-separated list of aliases for this prefix handler. Any of these aliases can be used in the query for this prefix handler.
queryExpansionConfig	string	-	Specifies how to expand the text content of this prefix handler. Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
dataModelState	string	-	Is this query prefix handler managed by a data model? @enum{null,auto,customized}. If null, this prefix handler is not related to a data model. If "auto", this prefix handler is auto-generated by a data model. If "customized", this prefix handler was auto-generated by a data model and then customized.
dataModelClass	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
dataModelProperty	string	-	If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
prefixMatch	boolean	False	The prefix handler matches all strings beginning with "name". For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
indexFieldTarget	string	-	The index field
indexFieldUnitSymbol	string	-	The output unit symbol
defaultUnitSymbol	string	-	(Optional) A default unit symbol

Nested elements

fromDataModelQueryPrefixHandler

Type	Wrapper	Description
QueryPrefixHandler	fromDataModel	If dataModelState is "customized", you will find here the original prefix handler generated by the data model. Use this to easily revert to "auto" state from "customized".

XML representation

<UnitsOfMeasurementPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" indexFieldTarget="" indexFieldUnitSymbol="" ></UnitsOfMeasurementPrefixHandler>

SWConfig com.exalead.search.v30.SWConfig

No documentation for this element.

Parent objects

UQLConfig

Attributes

stopWordsOnlyQueryAction enum(perform, error, empty)
stopWordsOnlyQueryReportMode enum(global, local, quiet)

Name	Type	Default value	Description
stopWordsOnlyQueryAction	enum(perform, error, empty)	perform	What to do when a query contains only stop words. One of: "perform": Performs the full query. "error": Sends an error. "empty": Sends an empty answer.
stopWordsOnlyQueryReportMode	enum(global, local, quiet)	local	Specifies how to report a stop-word only query error in 'Troubleshooting > Logs': quiet: No log. local: Error is reported in the Process log. global: Error is reported in both Process and Global logs.

Nested elements

SWList*

Type	Wrapper	Description
SWList*	-

XML representation

<SWConfig stopWordsOnlyQueryAction="perform" stopWordsOnlyQueryReportMode="local" ></SWConfig>

SWList com.exalead.search.v30.SWList

No documentation for this element.

Parent objects

SWConfig

Attributes

language iso code
resourceFile string
action enum(opt, remove)
level enum(normalized, exact, lowercase)

Name	Type	Default value	Description
language	iso code	-	Language code
resourceFile	string	-
action	enum(opt, remove)	opt
level	enum(normalized, exact, lowercase)	exact

XML representation

<SWList language="en" resourceFile="" action="opt" level="exact" ></SWList>

SemanticQueryAnalysisConfig com.exalead.mercury.mami.search.v20.SemanticQueryAnalysisConfig

No documentation for this element.

Parent objects

UQLConfig

Attributes

resourceDir string
unusedWordPolicy enum(mandatory, optional, remove, as-is)
language string
queryNames string
htmlLogPath string
singleMatch boolean
overlappingMatches boolean

Name	Type	Default value	Description
resourceDir	string	-	URL for the directory containing the semantic extractor binaries (data://, file;// or resource://).
unusedWordPolicy	enum(mandatory, optional, remove, as-is)	mandatory	Unused query word policy
language	string	-	The query language for which the analyzer is activated. If null, it is activated whatever the language.
queryNames	string	_default_	A list of comma-separated query names defining which parts of the query will be processed (default is _default_).
htmlLogPath	string	-	Path to a debug HTML file
singleMatch	boolean	True	If true, only the best match is selected for rewriting, if false, several rules may trigger simultaneously.
overlappingMatches	boolean	True	If true, allows rules to match on overlapping query parts. Only makes sense when singleMatch is false.

Nested elements

SemanticProcessor*

Type	Wrapper	Description
SemanticProcessor*	-	A list of semantic processor to apply before the semantic extractor.

XML representation

<SemanticQueryAnalysisConfig resourceDir="" unusedWordPolicy="mandatory" queryNames="_default_" singleMatch="true" overlappingMatches="true" ></SemanticQueryAnalysisConfig>

VirtualFieldDefinition com.exalead.search.v30.VirtualFieldDefinition

No documentation for this element.

Parent objects

SearchLogic

Attributes

name string
expr string

Name	Type	Default value	Description
name	string	-	Name of this virtual field
expr	string	-	Virtual field expression

XML representation

<VirtualFieldDefinition name="" expr="" ></VirtualFieldDefinition>

LocalOnlySearchTarget com.exalead.mercury.mami.search.v20.LocalOnlySearchTarget

No documentation for this element.

Attributes

Name	Type	Default value	Description
targetName	string	-	The name of this search target. This is the name to use in the search-api to reference a particular search target. Value must be a string matching '[A-Za-z0-9_]+'.
watchDogConnectTimeoutMS	int	1000	Global 'connect' timeout for connection to index slices maintained by this search target. This is an advanced parameter that should generally not be modified.
watchDogReadTimeoutMS	int	5000	Global 'read' timeout for connection to index slices maintainted by this search target. This is an advanced parameter that should generally not be modified.
nbVirtualPartitionPerSlice	int	1	Split every slices of the target in nbVirtualPartitionPerSlice virtual partitions. Emulate multislice behavior on a single slice, each partition acting on a subpart of the physical slice.
buildGroups	string	-	Comma-separated list of build groups. All local instances of the slices of this build group "will be queried

XML representation

<LocalOnlySearchTarget targetName="" watchDogConnectTimeoutMS="1000" watchDogReadTimeoutMS="5000" nbVirtualPartitionPerSlice="1" buildGroups="" ></LocalOnlySearchTarget>

SimpleSearchTarget com.exalead.mercury.mami.search.v20.SimpleSearchTarget

The search server evenly distributes search requests among all available index replicas. It also checks the health of all index replicas periodically, to know whether to ignore those that are not available.

Attributes

Name	Type	Default value	Description
targetName	string	-	The name of this search target. This is the name to use in the search-api to reference a particular search target. Value must be a string matching '[A-Za-z0-9_]+'.
watchDogConnectTimeoutMS	int	1000	Global 'connect' timeout for connection to index slices maintained by this search target. This is an advanced parameter that should generally not be modified.
watchDogReadTimeoutMS	int	5000	Global 'read' timeout for connection to index slices maintainted by this search target. This is an advanced parameter that should generally not be modified.
nbVirtualPartitionPerSlice	int	1	Split every slices of the target in nbVirtualPartitionPerSlice virtual partitions. Emulate multislice behavior on a single slice, each partition acting on a subpart of the physical slice.

Nested elements

TargetSlices*

Type	Wrapper	Description
TargetSlices*	-	Slice instances to query in this target

XML representation

<SimpleSearchTarget targetName="" watchDogConnectTimeoutMS="1000" watchDogReadTimeoutMS="5000" nbVirtualPartitionPerSlice="1" ></SimpleSearchTarget>

TargetSlices com.exalead.mercury.mami.search.v20.TargetSlices

No documentation for this element.

Parent objects

SimpleSearchTarget

Attributes

buildGroups string
slices string
instances string
power int

Name	Type	Default value	Description
buildGroups	string	-	Comma-separated list of build groups. If null, all build groups are considered
slices	string	-	Comma-separated list of index slices. If null, all slices are considered
instances	string	-	Comma-separated list of slice instances. If null, all instances are considered
power	int	1	The power ratio defines the ratio of the inbound trafic to divert to a given instance (among all instances of a given slice). A slice instance with power=2 will receive twice as much traffic as a slice instance with power=1. If power=0, this slice instance is used as failover only if all other instances of this slice have failed.

XML representation

<TargetSlices buildGroups="" slices="" instances="" power="1" ></TargetSlices>

LegacySearchTarget (deprecated) com.exalead.mercury.mami.search.v20.LegacySearchTarget

CloudView 5 compatible legacy search target. This target should generally not be used in new deployments.

Attributes

Name	Type	Default value	Description
targetName	string	-	The name of this search target. This is the name to use in the search-api to reference a particular search target. Value must be a string matching '[A-Za-z0-9_]+'.
watchDogConnectTimeoutMS	int	1000	Global 'connect' timeout for connection to index slices maintained by this search target. This is an advanced parameter that should generally not be modified.
watchDogReadTimeoutMS	int	5000	Global 'read' timeout for connection to index slices maintainted by this search target. This is an advanced parameter that should generally not be modified.
nbVirtualPartitionPerSlice	int	1	Split every slices of the target in nbVirtualPartitionPerSlice virtual partitions. Emulate multislice behavior on a single slice, each partition acting on a subpart of the physical slice.

Nested elements

ContentGroupConfig*

Type	Wrapper	Description
ContentGroupConfig*	-

XML representation

<LegacySearchTarget targetName="" watchDogConnectTimeoutMS="1000" watchDogReadTimeoutMS="5000" nbVirtualPartitionPerSlice="1" ></LegacySearchTarget>

ContentGroupConfig com.exalead.mercury.mami.search.v20.ContentGroupConfig

Content group configuration. A content group is a list of build groups

Parent objects

LegacySearchTarget

Attributes

contentGroup string
type string

Name	Type	Default value	Description
contentGroup	string	-	Name of the content group
type	string	-	Type of the content group. Must be used only as reference by custom code for complex multi-content groups installations.

Nested elements

BuildGroupConfig*

Type	Wrapper	Description
BuildGroupConfig*	-

XML representation

<ContentGroupConfig contentGroup="" type="" ></ContentGroupConfig>

BuildGroupConfig com.exalead.mercury.mami.search.v20.BuildGroupConfig

Build group configuration.

Parent objects

ContentGroupConfig

Attributes

buildGroup string

Name	Type	Default value	Description
buildGroup	string	-	Name of the build group.

Nested elements

BuildGroupException*

Type	Wrapper	Description
BuildGroupException*	-	List of build group exceptions.

XML representation

<BuildGroupConfig buildGroup="" ></BuildGroupConfig>

BuildGroupException com.exalead.mercury.mami.search.v20.BuildGroupException

No documentation for this element.

Parent objects

BuildGroupConfig

Attributes

slice int
sliceInstance string
enabled boolean
power int

Name	Type	Default value	Description
slice	int	-	slice number
sliceInstance	string	-	Name of the instance
enabled	boolean	False	Enables search for this instance.
power	int	1	The power ratio defines the ratio of the inbound trafic to divert to a given instance (among all instances of a given slice). A slice instance with power=2 will receive twice as much traffic as a slice instance with power=1. If power=0, this slice instance is used as failover only if all other instances of this slice have failed.

XML representation

<BuildGroupException slice="" sliceInstance="" enabled="false" power="1" ></BuildGroupException>

SearchAPIConfig com.exalead.mercury.mami.search.v20.SearchAPIConfig

No documentation for this element.

Attributes

name string
extraClassPath string

Name	Type	Default value	Description
name	string	-	Name of this API config
extraClassPath	string	-	Extra classpath for the web application. Comma or semicolon separated path of filenames or URLs pointing to directories or jar files. Directories should end with '/'. Used for JDBC for reporting and custom dependencies.

Nested elements

Command*

Type	Wrapper	Description
Command*	-

XML representation

<SearchAPIConfig name="" extraClassPath="" ></SearchAPIConfig>

SearchCommand com.exalead.mercury.mami.search.v20.SearchCommand

Search API command

Parent objects

SearchAPIConfig

Attributes

Name	Type	Default value	Description
defaultLogic	string	-	Default search logic (can be overriden using API parameters).
defaultTarget	string	-	Default search target (can be overriden using API parameters).
targetReporting	string	-	Target reporting to use for the search queries.
customRunnerFactoryClassId	string	-
maxConcurrentQueries	int	0	The maximum number of concurrent queries processed by the search server. If this limit is reached, incoming queries will be queued. 0 means the number of core of the search server times two.
maxWaitingQueries	int	200	When the maximum number of concurrent queries is reached the queries are put in a queue. This setting configures the maximum length of this queue. When this maximum is reached the query is rejected.
base	string	/search-api	HTTP path for this command
enableSOAP	boolean	False
isAliveSynchronous	boolean	True	Should the query be performed synchronously on isAlive call or should it run in a background thread, and cached result be returned.
isAliveAsynchronousDelayS	int	5	Elapsed time between two queries if the query is performed asynchronously.

Nested elements

customRunnerFactoryParamsKeyValue*
isAliveQueriesStringValue*
SearchCache*

Type	Wrapper	Description
KeyValue*	customRunnerFactoryParams
StringValue*	isAliveQueries
SearchCache*	-	List of caches for this search command. Cache entries will be looked up in all caches, in the order in which they are given. Incoming queries will be inserted in the first cache which accepts them. This allows complex configurations where you warmup some queries, and don't want them to be evicted by regular traffic `<TimeBasedSearchCache name="time" cacheQueries="false"> <WarmUpQuery args="q=#all" /> </TimeBasedSearchCache> <ConsistentSearchCache name="default" /> <SearchCommand caches="time,default" />`

XML representation

<SearchCommand defaultLogic="" defaultTarget="" targetReporting="" customRunnerFactoryClassId="" maxConcurrentQueries="0" maxWaitingQueries="200" base="/search-api" enableSOAP="false" isAliveSynchronous="true" isAliveAsynchronousDelayS="5" ></SearchCommand>

ConsistentSearchCache com.exalead.mercury.mami.search.v20.ConsistentSearchCache

This cache is always fully evicted each time a new generation of the index is available.

Warmup starts shortly afterwards, and is restarted for each new generation.

Parent objects

SearchCommand

Attributes

Name	Type	Default value	Description
name	string	-
warmupWorkRatio	double	1.0	Maximum ratio of time each warm-up thread will work.
warmupThreads	int	2	Maximum number of threads for warmup
cacheQueries	boolean	True	Should this cache handle caching of incoming queries. If this is false, only warmup queries will be cached. Incoming queries with the "cache=no" argument will never be cached. Incoming queries with the "cache=force:1" argument will override this setting. Incoming queries with the "cache=evict" argument will evict the query from all caches if possible.
maxCachedQueries	int	1000	Maximum number of queries to cache. Additional queries are evicted using a LRU algorithm.
ignoreFromCacheKey	string	cache,user_id,userquery_id,usersession_id	List of url query parameters to ignore from the cache key construction. Separate each field with a come ",".
warmupDelayS	int	10	Elapsed time in seconds between imports on one of the build group slices, then the warmup starts.

Nested elements

WarmUpQuery*

Type	Wrapper	Description
WarmUpQuery*	-	Queries that will be automatically cached

XML representation

<ConsistentSearchCache name="" warmupWorkRatio="1.0" warmupThreads="2" cacheQueries="true" maxCachedQueries="1000" ignoreFromCacheKey="cache,user_id,userquery_id,usersession_id" warmupDelayS="10" ></ConsistentSearchCache>

WarmUpQuery com.exalead.mercury.mami.search.v20.WarmUpQuery

Query that will automatically be cached.

Parent objects

ConsistentSearchCache
SearchCache
TimeBasedSearchCache

Attributes

args string

Name	Type	Default value	Description
args	string	-	Arguments of the query to autocache, in URL form. For example, q=test%20stuff&logic=sl0

XML representation

<WarmUpQuery args="" ></WarmUpQuery>

TimeBasedSearchCache com.exalead.mercury.mami.search.v20.TimeBasedSearchCache

The elements of this cache are evicted based on time policy.

They are not evicted each time a new generation of the index is available. Therefore, this cache can contain stale data.
Warmup queries are refreshed before their cache expiry

Parent objects

SearchCommand

Attributes

Name	Type	Default value	Description
name	string	-
warmupWorkRatio	double	1.0	Maximum ratio of time each warm-up thread will work.
warmupThreads	int	2	Maximum number of threads for warmup
cacheQueries	boolean	True	Should this cache handle caching of incoming queries. If this is false, only warmup queries will be cached. Incoming queries with the "cache=no" argument will never be cached. Incoming queries with the "cache=force:1" argument will override this setting. Incoming queries with the "cache=evict" argument will evict the query from all caches if possible.
maxCachedQueries	int	1000	Maximum number of queries to cache. Additional queries are evicted using a LRU algorithm.
ignoreFromCacheKey	string	cache,user_id,userquery_id,usersession_id	List of url query parameters to ignore from the cache key construction. Separate each field with a come ",".
expiryTimeS	int	3600	Expiry time for queries in the cache

Nested elements

WarmUpQuery*

Type	Wrapper	Description
WarmUpQuery*	-	Queries that will be automatically cached

XML representation

<TimeBasedSearchCache name="" warmupWorkRatio="1.0" warmupThreads="2" cacheQueries="true" maxCachedQueries="1000" ignoreFromCacheKey="cache,user_id,userquery_id,usersession_id" expiryTimeS="3600" ></TimeBasedSearchCache>

QueryExpansionCommand com.exalead.mercury.mami.search.v20.QueryExpansionCommand

A BaseSearchCommand used for query expansion

Parent objects

SearchAPIConfig

Attributes

Name	Type	Default value	Description
defaultLogic	string	-	Default search logic (can be overriden using API parameters).
defaultTarget	string	-	Default search target (can be overriden using API parameters).
targetReporting	string	-	Target reporting to use for the search queries.
customRunnerFactoryClassId	string	-
maxConcurrentQueries	int	0	The maximum number of concurrent queries processed by the search server. If this limit is reached, incoming queries will be queued. 0 means the number of core of the search server times two.
maxWaitingQueries	int	200	When the maximum number of concurrent queries is reached the queries are put in a queue. This setting configures the maximum length of this queue. When this maximum is reached the query is rejected.
base	string	/search-api	HTTP path for this command

Nested elements

customRunnerFactoryParamsKeyValue*

Type	Wrapper	Description
KeyValue*	customRunnerFactoryParams

XML representation

<QueryExpansionCommand defaultLogic="" defaultTarget="" targetReporting="" customRunnerFactoryClassId="" maxConcurrentQueries="0" maxWaitingQueries="200" base="/search-api" ></QueryExpansionCommand>

SpellCheckCommand com.exalead.mercury.mami.search.v20.SpellCheckCommand

Search API command

Parent objects

SearchAPIConfig

Attributes

Name	Type	Default value	Description
defaultLogic	string	-	Default search logic (can be overriden using API parameters).
defaultTarget	string	-	Default search target (can be overriden using API parameters).
targetReporting	string	-	Target reporting to use for the search queries.
customRunnerFactoryClassId	string	-
maxConcurrentQueries	int	0	The maximum number of concurrent queries processed by the search server. If this limit is reached, incoming queries will be queued. 0 means the number of core of the search server times two.
maxWaitingQueries	int	200	When the maximum number of concurrent queries is reached the queries are put in a queue. This setting configures the maximum length of this queue. When this maximum is reached the query is rejected.
base	string	/search-api	HTTP path for this command

Nested elements

customRunnerFactoryParamsKeyValue*

Type	Wrapper	Description
KeyValue*	customRunnerFactoryParams

XML representation

<SpellCheckCommand defaultLogic="" defaultTarget="" targetReporting="" customRunnerFactoryClassId="" maxConcurrentQueries="0" maxWaitingQueries="200" base="/search-api" ></SpellCheckCommand>

SecurityCommand com.exalead.mercury.mami.search.v20.SecurityCommand

Search API command

Parent objects

SearchAPIConfig

Attributes

base string

Name	Type	Default value	Description
base	string	/search-api	HTTP path for this command

XML representation

<SecurityCommand base="/search-api" ></SecurityCommand>

SuggestCommand com.exalead.mercury.mami.search.v20.SuggestCommand

Search API command

Parent objects

SearchAPIConfig

Attributes

base string
targetReporting string
tokenizationConfig string
autocomplete boolean
recursive boolean

Name	Type	Default value	Description
base	string	/search-api	HTTP path for this command
targetReporting	string	-
tokenizationConfig	string	-	Tokenization config name used to parse the query.
autocomplete	boolean	False	Appends suggest results to the last query word being typed in the search field to autocomplete it. It only works with a suggest built with the Subexpr matching or Substring matching build options.
recursive	boolean	False	Discards the leftmost word of the query progressively. It sends each new subquery to the suggest(s) until the max number of suggestions is reached, or until there is no more word to use. For example, for a query "A B C", the suggest will be called 3 times, with "A B C", "B C", and "C".

XML representation

<SuggestCommand base="/search-api" targetReporting="" tokenizationConfig="tok0" autocomplete="false" recursive="false" ></SuggestCommand>

DictionaryCommand com.exalead.mercury.mami.search.v20.DictionaryCommand

Search API command

Parent objects

SearchAPIConfig

Attributes

base string

Name	Type	Default value	Description
base	string	/search-api	HTTP path for this command

XML representation

<DictionaryCommand base="/search-api" ></DictionaryCommand>

GeoCommand com.exalead.mercury.mami.search.v20.GeoCommand

Search API command

Parent objects

SearchAPIConfig

Attributes

base string

Name	Type	Default value	Description
base	string	/search-api	HTTP path for this command

XML representation

<GeoCommand base="/search-api" ></GeoCommand>

IsAliveCommand com.exalead.mercury.mami.search.v20.IsAliveCommand

Search API command

Parent objects

SearchAPIConfig

Attributes

Name	Type	Default value	Description
base	string	/search-api	HTTP path for this command
queryToPerform	string	q=IsAliveQuery	The isAlive can optionally perform a query to check that all components are ready. If you want this behavior, enter the search parameters to query for here. For example "q=test&logic=sl0". If this is null, the query won't be performed.
targetQueryCommand	string	/search-api	Command path on which the query will be performed.
performQuerySynchronously	boolean	False	Should the query be performed synchronously on isAlive call or should it run in a background thread, and cached result be returned.
asynchronousQueryDelayS	int	5	Elapsed between two queries if the query is performed asynchronously.

XML representation

<IsAliveCommand base="/search-api" queryToPerform="q=IsAliveQuery" targetQueryCommand="/search-api" performQuerySynchronously="false" asynchronousQueryDelayS="5" ></IsAliveCommand>

DocumentFetchCommand com.exalead.mercury.mami.search.v20.DocumentFetchCommand

Abstract class for a fetch command.

Parent objects

SearchAPIConfig

Attributes

base string

Name	Type	Default value	Description
base	string	/search-api	HTTP path for this command

XML representation

<DocumentFetchCommand base="/search-api" ></DocumentFetchCommand>

PreviewCommand com.exalead.mercury.mami.search.v20.PreviewCommand

Retrieves rich HTML preview of the documents.

Parent objects

SearchAPIConfig

Attributes

Name	Type	Default value	Description
base	string	/search-api	HTTP path for this command
defaultLogic	string	-
defaultHighlightPrefixHandlers	string		Comma-separated list of prefix handlers that target the preview for highlight. The default prefix handler is "text".
defaultHighlightFacetIds	string		Comma-separated list of facets. Refinements on categories are highlighted in the preview if they have a parent category in this attribute.
convertAddresses	string	-	Semicolon separated list of convert instance names or urls to use.

XML representation

<PreviewCommand base="/search-api" defaultLogic="" defaultHighlightPrefixHandlers="" defaultHighlightFacetIds="" ></PreviewCommand>

ThumbnailCommand com.exalead.mercury.mami.search.v20.ThumbnailCommand

Abstract class for a fetch command.

Parent objects

SearchAPIConfig

Attributes

base string
maxConcurrentComputations int
defaultWidth int
defaultHeight int
allowUnsecurePrecomputedThumbnails boolean
convertAddresses string

Name	Type	Default value	Description
base	string	/search-api	HTTP path for this command
maxConcurrentComputations	int	4	Maximum number of concurrent thumbnail computations. Thumbnail queries that do not require a computation (cache hits) are not considered.
defaultWidth	int	120	Width of the computed thumbnail, if not specified in request
defaultHeight	int	90	Height of the computed thumbnail, if not specified in request
allowUnsecurePrecomputedThumbnails	boolean	False	When security is enforced on the fetch request, should a precomputed thumbnail be returned, if any. Security is not enforced on precomputed thumbnails: any logged user knowing the URL of a document could retrieve its precomputed thumbnail.
convertAddresses	string	-	Semicolon separated list of convert instance names or urls to use.

Nested elements

ThumbnailsCache

Type	Wrapper	Description
ThumbnailsCache	-	Cache for the thumbnails computed on the fly

XML representation

<ThumbnailCommand base="/search-api" maxConcurrentComputations="4" defaultWidth="120" defaultHeight="90" allowUnsecurePrecomputedThumbnails="false" ></ThumbnailCommand>

LocalThumbnailsCache com.exalead.mercury.mami.search.v20.LocalThumbnailsCache

The local thumbnails cache configuration.

Parent objects

ThumbnailCommand

Attributes

Name	Type	Default value	Description
secure	boolean	True	Thumbnails are only cached and retrieved if enforce_security is set to false for the request. If false, thumbnails are always cached and retrieved. Warning: any logged user knowing the URL of a document could then retrieve a thumbnail for it.
maxHeight	int	120	Maximum height of thumbnails for them to be cacheable
maxWidth	int	120	Maximum width of thumbnails for them to be cacheable
memoryEntries	int	20	Number of thumbnails to keep in memory cache
diskEntries	int	2000	Number of thumbnails to keep in disk cache
timeToLiveS	int	7200	Time period in seconds during which a thumbnail remains in cache. Note that when there is a cache, we also send an expires HTTP header for the thumbnails, with the same time period.

XML representation

<LocalThumbnailsCache secure="true" maxHeight="120" maxWidth="120" memoryEntries="20" diskEntries="2000" timeToLiveS="7200" ></LocalThumbnailsCache>

SuggestConfig com.exalead.mercury.mami.search.v20.SuggestConfig

List of Suggest and SuggestDispatcher

Attributes

version long

Name	Type	Default value	Description
version	long	-	-

Nested elements

Suggest*
SuggestDispatcher*

Type	Wrapper	Description
Suggest*	-	List of Suggest.
SuggestDispatcher*	-	List of SuggestDispatcher.

XML representation

<SuggestConfig version="" ></SuggestConfig>

QueryReportingSuggest com.exalead.mercury.mami.search.v20.QueryReportingSuggest

Takes the user queries stored for query reporting to build a suggest automatically.

Parent objects

SuggestConfig

Attributes

Name	Type	Default value	Description
suggestName	string	-	The mandatory suggest name. Must match [A-z0-9_]+
whiteListURL	string	-	URL to a suggest-allow list resource
blackListURL	string	-	URL to a suggest-block list resource
discardComplexQuery	boolean	False	Discards queries containing prefix handlers and operators.
applicationId	string	-	Retrieves logs for a specific application.

Nested elements

SuggestBuildConfig
SuggestBuildTrigger
SuggestQueryConfig

Type	Wrapper	Description
SuggestBuildConfig	-	Build options
SuggestBuildTrigger	-	Optional build trigger
SuggestQueryConfig	-

XML representation

<QueryReportingSuggest suggestName="" whiteListURL="" blackListURL="" discardComplexQuery="false" applicationId="" ></QueryReportingSuggest>

SuggestBuildConfig com.exalead.mercury.mami.search.v20.SuggestBuildConfig

Suggest build options

Parent objects

BuildableSuggest
CSVIndexFieldSuggest
CategoryPathSuggest
CategorySuggest
CategoryTitleSuggest
DynamicFieldSuggest
EnoviaSecureDynamicFieldSuggest
IndexFieldSuggest
QueryReportingSuggest
RelatedTermsSuggest
SearchBasedSuggest

Attributes

subExpr boolean
subString boolean
splitSentence boolean
splitNGrams int
computePermutation boolean
maxEntryLength int
maxSuggestions int
nbUrlsPerEntry int
tokenizationConfig string
sanitizeEntries boolean
buildAfterIndexImport boolean
secure boolean
securityModel enum(default, enovia)

Name	Type	Default value	Description
subExpr	boolean	False	Finds matches on every start of word. For example, "first test" will be returned both for "fir" and for "tes".
subString	boolean	False	Finds matches on every letter. For example, "first test" will be returned for "fir", for "rs", for "es", ...
splitSentence	boolean	False	Breaks up a suggest entry into several entries, and performs matches independently on the chunks. If the entry is multi-sentence, an entry will be created for each sentence.
splitNGrams	int	0	Breaks up a suggest entry into several entries, and performs matches independently on the chunks. A sliding window of ngrams of a given size will be created, and an entry will be created for each step of the window. For example, "a b c d e f" with a split on 4-grams will give entries "a b c d", "b c d e" and "c d e f". 0 means no splitting. This action is performed after the sentence split if the Sentence split option is selected.
computePermutation	boolean	False	Computes all permutations for an entry and adds them as separate entries. This action is performed after the sentence split if the Sentence split option is selected. Note that entries longer than 8 words will not be permuted for performance reasons (would generate n! entries)
maxEntryLength	int	50	Maximum number of characters in a suggest entry. This is a security measure to prevent overly-long entries. They are automatically truncated after the specified length. 0 means no limit.
maxSuggestions	int	10	The maximum number of suggestions that can be shown to the user for a given input string. You cannot change this dynamically.
nbUrlsPerEntry	int	3	Defines the maximum number of URL to be stored for each entry.
tokenizationConfig	string	-	Tokenization Config to use.
sanitizeEntries	boolean	False	Strips the entry of punctuation, and encloses any UQL operator between quotes.
buildAfterIndexImport	boolean	False	Triggers a build automatically after the index refreshes.
secure (UI: Enable security)	boolean	False	Makes use of documents and users' security tokens to restrict suggestions.
securityModel (UI: Security model)	enum(default, enovia)	default	Enables the enovia-specific handling of security tokens.

XML representation

<SuggestBuildConfig subExpr="false" subString="false" splitSentence="false" splitNGrams="0" computePermutation="false" maxEntryLength="50" maxSuggestions="10" nbUrlsPerEntry="3" tokenizationConfig="tok0" sanitizeEntries="false" buildAfterIndexImport="false" secure="false" securityModel="default" ></SuggestBuildConfig>

SimpleSuggestBuildTrigger com.exalead.mercury.mami.search.v20.SimpleSuggestBuildTrigger

Triggers a suggest build at a given time interval

Parent objects

BuildableSuggest
CSVIndexFieldSuggest
CategoryPathSuggest
CategorySuggest
CategoryTitleSuggest
DynamicFieldSuggest
EnoviaSecureDynamicFieldSuggest
IndexFieldSuggest
QueryReportingSuggest
RelatedTermsSuggest
SearchBasedSuggest

Attributes

Name	Type	Default value	Description
startTimestamp	long	-	Unix timestamp
endTimestamp	long	-	Unix timestamp
repeatInterval	long	-	Repeat interval in milliseconds

XML representation

<SimpleSuggestBuildTrigger startTimestamp="" endTimestamp="" repeatInterval="" ></SimpleSuggestBuildTrigger>

CronSuggestBuildTrigger com.exalead.mercury.mami.search.v20.CronSuggestBuildTrigger

Triggers a suggest build based on a Quartz Cron expression

Parent objects

BuildableSuggest
CSVIndexFieldSuggest
CategoryPathSuggest
CategorySuggest
CategoryTitleSuggest
DynamicFieldSuggest
EnoviaSecureDynamicFieldSuggest
IndexFieldSuggest
QueryReportingSuggest
RelatedTermsSuggest
SearchBasedSuggest

Attributes

Name	Type	Default value	Description
startTimestamp	long	-	Unix timestamp
endTimestamp	long	-	Unix timestamp
cronExpression	string	-	Quartz Cron expression: seconds minutes hours day_of_month month day of week [year]

XML representation

<CronSuggestBuildTrigger startTimestamp="" endTimestamp="" cronExpression="" ></CronSuggestBuildTrigger>

SuggestQueryConfig com.exalead.mercury.mami.search.v20.SuggestQueryConfig

Defines default suggest-time options

Parent objects

BuildableSuggest
CSVIndexFieldSuggest
CategoryPathSuggest
CategorySuggest
CategoryTitleSuggest
DynamicFieldSuggest
EnoviaSecureDynamicFieldSuggest
IndexFieldSuggest
QueryReportingSuggest
RelatedTermsSuggest
SearchBasedSuggest
StaticResourceSuggest
StaticXMLSuggest
Suggest

Attributes

Name	Type	Default value	Description
distance	int	2	Allows approximate matching. The higher the distance the more approximate the match. 0 for exact match.
minLengthForDistance1	int	3	Only searches for distance 1 fuzzy matches if the original word in the query is at least N characters long. This avoids too much approximation on very short words. The default value is 3.
minLengthForDistance2	int	6	Only searches for distance 2 fuzzy matches if the original word in the query is at least N characters long. This avoids too much approximation on very short words. The default value is 6.
autocomplete	boolean	False	Only works with a suggest built with subString or subExpr options. If set, the query is not replaced by the suggest results, instead, the results are appended to the query being typed in order to auto complete it.
recursive	boolean	False	Discards the leftmost word of the query progressively. Each new subquery, is sent to the suggest(s), until maxSuggestions is reached, or until there is no more word to use. For example, for a query "A B C", the suggests will be called 3 times, with "A B C", "B C", and "C".
secureSuggestTimeoutMS	int	200	Aborts matching in a secure suggest when this time limit is reached.

XML representation

<SuggestQueryConfig distance="2" minLengthForDistance1="3" minLengthForDistance2="6" autocomplete="false" recursive="false" secureSuggestTimeoutMS="200" ></SuggestQueryConfig>

IndexFieldSuggest com.exalead.mercury.mami.search.v20.IndexFieldSuggest

Takes the value of an index field to build a suggest.

Parent objects

SuggestConfig

Attributes

logic string
target string
indexField string
query string
searchServerInstance string
suggestName string
whiteListURL string
blackListURL string

Name	Type	Default value	Description
logic	string	-	The search logic to use
target	string	-	The search target to use
indexField	string	-	The index field to retrieve
query	string	#all	The query: may be used as a filter for the exported index data
searchServerInstance	string	-	The search server instance to target
suggestName	string	-	The mandatory suggest name. Must match [A-z0-9_]+
whiteListURL	string	-	URL to a suggest-allow list resource
blackListURL	string	-	URL to a suggest-block list resource

Nested elements

SuggestBuildConfig
SuggestBuildTrigger
SuggestQueryConfig

Type	Wrapper	Description
SuggestBuildConfig	-	Build options
SuggestBuildTrigger	-	Optional build trigger
SuggestQueryConfig	-

XML representation

<IndexFieldSuggest logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" ></IndexFieldSuggest>

DynamicFieldSuggest com.exalead.mercury.mami.search.v20.DynamicFieldSuggest

Takes the value from a subset of a dynamic index field to build a suggest.

Parent objects

SuggestConfig

Attributes

logic string
target string
indexField string
query string
searchServerInstance string
suggestName string
whiteListURL string
blackListURL string
mode enum(all, exact, prefix, suffix, regexp)
pattern string
keepContext boolean

Name	Type	Default value	Description
logic	string	-	The search logic to use
target	string	-	The search target to use
indexField	string	-	The index field to retrieve
query	string	#all	The query: may be used as a filter for the exported index data
searchServerInstance	string	-	The search server instance to target
suggestName	string	-	The mandatory suggest name. Must match [A-z0-9_]+
whiteListURL	string	-	URL to a suggest-allow list resource
blackListURL	string	-	URL to a suggest-block list resource
mode	enum(all, exact, prefix, suffix, regexp)	all	Matching mode. If 'all' is selected, all the content of the index field will be used.
pattern	string	-	The string to look for.
keepContext	boolean	True	When set to True, the context is kept in suggest entry.

Nested elements

SuggestBuildConfig
SuggestBuildTrigger
SuggestQueryConfig

Type	Wrapper	Description
SuggestBuildConfig	-	Build options
SuggestBuildTrigger	-	Optional build trigger
SuggestQueryConfig	-

XML representation

<DynamicFieldSuggest logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" mode="all" pattern="" keepContext="true" ></DynamicFieldSuggest>

EnoviaSecureDynamicFieldSuggest com.exalead.mercury.mami.search.v20.EnoviaSecureDynamicFieldSuggest

Retrieve enovia-specific tokens from the index and uses the value of an index field and the associated security tokens to build a suggest.

Parent objects

SuggestConfig

Attributes

mode enum(all, exact, prefix, suffix, regexp)
pattern string
keepContext boolean
logic string
target string
indexField string
query string
searchServerInstance string
suggestName string
whiteListURL string
blackListURL string
mxentryQuery string
securityTokensMeta string

Name	Type	Default value	Description
mode	enum(all, exact, prefix, suffix, regexp)	all	Matching mode. If 'all' is selected, all the content of the index field will be used.
pattern	string	-	The string to look for.
keepContext	boolean	True	When set to True, the context is kept in suggest entry.
logic	string	-	The search logic to use
target	string	-	The search target to use
indexField	string	-	The index field to retrieve
query	string	#all	The query: may be used as a filter for the exported index data
searchServerInstance	string	-	The search server instance to target
suggestName	string	-	The mandatory suggest name. Must match [A-z0-9_]+
whiteListURL	string	-	URL to a suggest-allow list resource
blackListURL	string	-	URL to a suggest-block list resource
mxentryQuery	string	availability:aclfts	This query retrieves mxov, mxentry and mxpass.
securityTokensMeta	string	mxov	Meta associated to each suggest entry used as base for security token computing.

Nested elements

SuggestBuildConfig
SuggestBuildTrigger
SuggestQueryConfig

Type	Wrapper	Description
SuggestBuildConfig	-	Build options
SuggestBuildTrigger	-	Optional build trigger
SuggestQueryConfig	-

XML representation

<EnoviaSecureDynamicFieldSuggest mode="all" pattern="" keepContext="true" logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" mxentryQuery="availability:aclfts" securityTokensMeta="mxov" ></EnoviaSecureDynamicFieldSuggest>

RelatedTermsSuggest com.exalead.mercury.mami.search.v20.RelatedTermsSuggest

Takes the value of the 'keyword' index field.

Parent objects

SuggestConfig

Attributes

logic string
target string
indexField string
query string
searchServerInstance string
suggestName string
whiteListURL string
blackListURL string
dictionaryName string

Name	Type	Default value	Description
logic	string	-	The search logic to use
target	string	-	The search target to use
indexField	string	-	The index field to retrieve
query	string	#all	The query: may be used as a filter for the exported index data
searchServerInstance	string	-	The search server instance to target
suggestName	string	-	The mandatory suggest name. Must match [A-z0-9_]+
whiteListURL	string	-	URL to a suggest-allow list resource
blackListURL	string	-	URL to a suggest-block list resource
dictionaryName	string	dict0

Nested elements

SuggestBuildConfig
SuggestBuildTrigger
SuggestQueryConfig

Type	Wrapper	Description
SuggestBuildConfig	-	Build options
SuggestBuildTrigger	-	Optional build trigger
SuggestQueryConfig	-

XML representation

<RelatedTermsSuggest logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" dictionaryName="dict0" ></RelatedTermsSuggest>

CSVIndexFieldSuggest com.exalead.mercury.mami.search.v20.CSVIndexFieldSuggest

Takes one value from a multi-encoded CSV field.

Parent objects

SuggestConfig

Attributes

logic string
target string
indexField string
query string
searchServerInstance string
suggestName string
whiteListURL string
blackListURL string
meta string

Name	Type	Default value	Description
logic	string	-	The search logic to use
target	string	-	The search target to use
indexField	string	-	The index field to retrieve
query	string	#all	The query: may be used as a filter for the exported index data
searchServerInstance	string	-	The search server instance to target
suggestName	string	-	The mandatory suggest name. Must match [A-z0-9_]+
whiteListURL	string	-	URL to a suggest-allow list resource
blackListURL	string	-	URL to a suggest-block list resource
meta	string	-	The CSV encoded meta to retrieve

Nested elements

SuggestBuildConfig
SuggestBuildTrigger
SuggestQueryConfig

Type	Wrapper	Description
SuggestBuildConfig	-	Build options
SuggestBuildTrigger	-	Optional build trigger
SuggestQueryConfig	-

XML representation

<CSVIndexFieldSuggest logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" meta="" ></CSVIndexFieldSuggest>

CategoryTitleSuggest com.exalead.mercury.mami.search.v20.CategoryTitleSuggest

Takes category titles of one category index field.

Parent objects

SuggestConfig

Attributes

root string
rootForm enum(exact, lowercase, normalized)
logic string
target string
indexField string
query string
searchServerInstance string
suggestName string
whiteListURL string
blackListURL string

Name	Type	Default value	Description
root	string	-	Path to retrieve
rootForm	enum(exact, lowercase, normalized)	normalized	Form of the category root.
logic	string	-	The search logic to use
target	string	-	The search target to use
indexField	string	-	The index field to retrieve
query	string	#all	The query: may be used as a filter for the exported index data
searchServerInstance	string	-	The search server instance to target
suggestName	string	-	The mandatory suggest name. Must match [A-z0-9_]+
whiteListURL	string	-	URL to a suggest-allow list resource
blackListURL	string	-	URL to a suggest-block list resource

Nested elements

SuggestBuildConfig
SuggestBuildTrigger
SuggestQueryConfig

Type	Wrapper	Description
SuggestBuildConfig	-	Build options
SuggestBuildTrigger	-	Optional build trigger
SuggestQueryConfig	-

XML representation

<CategoryTitleSuggest root="" rootForm="normalized" logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" ></CategoryTitleSuggest>

CategoryPathSuggest com.exalead.mercury.mami.search.v20.CategoryPathSuggest

Takes category paths of one category index field.

Parent objects

SuggestConfig

Attributes

root string
rootForm enum(exact, lowercase, normalized)
logic string
target string
indexField string
query string
searchServerInstance string
suggestName string
whiteListURL string
blackListURL string
removeRoot boolean

Name	Type	Default value	Description
root	string	-	Path to retrieve
rootForm	enum(exact, lowercase, normalized)	normalized	Form of the category root.
logic	string	-	The search logic to use
target	string	-	The search target to use
indexField	string	-	The index field to retrieve
query	string	#all	The query: may be used as a filter for the exported index data
searchServerInstance	string	-	The search server instance to target
suggestName	string	-	The mandatory suggest name. Must match [A-z0-9_]+
whiteListURL	string	-	URL to a suggest-allow list resource
blackListURL	string	-	URL to a suggest-block list resource
removeRoot	boolean	False	Removes the category root from results.

Nested elements

SuggestBuildConfig
SuggestBuildTrigger
SuggestQueryConfig

Type	Wrapper	Description
SuggestBuildConfig	-	Build options
SuggestBuildTrigger	-	Optional build trigger
SuggestQueryConfig	-

XML representation

<CategoryPathSuggest root="" rootForm="normalized" logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" removeRoot="false" ></CategoryPathSuggest>

StaticXMLSuggest com.exalead.mercury.mami.search.v20.StaticXMLSuggest

Takes a suggest XML file as parameter and compiles it.

This suggest cannot be scheduled.

Build options are read directly from the suggest XML file. Entries are compiled as found in the file.

Parent objects

SuggestConfig

Attributes

suggestName string
whiteListURL string
blackListURL string
url string
tokenizationConfig string

Name	Type	Default value	Description
suggestName	string	-	The mandatory suggest name. Must match [A-z0-9_]+
whiteListURL	string	-	URL to a suggest-allow list resource
blackListURL	string	-	URL to a suggest-block list resource
url	string	-	URL to the Suggest XML file.
tokenizationConfig	string	-	Tokenization Config to use.

Nested elements

SuggestQueryConfig

Type	Wrapper	Description
SuggestQueryConfig	-

XML representation

<StaticXMLSuggest suggestName="" whiteListURL="" blackListURL="" url="" tokenizationConfig="tok0" ></StaticXMLSuggest>

StaticResourceSuggest com.exalead.mercury.mami.search.v20.StaticResourceSuggest

Takes an already compiled resource as parameter.

This suggest can neither be scheduled nor built.

Parent objects

SuggestConfig

Attributes

suggestName string
whiteListURL string
blackListURL string
url string
tokenizationConfig string

Name	Type	Default value	Description
suggestName	string	-	The mandatory suggest name. Must match [A-z0-9_]+
whiteListURL	string	-	URL to a suggest-allow list resource
blackListURL	string	-	URL to a suggest-block list resource
url	string	-	URL to the directory where the compiled suggest data is stored.
tokenizationConfig	string	-	Tokenization Config to use.

Nested elements

SuggestQueryConfig

Type	Wrapper	Description
SuggestQueryConfig	-

XML representation

<StaticResourceSuggest suggestName="" whiteListURL="" blackListURL="" url="" tokenizationConfig="tok0" ></StaticResourceSuggest>

SuggestDispatcher com.exalead.mercury.mami.search.v20.SuggestDispatcher

A SuggestDispatcher associates prefix handlers to Suggests.

It may be used by the SuggestCommand to dispatch a relevant part of a query to the right suggest.

For example, given a query "title: test", it will send "test" to the Suggest attached to the "title" prefix handler.

Parent objects

SuggestConfig

Attributes

name string
matchWholeQuery boolean
useWithUnknownPrefix boolean
addQuotes boolean
displayPrefixInSuggest boolean
maxNbResults int
boostVariety boolean

Name	Type	Default value	Description
name	string	-	The mandatory suggest dispatcher name. Must match [A-z0-9_]+
matchWholeQuery	boolean	False	Sends the whole query to the default suggest if the cursor is outside a prefix handler. If unselected, it sends a subquery delimited by surrounding prefixes.
useWithUnknownPrefix	boolean	False	Sends the query to the default suggest if the cursor is within an undefined prefix handler. If unselected, undefined prefix handlers are ignored and there will be no suggestions.
addQuotes	boolean	True	Adds quotes where necessary so that the whole suggestion is included in the prefix handler.
displayPrefixInSuggest	boolean	False	When activated, the suggest is prefixed by the corresponding prefix handler if defined.
maxNbResults	int	0	Max number of suggest entries returned by this suggest dispatcher (0 returns all available entries).
boostVariety	boolean	False	Allows to retrieve the best matches for each suggest according to the maximum number of suggestions. Note that this mode does not always return the best global results.

Nested elements

designedForLogicsStringValue*
PrefixHandlerSuggestPair*

Type	Wrapper	Description
StringValue*	designedForLogics	SearchLogic names. Prefix handler names will be checked against these logics
PrefixHandlerSuggestPair*	-	Association of a prefix handler and a suggest

XML representation

<SuggestDispatcher name="" matchWholeQuery="false" useWithUnknownPrefix="false" addQuotes="true" displayPrefixInSuggest="false" maxNbResults="0" boostVariety="false" ></SuggestDispatcher>

PrefixHandlerSuggestPair com.exalead.mercury.mami.search.v20.PrefixHandlerSuggestPair

Maps a prefix handler to a suggest dictionary. You can map as many pairs as needed.

Parent objects

SuggestDispatcher

Attributes

Name	Type	Default value	Description
prefixHandler	string	-	A prefix handler name or alias
suggestName	string	-	An existing suggest name
maxNbResults	int	0	Max number of suggest entries returned by this suggest
useInDefaultSuggest	boolean	False	Select Default to specify the suggest dictionary to use by default for a specific prefix handler. Note that you must specify at least one default suggest to use the following options: Match whole query Use default suggest for non configured prefix Add prefix handler to suggestion

XML representation

<PrefixHandlerSuggestPair prefixHandler="" suggestName="" maxNbResults="0" useInDefaultSuggest="false" ></PrefixHandlerSuggestPair>

LoggingConfig com.exalead.mercury.mami.master.v10.LoggingConfig

Data model
General Logging configuration, for generic process logs.
Specific system (crawler, query) have their specific log policy.

Attributes

Name	Type	Default value	Description
version	long	-	-
enableTimeBasedLogRotation	boolean	False	Rotates process log files each day at midnight.
logRotationCronExpression	string	-	Custom Quartz Cron expression to trigger log rotation. Only taken into account if enableTimeBasedLogRotation is false.
logRotationMaxSizeKB	long	-	If set, process log files are rotated when bigger than the specified number of kilobytes.
logPurgeMaxSizeKB	long	-	If set, process log files are purged (older first) when their total size is above the specified number.
logExpirationDays	long	-	If set, process log files are purged when older than the specified number of days.
logInSyslog	boolean	False	If set, log are also copied in syslog with type=daemon, ident=cloudview, message="AppenderName: logMessage" This producs this type of line in /var/log/daemon.log: "Mar 19 12:10:47 madpc125 cloudview: index.clustermanager: Building slice list for st0"

Nested elements

GlobalLoggerConfig
LoggingLevel*

Type	Wrapper	Description
GlobalLoggerConfig	-
LoggingLevel*	-

XML representation

<LoggingConfig version="" enableTimeBasedLogRotation="false" logRotationCronExpression="" logRotationMaxSizeKB="" logPurgeMaxSizeKB="" logExpirationDays="" logInSyslog="false" ></LoggingConfig>

GlobalLoggerConfig com.exalead.mercury.mami.master.v10.GlobalLoggerConfig

Global logger module configuration.

Parent objects

LoggingConfig

Attributes

maxSize int
enableGlobalLogEntryStore boolean

Name	Type	Default value	Description
maxSize	int	100	Max size of the RAM buffer that stores latest messages.
enableGlobalLogEntryStore	boolean	False	Keeps all logs (in these case, an additional structure is created on disk and GetGlobalLogEntryList can be called).

XML representation

<GlobalLoggerConfig maxSize="100" enableGlobalLogEntryStore="false" ></GlobalLoggerConfig>

LoggingLevel com.exalead.mercury.mami.master.v10.LoggingLevel

Logging level configuration.

Parent objects

LoggingConfig

Attributes

level string
host string
install string
process string
logger string

Name	Type	Default value	Description
level	string	-	Logging level. The following values are available: `off`, `fatal`, `error`, `warning`, `info`, `notice`, `debug`, and `trace`.
host	string	-	Host for which this logging level should be applied. If null, the configuration applies to every host of the cluster.
install	string	-	Instance for which this logging level should be applied. If null, the configuration applies to every instance of the cluster.
process	string	-	Process for which this logging level should be applied. If null, the configuration applies to every process of the selected host (or every host if `host` is null).
logger	string	-	Logger for which this logging level should be applied. If null, the configuration applies to every logger.

XML representation

<LoggingLevel level="" host="" install="" process="" logger="" ></LoggingLevel>

SetLoggingLevel com.exalead.mercury.mami.master.v10.SetLoggingLevel

Dynamically updates the logging level of a given process.

Parent objects

LoggingConfig

Attributes

level string
host string
install string
process string
logger string

Name	Type	Default value	Description
level	string	-	Logging level. The following values are available: `off`, `fatal`, `error`, `warning`, `info`, `notice`, `debug`, and `trace`.
host	string	-	Host for which this logging level should be applied. If null, the configuration applies to every host of the cluster.
install	string	-	Instance for which this logging level should be applied. If null, the configuration applies to every instance of the cluster.
process	string	-	Process for which this logging level should be applied. If null, the configuration applies to every process of the selected host (or every host if `host` is null).
logger	string	-	Logger for which this logging level should be applied. If null, the configuration applies to every logger.

XML representation

<SetLoggingLevel level="" host="" install="" process="" logger="" ></SetLoggingLevel>

ReportingConfig com.exalead.cloudview.reporting.v10.ReportingConfig

No documentation for this element.

Attributes

Name	Type	Default value	Description
httpProxyHost	string	-	Address of the HTTP proxy to send usage data report and diagnostics to Exalead
httpProxyPort	int	-	Port of the HTTP proxy to send usage data report and diagnostics to Exalead
httpProxyLogin	string	-	Login of the HTTP proxy to send usage data report and diagnostics to Exalead
httpProxyPassword	string	-	Password of the HTTP proxy to send usage data report and diagnostics to Exalead

Nested elements

CustomReportField*
NotificationsConfig
PerformanceMonitoringConfig
SearchDynamicReportingConfig
SearchStaticReportingConfig
ServiceReportingConfig
UsageDataReportConfig

Type	Wrapper	Description
CustomReportField*	-
NotificationsConfig	-
PerformanceMonitoringConfig	-
SearchDynamicReportingConfig	-
SearchStaticReportingConfig	-
ServiceReportingConfig	-
UsageDataReportConfig	-

XML representation

<ReportingConfig httpProxyHost="" httpProxyPort="" httpProxyLogin="" httpProxyPassword="" ></ReportingConfig>

HttpHeaderField com.exalead.cloudview.reporting.v10.HttpHeaderField

No documentation for this element.

Parent objects

ReportingConfig

Attributes

fieldName string
headerName string

Name	Type	Default value	Description
fieldName	string	-
headerName	string	-

XML representation

<HttpHeaderField fieldName="" headerName="" ></HttpHeaderField>

HttpParamField com.exalead.cloudview.reporting.v10.HttpParamField

No documentation for this element.

Parent objects

ReportingConfig

Attributes

fieldName string
paramName string

Name	Type	Default value	Description
fieldName	string	-
paramName	string	-

XML representation

<HttpParamField fieldName="" paramName="" ></HttpParamField>

NotificationsConfig com.exalead.cloudview.reporting.v10.NotificationsConfig

No documentation for this element.

Parent objects

ReportingConfig

Attributes

Name	Type	Default value	Description
enabled	boolean	True
licenseNotificationEnabled	boolean	True
processesNotificationEnabled	boolean	True
indexingNotificationEnabled	boolean	True
searchNotificationEnabled	boolean	True
tokenNotificatioEnabled	boolean	True
tokenNotificationThreshold	double	0.8
sender	string	-
recipient	string	-
subjectPrefix	string	[CloudView]

Nested elements

SMTPServer

Type	Wrapper	Description
SMTPServer	-

XML representation

<NotificationsConfig enabled="true" licenseNotificationEnabled="true" processesNotificationEnabled="true" indexingNotificationEnabled="true" searchNotificationEnabled="true" tokenNotificatioEnabled="true" tokenNotificationThreshold="0.8" subjectPrefix="[CloudView] " ></NotificationsConfig>

SMTPServer com.exalead.cloudview.reporting.v10.SMTPServer

SMTP Server used to send mails concerning reporting of your Cloudview instance.

Parent objects

NotificationsConfig

Attributes

username string
password string
host string
port int
tls boolean

Name	Type	Default value	Description
username	string	-	Username used to connect to the SMTP server.
password	string	-	Password associated to the username.
host	string	localhost	Hostname of the SMTP server.
port	int	25	Port of the targeted SMTP server.
tls	boolean	False	Enables the TLS (Transport Layer Security) protocol for this server.

XML representation

<SMTPServer username="" password="" host="localhost" port="25" tls="false" ></SMTPServer>

PerformanceMonitoringConfig com.exalead.cloudview.reporting.v10.PerformanceMonitoringConfig

Monitoring-Console configuration

Parent objects

ReportingConfig

Attributes

monitoringIntervalS int
timeout int
collectorThreads int
enabled boolean
rrddir string
configdir string
tmpdir string
libspath string
autocreate boolean
onlyForInstallName string

Name	Type	Default value	Description
monitoringIntervalS	int	60	Interval in seconds between two monitoring probes. Value must be higher or equal to 4
timeout	int	20	Timeout value in seconds for network activity
collectorThreads	int	6
enabled	boolean	True	Enables monitoring
rrddir	string	${dataDir}/perfmonitoring/probe	Directory where probes files (rrd) are stored.
configdir	string	${dataDir}/perfmonitoring/config	Directory parsed to find xml configuration files.
tmpdir	string	${dataDir}/perfmonitoring/tmp	Temporary directory used internally.
libspath	string	${installDir}/extrajava/jrds/	List of jar files or directory containing probes.
autocreate	boolean	True	Enables the auto-creation of configdir and rrddir.
onlyForInstallName	string		If set, restricts monitoring to a specific install name.

Nested elements

monitoredPageCachesStringValue*
MonitoredHost*
MonitoredIndexField*
MonitoredIndexFieldConfig
PerformanceMonitoringTimer*

Type	Wrapper	Description
StringValue*	monitoredPageCaches	List of PageCache to monitor.
MonitoredHost*	-	Additional host probes
MonitoredIndexField*	-	List of index field to monitor.
MonitoredIndexFieldConfig	-	Global index fied monitoring config
PerformanceMonitoringTimer*	-	Additional timers

XML representation

<PerformanceMonitoringConfig monitoringIntervalS="60" timeout="20" collectorThreads="6" enabled="true" rrddir="${dataDir}/perfmonitoring/probe" configdir="${dataDir}/perfmonitoring/config" tmpdir="${dataDir}/perfmonitoring/tmp" libspath="${installDir}/extrajava/jrds/" autocreate="true" onlyForInstallName="" ></PerformanceMonitoringConfig>

MonitoredHost com.exalead.cloudview.reporting.v10.MonitoredHost

No documentation for this element.

Parent objects

PerformanceMonitoringConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-

Nested elements

MonitoredHostConnection*
MonitoredHostProbe*

Type	Wrapper	Description
MonitoredHostConnection*	-
MonitoredHostProbe*	-

XML representation

<MonitoredHost name="" ></MonitoredHost>

MonitoredHostConnection com.exalead.cloudview.reporting.v10.MonitoredHostConnection

How to connect to a monitored host.

Parent objects

MonitoredHost

Attributes

type string
name string

Name	Type	Default value	Description
type	string	-	Connection type. Must not be empty.
name	string	-	Connection name. Must not be empty.

Nested elements

MonitoredHostElementArg*

Type	Wrapper	Description
MonitoredHostElementArg*	-	List of arguments for connection. Currently unused.

XML representation

<MonitoredHostConnection type="" name="" ></MonitoredHostConnection>

MonitoredHostElementArg com.exalead.cloudview.reporting.v10.MonitoredHostElementArg

No documentation for this element.

Parent objects

MonitoredHostConnection
MonitoredHostProbe
argList MonitoredHostProbe

Attributes

type string
value string

Name	Type	Default value	Description
type	string	-
value	string	-

XML representation

<MonitoredHostElementArg type="" value="" ></MonitoredHostElementArg>

MonitoredHostProbe com.exalead.cloudview.reporting.v10.MonitoredHostProbe

A monitoring probe

Parent objects

MonitoredHost

Attributes

type string
label string
connection string

Name	Type	Default value	Description
type	string	-	The probe type
label	string	-	The descriptive label of the probe
connection	string	-	The name of the MonitoredHostConnection used by the probe.

Nested elements

argListMonitoredHostElementArg*
MonitoredHostElementArg*

Type	Wrapper	Description
MonitoredHostElementArg*	argList	The list of MonitoredHostElementArg (one per argument of the probe)
MonitoredHostElementArg*	-	The list of MonitoredHostElementArg (one per argument of the probe).

XML representation

<MonitoredHostProbe type="" label="" connection="" ></MonitoredHostProbe>

MonitoredIndexField com.exalead.cloudview.reporting.v10.MonitoredIndexField

No documentation for this element.

Parent objects

PerformanceMonitoringConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-

XML representation

<MonitoredIndexField name="" ></MonitoredIndexField>

MonitoredIndexFieldConfig com.exalead.cloudview.reporting.v10.MonitoredIndexFieldConfig

No documentation for this element.

Parent objects

PerformanceMonitoringConfig

Attributes

monitorAllFacetFields boolean

Name	Type	Default value	Description
monitorAllFacetFields	boolean	True	If true, monitor all the Value and Category fields

XML representation

<MonitoredIndexFieldConfig monitorAllFacetFields="true" ></MonitoredIndexFieldConfig>

PerformanceMonitoringTimer com.exalead.cloudview.reporting.v10.PerformanceMonitoringTimer

Timer configuration. A timer is identified by a name and is composed of one or several probe types.

Parent objects

PerformanceMonitoringConfig

Attributes

Name	Type	Default value	Description
monitoringIntervalS	int	60	Interval in seconds between two monitoring probes. Value must be higher or equal to 4
timeout	int	20	Timeout value in seconds for network activity
collectorThreads	int	6
name	string	-	Identifier name of the timer

Nested elements

probeTypesStringValue*

Type	Wrapper	Description
StringValue*	probeTypes	List of probe types using this timer. Probe types that are not referenced by any timer use the default one.

XML representation

<PerformanceMonitoringTimer monitoringIntervalS="60" timeout="20" collectorThreads="6" name="" ></PerformanceMonitoringTimer>

SearchDynamicReportingConfig com.exalead.cloudview.reporting.v10.SearchDynamicReportingConfig

Dynamic Reporting configuration

Parent objects

ReportingConfig

Attributes

enabled boolean

Name	Type	Default value	Description
enabled	boolean	False	Enables automatic report generations

Nested elements

SearchDynamicReportingConfigNames

Type	Wrapper	Description
SearchDynamicReportingConfigNames	-	Generated reporting configuration element names.

XML representation

<SearchDynamicReportingConfig enabled="false" ></SearchDynamicReportingConfig>

SearchDynamicReportingConfigNames com.exalead.cloudview.reporting.v10.SearchDynamicReportingConfigNames

Contains names of configuration elements used by the dynamic reporting

Parent objects

SearchDynamicReportingConfig

Attributes

Name	Type	Default value	Description
hostName	string	-
installName	string	-
papiPort	int	-
reportingConnector	string	-
reportingBuildGroup	string	-
reportingDataModel	string	-
reportingTaskQueue	string	-
reportingIndexBuilder	string	-
reportingSearchTarget	string	-
reportingSearchLogic	string	-

XML representation

<SearchDynamicReportingConfigNames hostName="" installName="" papiPort="" reportingConnector="" reportingBuildGroup="" reportingDataModel="" reportingTaskQueue="" reportingIndexBuilder="" reportingSearchTarget="" reportingSearchLogic="" ></SearchDynamicReportingConfigNames>

SearchStaticReportingConfig com.exalead.cloudview.reporting.v10.SearchStaticReportingConfig

Static Reporting configuration

Parent objects

ReportingConfig

Attributes

Name	Type	Default value	Description
enabled	boolean	True	Enables automatic report generations
reportCron	string	-	Quartz Cron expression triggering global static report generation.
reportApplicationCron	string	0 10 0 * * ?	Quartz Cron expression triggering static report generation on applications. The default setting is every day at 00:10. Reports are generated for each deployed application.
topLimit	int	100	Maximum number of rows to retrieve while computing ordered results (for example, top queries).
maxVersionsToKeep	int	30	Maximum number of resource versions to keep in the StaticReportList. 0 means no limit is enforced, whereas 1 will keep only the most recent version.

Nested elements

ApplicationReportScheduling*

Type	Wrapper	Description
ApplicationReportScheduling*	-	List of application reports scheduling. Each application can have a scheduled search static reporting. Each application report scheduling is defined by the application id and a Quartz Cron expression.

XML representation

<SearchStaticReportingConfig enabled="true" reportCron="" reportApplicationCron="0 10 0 * * ?" topLimit="100" maxVersionsToKeep="30" ></SearchStaticReportingConfig>

ApplicationReportScheduling com.exalead.cloudview.reporting.v10.ApplicationReportScheduling

Associate a Quartz Cron expression to a specific application in order to trigger its reporting generation

Parent objects

SearchStaticReportingConfig

Attributes

cron string
applicationId string

Name	Type	Default value	Description
cron	string	-
applicationId	string	-

XML representation

<ApplicationReportScheduling cron="" applicationId="" ></ApplicationReportScheduling>

ServiceReportingConfig com.exalead.cloudview.reporting.v10.ServiceReportingConfig

No documentation for this element.

Nested elements

ReporterConfig*

Type	Wrapper	Description
ReporterConfig*	-

XML representation

<ServiceReportingConfig ></ServiceReportingConfig>

ReporterConfig com.exalead.cloudview.reporting.v10.ReporterConfig

No documentation for this element.

Parent objects

ServiceReportingConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-

Nested elements

ReportingPublisher*

Type	Wrapper	Description
ReportingPublisher*	-

XML representation

<ReporterConfig name="" ></ReporterConfig>

PAPIReportingPublisher com.exalead.cloudview.reporting.v10.PAPIReportingPublisher

Base class for PAPI Publishers.

Parent objects

ReporterConfig

Attributes

Name	Type	Default value	Description
synchronous	boolean	False
closeTimeoutMS	int	0
batchTimeoutMS	int	1000
minBatchSize	int	50
maxBatchSize	int	250
maxQueueSize	int	250
publishedFields	string	-
host	string	-	Hostname of the reporting publisher
port	int	-	Port of the reporting publisher
connectorName	string	-	Name of the reporting publisher

XML representation

<PAPIReportingPublisher synchronous="false" closeTimeoutMS="0" batchTimeoutMS="1000" minBatchSize="50" maxBatchSize="250" maxQueueSize="250" publishedFields="" host="" port="" connectorName="" ></PAPIReportingPublisher>

CSVReportingPublisher com.exalead.cloudview.reporting.v10.CSVReportingPublisher

Base class for CSV Publishers.

Parent objects

ReporterConfig

Attributes

Name	Type	Default value	Description
synchronous	boolean	False
closeTimeoutMS	int	0
batchTimeoutMS	int	1000
minBatchSize	int	50
maxBatchSize	int	250
maxQueueSize	int	250
publishedFields	string	-
file	string	-
maxFileSizeMB	int	50	0 means no rotation is triggered by the file size
rotateEveryNMonths	int	-	0 means no rotation is triggered by the file age in month
rotateEveryNDays	int	1	0 means no rotation is triggered by the file age in day
rotateEveryNHours	int	-	0 means no rotation is triggered by the file age in hour
maxSizeToKeepMB	int	2500	Maximum csv reporting files volume in megaoctet. The oldest files are discarded at rotation time. 0 means no limit is enforced.
maxFileToKeep	int	50	Maximum csv reporting files to keep. The oldest files are discarded at rotation time. 0 means no limit is enforced, whereas 1 will discards all rotated files.
maxDayToKeep	int	30	Maximum csv reporting file age in day to keep. The oldest files are discarded at rotation time. 0 means no limit is enforced, whereas 1 only keep todays files.

XML representation

<CSVReportingPublisher synchronous="false" closeTimeoutMS="0" batchTimeoutMS="1000" minBatchSize="50" maxBatchSize="250" maxQueueSize="250" publishedFields="" file="" maxFileSizeMB="50" rotateEveryNMonths="" rotateEveryNDays="1" rotateEveryNHours="" maxSizeToKeepMB="2500" maxFileToKeep="50" maxDayToKeep="30" ></CSVReportingPublisher>

JDBCReportingPublisher com.exalead.cloudview.reporting.v10.JDBCReportingPublisher

Base class for JDBC Publishers.

Parent objects

ReporterConfig

Attributes

synchronous boolean
closeTimeoutMS int
batchTimeoutMS int
minBatchSize int
maxBatchSize int
maxQueueSize int
publishedFields string
driver string
connectionString string
login string
password string
table string
insertQuery string

Name	Type	Default value	Description
synchronous	boolean	False
closeTimeoutMS	int	0
batchTimeoutMS	int	1000
minBatchSize	int	50
maxBatchSize	int	250
maxQueueSize	int	250
publishedFields	string	-
driver	string	-	JDBC driver fully qualified Java class name.
connectionString	string	-	JDBC connection string. This is vendor specific.
login	string	-	Login for the database connection.
password	string	-	Password for the database connection.
table	string	-	Table where data must be inserted.
insertQuery	string	-	Query used to insert data in the table.

XML representation

<JDBCReportingPublisher synchronous="false" closeTimeoutMS="0" batchTimeoutMS="1000" minBatchSize="50" maxBatchSize="250" maxQueueSize="250" publishedFields="" driver="" connectionString="" login="" password="" table="" insertQuery="" ></JDBCReportingPublisher>

ReportingStorePublisher com.exalead.cloudview.reporting.v10.ReportingStorePublisher

Base class for PAPI Publishers.

Parent objects

ReporterConfig

Attributes

Name	Type	Default value	Description
synchronous	boolean	False
closeTimeoutMS	int	0
batchTimeoutMS	int	1000
minBatchSize	int	50
maxBatchSize	int	250
maxQueueSize	int	250
publishedFields	string	-
schema	string	-
rotationCron	string	0 0 0 * * ?	Quartz Cron expression to force a rotation. A rotation is also triggered every time a collection is queried.
maxRowCount	long	100000	Maximum count of line in the database. When this size is reached the oldest records are discarded. 0 means there is no limit to the database size.

XML representation

<ReportingStorePublisher synchronous="false" closeTimeoutMS="0" batchTimeoutMS="1000" minBatchSize="50" maxBatchSize="250" maxQueueSize="250" publishedFields="" schema="" rotationCron="0 0 0 * * ?" maxRowCount="100000" ></ReportingStorePublisher>

UsageDataReportConfig com.exalead.cloudview.reporting.v10.UsageDataReportConfig

Configuration of the Usage Data Reporting (UDR). UDR allows 3DS EXALEAD to improve its products thanks to anonymous information on their use.

Parent objects

ReportingConfig

Attributes

enabled boolean

Name	Type	Default value	Description
enabled	boolean	False	Should usage data be sent regularly to 3DS EXALEAD?

XML representation

<UsageDataReportConfig enabled="false" ></UsageDataReportConfig>

DeploymentConfig exa.bee.deploy.v10.DeploymentConfig

Deployment configuration.
Defines the default {@link PortRange} used by hosts.

Defines the list of {@link Host} part of the cluster configuration.

Defines the list of virtual services defined for this installation.

Attributes

Name	Type	Default value	Description
version	long	-	-
description	string	-	A descriptive name for this deployment, that will appear in the Admin-UI, for identification purposes
hostAgentPort	int	-	Default port for the hostagent if not defined in each host

Nested elements

Host*
PortRange
VirtualService*

Type	Wrapper	Description
Host*	-	-
PortRange	-	-
VirtualService*	-	-

XML representation

<DeploymentConfig version="" description="" hostAgentPort="" ></DeploymentConfig>

Host exa.bee.deploy.v10.Host

Defines a {@link PortRange} which overrides the one defined at {@link DeploymentConfig} level. Note that if this host's port range has a null start port then, default port range start port is used. Same for last port.

Defines the set of roles implemented by this host. See Management API reference guide for a complete list of available roles.

Parent objects

DeploymentConfig

Attributes

hostname string
install string
architecture string
hostAgentPort int

Name	Type	Default value	Description
hostname	string	-	Network name of the physical host. The host must be reachable by all other hosts of the CloudView install using this name.
install	string	-	Name of this CloudView cluster host on the physical host. It is possible to deploy several CloudView installs in the same CloudView cluster on the same physical host.
architecture	string	-	System architecture of the host. Optional.
hostAgentPort	int	-	Fixed Host agent port (optional). Overrides hostAgentPort set in DeploymentConfig

Nested elements

PortRange
Role*

Type	Wrapper	Description
PortRange	-	Range of TCP ports used for this host
Role*	-	CloudView roles hosted by this host

XML representation

<Host hostname="" install="" architecture="" hostAgentPort="" ></Host>

PortRange exa.bee.deploy.v10.PortRange

Port range attributed to the product

Parent objects

DeploymentConfig
Host

Attributes

startPort int
lastPort int

Name	Type	Default value	Description
startPort	int	-	First TCP port
lastPort	int	-	Last TCP port

XML representation

<PortRange startPort="" lastPort="" ></PortRange>

Role exa.bee.deploy.v10.Role

A Role in the CloudView process. The list of valid Roles can be found in the CloudView deployment guide

Parent objects

Host

Attributes

name string

Name	Type	Default value	Description
name	string	-	$CONSTRAINTS

Nested elements

RoleAttribute*

Type	Wrapper	Description
RoleAttribute*	-	-

XML representation

<Role name="" ></Role>

RoleAttribute exa.bee.deploy.v10.RoleAttribute

Parameter of a role. The list of valid role attributes depends on the
Role.

Parent objects

Role

Attributes

name string
value string

Name	Type	Default value	Description
name	string	-	$CONSTRAINTS
value	string	-	$CONSTRAINTS

XML representation

<RoleAttribute name="" value="" ></RoleAttribute>

VirtualService exa.bee.deploy.v10.VirtualService

Virtual services are internal high availability points. Not used in general
CloudView deployments..
Defines a single access point to a pool of services. {@link VirtualServiceRef} can be used to override the default behavior of {@link VirtualServiceGroupRef}.

Parent objects

DeploymentConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-	Name of the virtual service (use in Directory.dispatch(...).

Nested elements

VirtualServiceGroupRef*
VirtualServiceRef*

Type	Wrapper	Description
VirtualServiceGroupRef*	-	-
VirtualServiceRef*	-	-

XML representation

<VirtualService name="" ></VirtualService>

VirtualServiceGroupRef exa.bee.deploy.v10.VirtualServiceGroupRef

Internal advanced configuration.
Reference to a group of services.
The set of services referenced by a group is built from the roles "Process" whose attribute virtualGroup match this group's name.
Default behaviour is simple round-robin among all the services.

Parent objects

VirtualService

Attributes

name string
service string

Name	Type	Default value	Description
name	string	-	-
service	string	-	Name of the service which is virtualized

XML representation

<VirtualServiceGroupRef name="" service="" ></VirtualServiceGroupRef>

VirtualServiceRef exa.bee.deploy.v10.VirtualServiceRef

Internal advanced configuration. Reference to a concrete service implementation.

Parent objects

VirtualService

Attributes

name string
enabled boolean
power int
maxConnection int

Name	Type	Default value	Description
name	string	-	Fully qualified path to the service implementation.
enabled	boolean	true	Use this to easily enable or disable.
power	int	1	-
maxConnection	int	100	-

XML representation

<VirtualServiceRef name="" enabled="true" power="1" maxConnection="100" ></VirtualServiceRef>

CloudviewDeploymentInternalConfig com.exalead.mercury.mami.deploy.v10.CloudviewDeploymentInternalConfig

Advanced internal settings.

Attributes

Name	Type	Default value	Description
enableServiceList	boolean	true	Enable services listing on Java processes
version	long	-	-
collapseIndexSlices	boolean	True	Merges the processes for all slices (in a given index replica) into only one process.
collapseSearchServers	boolean	False	Merges searchserver processes in the gateway process.
collapseMaster	boolean	False	Merges the Exascript master process into the hostagent.
debug	boolean	False	Activate debugger access on java process.

Nested elements

globalBeegistryKeysKeyValue*
ProcessInternalConfig*
SearchInternalParameters
ServerCiphers
ServerCryptographicProtocols
TickInternalParameters

Type	Wrapper	Description
KeyValue*	globalBeegistryKeys	Additional global beegistry keys
ProcessInternalConfig*	-	Configuration sets for the process. Configuration are matched in a first-match fashion. For example: <ProcessInternalConfig name="foobar" runtime="java" /> <ProcessInternalConfig runtime="java" /> the second one will be used for all java processes except foobar.
SearchInternalParameters	-
ServerCiphers	-	Defines the cipher suite by a sequence on include and exclude statement.
ServerCryptographicProtocols	-	Set the protocol that will be supported by servers (SSL, TLS..., see javax.net.ssl.SSLEngine)
TickInternalParameters	-

XML representation

<CloudviewDeploymentInternalConfig enableServiceList="true" version="" collapseIndexSlices="true" collapseSearchServers="false" collapseMaster="false" debug="false" ></CloudviewDeploymentInternalConfig>

ProcessInternalConfig exa.bee.deploy.v10.ProcessInternalConfig

Internal configuration for a set of processes.

Parent objects

CloudviewDeploymentInternalConfig

Attributes

Name	Type	Default value	Description
name	string	-	Name of the process on which this applies. Applies to all processes if null
hostname	string	-	Name of the host on which this applies. Applies to all hosts if null
installName	string	-	Name of the install on which this applies. Applies to all installs if null
runtime	string	-	Runtime on which this applies. One of exa, java, dotnet. Applies to all runtimes if null
exaBinary	string	-	EXA binary to use in case of an EXA process.
httpMinThreads	int	5	Minimum number of alive threads for servicing HTTP queries
httpMaxThreads	int	500	Maximum number of alive threads for servicing HTTP queries
httpMaxStopTime	int	100	Maximum total time(s) allowed for current queries to be treated before really stopping the server
httpMaxGETQuerySize	int	32768	Default max size in bytes of HTTP GET queries on this process. Note that this might be overriden by specific settings (like the SearchAPI max size)
httpMaxFormPOSTQuerySize	int	10000000	Default max size in bytes of HTTP POST queries on this process. Note that this might be overriden by specific settings (like the SearchAPI max size)
httpMaxIdleTimeMS	int	86400000	Maximum timeout for an HTTP connection, either while waiting for new data, or while processing a query
httpAcceptorThreads	int	1	Number of threads waiting for new HTTP connections
httpAcceptQueueSize	int	1200	Maximum size for the TCP acception queue

Nested elements

paramsKeyValue*
argsStringValue*
envOverrideKeyValue*
envAppendKeyValue*
envPrependKeyValue*

Type	Wrapper	Description
KeyValue*	params	Various advanced process params: stopTimeout: Time, in milliseconds, to wait for the process to stop before being killed startTimeout: Time, in milliseconds, to wait for the process to start before being killed startDelay: Time, in milliseconds, to delay the start of this process after the others killIfNotResponding: (true/false) If the process fails to answer to pings, indicate if it should be killed or not
StringValue*	args	Command line arguments. If empty, default arguments are used. List of possible params for java processes: -Dcv.showServletStacks=false : do not print java stacks when web app fails For linux: Overriding ulimits -Dulimit.NAME=XXXX with NAME one of 'core', 'cpu', 'data', 'fsize', 'nofile', 'stack', 'as', 'rss', 'nproc', 'memlock', 'locks', 'sigpending', 'msgqueue', 'nice', 'rtprio', 'rttime' See linux ulimits doc for precise documentation XXXX is a number expressed in octets for sizes Overriding malloc options -DMALLOPT_NAME=YYYY with MALLOPT_NAME one of 'mmap_threshold', 'mmap_max', 'mxfast', 'top_pad', 'trim_threshold' YYYY is a number expressed in octets See linux mallopt documentation for more details.
KeyValue*	envOverride	Environment variables to set/replace for this process
KeyValue*	envAppend	Additional content to append at the end of environment variables (for PATH-like variables)
KeyValue*	envPrepend	Additional content to prepend at the beginning of environment variables (for PATH-like variables)

XML representation

<ProcessInternalConfig name="" hostname="" installName="" runtime="" exaBinary="" httpMinThreads="5" httpMaxThreads="500" httpMaxStopTime="100" httpMaxGETQuerySize="32768" httpMaxFormPOSTQuerySize="10000000" httpMaxIdleTimeMS="86400000" httpAcceptorThreads="1" httpAcceptQueueSize="1200" ></ProcessInternalConfig>

SearchInternalParameters com.exalead.mercury.mami.deploy.v10.SearchInternalParameters

Search internal parameters to tune performance. Warning: Experts only, this can have a huge performance impact.

Parent objects

CloudviewDeploymentInternalConfig

Attributes

fullHitsContextMaxLifeSpan int
rankedSearchFullHitsRecyclingMaxSize int
fullHitMetasRecyclingMaxSize int
messagesAllocatorMaxSizeMB int
messagesAllocatorMaxLifeSpan int
workerThreadMaxQueries int
runnerMaxLifeSpan int
runnerMaxQueries int
runnerKeepAtLeastWhenIdle int
runnerKeepAtLeastWhenInBurst int
runnerDestroyIdleAfter int

Name	Type	Default value	Description
fullHitsContextMaxLifeSpan	int	1000	When fullhits and their associated metas are released, they are put back into a memory pool for reuse. A full free is performed each 'fullHitsContextMaxLifeSpan' queries.
rankedSearchFullHitsRecyclingMaxSize	int	500	When fullhits are released, they are put back into a memory pool for reuse. A full free is performed once the pool has allocated more than 'rankedSearchFullHitsRecyclingMaxSize' fullhits.
fullHitMetasRecyclingMaxSize	int	50000	When fullhit metas are released, they are put back into a memory pool for reuse. A full free is performed once the pool has allocated more than 'fullHitMetasRecyclingMaxSize' metas.
messagesAllocatorMaxSizeMB	int	2000	A full reinitialization of the FLEA messages allocator will be performed once the pool of flea message has allocated more than 'messagesAllocatorMaxSizeMB' bytes.
messagesAllocatorMaxLifeSpan	int	1000	A full reinitialization of the FLEA messages allocator will be performed once 'messagesAllocatorMaxLifeSpan' queries have been done.
workerThreadMaxQueries	int	100	A worker (query container at slice level) is entirely destroyed once a worker thread has served 'workerThreadMaxQueries' queries.
runnerMaxLifeSpan	int	300	A runner (query container at searchserver level) is entirely destroyed once a runner has served queries during 'runnerMaxLifeSpan' seconds.
runnerMaxQueries	int	1000	A runner is entirely destroyed once a runner has served 'runnerMaxQueries' queries.
runnerKeepAtLeastWhenIdle	int	0	Keep at least 'runnerKeepAtLeastWhenIdle' runner(s) ready to serve queries even without traffic.
runnerKeepAtLeastWhenInBurst	int	5	Be sure you keep at least 'runnerKeepAtLeastWhenInBurst' live runners, whatever the fullfilment of 'runnerMaxQueries' 'runnerMaxLifeSpan' when your are in a peak of activity.
runnerDestroyIdleAfter	int	60	Destroy idle runners after 'runnerDestroyIdleAfter' seconds of inactivity.

XML representation

<SearchInternalParameters fullHitsContextMaxLifeSpan="1000" rankedSearchFullHitsRecyclingMaxSize="500" fullHitMetasRecyclingMaxSize="50000" messagesAllocatorMaxSizeMB="2000" messagesAllocatorMaxLifeSpan="1000" workerThreadMaxQueries="100" runnerMaxLifeSpan="300" runnerMaxQueries="1000" runnerKeepAtLeastWhenIdle="0" runnerKeepAtLeastWhenInBurst="5" runnerDestroyIdleAfter="60" ></SearchInternalParameters>

ServerCiphers exa.bee.deploy.v10.ServerCiphers

Defines the cipher suite by a sequence on include and exclude statement.
When no cipher suite is included, all supported cipher suites are accepted.,

Nested elements

Exclude*
Include*

Type	Wrapper	Description
Exclude*	-	Names of the cipher that can not be used for HTTPS requests.
Include*	-	Names of the cipher that can be used for HTTPS requests. If empty then all ciphers can be used.

XML representation

<ServerCiphers ></ServerCiphers>

Exclude exa.bee.deploy.v10.Exclude

Names of a cipher suite that can not be used for HTTPS request.

Parent objects

ServerCiphers

Attributes

name string

Name	Type	Default value	Description
name	string	-

XML representation

<Exclude name="" ></Exclude>

Include exa.bee.deploy.v10.Include

Names of a cipher suite that can be used for HTTPS request.

Parent objects

ServerCiphers

Attributes

name string

Name	Type	Default value	Description
name	string	-

XML representation

<Include name="" ></Include>

ServerCryptographicProtocols exa.bee.deploy.v10.ServerCryptographicProtocols

Set the protocol that will be supported by servers (SSL, TLS..., see javax.net.ssl.SSLEngine)

Nested elements

ExcludeProtocol*
IncludeProtocol*

Type	Wrapper	Description
ExcludeProtocol*	-	Name of a crypograhic protocol as defined in javax.net.ssl.SSLEngine) to exclude
IncludeProtocol*	-	Name of a crypograhic protocol (as defined in javax.net.ssl.SSLEngine) to include

XML representation

<ServerCryptographicProtocols ></ServerCryptographicProtocols>

ExcludeProtocol exa.bee.deploy.v10.ExcludeProtocol

Name of a crypograhic protocol as defined in javax.net.ssl.SSLEngine) to exclude

Parent objects

ServerCryptographicProtocols

Attributes

name string

Name	Type	Default value	Description
name	string	-

XML representation

<ExcludeProtocol name="" ></ExcludeProtocol>

IncludeProtocol exa.bee.deploy.v10.IncludeProtocol

Name of a crypograhic protocol (as defined in javax.net.ssl.SSLEngine) to include

Parent objects

ServerCryptographicProtocols

Attributes

name string

Name	Type	Default value	Description
name	string	-

XML representation

<IncludeProtocol name="" ></IncludeProtocol>

TickInternalParameters com.exalead.mercury.mami.deploy.v10.TickInternalParameters

Tick message parameters to tune performance. Warning: Experts only, this can have a huge performance impact.

Parent objects

CloudviewDeploymentInternalConfig

Attributes

bufferRecyclingMaxSize int
bufferRecyclingMaxLifeSpan int

Name	Type	Default value	Description
bufferRecyclingMaxSize	int	1000	When a tick message is released, it is put back into a memory pool for reuse. The attached payload is kept if its size is smaller than bufferRecyclingMaxSize, and freed if larger.
bufferRecyclingMaxLifeSpan	int	64	To avoid a tick message pool full of messages with a large payload (==bufferRecyclingMaxSize), the payload is released every time the message has been used bufferRecyclingMaxLifeSpan times.

XML representation

<TickInternalParameters bufferRecyclingMaxSize="1000" bufferRecyclingMaxLifeSpan="64" ></TickInternalParameters>

ProcessConfigList exa.bee.deploy.v10.ProcessConfigList

Alternative way for defining a service in a cloudview product.

Attributes

version long

Name	Type	Default value	Description
version	long	-	-

Nested elements

paramsKeyValue*
ProcessConfig*

Type	Wrapper	Description
KeyValue*	params	Parameters added to the global beegistry
ProcessConfig*	-	-

XML representation

<ProcessConfigList version="" ></ProcessConfigList>

SimpleJavaProcessConfig exa.bee.deploy.v10.SimpleJavaProcessConfig

Simple java process definition which only exposes a set of web applications.

Parent objects

ProcessConfigList

Attributes

name string

Name	Type	Default value	Description
name	string	-	Process name. Referenced by the attribute "process" of a "Process\ role.

Nested elements

JavaWebApp*

Type	Wrapper	Description
JavaWebApp*	-	-

XML representation

<SimpleJavaProcessConfig name="" ></SimpleJavaProcessConfig>

JavaWebApp exa.bee.deploy.v10.JavaWebApp

A Java Web application, to be embedded in a custom process.
A Webapp can optionally expose some services which will be accessible by other parts of the CloudView instance through the Directory mechanism

Parent objects

ComplexJavaProcessConfig
JavaServer
SimpleJavaProcessConfig

Attributes

name string
url string
webAppLoaderClassId string
extraClassPath string
isGlobal boolean

Name	Type	Default value	Description
name	string	-	Name of this webapp
url	string	-	Webapp location (WAR file or directory)
webAppLoaderClassId	string	-	Custom code for overloading the default webapp classloader
extraClassPath	string	-	Elements to add to the webapp class path, additionally to the default CloudView libraries. extraClassPath is a comma-separated list of elements. Each element must be one of: extrajava:folder/STAR.jar : Adds all jar files in $KIT/extrajava/folder and $DATA/extrajava/folder extrajava:folder/STARSTAR/STAR.jar : Add recursively all jar files in $KIT/extrajava/folder and $DATA/extrajava/folder Absolute path to a JAR file Absolute path to a class folder
isGlobal	boolean	true	Global applications are deployed under the context path "/name". Not global applications are deployed under "/name/hostname-installname/processname". Note that global applications can not expose any service

Nested elements

initParamsKeyValue*
JavaWebAppService*

Type	Wrapper	Description
KeyValue*	initParams	Webapp init parameters
JavaWebAppService*	-	-

XML representation

<JavaWebApp name="" url="" webAppLoaderClassId="" extraClassPath="" isGlobal="true" ></JavaWebApp>

JavaWebAppService exa.bee.deploy.v10.JavaWebAppService

Describe the hosted Java WebApp service.

Parent objects

JavaWebApp

Attributes

name string
schemaName string

Name	Type	Default value	Description
name	string	-	Name of the service (relative path) in the web app.
schemaName	string	-	Optional schema name when published in the directory

XML representation

<JavaWebAppService name="" schemaName="" ></JavaWebAppService>

ComplexJavaProcessConfig exa.bee.deploy.v10.ComplexJavaProcessConfig

Base class for describing processes which can be hosted in a cloudview installation.

Parent objects

ProcessConfigList

Attributes

name string

Name	Type	Default value	Description
name	string	-	Process name. Referenced by the attribute "process" of a "Process\ role.

Nested elements

paramsKeyValue*
ConfigFile*
JavaServer*
JavaService*
JavaWebApp*

Type	Wrapper	Description
KeyValue*	params	Parameters added to the process local configuration registry.
ConfigFile*	-	-
JavaServer*	-	-
JavaService*	-	-
JavaWebApp*	-	-

XML representation

<ComplexJavaProcessConfig name="" ></ComplexJavaProcessConfig>

ConfigFile exa.bee.deploy.v10.ConfigFile

No documentation for this element.

Parent objects

ComplexExaProcessConfig
ComplexJavaProcessConfig

Attributes

url string
extraClassPath string

Name	Type	Default value	Description
url	string	-	-
extraClassPath	string	-	Extra class path elements to add to the context class loader when loading this configuration file. Follows the JavaWebApp.extraClassPath syntax. This class path will be used to load the Java objects that might be contained in this configuration file

XML representation

<ConfigFile url="" extraClassPath="" ></ConfigFile>

JavaServer exa.bee.deploy.v10.JavaServer

A named server containing services and/or web apps.

Parent objects

ComplexJavaProcessConfig

Attributes

name string
protocol string

Name	Type	Default value	Description
name	string	-	Server name
protocol	string	-	Server protocol

Nested elements

JavaService*
JavaWebApp*

Type	Wrapper	Description
JavaService*	-	List of services deployed in this server
JavaWebApp*	-	List of web apps deployed in this server

XML representation

<JavaServer name="" protocol="" ></JavaServer>

JavaService exa.bee.deploy.v10.JavaService

No documentation for this element.

Parent objects

ComplexJavaProcessConfig
JavaServer

Attributes

name string
schemaName string
beeKey string
type string
publicName string
extraClassPath string

Name	Type	Default value	Description
name	string	-	Service name. Exposed full prefix name will be automatically prefixed
schemaName	string	-	Optional schema name used when publishing the directory
beeKey	string	-	service implementation
type	string	-	service type
publicName	string	-	Optional public name. If null, name + host + processName is used
extraClassPath	string	-	Extra class path elements to add to the context class loader when performing queries on this service. Follows the JavaWebApp.extraClassPath syntax.

XML representation

<JavaService name="" schemaName="" beeKey="" type="" publicName="" extraClassPath="" ></JavaService>

SimpleDotNetProcessConfig exa.bee.deploy.v10.SimpleDotNetProcessConfig

Describe the hosted .NET process configuration.

Parent objects

ProcessConfigList

Attributes

name string

Name	Type	Default value	Description
name	string	-	Process name. Referenced by the attribute "process" of a "Process\ role.

XML representation

<SimpleDotNetProcessConfig name="" ></SimpleDotNetProcessConfig>

ComplexExaProcessConfig exa.bee.deploy.v10.ComplexExaProcessConfig

Base class for describing processes which can be hosted in a cloudview installation.

Parent objects

ProcessConfigList

Attributes

name string

Name	Type	Default value	Description
name	string	-	Process name. Referenced by the attribute "process" of a "Process\ role.

Nested elements

paramsKeyValue*
ConfigFile*
ExaServer*
ExaService*

Type	Wrapper	Description
KeyValue*	params	Parameters added to the process local configuration registry.
ConfigFile*	-	-
ExaServer*	-	-
ExaService*	-	-

XML representation

<ComplexExaProcessConfig name="" ></ComplexExaProcessConfig>

ExaServer exa.bee.deploy.v10.ExaServer

No documentation for this element.

Parent objects

ComplexExaProcessConfig

Attributes

name string
protocol string

Name	Type	Default value	Description
name	string	-	-
protocol	string	-	-

Nested elements

ExaService*

Type	Wrapper	Description
ExaService*	-	-

XML representation

<ExaServer name="" protocol="" ></ExaServer>

ExaService exa.bee.deploy.v10.ExaService

No documentation for this element.

Parent objects

ComplexExaProcessConfig
ExaServer

Attributes

name string
schemaName string
beeKey string
type string
publicName string

Name	Type	Default value	Description
name	string	-	-
schemaName	string	-	-
beeKey	string	-	-
type	string	-	-
publicName	string	-	Optional public name. If null, name + host + processName is used

XML representation

<ExaService name="" schemaName="" beeKey="" type="" publicName="" ></ExaService>

LinguisticConfig com.exalead.linguistic.v10.LinguisticConfig

No documentation for this element.

Attributes

version long

Name	Type	Default value	Description
version	long	-	-

Nested elements

TokenizationConfig*

Type	Wrapper	Description
TokenizationConfig*	-

XML representation

<LinguisticConfig version="" ></LinguisticConfig>

TokenizationConfig com.exalead.linguistic.v10.TokenizationConfig

How to tokenize documents, ie split the input strings in tokens. Tokens are usually words.

Parent objects

LinguisticConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-	The tokenization configuration's name.

Nested elements

FormIndexingConfig
NormalizerConfig
Tokenizer*

Type	Wrapper	Description
FormIndexingConfig	-	How to index different word forms (exact, normalized, lemmatized) for each language.
NormalizerConfig	-	How to normalize words.
Tokenizer*	-	List of enabled tokenizers. Tokenizer choice depends on the document's language.

XML representation

<TokenizationConfig name="" ></TokenizationConfig>

FormIndexingConfig com.exalead.linguistic.v10.FormIndexingConfig

No documentation for this element.

Nested elements

Form*

Type	Wrapper	Description
Form*	-

XML representation

<FormIndexingConfig ></FormIndexingConfig>

Form com.exalead.linguistic.v10.Form

No documentation for this element.

Parent objects

FormIndexingConfig

Attributes

tag string
indexKind short
trustLevel int

Name	Type	Default value	Description
tag	string	-
indexKind	short	-
trustLevel	int	100

XML representation

<Form tag="" indexKind="" trustLevel="100" ></Form>

NormalizerConfig com.exalead.linguistic.v10.NormalizerConfig

No documentation for this element.

Parent objects

TokenizationConfig

Attributes

transliteration boolean
useNormalizationExceptions boolean
normalizationExceptionsDir string
disableBasisTechNormalizerForLanguages string
enableBasisTechNormalizerForLanguages string
useGermanExceptions boolean

Name	Type	Default value	Description
transliteration	boolean	True	When normalizing, convert some characters to their latin equivalent
useNormalizationExceptions	boolean	True	Uses the normalization exceptions, even if the resource path is not set (in that case, the default resource is used).
normalizationExceptionsDir	string	-	The path to the normalization-exceptions resource directory. Use the default resource if no path is set.
disableBasisTechNormalizerForLanguages	string	-	Removes BasisTech's processor "Encoding and Character Normalizer" for these languages.
enableBasisTechNormalizerForLanguages	string	ja	Adds BasisTech's processor "Encoding and Character Normalizer" only for these languages.
useGermanExceptions	boolean	False

Nested elements

NormalizerCharOverride*
NormalizerIndexLower*

Type	Wrapper	Description
NormalizerCharOverride*	-
NormalizerIndexLower*	-

XML representation

<NormalizerConfig transliteration="true" useNormalizationExceptions="true" normalizationExceptionsDir="" enableBasisTechNormalizerForLanguages="ja" useGermanExceptions="false" ></NormalizerConfig>

NormalizerCharOverride com.exalead.linguistic.v10.NormalizerCharOverride

Introduces custom normalization rule

Parent objects

NormalizerConfig

Attributes

chr string
replace string
language iso code

Name	Type	Default value	Description
chr	string	-
replace	string	-
language	iso code	-

XML representation

<NormalizerCharOverride chr="" replace="" language="en" ></NormalizerCharOverride>

NormalizerIndexLower com.exalead.linguistic.v10.NormalizerIndexLower

The list of words to index in lowercase instead of normalized form.

Parent objects

NormalizerConfig

Attributes

word string
language iso code

Name	Type	Default value	Description
word	string	-
language	iso code	-

XML representation

<NormalizerIndexLower word="" language="en" ></NormalizerIndexLower>

StandardTokenizer com.exalead.linguistic.v10.StandardTokenizer

Sets a specific configuration for the standard tokenizer. If this object is not in the Semantic processors list, standard tokenization is used.

Parent objects

TokenizationConfig

Attributes

language iso code
concatAlphaNum boolean
concatNumAlpha boolean

Name	Type	Default value	Description
language	iso code	-	The language handled by this tokenizer. This can be null for all unhandled languages.
concatAlphaNum	boolean	True	Concat alpha and num chars
concatNumAlpha	boolean	True	Concat num and alpha chars

Nested elements

charOverridesStandardTokenizerOverride*
patternOverridesStandardTokenizerOverride*
TokenizerPlugin*

Type	Wrapper	Description
StandardTokenizerOverride*	charOverrides	Set of rules that allows the forcing of a type for specific character or for a character set defined by a unicode property.
StandardTokenizerOverride*	patternOverrides	Set of rules that allows the forcing of specific regexp type.
TokenizerPlugin*	-

XML representation

<StandardTokenizer language="en" concatAlphaNum="true" concatNumAlpha="true" ></StandardTokenizer>

BasisTechTokenizationCompatibility com.exalead.linguistic.v10.BasisTechTokenizationCompatibility

No documentation for this element.

Parent objects

StandardTokenizer

Attributes

languages string

Name	Type	Default value	Description
languages	string	en,de,fr,sv,es,it,nl,pt,no,fi,da,bg,ca,cs,el,hr,hu,pl,ru,sk,sl,sr	Postprocesses BasisTech's analyzer output in order to generate a tokenization as close as possible to that of this standard tokenizer for these languages.

XML representation

<BasisTechTokenizationCompatibility languages="en,de,fr,sv,es,it,nl,pt,no,fi,da,bg,ca,cs,el,hr,hu,pl,ru,sk,sl,sr" ></BasisTechTokenizationCompatibility>

GermanDisagglutiner com.exalead.linguistic.v10.GermanDisagglutiner

This class performs disagglutinations for German.
Use with a Standard tokenizer.

Parent objects

StandardTokenizer

Attributes

resourceDir string

Name	Type	Default value	Description
resourceDir	string	-	URL for the resource (data://, file;// or resource://) if null, use standard resource of product.

XML representation

<GermanDisagglutiner resourceDir="" ></GermanDisagglutiner>

DutchDisagglutiner com.exalead.linguistic.v10.DutchDisagglutiner

This class performs disagglutinations for Dutch.
Use with a Standard tokenizer.

Parent objects

StandardTokenizer

Attributes

resourceDir string

Name	Type	Default value	Description
resourceDir	string	-	URL for the resource (data://, file;// or resource://) if null, use standard resource of product.

XML representation

<DutchDisagglutiner resourceDir="" ></DutchDisagglutiner>

NorwegianDisagglutiner com.exalead.linguistic.v10.NorwegianDisagglutiner

This class performs disagglutinations for Norwegian.
Use with a Standard tokenizer.

Parent objects

StandardTokenizer

Attributes

resourceDir string

Name	Type	Default value	Description
resourceDir	string	-	URL for the resource (data://, file;// or resource://) if null, use standard resource of product.

XML representation

<NorwegianDisagglutiner resourceDir="" ></NorwegianDisagglutiner>

ChineseWordFinder com.exalead.linguistic.v10.ChineseWordFinder

This class performs words detection for Chinese.
Use with a Standard tokenizer.

Parent objects

StandardTokenizer

Attributes

Name	Type	Default value	Description
resourceDir	string	-	URL for the resource (data://, file;// or resource://) if null, use standard resource of product.
addSimplified	boolean	False	Annotates each token with its simplified Chinese transliteration, when applicable.
addNGrams	boolean	False	Annotates with NGrams
minNGramLength	int	1	Minimum length of NGrams (1: monograms, 2: bigrams...)
maxNGramLength	int	2	Maximum length of NGrams (1: monograms, 2: bigrams...)

XML representation

<ChineseWordFinder resourceDir="" addSimplified="false" addNGrams="false" minNGramLength="1" maxNGramLength="2" ></ChineseWordFinder>

JapaneseWordFinder com.exalead.linguistic.v10.JapaneseWordFinder

Performs words detection for Japanese.
Use with a Standard tokenizer.

Parent objects

StandardTokenizer

Attributes

resourceDir string
processRomaji boolean
addRomaji boolean
addCharset boolean
addNGrams boolean
minNGramLength int
maxNGramLength int
addTransliterations boolean
addPartOfSpeech boolean

Name	Type	Default value	Description
resourceDir	string	-	URL for the resource (data://, file;// or resource://) if null, use standard resource of product.
processRomaji	boolean	False	Processes romaji tokens
addRomaji	boolean	False	Annotates each token with its romaji transliteration, when applicable.
addCharset	boolean	False	Annotates with charset
addNGrams	boolean	False	Annotates with NGrams
minNGramLength	int	1	Minimum length of NGrams (1: monograms, 2: bigrams...)
maxNGramLength	int	2	Maximum length of NGrams (1: monograms, 2: bigrams...)
addTransliterations	boolean	False	Annotates with transliterations (kanji, katakana, hiragana, romaji).
addPartOfSpeech	boolean	False	Annotates with part of speech.

XML representation

<JapaneseWordFinder resourceDir="" processRomaji="false" addRomaji="false" addCharset="false" addNGrams="false" minNGramLength="1" maxNGramLength="2" addTransliterations="false" addPartOfSpeech="false" ></JapaneseWordFinder>

StandardTokenizerOverride com.exalead.linguistic.v10.StandardTokenizerOverride

No documentation for this element.

Parent objects

charOverrides StandardTokenizer
patternOverrides StandardTokenizer

Attributes

type enum(token, separator, sentence, ignore, punct)
toOverride string

Name	Type	Default value	Description
type	enum(token, separator, sentence, ignore, punct)	token	Values = "token", "separator", "sentence" (will break related terms extraction, named entities, ...), "ignore" or "punct" (sentence is considered as a separator but it is also considered as an entity separator for semantic extractors)
toOverride	string	-

XML representation

<StandardTokenizerOverride type="token" toOverride="" ></StandardTokenizerOverride>

StandardTokenizerPatternOverride com.exalead.linguistic.v10.StandardTokenizerPatternOverride

No documentation for this element.

Parent objects

charOverrides StandardTokenizer
patternOverrides StandardTokenizer

Attributes

type enum(token, separator, sentence, ignore, punct)
toOverride string
separated boolean

Name	Type	Default value	Description
type	enum(token, separator, sentence, ignore, punct)	token	Values = "token", "separator", "sentence" (will break related terms extraction, named entities, ...), "ignore" or "punct" (sentence is considered as a separator but it is also considered as an entity separator for semantic extractors)
toOverride	string	-
separated	boolean	True	Pattern must be separated to match.

XML representation

<StandardTokenizerPatternOverride type="token" toOverride="" separated="true" ></StandardTokenizerPatternOverride>

ChineseTokenizer com.exalead.linguistic.v10.ChineseTokenizer

When set in the configuration, tokenizes Chinese documents.

Parent objects

TokenizationConfig

Attributes

addSimplified boolean
resourceDir string

Name	Type	Default value	Description
addSimplified	boolean	False	Annotates each token with its simplified Chinese transliteration, when applicable.
resourceDir	string	-	URL for the resource (data://, file;// or resource://) if null, use standard resource of product.

XML representation

<ChineseTokenizer addSimplified="false" resourceDir="" ></ChineseTokenizer>

JapaneseTokenizer com.exalead.linguistic.v10.JapaneseTokenizer

When set in the configuration, tokenizes Japanese documents.

Parent objects

TokenizationConfig

Attributes

addRomanji boolean
addMorphology boolean
resourceDir string
favor enum(precision, recall)

Name	Type	Default value	Description
addRomanji	boolean	False	Annotates each token with its romaji transliteration, when applicable.
addMorphology	boolean	False	Annotates each token with its part of speech, when applicable.
resourceDir	string	-	URL for the resource (data://, file;// or resource://) if null, use standard resource of product.
favor	enum(precision, recall)	recall	Favoring precision decreases the amount of unrelevant documents returned by queries, favoring recall decreases the amount of missing documents.

XML representation

<JapaneseTokenizer addRomanji="false" addMorphology="false" resourceDir="" favor="recall" ></JapaneseTokenizer>

BasisTechTokenizer com.exalead.linguistic.v10.BasisTechTokenizer

No documentation for this element.

Parent objects

TokenizationConfig

Attributes

language iso code

Name	Type	Default value	Description
language	iso code	-	The language handled by this tokenizer. This can be null for all unhandled languages.

XML representation

<BasisTechTokenizer language="en" ></BasisTechTokenizer>

CustomTokenizer com.exalead.linguistic.v10.CustomTokenizer

No documentation for this element.

Parent objects

TokenizationConfig

Attributes

classId string

Name	Type	Default value	Description
classId	string	-	The class identifier of the custom processor; this class must implement CustomTokenizerInterface.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<CustomTokenizer classId="" ></CustomTokenizer>

DictionaryConfig com.exalead.dictionary.v10.DictionaryConfig

Defines what a dictionary must contain.

Attributes

name string

Name	Type	Default value	Description
name	string	-

Nested elements

BuildPolicies
CompactPolicies
QueryConfig*
ResourceConfig*

Type	Wrapper	Description
BuildPolicies	-	The build policies used to trigger a build of a dictionary.
CompactPolicies	-	The compact policies used to trigger dictionary compaction.
QueryConfig*	-
ResourceConfig*	-	Resources that should be built for this dictionary, along with their configuration.

XML representation

<DictionaryConfig name="" ></DictionaryConfig>

BuildPolicies com.exalead.dictionary.v10.BuildPolicies

No documentation for this element.

Parent objects

DictionaryConfig

Attributes

disjunctives boolean

Name	Type	Default value	Description
disjunctives	boolean	True

Nested elements

BuildPolicy*

Type	Wrapper	Description
BuildPolicy*	-	Specifies the build policies.

XML representation

<BuildPolicies disjunctives="true" ></BuildPolicies>

CompactCountBuildPolicy com.exalead.dictionary.v10.CompactCountBuildPolicy

Triggers a build when a given number of compacts have been done

Parent objects

BuildPolicies

Attributes

countThreshold int

Name	Type	Default value	Description
countThreshold	int	1	Number of compacts that will trigger a compact.

XML representation

<CompactCountBuildPolicy countThreshold="1" ></CompactCountBuildPolicy>

PeriodicBuildPolicy com.exalead.dictionary.v10.PeriodicBuildPolicy

Triggers a build after 'delayInS' seconds have elapsed.

Parent objects

BuildPolicies

Attributes

delayInS long

Name	Type	Default value	Description
delayInS	long	600	Time in seconds between two builds.

XML representation

<PeriodicBuildPolicy delayInS="600" ></PeriodicBuildPolicy>

CompactPolicies com.exalead.dictionary.v10.CompactPolicies

No documentation for this element.

Parent objects

DictionaryConfig

Attributes

disjunctives boolean

Name	Type	Default value	Description
disjunctives	boolean	True

Nested elements

CompactFilter*
CompactPolicy*

Type	Wrapper	Description
CompactFilter*	-	Filtering rules for data imported from the stream to be compacted. * CompactFrequencyFilter...
CompactPolicy*	-	Specifies the compact policies.

XML representation

<CompactPolicies disjunctives="true" ></CompactPolicies>

FrequencyCompactFilter com.exalead.dictionary.v10.FrequencyCompactFilter

Filter resources while compacting according to the given frequencies / languages

Parent objects

CompactPolicies

Attributes

minFrequency long
language iso code

Name	Type	Default value	Description
minFrequency	long	1	Discards entries with frequency below this threshold
language	iso code	-	Language represented by its iso639-1 code, e.g: fr, en...

XML representation

<FrequencyCompactFilter minFrequency="1" language="en" ></FrequencyCompactFilter>

ImportCountCompactPolicy com.exalead.dictionary.v10.ImportCountCompactPolicy

Triggers a compact when a given number of imports have been done

Parent objects

CompactPolicies

Attributes

countThreshold int

Name	Type	Default value	Description
countThreshold	int	1	Number of imports that will trigger a compact.

XML representation

<ImportCountCompactPolicy countThreshold="1" ></ImportCountCompactPolicy>

PeriodicCompactPolicy com.exalead.dictionary.v10.PeriodicCompactPolicy

Triggers a compact after 'delayInS' seconds have elapsed.

Parent objects

CompactPolicies

Attributes

delayInS long

Name	Type	Default value	Description
delayInS	long	600	Time in seconds between two compacts.

XML representation

<PeriodicCompactPolicy delayInS="600" ></PeriodicCompactPolicy>

SpellcheckQueryConfig com.exalead.dictionary.v10.SpellcheckQueryConfig

No documentation for this element.

Parent objects

DictionaryConfig

Attributes

tokenizationConfig string
preAllocatedPipelines int

Name	Type	Default value	Description
tokenizationConfig	string	-
preAllocatedPipelines	int	-

Nested elements

blacklistBWList*
whitelistBWList*

Type	Wrapper	Description
BWList*	blacklist
BWList*	whitelist

XML representation

<SpellcheckQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></SpellcheckQueryConfig>

BWList com.exalead.dictionary.v10.BWList

No documentation for this element.

Parent objects

blacklist ApproxQueryConfig
blacklist ExpansionQueryConfig
blacklist NGramFrequencyQueryConfig
blacklist PhoneticQueryConfig
blacklist QueryConfig
blacklist RelatedTermsQueryConfig
blacklist SpellcheckQueryConfig
blacklist WordFrequencyQueryConfig
whitelist ApproxQueryConfig
whitelist ExpansionQueryConfig
whitelist NGramFrequencyQueryConfig
whitelist PhoneticQueryConfig
whitelist QueryConfig
whitelist RelatedTermsQueryConfig
whitelist SpellcheckQueryConfig
whitelist WordFrequencyQueryConfig

Attributes

resourceURL string
name string
defaultList boolean

Name	Type	Default value	Description
resourceURL	string	-
name	string	-
defaultList	boolean	False	Select the list to be used by default

XML representation

<BWList resourceURL="" name="" defaultList="false" ></BWList>

RelatedTermsQueryConfig com.exalead.dictionary.v10.RelatedTermsQueryConfig

Related-terms synthesis configuration. At query-time, the searchserver will ask the dictionary to perform the related-terms synthesis, according to their display forms and corpus frequency.

Parent objects

DictionaryConfig

Attributes

tokenizationConfig string
preAllocatedPipelines int

Name	Type	Default value	Description
tokenizationConfig	string	-
preAllocatedPipelines	int	-

Nested elements

blacklistBWList*
whitelistBWList*

Type	Wrapper	Description
BWList*	blacklist
BWList*	whitelist

XML representation

<RelatedTermsQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></RelatedTermsQueryConfig>

ExpansionQueryConfig com.exalead.dictionary.v10.ExpansionQueryConfig

No documentation for this element.

Parent objects

DictionaryConfig

Attributes

tokenizationConfig string
preAllocatedPipelines int

Name	Type	Default value	Description
tokenizationConfig	string	-
preAllocatedPipelines	int	-

Nested elements

blacklistBWList*
whitelistBWList*

Type	Wrapper	Description
BWList*	blacklist
BWList*	whitelist

XML representation

<ExpansionQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></ExpansionQueryConfig>

WordFrequencyQueryConfig com.exalead.dictionary.v10.WordFrequencyQueryConfig

Enables word frequency use at query-time. It can be used for things like linguistic expansion.

Parent objects

DictionaryConfig

Attributes

tokenizationConfig string
preAllocatedPipelines int

Name	Type	Default value	Description
tokenizationConfig	string	-
preAllocatedPipelines	int	-

Nested elements

blacklistBWList*
whitelistBWList*

Type	Wrapper	Description
BWList*	blacklist
BWList*	whitelist

XML representation

<WordFrequencyQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></WordFrequencyQueryConfig>

NGramFrequencyQueryConfig com.exalead.dictionary.v10.NGramFrequencyQueryConfig

Enables ngram frequency use at query-time.

Parent objects

DictionaryConfig

Attributes

tokenizationConfig string
preAllocatedPipelines int

Name	Type	Default value	Description
tokenizationConfig	string	-
preAllocatedPipelines	int	-

Nested elements

blacklistBWList*
whitelistBWList*

Type	Wrapper	Description
BWList*	blacklist
BWList*	whitelist

XML representation

<NGramFrequencyQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></NGramFrequencyQueryConfig>

ApproxQueryConfig com.exalead.dictionary.v10.ApproxQueryConfig

No documentation for this element.

Parent objects

DictionaryConfig

Attributes

tokenizationConfig string
preAllocatedPipelines int

Name	Type	Default value	Description
tokenizationConfig	string	-
preAllocatedPipelines	int	-

Nested elements

blacklistBWList*
whitelistBWList*

Type	Wrapper	Description
BWList*	blacklist
BWList*	whitelist

XML representation

<ApproxQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></ApproxQueryConfig>

PhoneticQueryConfig com.exalead.dictionary.v10.PhoneticQueryConfig

Enables phonetic search by looking in the dictionary for words matching the phonetic form of the query. The phonetic query config requires a tokenization config. It will work only if a PhonemesResourceConfig exists, in order to build the phonetic forms of the words at build-time.

Parent objects

DictionaryConfig

Attributes

tokenizationConfig string
preAllocatedPipelines int

Name	Type	Default value	Description
tokenizationConfig	string	-
preAllocatedPipelines	int	-

Nested elements

blacklistBWList*
whitelistBWList*

Type	Wrapper	Description
BWList*	blacklist
BWList*	whitelist

XML representation

<PhoneticQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></PhoneticQueryConfig>

WordsResourceConfig com.exalead.dictionary.v10.WordsResourceConfig

Should we store word dictionary for this dictionary.
The word dictionary is used for relevance purposes and term expansion.

Nested elements

LanguageConfig*

Type	Wrapper	Description
LanguageConfig*	-

XML representation

<WordsResourceConfig ></WordsResourceConfig>

LanguageConfig com.exalead.dictionary.v10.LanguageConfig

No documentation for this element.

Parent objects

CustomResourceConfig
NGramsResourceConfig
PhonemesResourceConfig
RelatedTermsResourceConfig
ResourceConfig
WordsResourceConfig

Attributes

language iso code
maxTerm long

Name	Type	Default value	Description
language	iso code	-	Language represented by its iso639-1 code, e.g: fr, en...
maxTerm	long	1000000	Maximum number of terms kept

Nested elements

Filter*

Type	Wrapper	Description
Filter*	-	Filtering rules for data imported from the stream to build this resource. * RegexpFilter, FrequencyFilter, CustomFilter ...

XML representation

<LanguageConfig language="en" maxTerm="1000000" ></LanguageConfig>

RegexpFilter com.exalead.dictionary.v10.RegexpFilter

Discards entries matching this regular expression

Parent objects

LanguageConfig

Attributes

regexp string

Name	Type	Default value	Description
regexp	string	-	Discards entries matching this regular expression

XML representation

<RegexpFilter regexp="" ></RegexpFilter>

FrequencyFilter com.exalead.dictionary.v10.FrequencyFilter

No documentation for this element.

Parent objects

LanguageConfig

Attributes

minFrequency long

Name	Type	Default value	Description
minFrequency	long	1	Discards entries with frequency below this threshold

XML representation

<FrequencyFilter minFrequency="1" ></FrequencyFilter>

PhonemesResourceConfig com.exalead.dictionary.v10.PhonemesResourceConfig

Should we build a phonetic resource for this dictionary.
Used for relevance purposes and term expansion.
Required for phonetic term expansion.

Nested elements

LanguageConfig*

Type	Wrapper	Description
LanguageConfig*	-

XML representation

<PhonemesResourceConfig ></PhonemesResourceConfig>

RelatedTermsResourceConfig com.exalead.dictionary.v10.RelatedTermsResourceConfig

Should we build related terms resource for this dictionary.
A related terms semantic processor must also be defined in the pipeline.

Parent objects

DictionaryConfig

Attributes

overallMaxTerm long

Name	Type	Default value	Description
overallMaxTerm	long	100000000	Overall maximum number of terms kept regardless of the language

Nested elements

LanguageConfig*

Type	Wrapper	Description
LanguageConfig*	-

XML representation

<RelatedTermsResourceConfig overallMaxTerm="100000000" ></RelatedTermsResourceConfig>

NGramsResourceConfig com.exalead.dictionary.v10.NGramsResourceConfig

Should we build ngrams resource for this dictionary.
Used to provide accurate spell checking.

Nested elements

LanguageConfig*

Type	Wrapper	Description
LanguageConfig*	-

XML representation

<NGramsResourceConfig ></NGramsResourceConfig>

CustomResourceConfig com.exalead.dictionary.v10.CustomResourceConfig

No documentation for this element.

Nested elements

LanguageConfig*

Type	Wrapper	Description
LanguageConfig*	-

XML representation

<CustomResourceConfig ></CustomResourceConfig>

ResourceManagerConfig com.exalead.mercury.mami.resources.v10.ResourceManagerConfig

Configuration of the resource manager.

The resource manager supervises resources and provides methods to fetch, convert and build them.

Attributes

version long
resourceDir string

Name	Type	Default value	Description
version	long	-	-
resourceDir	string	data:///build/resources	Path to master copy of all resources

Nested elements

ResourceGroup*

Type	Wrapper	Description
ResourceGroup*	-

XML representation

<ResourceManagerConfig version="" resourceDir="data:///build/resources" ></ResourceManagerConfig>

ResourceGroup com.exalead.mercury.mami.resources.v10.ResourceGroup

A ResourceGroup is a container for one or more resources.
Each time a ResourceGroup is published, all resources are compiled and published atomically.

Parent objects

ResourceManagerConfig

Attributes

name string
roles string
waitOnSync boolean

Name	Type	Default value	Description
name	string	-	Resource group name
roles	string	-	The list of roles on which the resource should be published (comma-separated).
waitOnSync	boolean	False

Nested elements

Resource*

Type	Wrapper	Description
Resource*	-

XML representation

<ResourceGroup name="" roles="" waitOnSync="false" ></ResourceGroup>

Resource com.exalead.mercury.mami.resources.v10.Resource

Base class for resources.

A resource belongs to a group

Parent objects

ResourceGroup

Attributes

name string
type string
description string
author string
applicationId string
converterClassId string

Name	Type	Default value	Description
name	string	-	Name of the resource.
type	string	-	Type of the resource.
description	string	-	Short description of the resource.
author	string	-	Author of the resource.
applicationId	string	-	Application Id of the resource. If defined, the resource can only be edited in this application.
converterClassId	string	-	Converter Class Id. If defined, it will be used instead of the default converter associated with the resource type.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-	Configuration of the resource.

XML representation

<Resource name="" type="" description="" author="" applicationId="" converterClassId="" ></Resource>

SemanticResource com.exalead.mercury.mami.resources.v10.SemanticResource

A Semantic resource is a resource requiring a tokenization config

Parent objects

ResourceGroup

Attributes

name string
type string
description string
author string
applicationId string
converterClassId string
tokenizationConfig string

Name	Type	Default value	Description
name	string	-	Name of the resource.
type	string	-	Type of the resource.
description	string	-	Short description of the resource.
author	string	-	Author of the resource.
applicationId	string	-	Application Id of the resource. If defined, the resource can only be edited in this application.
converterClassId	string	-	Converter Class Id. If defined, it will be used instead of the default converter associated with the resource type.
tokenizationConfig	string	-	Specify the tokenization config to use

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-	Configuration of the resource.

XML representation

<SemanticResource name="" type="" description="" author="" applicationId="" converterClassId="" tokenizationConfig="tok0" ></SemanticResource>

ProductSecurity com.exalead.mercury.mami.master.v10.ProductSecurity

Defines the product security.
Specifies how product components are exposed to the rest of the world.

Also defines a set of identity provider configurations. Identity providers can be used to define authentication domain and to provide common authentication tools for different services (for example, admin UI and api console).

Attributes

version long
login string
password string
secureInternalConnections boolean

Name	Type	Default value	Description
version	long	-	-
login	string	-	Root login. Can be used to connect to every secured UI. Also used for securing inter process communications when required.
password	string	-	Root password. Encrypted using product RSA encryption key.
secureInternalConnections	boolean	False	Basic authentication using product login - password.

Nested elements

trustedHostStringValue*
IdentityProviderConfig*
MAMISecurity
PushAPISecurity
SearchAPISecurity

Type	Wrapper	Description
StringValue*	trustedHost	Hostnames allowed for redirection. "" means any characters. For example, .exalead.com would allow all hosts in the exalead.com domain.
IdentityProviderConfig*	-
MAMISecurity	-
PushAPISecurity	-
SearchAPISecurity	-

XML representation

<ProductSecurity version="" login="" password="" secureInternalConnections="false" ></ProductSecurity>

IdentityProviderConfig com.exalead.mercury.mami.master.v10.IdentityProviderConfig

Definition of an identity provider.
Used, for example, by the admin ui components to define a common login API.

Parent objects

ProductSecurity

Attributes

Name	Type	Default value	Description
name	string	-	Identity provider name.
filterFactoryClassId	string	-	Optional classId of a specific FilterFactory to use instead of the default one. It is used to create a specific SecurityFilter to change the authentication method.
securitySource	string	-	Associated security source.
sessionInactivityTimeoutS	long	21600	Sessions will time out after this time. -1 for no timeout

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<IdentityProviderConfig name="" filterFactoryClassId="" securitySource="" sessionInactivityTimeoutS="21600" ></IdentityProviderConfig>

MAMISecurity com.exalead.mercury.mami.master.v10.MAMISecurity

Management APIs security (gateway).

Parent objects

ProductSecurity

Attributes

login string
password string
useHttps boolean
serverCertificate string

Name	Type	Default value	Description
login	string	-	Login. Set to null to disable authentication.
password	string	-	Password. Encrypted using product RSA encryption key.
useHttps	boolean	False	Secures connection with HTTPS.
serverCertificate	string	-	When using HTTPS, name of a sertificate inside the security folder of the data directory. When not provided the default certificate is used.

Nested elements

IPRule*

Type	Wrapper	Description
IPRule*	-

XML representation

<MAMISecurity login="" password="" useHttps="false" serverCertificate="" ></MAMISecurity>

IPRule (deprecated) com.exalead.mercury.mami.master.v10.IPRule

IP Rule.

Parent objects

MAMISecurity
SearchAPISecurity

Attributes

allow boolean
addr string
mask string

Name	Type	Default value	Description
allow	boolean	False	Allows the IP address access.
addr	string	-	Address.
mask	string	-	Mask.

XML representation

<IPRule allow="false" addr="" mask="" ></IPRule>

PushAPISecurity com.exalead.mercury.mami.master.v10.PushAPISecurity

PushAPI Security

Warning on contrary to MAMI Security and Search API Security, the login and password are configured at the connector config level.

Parent objects

ProductSecurity

Attributes

serverCertificate string
useHttps boolean

Name	Type	Default value	Description
serverCertificate	string	-	When using HTTPS, name of a sertificate inside the security folder of the data directory. When not provided the default certificate is used.
useHttps	boolean	False	Secures connection with HTTPS.

XML representation

<PushAPISecurity serverCertificate="" useHttps="false" ></PushAPISecurity>

SearchAPISecurity com.exalead.mercury.mami.master.v10.SearchAPISecurity

Class used for search API security (search, fetch).

Defines whether the connection to the search API should be authenticated and/or secured.

Parent objects

ProductSecurity

Attributes

login string
password string
useHttps boolean
serverCertificate string

Name	Type	Default value	Description
login	string	-	Login. Set to null to disable authentication.
password	string	-	Password. Encrypted using product RSA encryption key.
useHttps	boolean	False	Secures connection with HTTPS.
serverCertificate	string	-	When using HTTPS, name of a sertificate inside the security folder of the data directory. When not provided the default certificate is used.

Nested elements

IPRule*

Type	Wrapper	Description
IPRule*	-

XML representation

<SearchAPISecurity login="" password="" useHttps="false" serverCertificate="" ></SearchAPISecurity>

SecuritySourceList com.exalead.mercury.mami.master.v10.SecuritySourceList

List of security sources

Attributes

version long
strictConfig boolean

Name	Type	Default value	Description
version	long	-	-
strictConfig	boolean	False	Stops deployment if a security source failed to initialize.

Nested elements

SecuritySourceBase*

Type	Wrapper	Description
SecuritySourceBase*	-

XML representation

<SecuritySourceList version="" strictConfig="false" ></SecuritySourceList>

MetaSecuritySource com.exalead.mercury.mami.master.v10.MetaSecuritySource

Meta security source configuration

Parent objects

SecuritySourceList

Attributes

name string
deploy boolean
type enum(First, Merge, No Authentication)
forcedTokens string

Name	Type	Default value	Description
name	string	-	Security source name
deploy	boolean	True	Will this security source be deployed?
type	enum(First, Merge, No Authentication)	First	Type of action when an authentication succeeds Value can be null or one of First Merge No Authentication
forcedTokens	string		An optional comma-separated list of tokens which will be appended to all authenticated users.

Nested elements

AuthenticationSource*
AuthorizationSource*

Type	Wrapper	Description
AuthenticationSource*	-	List of sources on which authentication will be performed
AuthorizationSource*	-	List of sources for which token will be retrieved if an authentication has succeeded.

XML representation

<MetaSecuritySource name="" deploy="true" type="First" forcedTokens="" ></MetaSecuritySource>

AuthenticationSource com.exalead.mercury.mami.master.v10.AuthenticationSource

No documentation for this element.

Parent objects

MetaSecuritySource

Attributes

name string
rewrittenLogin string

Name	Type	Default value	Description
name	string	-	The name of the security source.
rewrittenLogin	string		Can be used to rewrite the login. '$login' will be replaced by the user's login. For example, OFFICE\$login

XML representation

<AuthenticationSource name="" ></AuthenticationSource>

AuthorizationSource com.exalead.mercury.mami.master.v10.AuthorizationSource

No documentation for this element.

Parent objects

MetaSecuritySource

Attributes

name string
rewrittenLogin string
passwordIsMandatory boolean

Name	Type	Default value	Description
name	string	-	The name of the security source.
rewrittenLogin	string		Can be used to rewrite the login. '$login' will be replaced by the user's login. For example, OFFICE\$login
passwordIsMandatory	boolean	False	Forces the password to match

XML representation

<AuthorizationSource name="" passwordIsMandatory="false" ></AuthorizationSource>

SecuritySource com.exalead.mercury.mami.master.v10.SecuritySource

Security source configuration

Parent objects

SecuritySourceList

Attributes

name string
deploy boolean
classId string
customClassId string
singleInstance boolean

Name	Type	Default value	Description
name	string	-	Security source name
deploy	boolean	True	Will this security source be deployed?
classId	string	-	Security source type
customClassId	string	-	Customized security source type
singleInstance	boolean	False	Will this source be deployed on each security command?

Nested elements

configKeyValue*

Type	Wrapper	Description
KeyValue*	config

XML representation

<SecuritySource name="" deploy="true" classId="" customClassId="" singleInstance="false" ></SecuritySource>

RemoteHttpSource com.exalead.mercury.mami.master.v10.RemoteHttpSource

Remote Http security source configuration

Parent objects

SecuritySourceList

Attributes

name string
deploy boolean
service string
isAlivePath string
maxRetries int

Name	Type	Default value	Description
name	string	-	Security source name
deploy	boolean	True	Will this security source be deployed?
service	string		Service path on the remote security source
isAlivePath	string	-	The path to determine if the service is available on hosts. Not used if null.
maxRetries	int	-	Number of retries before skipping an host

Nested elements

RemoteHttpConfig*

Type	Wrapper	Description
RemoteHttpConfig*	-	Hosts on which authentication will be performed

XML representation

<RemoteHttpSource name="" deploy="true" service="" isAlivePath="" maxRetries="" ></RemoteHttpSource>

RemoteHttpConfig com.exalead.mercury.mami.master.v10.RemoteHttpConfig

Security source config.
The list of uri used by RemoteHttpSource

Parent objects

RemoteHttpSource

Attributes

protocol enum(http, https)
host string
port int
power int

Name	Type	Default value	Description
protocol	enum(http, https)	http	Protocol of the remote security source host
host	string	-	Hostname of the remote security source
port	int	80	Port of the remote security source
power	int	1	Priority of this host

XML representation

<RemoteHttpConfig protocol="http" host="" port="80" power="1" ></RemoteHttpConfig>

AlertingConfig com.exalead.cloudview.alerting.v10.AlertingConfig

No documentation for this element.

Attributes

version long
dbConnectionString string
dbUsername string
dbPassword string

Name	Type	Default value	Description
version	long	-	-
dbConnectionString	string	jdbc:sqlite:${dataDir}/alerting/db.sql	JDBC connection string for the alerting database.
dbUsername	string	-
dbPassword	string	-

XML representation

<AlertingConfig version="" dbConnectionString="jdbc:sqlite:${dataDir}/alerting/db.sql" dbUsername="" dbPassword="" ></AlertingConfig>

ApplicationAlertingConfig com.exalead.cloudview.alerting.v10.ApplicationAlertingConfig

No documentation for this element.

Attributes

lastModifiedDate long
humanLastModifiedDate long

Name	Type	Default value	Description
lastModifiedDate	long	-
humanLastModifiedDate	long	0

Nested elements

AlertGroup*

Type	Wrapper	Description
AlertGroup*	-

XML representation

<ApplicationAlertingConfig lastModifiedDate="" humanLastModifiedDate="0" ></ApplicationAlertingConfig>

AlertGroup com.exalead.cloudview.alerting.v10.AlertGroup

Defines the scheduling and publication rules that apply to a set of scheduled alerts

Parent objects

ApplicationAlertingConfig

Attributes

name string
description string
securitySource string
application string
useAsDefault boolean
publishDocumentMetas boolean
storeAlertOccurences boolean
realTime boolean
autoBuild boolean
storeResults boolean
incremental boolean
groupAlertsByUser boolean

Name	Type	Default value	Description
name	string	-
description	string	-
securitySource	string	-	Name of the security source to check tokens against when executing user alerts
application	string	-	Mashup application for this alert group
useAsDefault	boolean	False	Automatically selects this alert group in the "Save as alert" Mashup Widget.
publishDocumentMetas	boolean	True	When enabled, document metas are also published on match. It may be useful to disable this option to reduce gateway heap usage.
storeAlertOccurences	boolean	True	If set, each matching alert will be stored in the underlying alert_occurence table.
realTime	boolean	False	Matches alerts in this alert group as soon as the document is pushed (real-time alerting). If false, matches alerts by executing scheduled queries (scheduled alerting). Real-time alerting offers very low latency alerting, but with fewer features. For details, see the CloudView Administration Guide.
autoBuild	boolean	True	Automatically rebuilds the real-time alerting matchers each time an alert is added, deleted, or updated. Autobuild should be disabled for large alerting groups (thousands of alerts).
storeResults	boolean	False	Stores the results of each alert. Required for using the real-time alerts displayer; calculating the difference since the last alert trigger; and ensuring transaction consistency if a publication fails.
incremental	boolean	False	Includes a timestamp criteria with each alert request so the alert only returns documents modified or added since the last alert trigger. Only applies to scheduled alerting groups, since real-time alerting groups are always incremental.
groupAlertsByUser	boolean	False	Groups all alerts for a user in a single publication. Only applies to scheduled alerting groups, since real-time alerting publications are never grouped.

Nested elements

AlertPublisher*
IncrementalModeConfig
Scheduling

Type	Wrapper	Description
AlertPublisher*	-	List of alert publisher configurations.
IncrementalModeConfig	-	Advanced configuration for the incrementality of scheduled alerting groups.
Scheduling	-	Defines the trigger rule for scheduled queries. Only applies to scheduled alerting groups.

XML representation

<AlertGroup name="" description="" securitySource="" application="" useAsDefault="false" publishDocumentMetas="true" storeAlertOccurences="true" realTime="false" autoBuild="true" storeResults="false" incremental="false" groupAlertsByUser="false" ></AlertGroup>

CustomPublisher com.exalead.cloudview.alerting.v10.CustomPublisher

Publishes push-mode alerts

Parent objects

AlertGroup

Attributes

name string
classId string

Name	Type	Default value	Description
name	string	-
classId	string	-	Publisher type

Nested elements

configKeyValue*
KeyValue

Type	Wrapper	Description
KeyValue*	config
KeyValue	-	Defined for XSD compatibility, do not use

XML representation

<CustomPublisher name="" classId="" ></CustomPublisher>

SimpleHTTPPublisher com.exalead.cloudview.alerting.v10.SimpleHTTPPublisher

Pushes the alerts to a remote HTTP service.
Each alert occurence triggers an HTTP POST query.
This publisher can only be used for scheduled alert groups.

Parent objects

AlertGroup

Attributes

name string
address string
format string

Name	Type	Default value	Description
name	string	-
address	string	-	Publication endpoint address
format	string	atom	Alert publishing format. The only supported format is 'atom'. This is the native atom format of the Mashup API. It can be deserialized in Java by using, for example, the AccessClient. Additional information is added on the root feed, with alerting-specific data.

XML representation

<SimpleHTTPPublisher name="" address="" format="atom" ></SimpleHTTPPublisher>

RealTimeHTTPPublisher com.exalead.cloudview.alerting.v10.RealTimeHTTPPublisher

Pushes the alerts to a remote HTTP service.
Each alert occurence triggers an HTTP POST query.
This publisher can only be used for real-time alert groups.

Parent objects

AlertGroup

Attributes

name string
address string
format string

Name	Type	Default value	Description
name	string	-
address	string	-	Publication endpoint address
format	string	json	Alert publishing format. The only supported format is 'json'

XML representation

<RealTimeHTTPPublisher name="" address="" format="json" ></RealTimeHTTPPublisher>

ConsolePublisher com.exalead.cloudview.alerting.v10.ConsolePublisher

Debug-only publisher that simply logs alerts

Parent objects

AlertGroup

Attributes

name string

Name	Type	Default value	Description
name	string	-

XML representation

<ConsolePublisher name="" ></ConsolePublisher>

SimpleMailPublisher com.exalead.cloudview.alerting.v10.SimpleMailPublisher

In the formats, the following entries are replaced: ${TOTAL_NB_RESULTS}, ${NB_ALERTS}, ${USER}, ${ALERT_NAME}, ${ALERT_DESCRIPTION}, ${ALERT_NB_RESULTS}, ${TITLE}, ${TEXT}, ${ID}

Parent objects

AlertGroup

Attributes

name string
fromAddress string
toAddressFormat string
titleFormat string
mailTemplateDirectory string
mailTemplate string
groupAlertsByUser boolean
feedToRender string

Name	Type	Default value	Description
name	string	-
fromAddress	string	-	E-mail address that will appear as "From" in alerting mails.
toAddressFormat	string	-	Format string used to build the recipient e-mail address. ${USER} is automatically substituted with the login of the user. For example, if the format is "${USER}@mycompany.com", and "user1" sets up an alert, mails for this alert are sent to "user1@mycompany.com".
titleFormat	string	${NB_RESULTS} results for your ${NB_ALERTS} alert(s)
mailTemplateDirectory	string	resource:///alerting/mail/templates
mailTemplate	string	default.ftl
groupAlertsByUser	boolean	True
feedToRender	string	cloudview	Name of the top-level feed to render.

Nested elements

metaToExposeStringValue*
SMTPServer

Type	Wrapper	Description
StringValue*	metaToExpose	List of meta exposed in mail template. Metas "publicurl" and "lastmodifieddate" are exposed by default
SMTPServer	-

XML representation

<SimpleMailPublisher name="" fromAddress="" toAddressFormat="" titleFormat="${NB_RESULTS} results for your ${NB_ALERTS} alert(s)" mailTemplateDirectory="resource:///alerting/mail/templates" mailTemplate="default.ftl" groupAlertsByUser="true" feedToRender="cloudview" ></SimpleMailPublisher>

SMTPServer com.exalead.cloudview.alerting.v10.SMTPServer

No documentation for this element.

Parent objects

SimpleMailPublisher

Attributes

host string
port int
username string
password string
tls boolean

Name	Type	Default value	Description
host	string	localhost
port	int	25
username	string	-
password	string	-
tls	boolean	False

XML representation

<SMTPServer host="localhost" port="25" username="" password="" tls="false" ></SMTPServer>

StandardIncrementalModeConfig com.exalead.cloudview.alerting.v10.StandardIncrementalModeConfig

A simple implementation of incremental mode. Using this mode may cause some documents that have already been alerted to trigger another alert.
When each document is analyzed, an analysis timestamp is attached to this document. When triggering the alert query, the criteria "analysis_timestamp > (PREVIOUS_ALERT_TRIGGER_TIMESTAMP - OFFSET)" is added to the user query.
PREVIOUS_ALERT_TRIGGER_TIMESTAMP is then set to the current timestamp.
The OFFSET is required to account for the time delta between the moment a document is analyzed and the moment it is available for queries.

Parent objects

AlertGroup

Attributes

analysisTimestampIndexField string
timestampRestrictionPageParameter string
timestampOffsetS long

Name	Type	Default value	Description
analysisTimestampIndexField	string	analysisdate
timestampRestrictionPageParameter	string	cloudview.eq.date_restriction	Name of the Mashup-API page parameter to send the ELLQL query for timestamp restriction
timestampOffsetS	long	900	Negative offset in seconds on the previous trigger timestamp, to account for the time delta between the moment a document is analyzed and the moment it is available for queries.

XML representation

<StandardIncrementalModeConfig analysisTimestampIndexField="analysisdate" timestampRestrictionPageParameter="cloudview.eq.date_restriction" timestampOffsetS="900" ></StandardIncrementalModeConfig>

FixedTimeScheduling com.exalead.cloudview.alerting.v10.FixedTimeScheduling

Sequentially executes all alerts at fixed times, according to cron rules.
Use this option when alerts need to be sent at fixed times, and when there are few alerts (in the low 1000s).
Because of the sequential processing, this option generates a significant load when alerts are triggered.

Parent objects

AlertGroup

Attributes

cronExpression string
workerThreads int

Name	Type	Default value	Description
cronExpression	string	-
workerThreads	int	1

XML representation

<FixedTimeScheduling cronExpression="" workerThreads="1" ></FixedTimeScheduling>

ConsolidationConfig com.exalead.mercury.mami.consolidation.v10.ConsolidationConfig

Consolidation Server configuration. The Consolidation Server collects documents from a push API.
The document goes through the following transformation steps before being sent to the Indexing Server:

Transformation processors
Aggregation processors

Attributes

name string
nbThreads int
maxNativeMemoryConsumptionThreshold enum(disabled, enabled, auto)
maxNativeMemoryConsumptionInMB int

Name	Type	Default value	Description
name	string	-	Identifies this consolidation configuration uniquely
nbThreads	int	4	Number of threads to allocate during the transformation and aggregation phases.
maxNativeMemoryConsumptionThreshold	enum(disabled, enabled, auto)	enabled	When reaching the memory consumption value specified, transformation is stopped and transformed documents are committed. Then the transformation starts again. Enabled: Commits when the memory usage reaches the threshold value specified (by default, 2048 MB). Auto: Commits when the memory usage reaches 2048 MB.'
maxNativeMemoryConsumptionInMB	int	2048	The maximum of native memory the transformation service can allocate. Reaching this limit triggers a commit.

Nested elements

CommitTriggerConditionsCommitTriggerCondition*
AggregationTriggerConditionsCommitTriggerCondition*
AdvancedConfig
AggregationForwardProcessorConfigList
AggregationProcessorConfigList
TransformationProcessorConfigList

Type	Wrapper	Description
CommitTriggerCondition*	CommitTriggerConditions
CommitTriggerCondition*	AggregationTriggerConditions
AdvancedConfig	-
AggregationForwardProcessorConfigList	-
AggregationProcessorConfigList	-
TransformationProcessorConfigList	-

XML representation

<ConsolidationConfig name="" nbThreads="4" maxNativeMemoryConsumptionThreshold="enabled" maxNativeMemoryConsumptionInMB="2048" ></ConsolidationConfig>

AdvancedConfig com.exalead.mercury.mami.consolidation.v10.AdvancedConfig

No documentation for this element.

Parent objects

ConsolidationConfig

Attributes

sequentialCompaction boolean

Name	Type	Default value	Description
sequentialCompaction	boolean	True

Nested elements

AdvancedAggregationConfig
CompactPolicies
DocumentStoreWarnConfig

Type	Wrapper	Description
AdvancedAggregationConfig	-
CompactPolicies	-
DocumentStoreWarnConfig	-

XML representation

<AdvancedConfig sequentialCompaction="true" ></AdvancedConfig>

AdvancedAggregationConfig com.exalead.mercury.mami.consolidation.v10.AdvancedAggregationConfig

No documentation for this element.

Parent objects

AdvancedConfig

Attributes

impactQueueSize int
aggregationQueueSize int
forwardQueueSize int
aggregatedVerticesCacheSize int
vertexCacheSize int
impactDetectionResultCacheMaxFileSizeInBytes int
logCountImpactedVertices int
metaCountLogThreshold int
disableCheckpointForwarding boolean
countCheckpointAsATask boolean
progressionSaveThresholdInMS long

Name	Type	Default value	Description
impactQueueSize	int	8
aggregationQueueSize	int	8
forwardQueueSize	int	1600
aggregatedVerticesCacheSize	int	65536	Deprecated since V6R2016xR4, a dedup on impact detection results is now performed before running the aggregation
vertexCacheSize	int	65536
impactDetectionResultCacheMaxFileSizeInBytes	int	10485760
logCountImpactedVertices	int	-1	Possible values: -1: disabled (default value) 0: log all N: log only if the number of vertices impacted is greater than or equals to N
metaCountLogThreshold	int	-1	Possible values: -1: disabled (default value) N: log only if the number of values for a meta is greater than or equals to N
disableCheckpointForwarding	boolean	False
countCheckpointAsATask	boolean	True
progressionSaveThresholdInMS	long	3600000

XML representation

<AdvancedAggregationConfig impactQueueSize="8" aggregationQueueSize="8" forwardQueueSize="1600" aggregatedVerticesCacheSize="65536" vertexCacheSize="65536" impactDetectionResultCacheMaxFileSizeInBytes="10485760" logCountImpactedVertices="-1" metaCountLogThreshold="-1" disableCheckpointForwarding="false" countCheckpointAsATask="true" progressionSaveThresholdInMS="3600000" ></AdvancedAggregationConfig>

DocumentStoreWarnConfig com.exalead.mercury.mami.consolidation.v10.DocumentStoreWarnConfig

No documentation for this element.

Parent objects

AdvancedConfig

Attributes

Name	Type	Default value	Description
thresholdInBytes	long	104857600
enableAtTransformation	boolean	True
enableAtAggregation	boolean	True

XML representation

<DocumentStoreWarnConfig thresholdInBytes="104857600" enableAtTransformation="true" enableAtAggregation="true" ></DocumentStoreWarnConfig>

AggregationForwardProcessorConfigList com.exalead.mercury.mami.consolidation.v10.AggregationForwardProcessorConfigList

No documentation for this element.

Nested elements

AggregationForwardProcessorConfig*

Type	Wrapper	Description
AggregationForwardProcessorConfig*	-

XML representation

<AggregationForwardProcessorConfigList ></AggregationForwardProcessorConfigList>

AggregationForwardProcessorConfig com.exalead.mercury.mami.consolidation.v10.AggregationForwardProcessorConfig

No documentation for this element.

Parent objects

AggregationForwardProcessorConfigList

Attributes

Name	Type	Default value	Description
pushAPIServer	string	-	Specify the target Push API server
documentTypes	string	-	Comma-separated list of document types to forward. Leave empty to match all documents. Deprecated since V6R2015x.SP4, replaced by ForwardedDocumentTypeList
triggerIndexing	boolean	True	If set to true (default behavior), it will trigger an indexing job on your remote Push API server when the aggregation job is complete

Nested elements

ForwardedDocumentTypeList

Type	Wrapper	Description
ForwardedDocumentTypeList	-

XML representation

<AggregationForwardProcessorConfig pushAPIServer="" documentTypes="" triggerIndexing="true" ></AggregationForwardProcessorConfig>

ForwardedDocumentTypeList com.exalead.mercury.mami.consolidation.v10.ForwardedDocumentTypeList

List of forwarded document types

Nested elements

ForwardedDocumentType*

Type	Wrapper	Description
ForwardedDocumentType*	-

XML representation

<ForwardedDocumentTypeList ></ForwardedDocumentTypeList>

ForwardedDocumentType com.exalead.mercury.mami.consolidation.v10.ForwardedDocumentType

No documentation for this element.

Parent objects

ForwardedDocumentTypeList

Attributes

type string

Name	Type	Default value	Description
type	string	-

XML representation

<ForwardedDocumentType type="" ></ForwardedDocumentType>

AggregationProcessorConfigList com.exalead.mercury.mami.consolidation.v10.AggregationProcessorConfigList

List of aggregation processors that yield aggregated documents

Nested elements

CommonProcessorConfig*

Type	Wrapper	Description
CommonProcessorConfig*	-	Declare an aggregation processor to be used by the Consolidation Server.

XML representation

<AggregationProcessorConfigList ></AggregationProcessorConfigList>

TransformationProcessorConfig com.exalead.mercury.mami.consolidation.v10.TransformationProcessorConfig

No documentation for this element.

Parent objects

AggregationProcessorConfigList

Attributes

source string
name string
description string
enabled boolean
mime string
perfMonitored boolean
context string
code string

Name	Type	Default value	Description
source	string	-
name	string	-	Identifies this processor uniquely in the processing step
description	string	-	Description of this processor
enabled	boolean	True
mime	string	-	Mime-type of the language used to define the processor. As of now, supported types are: text/x-groovy text/x-jar
perfMonitored	boolean	False	Show the processor statistics in the Monitoring Console.
context	string	-	Indicates the context in which the processing operation will occur create/update delete
code	string	-	Processor source code run on add/update operations.

XML representation

<TransformationProcessorConfig source="" name="" description="" enabled="true" mime="" perfMonitored="false" context="" code="" ></TransformationProcessorConfig>

CustomJavaTransformationProcessor com.exalead.mercury.mami.consolidation.v10.CustomJavaTransformationProcessor

No documentation for this element.

Parent objects

AggregationProcessorConfigList

Attributes

source string
name string
description string
enabled boolean
mime string
perfMonitored boolean
context string
classId string
pluginName string

Name	Type	Default value	Description
source	string	-
name	string	-	Identifies this processor uniquely in the processing step
description	string	-	Description of this processor
enabled	boolean	True
mime	string	-	Mime-type of the language used to define the processor. As of now, supported types are: text/x-groovy text/x-jar
perfMonitored	boolean	False	Show the processor statistics in the Monitoring Console.
context	string	-	Indicates the context in which the processing operation will occur create/update delete
classId	string	-
pluginName	string	-	Deprecated since V6R2016x.R2, not used by the product so not replaced.

Nested elements

configKeyValue*

Type	Wrapper	Description
KeyValue*	config

XML representation

<CustomJavaTransformationProcessor source="" name="" description="" enabled="true" mime="" perfMonitored="false" context="" classId="" pluginName="" ></CustomJavaTransformationProcessor>

AggregationProcessorConfig com.exalead.mercury.mami.consolidation.v10.AggregationProcessorConfig

No documentation for this element.

Parent objects

AggregationProcessorConfigList

Attributes

name string
description string
enabled boolean
mime string
perfMonitored boolean
context string
code string

Name	Type	Default value	Description
name	string	-	Identifies this processor uniquely in the processing step
description	string	-	Description of this processor
enabled	boolean	True
mime	string	-	Mime-type of the language used to define the processor. As of now, supported types are: text/x-groovy text/x-jar
perfMonitored	boolean	False	Show the processor statistics in the Monitoring Console.
context	string	-	Indicates the context in which the processing operation will occur create/update delete
code	string	-	Processor source code run on add/update operations.

XML representation

<AggregationProcessorConfig name="" description="" enabled="true" mime="" perfMonitored="false" context="" code="" ></AggregationProcessorConfig>

CustomJavaAggregationProcessor com.exalead.mercury.mami.consolidation.v10.CustomJavaAggregationProcessor

No documentation for this element.

Parent objects

AggregationProcessorConfigList

Attributes

name string
description string
enabled boolean
mime string
perfMonitored boolean
context string
classId string
pluginName string

Name	Type	Default value	Description
name	string	-	Identifies this processor uniquely in the processing step
description	string	-	Description of this processor
enabled	boolean	True
mime	string	-	Mime-type of the language used to define the processor. As of now, supported types are: text/x-groovy text/x-jar
perfMonitored	boolean	False	Show the processor statistics in the Monitoring Console.
context	string	-	Indicates the context in which the processing operation will occur create/update delete
classId	string	-
pluginName	string	-	Deprecated since V6R2016x.R2, not used by the product so not replaced.

Nested elements

configKeyValue*

Type	Wrapper	Description
KeyValue*	config

XML representation

<CustomJavaAggregationProcessor name="" description="" enabled="true" mime="" perfMonitored="false" context="" classId="" pluginName="" ></CustomJavaAggregationProcessor>

TransformationProcessorConfigList com.exalead.mercury.mami.consolidation.v10.TransformationProcessorConfigList

List of transformation processors that yield transformed documents

Nested elements

CommonTransformationProcessorConfig*

Type	Wrapper	Description
CommonTransformationProcessorConfig*	-	Declare a transformation processor to be used by the Consolidation Server.

XML representation

<TransformationProcessorConfigList ></TransformationProcessorConfigList>

CustomConfigList com.exalead.mercury.mami.master.v10.CustomConfigList

A list of custom configuration entries.

Attributes

version long

Name	Type	Default value	Description
version	long	-	-

Nested elements

CustomConfig*

Type	Wrapper	Description
CustomConfig*	-

XML representation

<CustomConfigList version="" ></CustomConfigList>

CustomConfig com.exalead.mercury.mami.master.v10.CustomConfig

Data model
A custom configuration entry.
Contains the custom configuration entries in a set of @c KeyValue.

Parent objects

CustomConfigList

Attributes

name string

Name	Type	Default value	Description
name	string	-	Name of the custom configuration entry.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<CustomConfig name="" ></CustomConfig>

SchedulingConfig com.exalead.mercury.mami.master.v10.SchedulingConfig

Scheduled jobs configuration. Allows to run jobs periodically at a certain date or time

Attributes

version long

Name	Type	Default value	Description
version	long	-	-

Nested elements

CalendarConfig*
JobConfigGroup*
TriggerConfigGroup*

Type	Wrapper	Description
CalendarConfig*	-
JobConfigGroup*	-
TriggerConfigGroup*	-

XML representation

<SchedulingConfig version="" ></SchedulingConfig>

CronCalendarConfig com.exalead.mercury.mami.master.v10.CronCalendarConfig

Set to exclude a time range defined with a Quartz Cron expression.

Parent objects

SchedulingConfig

Attributes

name string
cronExpression string

Name	Type	Default value	Description
name	string	-	Name of the calendar.
cronExpression	string	-	The Quartz Cron expression.

XML representation

<CronCalendarConfig name="" cronExpression="" ></CronCalendarConfig>

DailyCalendarConfig com.exalead.mercury.mami.master.v10.DailyCalendarConfig

Used to exclude a time range from a day.

Parent objects

SchedulingConfig

Attributes

name string
startTime string
endTime string

Name	Type	Default value	Description
name	string	-	Name of the calendar.
startTime	string	-	Time range start.
endTime	string	-	Time range end.

XML representation

<DailyCalendarConfig name="" startTime="" endTime="" ></DailyCalendarConfig>

MonthlyCalendarConfig com.exalead.mercury.mami.master.v10.MonthlyCalendarConfig

Used to exclude a set of days from a month.

Contains the range of days to exclude (1-31)

Parent objects

SchedulingConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-	Name of the calendar.

Nested elements

IntValue*

Type	Wrapper	Description
IntValue*	-

XML representation

<MonthlyCalendarConfig name="" ></MonthlyCalendarConfig>

WeeklyCalendarConfig com.exalead.mercury.mami.master.v10.WeeklyCalendarConfig

Used to exclude a set of days in the week.

Contains the range of days to exclude (1-7).

Parent objects

SchedulingConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-	Name of the calendar.

Nested elements

IntValue*

Type	Wrapper	Description
IntValue*	-

XML representation

<WeeklyCalendarConfig name="" ></WeeklyCalendarConfig>

JobConfigGroup com.exalead.mercury.mami.master.v10.JobConfigGroup

Configuration for a group of jobs.

Parent objects

SchedulingConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-	Group name.

Nested elements

JobConfig*

Type	Wrapper	Description
JobConfig*	-

XML representation

<JobConfigGroup name="" ></JobConfigGroup>

DispatchJobConfig com.exalead.mercury.mami.master.v10.DispatchJobConfig

Generic job used to dispath a message to a service

Parent objects

JobConfigGroup

Attributes

name string

Name	Type	Default value	Description
name	string	-	Job name.

Nested elements

DispatchMessage

Type	Wrapper	Description
DispatchMessage	-

XML representation

<DispatchJobConfig name="" ></DispatchJobConfig>

TriggerConfigGroup com.exalead.mercury.mami.master.v10.TriggerConfigGroup

Configuration for a group of triggers

Parent objects

SchedulingConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-	Group name.

Nested elements

TriggerConfig*

Type	Wrapper	Description
TriggerConfig*	-

XML representation

<TriggerConfigGroup name="" ></TriggerConfigGroup>

SimpleTriggerConfig com.exalead.mercury.mami.master.v10.SimpleTriggerConfig

Simple trigger 
The following values for {@link misfireInstruction} are available:

fire_now: immediately fires the trigger
reschedule_next_with_existing_count: reschedule to next execution with {@link repeatCount} left unchanged.
reschedule_next_with_remaining_count: reschedule to next execution with repeat count set to what it would be if it had not missed any firings.
reschedule_now_with_existing_repeat_count: reschedule now with {@link repeatCount} left unchanged.
reschedule_now_with_remaining_repeat_count: reschedule now with repeat count set to what it would be if it had not missed any firings.

If {@link misfireInstruction} is set to null, when the trigger couldn't be fired, the following scheme applies

If {@link repeatCount} is set to 0, the instruction will be interpreted as fire_now
If {link repeatCount} is set to <0, the instruction will be interpreted as reschedule_next_with_remaining_count
If {link repeatCount} is set to >0, the instruction will be interpreted as reschedule_now_with_existing_repeat_count

Parent objects

TriggerConfigGroup

Attributes

name string
startTime long
endTime long
jobGroupName string
jobName string
calendarName string
misfireInstruction string
repeatCount int
repeatInterval long

Name	Type	Default value	Description
name	string	-	Name of the trigger.
startTime	long	-	Start date for this trigger.
endTime	long	-	End date for this trigger.
jobGroupName	string	-	Job group for this trigger.
jobName	string	-	Job name for this trigger.
calendarName	string	-	Optional calendar name for this trigger (to exclude blocks of time), only applies to simple triggers
misfireInstruction	string	-	Action to execute when the trigger couldn't be fired. @refer to each trigger documentation for available misfire instructions.
repeatCount	int	-	Repeats count (if <0 , no limit).
repeatInterval	long	-	Repeats interval in milliseconds.

XML representation

<SimpleTriggerConfig name="" startTime="" endTime="" jobGroupName="" jobName="" calendarName="" misfireInstruction="" repeatCount="" repeatInterval="" ></SimpleTriggerConfig>

CronTriggerConfig com.exalead.mercury.mami.master.v10.CronTriggerConfig

Cron based trigger 
The following values for {@link misfireInstruction} are available:

do_nothing: nothing is done when a trigger has been misfired.
fire_once_now: the trigger is immediately executed when it has been misfired.

If {@link misfireInstruction} is set to null, when the trigger couldn't be fired, the following scheme applies:

The instruction will be interpreted as fire_once_now.

Parent objects

TriggerConfigGroup

Attributes

name string
startTime long
endTime long
jobGroupName string
jobName string
calendarName string
misfireInstruction string
cronExpression string

Name	Type	Default value	Description
name	string	-	Name of the trigger.
startTime	long	-	Start date for this trigger.
endTime	long	-	End date for this trigger.
jobGroupName	string	-	Job group for this trigger.
jobName	string	-	Job name for this trigger.
calendarName	string	-	Optional calendar name for this trigger (to exclude blocks of time), only applies to simple triggers
misfireInstruction	string	-	Action to execute when the trigger couldn't be fired. @refer to each trigger documentation for available misfire instructions.
cronExpression	string	-	Cron expression.

XML representation

<CronTriggerConfig name="" startTime="" endTime="" jobGroupName="" jobName="" calendarName="" misfireInstruction="" cronExpression="" ></CronTriggerConfig>

ValidatorConfig com.exalead.mercury.mami.master.v10.ValidatorConfig

No documentation for this element.

Nested elements

ValidatorGroup*

Type	Wrapper	Description
ValidatorGroup*	-

XML representation

<ValidatorConfig ></ValidatorConfig>

ValidatorGroup com.exalead.mercury.mami.master.v10.ValidatorGroup

List of Validators that have the same runtime or target the same configuration.

Allows to run specific validators at the same time.

Parent objects

ValidatorConfig

Attributes

name string
autorun boolean

Name	Type	Default value	Description
name	string	-	Name of this group of validators.
autorun	boolean	True	Default way to run all the group's validators. Default set to true (meaning the group will be ran at apply config time).

Nested elements

Validator*

Type	Wrapper	Description
Validator*	-	List of validators belonging to this group.

XML representation

<ValidatorGroup name="" autorun="true" ></ValidatorGroup>

Validator com.exalead.mercury.mami.master.v10.Validator

No documentation for this element.

Parent objects

ValidatorGroup

Attributes

classId string
name string
autorun boolean

Name	Type	Default value	Description
classId	string	-
name	string	-	Unique identifier of the validator.
autorun	boolean	True	True if validator is to be ran at apply configuration time. Otherwise, will only be ran manually.

XML representation

<Validator classId="" name="" autorun="true" ></Validator>

CustomValidator com.exalead.mercury.mami.master.v10.CustomValidator

No documentation for this element.

Parent objects

ValidatorGroup

Attributes

classId string
name string
autorun boolean

Name	Type	Default value	Description
classId	string	-
name	string	-	Unique identifier of the validator.
autorun	boolean	True	True if validator is to be ran at apply configuration time. Otherwise, will only be ran manually.

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-

XML representation

<CustomValidator classId="" name="" autorun="true" ></CustomValidator>

AdminUIConfig com.exalead.mercury.mami.adminui.v10.AdminUIConfig

No documentation for this element.

Attributes

version long
identityProvider string
useHttps boolean
serverCertificate string

Name	Type	Default value	Description
version	long	-	-
identityProvider	string	-
useHttps	boolean	False
serverCertificate	string	-

Nested elements

Role*

Type	Wrapper	Description
Role*	-

XML representation

<AdminUIConfig version="" identityProvider="" useHttps="false" serverCertificate="" ></AdminUIConfig>

Role com.exalead.mercury.mami.adminui.v10.Role

No documentation for this element.

Parent objects

AdminUIConfig

Attributes

name string

Name	Type	Default value	Description
name	string	-

Nested elements

Permission*

Type	Wrapper	Description
Permission*	-

XML representation

<Role name="" ></Role>

Permission com.exalead.mercury.mami.adminui.v10.Permission

No documentation for this element.

Parent objects

Role

Attributes

id string
permission string

Name	Type	Default value	Description
id	string	-
permission	string	-

XML representation

<Permission id="" permission="" ></Permission>

KeyValue exa.bee.KeyValue

No documentation for this element.

Parent objects

AlphanumProperty
CSVMultiField
ConvertInternalSettings
ConvertJavaPluginSetting
ConvertTextExtractor
CustomConfig
CustomDocumentProcessor
CustomHitOperation
CustomMetaOperation
CustomPostFilter
CustomPostProcessingPipeline
CustomPrefixHandler
CustomProcess
CustomProcessor
CustomPublisher
CustomQueryExpansionModule
CustomSemanticProcessor
CustomTokenizer
CustomTransform
CustomValidator
DataProperty
EnumProperty
Feed
GeoProperty
IdentityProviderConfig
KeyValue
NumericalProperty
ObjectProperty
Post
Property
ReplaceValues
Resource
ScratchHook
SemanticResource
SemanticType
SetDefaultValue
PushAPIFilter Crawler
PushAPIFilter FeedFetcher
PushAPIFilter ICrawler
add InsertHitInfoFromFacets
config Connector
config CustomJavaAggregationProcessor
config CustomJavaTransformationProcessor
config CustomPublisher
config CustomPublisher
config SecuritySource
customRunnerFactoryParams BaseSearchCommand
customRunnerFactoryParams QueryExpansionCommand
customRunnerFactoryParams SearchCommand
customRunnerFactoryParams SpellCheckCommand
envAppend ProcessInternalConfig
envOverride ProcessInternalConfig
envPrepend ProcessInternalConfig
forcedMeta Connector
globalBeegistryKeys CloudviewDeploymentInternalConfig
initParams JavaWebApp
messageContent DispatchMessage
parameters AddParameters
parameters Fetcher
params ComplexExaProcessConfig
params ComplexJavaProcessConfig
params ProcessConfigList
params ProcessInternalConfig
replace InsertHitInfoFromFacets
scanModeConfig ConnectorScheduledScan
tags AlphanumProperty
tags DataProperty
tags EnumProperty
tags GeoProperty
tags NumericalProperty
tags ObjectProperty
tags Property

Attributes

key string
value string
type string
description string

Name	Type	Default value	Description
key	string	-	The name of the key
value	string	-	-
type	string	-	-
description	string	-	-

Nested elements

KeyValue*

Type	Wrapper	Description
KeyValue*	-	-

XML representation

<KeyValue key="" value="" type="" description="" ></KeyValue>

StringConstantValue exa.bee.StringConstantValue

No documentation for this element.

Parent objects

mimeTypes Crawler
mimeTypes FeedFetcher
mimeTypes ICrawler
sessionIdBlacklist Crawler
sessionIdBlacklist FeedFetcher
sessionIdBlacklist ICrawler

Attributes

value string

Name	Type	Default value	Description
value	string	-	-

XML representation

<StringConstantValue value="" ></StringConstantValue>

StringValue exa.bee.StringValue

No documentation for this element.

Parent objects

ConvertInternalSettings
annotationsToCopy HTMLRelevantContentExtractor
args ProcessInternalConfig
classes HTMLCSSExtractor
classes HTMLCSSSelector
designedForLogics SuggestDispatcher
ids HTMLCSSExtractor
ids HTMLCSSSelector
idsAndClassesToIgnore HTMLRelevantContentExtractor
idsAndClassesToKeep HTMLRelevantContentExtractor
inputContexts ConcatValues
inputContexts ContentCleanup
inputContexts CoordinatesFormatter
inputContexts DebugProcessor
inputContexts LanguageDetector
inputContexts LanguageSetter
inputContexts MultiContextCSVEncoder
inputContexts MultiContextDocumentProcessor
inputContexts NumericalFormatter
inputContexts RemoteMOTAPIDocumentProcessor
inputContexts RemoveContexts
inputContexts StringHash
inputContexts StringHash32
inputContexts StringHash64
inputContexts StringTransform
inputContexts UTF8Checker
inputContexts ValueSelector
isAliveQueries SearchCommand
metaToExpose SimpleMailPublisher
mimes MimeCondition
monitoredPageCaches PerformanceMonitoringConfig
partSpecificContexts StandardPartsMerger
probeTypes PerformanceMonitoringTimer
targetInstances RemoteMOTAPIDocumentProcessor
trustedHost ProductSecurity
values SimilarStringToPart
values UniformRandomContextGenerator
values ZipfRandomContextGenerator

Attributes

value string

Name	Type	Default value	Description
value	string	-	-

XML representation

<StringValue value="" ></StringValue>

IntValue exa.bee.IntValue

No documentation for this element.

Parent objects

MonthlyCalendarConfig
WeeklyCalendarConfig

Attributes

value int

Name	Type	Default value	Description
value	int	-	-

XML representation

<IntValue value="" ></IntValue>

DispatchMessage exa.bee.DispatchMessage

No documentation for this element.

Parent objects

DispatchJobConfig

Attributes

serviceName string
messageName string
outputFormat string

Name	Type	Default value	Description
serviceName	string	-	-
messageName	string	-	-
outputFormat	string	-	-

Nested elements

messageContentKeyValue*

Type	Wrapper	Description
KeyValue*	messageContent	-

XML representation

<DispatchMessage serviceName="" messageName="" outputFormat="" ></DispatchMessage>