CloudView Configuration Reference
Help
  • Administration Console
  • Mashup Builder
  • Business Console
Quick search (class or attribute name)
  • DataModel com.exalead.datamodel.v10.DataModel
    CloudView Data Model
    Attributes
    • name string
    • defaultClass string
    • possibleLanguages string
    • storeAllMetas boolean
    • traceAllMetas boolean
    • expandNamedEntitiesLikeIn2014 boolean
    Name Type Default value Description
    name string - Name of this Data Model.
    defaultClass string - Default class to push documents when there is no "datamodel_class" papi directive, or no "TYPE" meta.
    Usually, all data model classes generate type-prefixed field names, like 'myclass_myproperty', except the defaultClass which has unprefixed fields in the expansion.
    The analysis pipeline deduces the data model class of a document by applying the following rules with this specific order:
    1. Value of papi custom directive "datamodel_class" if any.
    2. Value of papi meta "TYPE" if any.
    3. Value of the default data model class of the connector pushing the document if any.
    4. Use the data model class marked by the 'defaultClass' property.
    possibleLanguages string - The list of languages that can appear in the corpus.

    Language detection will be restricted to this set.

    A null value means that all languages are possible.
    storeAllMetas
    (UI: Store all unprocessed metas)
    boolean False Saves all metas that were not mapped to an index field or category as "csv encoded metas".

    These metas are retrievable only. To make a meta searchable, create a data model property for it.
    traceAllMetas
    (UI: Trace all metas)
    boolean False Select this option before indexing, then click "Add Properties from Traced Metas" to simplify creating new properties.

    The traced metas are stored in a separate database. Once you have created properties based on traced metas, de-select this option to prevent the database from growing unnecessarily over time.
    Trace all metas pushed by connectors in order to ease property configuration.
    expandNamedEntitiesLikeIn2014 boolean False

    Nested elements
    • DataModelClass*
    • DataModelEnum*
    • SemanticType*
    Type Wrapper Description
    DataModelClass* - List of classes of this data model.
    DataModelEnum* -
    SemanticType* - The list of semantic types of this data model.

    A semantic type defines the semantic analysis to perform, and can be assigned to 'alphanumeric' properties.

    The semantic types available out of the box are: text, identifier, meta, and url.
    XML representation
    <DataModel name="" defaultClass="" possibleLanguages="" storeAllMetas="false" traceAllMetas="false" expandNamedEntitiesLikeIn2014="false" ></DataModel>
  • DataModelClass com.exalead.datamodel.v10.DataModelClass
    No documentation for this element.
    Parent objects
    • DataModel
    Attributes
    • name string
    • parentClass string
    Name Type Default value Description
    name string -
    parentClass string - The parent class name. May be null if this class has no parent class.

    Nested elements
    • DynamicProperty*
    • ExpansionParams
    • Label*
    • Property*
    Type Wrapper Description
    DynamicProperty* - The dynamic properties of the class.
    ExpansionParams -
    Label* - Labels for i18n. TODO not implemented
    Property* - The properties of the class.
    XML representation
    <DataModelClass name="" parentClass="" ></DataModelClass>
  • DynamicProperty com.exalead.datamodel.v10.DynamicProperty
    Allows you to define matching rules to map multiple document meta names to a single property, and consequently to a single index field or category facet (or output context, if neither index field nor category facet is selected).
    This is useful when you do not know all the metas available in your data source. It also reduces the number of fields needed in the index.
    Parent objects
    • DataModelClass
    Attributes
    • storeMetaNames boolean
    Name Type Default value Description
    storeMetaNames boolean False Stores the meta name associated to each value, in the resulting index field (or category facet or output context).
    This is useful to search and retrieve specific metas within the field.

    If 'Store meta names' is not selected for a dynamic property, the resulting index field (or category facet or output context) is the same as any other alphanumerical or numerical field created with standard properties: it contains only meta values.

    Nested elements
    • MatchingRule*
    • Property
    Type Wrapper Description
    MatchingRule* - List of rules to define which document metas should be mapped to this dynamic property.
    Property - All property attributes apply to this dynamic property.
    XML representation
    <DynamicProperty storeMetaNames="false" ></DynamicProperty>
  • MatchingRule com.exalead.datamodel.v10.MatchingRule
    No documentation for this element.
    Parent objects
    • DynamicProperty
    Attributes
    • pattern string
    • mode enum(prefix, suffix, substring, pattern, exact)
    • unprefix boolean
    Name Type Default value Description
    pattern string - If null, property name is used
    mode enum(prefix, suffix, substring, pattern, exact)
    unprefix boolean False

    XML representation
    <MatchingRule mode="" unprefix="false" ></MatchingRule>
  • AlphanumProperty com.exalead.datamodel.v10.AlphanumProperty
    Alphanumerical property.
    Parent objects
    • DataModelClass
    • DynamicProperty
    Attributes
    • overrideClassExpansionParams boolean
    • dedicatedField boolean
    • dynamicProperty string
    • searchable boolean
    • retrievable boolean
    • ramBased boolean
    • facet boolean
    • disjunctiveRefines boolean
    • defaultValue string
    • additionalMetaNames string
    • arity enum(zero_or_one, one, many)
    • name string
    • comment string
    • semanticType string
    • indexInTextField boolean
    • queryExpansionConfig string
    • enumerated boolean
    Name Type Default value Description
    overrideClassExpansionParams boolean False
    dedicatedField boolean True Should we create a dedicated index field for this property, or use a multi-field encoding?
    A dedicated field is recommended if more than a few percent of objects have this property.
    Internal: if there is no dedicated field, we store in the "metas" field and index using prefixed-words.
    dynamicProperty string - Storing this meta in a dynamical property decreases index's memory footprint and improve index's commit time. Faceting and sorting performances will be impacted (slower).
    searchable
    (UI: Searchable)
    boolean True Allows users to search for matches in this property using a prefix.
    For example, to search for "API" in the title of a document, enter the search query <pre>title:API</pre>
    retrievable
    (UI: Retrievable)
    boolean True Displays the meta in the hit content of search results.
    If the property doesn't have a dedicated field, the value will be CSV-encoded in the 'metas' field.
    ramBased
    (UI: RAM based)
    boolean False Stores this property in RAM. Use this option if you need to:
    • sort search results on this property.
    • use this property in a virtual field.
    • use this property in a virtual facet.
    • use this property for an aggregation value.

    Only the "retrievable" part of the property is stored in RAM.
    facet boolean False Creates a navigation facet from the property values.
    disjunctiveRefines
    (UI: Disjunctive refines)
    boolean False Makes refinements on this facet disjunctive ("checkboxes") rather than exclusive (select one).
    defaultValue string - If there is no value for this property in the document, use this default value.
    additionalMetaNames string - Maps these metas (as a comma-separated list) to this index or facet field.
    Usable only for properties having a dedicated field, a facet, or are searchable without prefix.
    arity
    (UI: Cardinality)
    enum(zero_or_one, one, many) zero_or_one The arity of the property (zero_or_one, one, many) {@code enum(zero_or_one, one, many)}
    name string - Property name
    comment string - Optional comment to explain property use.
    semanticType string - Defines how the text is processed and indexed (tokenization, normalization and indexing-time semantic processing).
    This value must refer to a valid !SemanticType object in the DataModel.
    indexInTextField boolean False Allows users to search for matches in this property without using a prefix, by mapping the values to the generic "text" index field, as invisible text.
    For example, to search for "API" in a document, enter the search query <pre>API</pre>
    queryExpansionConfig string - Search-time query expansion config to use for this property.
    This follows the 'QEC' format specification.
    enumerated boolean False Is there a controlled and limited set of values for this property?
    Enumerated properties are indexed using a specific "Value" field, which stores entire values in an optimized way.
    Thus, performing faceting on this kind of field is faster than on a category field.
    It is not possible to perform any semantic processing on value fields.

    Nested elements
    • tagsKeyValue*
    • AdvancedParams
    • KeyValue*
    • Label*
    Type Wrapper Description
    KeyValue* tags User-defined tags for this property. This can be used for example to implement 6W faceting of the data model.
    AdvancedParams -
    KeyValue* -
    Label* - Labels for the internationalization of this property's name
    XML representation
    <AlphanumProperty overrideClassExpansionParams="false" dedicatedField="true" searchable="true" retrievable="true" ramBased="false" facet="false" disjunctiveRefines="false" defaultValue="" additionalMetaNames="" arity="zero_or_one" name="" comment="" semanticType="" indexInTextField="false" queryExpansionConfig="" enumerated="false" ></AlphanumProperty>
  • AdvancedParams com.exalead.datamodel.v10.AdvancedParams
    Advanced configuration options of a property
    Parent objects
    • DataModelClass
    Attributes
    • generateAnalysis boolean
    • generateIndexSchema boolean
    • generatePrefix boolean
    • generateFacet boolean
    • generateMeta boolean
    • expandedPipelines string
    • expandedSearchLogics string
    • facetSortFunction string
    • facetReverseSort boolean
    • facetExplicitSortOrderValues string
    • facetInHits boolean
    • facetInNavigation boolean
    • facetRoot string
    • categoryField string
    • categoryFacetMaxPerLevel int
    • facetMaxInHits int
    • numericalBits int
    • doublePrecision int
    • enableTrustedQueries boolean
    • dateInputFormat string
    • dateOutputFormat string
    • gzip boolean
    • unit string
    • patternSearchEnabled boolean
    • patternSearchOptimizationType enum(ngramv1, ngramv2)
    • forcedRankToTextMapping nullableint
    • detectTitle boolean
    Name Type Default value Description
    generateAnalysis
    (UI: Generate analysis config)
    boolean True Creates the required mappings and document processors for this property. Only set this option to "false" to take complete control over analysis expansion for this property.
    generateIndexSchema
    (UI: Generate index schema)
    boolean True Creates the required index fields for this property. Only set this option to "false" to take complete control over index schema expansion for this property.
    generatePrefix
    (UI: Generate prefix handler)
    boolean True Creates a prefix handler in the search logics for this property. Only set this option to "false" to take complete control over query prefix handler expansion for this property.
    generateFacet
    (UI: Generate category facet)
    boolean True Creates a category facet in the search logics for this property. Only set this option to "false" to take complete control over facet config expansion for this property.
    generateMeta
    (UI: Generate hit meta)
    boolean True Creates a hit meta in the search logics for this property. Only set this option to "false" to take complete control over meta expansion for this property.
    expandedPipelines string - A comma-separated list of analysis pipelines to generate mappings and document processors on. If empty, generate on all pipelines.
    expandedSearchLogics string - A comma-separated list of search logics to generate prefix handlers, facets and hit metas on. If empty, generate on all search logics.
    facetSortFunction string count Sorting method if a category facet is created for this property. Select "ReverseSort" to reverse the sorting order.
    facetReverseSort boolean False Reverse sorting order if a category facet is created for this property.
    facetExplicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    facetInHits
    (UI: Use for hit content)
    boolean True Displays this property in the hits content (if a category facet is created).
    facetInNavigation
    (UI: Use for navigation)
    boolean True Displays this property in the navigation (if a category facet is created).
    facetRoot
    (UI: Override root path)
    string - Replaces the default facet root path (Top/ClassProperties/<Property name>) (if a category facet is created).
    categoryField
    (UI: Override field)
    string - Replaces the category field for this property (if a category facet is created).
    categoryFacetMaxPerLevel
    (UI: Max. values)
    int 100 Displays a maximum of N values per level for this property (if a category facet is created).
    facetMaxInHits int 100 Displays a maximum of N values for this property (if a category facet is created).
    numericalBits
    (UI: Bits for numerical value)
    int 63 Number of bits used to store a numerical value.
    doublePrecision int 4 Number of figures after the decimal point (if this is a double property).
    enableTrustedQueries
    (UI: Trusted queries support)
    boolean False Precomputes the indexing of prefixes for the Trusted Queries widget.
    dateInputFormat string - Specifies an input format for DateFormatter for numerical properties with date or date_time type
    dateOutputFormat string - Specifies an output format for numerical properties with date or date_time type, used in full hits
    gzip boolean True Activates content compression (if an alphanum field is created)
    unit string - Unit Of Measurement in which the value is stored
    patternSearchEnabled boolean False Calculates additional dictionaries to perform pattern search within words (native prefix, suffix, substring search using #anumpattern) without relying on pattern expansion.
    This has a very high cost in indexing time and indexing size. Do not select unless advised by Exalead teams.
    patternSearchOptimizationType enum(ngramv1, ngramv2) ngramv1 Select algorithm based on dictionary ngrams or indexed ngrams (untokenized text only).
    forcedRankToTextMapping nullableint Sets the ranking value for chunks in mapping to text field.
    -1 means that the chunk internal ranking value is kept.
    detectTitle boolean False Detect words set after # in path and use them as title

    XML representation
    <AdvancedParams generateAnalysis="true" generateIndexSchema="true" generatePrefix="true" generateFacet="true" generateMeta="true" expandedPipelines="" expandedSearchLogics="" facetSortFunction="count" facetReverseSort="false" facetInHits="true" facetInNavigation="true" facetRoot="" categoryField="" categoryFacetMaxPerLevel="100" facetMaxInHits="100" numericalBits="63" doublePrecision="4" enableTrustedQueries="false" dateInputFormat="" dateOutputFormat="" gzip="true" patternSearchEnabled="false" patternSearchOptimizationType="ngramv1" forcedRankToTextMapping="" detectTitle="false" ></AdvancedParams>
  • AdvancedPointParams com.exalead.datamodel.v10.AdvancedPointParams
    Advanced configuration options of a point property
    Parent objects
    • DataModelClass
    Attributes
    • facetSortFunction string
    • facetReverseSort boolean
    • facetExplicitSortOrderValues string
    • facetInHits boolean
    • facetInNavigation boolean
    • facetRoot string
    • categoryField string
    • categoryFacetMaxPerLevel int
    • facetMaxInHits int
    • numericalBits int
    • doublePrecision int
    • enableTrustedQueries boolean
    • dateInputFormat string
    • dateOutputFormat string
    • gzip boolean
    • unit string
    • patternSearchEnabled boolean
    • patternSearchOptimizationType enum(ngramv1, ngramv2)
    • forcedRankToTextMapping nullableint
    • detectTitle boolean
    • generateAnalysis boolean
    • generateIndexSchema boolean
    • generatePrefix boolean
    • generateFacet boolean
    • generateMeta boolean
    • expandedPipelines string
    • expandedSearchLogics string
    • generateFromSeparateMetas boolean
    • firstCoordinateMeta string
    • secondCoordinateMeta string
    Name Type Default value Description
    facetSortFunction string count Sorting method if a category facet is created for this property. Select "ReverseSort" to reverse the sorting order.
    facetReverseSort boolean False Reverse sorting order if a category facet is created for this property.
    facetExplicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    facetInHits
    (UI: Use for hit content)
    boolean True Displays this property in the hits content (if a category facet is created).
    facetInNavigation
    (UI: Use for navigation)
    boolean True Displays this property in the navigation (if a category facet is created).
    facetRoot
    (UI: Override root path)
    string - Replaces the default facet root path (Top/ClassProperties/<Property name>) (if a category facet is created).
    categoryField
    (UI: Override field)
    string - Replaces the category field for this property (if a category facet is created).
    categoryFacetMaxPerLevel
    (UI: Max. values)
    int 100 Displays a maximum of N values per level for this property (if a category facet is created).
    facetMaxInHits int 100 Displays a maximum of N values for this property (if a category facet is created).
    numericalBits
    (UI: Bits for numerical value)
    int 63 Number of bits used to store a numerical value.
    doublePrecision int 4 Number of figures after the decimal point (if this is a double property).
    enableTrustedQueries
    (UI: Trusted queries support)
    boolean False Precomputes the indexing of prefixes for the Trusted Queries widget.
    dateInputFormat string - Specifies an input format for DateFormatter for numerical properties with date or date_time type
    dateOutputFormat string - Specifies an output format for numerical properties with date or date_time type, used in full hits
    gzip boolean True Activates content compression (if an alphanum field is created)
    unit string - Unit Of Measurement in which the value is stored
    patternSearchEnabled boolean False Calculates additional dictionaries to perform pattern search within words (native prefix, suffix, substring search using #anumpattern) without relying on pattern expansion.
    This has a very high cost in indexing time and indexing size. Do not select unless advised by Exalead teams.
    patternSearchOptimizationType enum(ngramv1, ngramv2) ngramv1 Select algorithm based on dictionary ngrams or indexed ngrams (untokenized text only).
    forcedRankToTextMapping nullableint Sets the ranking value for chunks in mapping to text field.
    -1 means that the chunk internal ranking value is kept.
    detectTitle boolean False Detect words set after # in path and use them as title
    generateAnalysis
    (UI: Generate analysis config)
    boolean True Creates the required mappings and document processors for this property. Only set this option to "false" to take complete control over analysis expansion for this property.
    generateIndexSchema
    (UI: Generate index schema)
    boolean True Creates the required index fields for this property. Only set this option to "false" to take complete control over index schema expansion for this property.
    generatePrefix
    (UI: Generate prefix handler)
    boolean True Creates a prefix handler in the search logics for this property. Only set this option to "false" to take complete control over query prefix handler expansion for this property.
    generateFacet
    (UI: Generate category facet)
    boolean True Creates a category facet in the search logics for this property. Only set this option to "false" to take complete control over facet config expansion for this property.
    generateMeta
    (UI: Generate hit meta)
    boolean True Creates a hit meta in the search logics for this property. Only set this option to "false" to take complete control over meta expansion for this property.
    expandedPipelines string - A comma-separated list of analysis pipelines to generate mappings and document processors on. If empty, generate on all pipelines.
    expandedSearchLogics string - A comma-separated list of search logics to generate prefix handlers, facets and hit metas on. If empty, generate on all search logics.
    generateFromSeparateMetas boolean False Content generated by several metas
    firstCoordinateMeta string - This meta will be used to define the first coordinate of the point
    secondCoordinateMeta string - This meta will be used to define the second coordinate of the point

    XML representation
    <AdvancedPointParams facetSortFunction="count" facetReverseSort="false" facetInHits="true" facetInNavigation="true" facetRoot="" categoryField="" categoryFacetMaxPerLevel="100" facetMaxInHits="100" numericalBits="63" doublePrecision="4" enableTrustedQueries="false" dateInputFormat="" dateOutputFormat="" gzip="true" patternSearchEnabled="false" patternSearchOptimizationType="ngramv1" forcedRankToTextMapping="" detectTitle="false" generateAnalysis="true" generateIndexSchema="true" generatePrefix="true" generateFacet="true" generateMeta="true" expandedPipelines="" expandedSearchLogics="" generateFromSeparateMetas="false" firstCoordinateMeta="" secondCoordinateMeta="" ></AdvancedPointParams>
  • AdvancedGeoParams com.exalead.datamodel.v10.AdvancedGeoParams
    Advanced configuration options of a geo property
    Parent objects
    • DataModelClass
    Attributes
    • facetSortFunction string
    • facetReverseSort boolean
    • facetExplicitSortOrderValues string
    • facetInHits boolean
    • facetInNavigation boolean
    • facetRoot string
    • categoryField string
    • categoryFacetMaxPerLevel int
    • facetMaxInHits int
    • numericalBits int
    • doublePrecision int
    • enableTrustedQueries boolean
    • dateInputFormat string
    • dateOutputFormat string
    • gzip boolean
    • unit string
    • patternSearchEnabled boolean
    • patternSearchOptimizationType enum(ngramv1, ngramv2)
    • forcedRankToTextMapping nullableint
    • detectTitle boolean
    • generateAnalysis boolean
    • generateIndexSchema boolean
    • generatePrefix boolean
    • generateFacet boolean
    • generateMeta boolean
    • expandedPipelines string
    • expandedSearchLogics string
    • maxBlockSize int
    • precision int
    • bboxSuffix string
    Name Type Default value Description
    facetSortFunction string count Sorting method if a category facet is created for this property. Select "ReverseSort" to reverse the sorting order.
    facetReverseSort boolean False Reverse sorting order if a category facet is created for this property.
    facetExplicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    facetInHits
    (UI: Use for hit content)
    boolean True Displays this property in the hits content (if a category facet is created).
    facetInNavigation
    (UI: Use for navigation)
    boolean True Displays this property in the navigation (if a category facet is created).
    facetRoot
    (UI: Override root path)
    string - Replaces the default facet root path (Top/ClassProperties/<Property name>) (if a category facet is created).
    categoryField
    (UI: Override field)
    string - Replaces the category field for this property (if a category facet is created).
    categoryFacetMaxPerLevel
    (UI: Max. values)
    int 100 Displays a maximum of N values per level for this property (if a category facet is created).
    facetMaxInHits int 100 Displays a maximum of N values for this property (if a category facet is created).
    numericalBits
    (UI: Bits for numerical value)
    int 63 Number of bits used to store a numerical value.
    doublePrecision int 4 Number of figures after the decimal point (if this is a double property).
    enableTrustedQueries
    (UI: Trusted queries support)
    boolean False Precomputes the indexing of prefixes for the Trusted Queries widget.
    dateInputFormat string - Specifies an input format for DateFormatter for numerical properties with date or date_time type
    dateOutputFormat string - Specifies an output format for numerical properties with date or date_time type, used in full hits
    gzip boolean True Activates content compression (if an alphanum field is created)
    unit string - Unit Of Measurement in which the value is stored
    patternSearchEnabled boolean False Calculates additional dictionaries to perform pattern search within words (native prefix, suffix, substring search using #anumpattern) without relying on pattern expansion.
    This has a very high cost in indexing time and indexing size. Do not select unless advised by Exalead teams.
    patternSearchOptimizationType enum(ngramv1, ngramv2) ngramv1 Select algorithm based on dictionary ngrams or indexed ngrams (untokenized text only).
    forcedRankToTextMapping nullableint Sets the ranking value for chunks in mapping to text field.
    -1 means that the chunk internal ranking value is kept.
    detectTitle boolean False Detect words set after # in path and use them as title
    generateAnalysis
    (UI: Generate analysis config)
    boolean True Creates the required mappings and document processors for this property. Only set this option to "false" to take complete control over analysis expansion for this property.
    generateIndexSchema
    (UI: Generate index schema)
    boolean True Creates the required index fields for this property. Only set this option to "false" to take complete control over index schema expansion for this property.
    generatePrefix
    (UI: Generate prefix handler)
    boolean True Creates a prefix handler in the search logics for this property. Only set this option to "false" to take complete control over query prefix handler expansion for this property.
    generateFacet
    (UI: Generate category facet)
    boolean True Creates a category facet in the search logics for this property. Only set this option to "false" to take complete control over facet config expansion for this property.
    generateMeta
    (UI: Generate hit meta)
    boolean True Creates a hit meta in the search logics for this property. Only set this option to "false" to take complete control over meta expansion for this property.
    expandedPipelines string - A comma-separated list of analysis pipelines to generate mappings and document processors on. If empty, generate on all pipelines.
    expandedSearchLogics string - A comma-separated list of search logics to generate prefix handlers, facets and hit metas on. If empty, generate on all search logics.
    maxBlockSize int 24 Maximum number of objects in lookup structure leaves.
    precision int 6 The number of decimals that will be used in geometrical representations and computations.
    bboxSuffix string _bbox Meta name suffix for BBox.

    XML representation
    <AdvancedGeoParams facetSortFunction="count" facetReverseSort="false" facetInHits="true" facetInNavigation="true" facetRoot="" categoryField="" categoryFacetMaxPerLevel="100" facetMaxInHits="100" numericalBits="63" doublePrecision="4" enableTrustedQueries="false" dateInputFormat="" dateOutputFormat="" gzip="true" patternSearchEnabled="false" patternSearchOptimizationType="ngramv1" forcedRankToTextMapping="" detectTitle="false" generateAnalysis="true" generateIndexSchema="true" generatePrefix="true" generateFacet="true" generateMeta="true" expandedPipelines="" expandedSearchLogics="" maxBlockSize="24" precision="6" bboxSuffix="_bbox" ></AdvancedGeoParams>
  • Label com.exalead.datamodel.v10.Label
    The translation of a data model label in a language
    Parent objects
    • AlphanumProperty
    • DataModelClass
    • DataModelEnum
    • DataModelEnumElement
    • DataProperty
    • EnumProperty
    • GeoProperty
    • NumericalProperty
    • ObjectProperty
    • Property
    Attributes
    • language string
    • value string
    Name Type Default value Description
    language string - Label language
    value string - Label value

    XML representation
    <Label language="en" value="" ></Label>
  • NumericalProperty com.exalead.datamodel.v10.NumericalProperty
    Numerical properties offer the feature of optionally enabling range (superior, inferior, ...) on values
    Parent objects
    • DataModelClass
    • DynamicProperty
    Attributes
    • overrideClassExpansionParams boolean
    • dedicatedField boolean
    • dynamicProperty string
    • searchable boolean
    • retrievable boolean
    • ramBased boolean
    • facet boolean
    • disjunctiveRefines boolean
    • defaultValue string
    • additionalMetaNames string
    • arity enum(zero_or_one, one, many)
    • name string
    • comment string
    • type enum(integer, double, date, date_time, point_geo, point_xy, unsigned, measure)
    Name Type Default value Description
    overrideClassExpansionParams boolean False
    dedicatedField boolean True Should we create a dedicated index field for this property, or use a multi-field encoding?
    A dedicated field is recommended if more than a few percent of objects have this property.
    Internal: if there is no dedicated field, we store in the "metas" field and index using prefixed-words.
    dynamicProperty string - Storing this meta in a dynamical property decreases index's memory footprint and improve index's commit time. Faceting and sorting performances will be impacted (slower).
    searchable
    (UI: Searchable)
    boolean True Allows users to search for matches in this property using a prefix.
    For example, to search for "API" in the title of a document, enter the search query <pre>title:API</pre>
    retrievable
    (UI: Retrievable)
    boolean True Displays the meta in the hit content of search results.
    If the property doesn't have a dedicated field, the value will be CSV-encoded in the 'metas' field.
    ramBased
    (UI: RAM based)
    boolean False Stores this property in RAM. Use this option if you need to:
    • sort search results on this property.
    • use this property in a virtual field.
    • use this property in a virtual facet.
    • use this property for an aggregation value.

    Only the "retrievable" part of the property is stored in RAM.
    facet boolean False Creates a navigation facet from the property values.
    disjunctiveRefines
    (UI: Disjunctive refines)
    boolean False Makes refinements on this facet disjunctive ("checkboxes") rather than exclusive (select one).
    defaultValue string - If there is no value for this property in the document, use this default value.
    additionalMetaNames string - Maps these metas (as a comma-separated list) to this index or facet field.
    Usable only for properties having a dedicated field, a facet, or are searchable without prefix.
    arity
    (UI: Cardinality)
    enum(zero_or_one, one, many) zero_or_one The arity of the property (zero_or_one, one, many) {@code enum(zero_or_one, one, many)}
    name string - Property name
    comment string - Optional comment to explain property use.
    type enum(integer, double, date, date_time, point_geo, point_xy, unsigned, measure) Type of this numerical property.
    Depending on the type, additional validation and formating will be performed on the value.
    Valid values are:
    • integer: The value must be an integer (this is the default).
    • double: The value must be a double. It is possible to configure the precision in advanced parameters.
    • date: A date, with a day precision.
    • date_time: A date, with time (second precision).
    • point_geo: A point on a sphere, in GPS coordinates. The value must have a "latitude,longitude" format, with latitude being a double between -90 and +90, and longitude being a double between -180 and 180.
    • point_xy: A point on a XY plan. The value must have a "X,Y" format, with X and Y being positive integers.
    • unsigned: The value must be a positive integer.
    • measure: The value must be a double and could be associated to a unit of measurement symbol.

    Nested elements
    • tagsKeyValue*
    • AdvancedParams
    • KeyValue*
    • Label*
    Type Wrapper Description
    KeyValue* tags User-defined tags for this property. This can be used for example to implement 6W faceting of the data model.
    AdvancedParams -
    KeyValue* -
    Label* - Labels for the internationalization of this property's name
    XML representation
    <NumericalProperty overrideClassExpansionParams="false" dedicatedField="true" searchable="true" retrievable="true" ramBased="false" facet="false" disjunctiveRefines="false" defaultValue="" additionalMetaNames="" arity="zero_or_one" name="" comment="" type="" ></NumericalProperty>
  • GeoProperty com.exalead.datamodel.v10.GeoProperty
    Geo property
    Parent objects
    • DataModelClass
    • DynamicProperty
    Attributes
    • overrideClassExpansionParams boolean
    • dedicatedField boolean
    • dynamicProperty string
    • searchable boolean
    • retrievable boolean
    • ramBased boolean
    • facet boolean
    • disjunctiveRefines boolean
    • defaultValue string
    • additionalMetaNames string
    • arity enum(zero_or_one, one, many)
    • name string
    • comment string
    • type enum(Planar)
    Name Type Default value Description
    overrideClassExpansionParams boolean False
    dedicatedField boolean True Should we create a dedicated index field for this property, or use a multi-field encoding?
    A dedicated field is recommended if more than a few percent of objects have this property.
    Internal: if there is no dedicated field, we store in the "metas" field and index using prefixed-words.
    dynamicProperty string - Storing this meta in a dynamical property decreases index's memory footprint and improve index's commit time. Faceting and sorting performances will be impacted (slower).
    searchable
    (UI: Searchable)
    boolean True Allows users to search for matches in this property using a prefix.
    For example, to search for "API" in the title of a document, enter the search query <pre>title:API</pre>
    retrievable
    (UI: Retrievable)
    boolean True Displays the meta in the hit content of search results.
    If the property doesn't have a dedicated field, the value will be CSV-encoded in the 'metas' field.
    ramBased
    (UI: RAM based)
    boolean False Stores this property in RAM. Use this option if you need to:
    • sort search results on this property.
    • use this property in a virtual field.
    • use this property in a virtual facet.
    • use this property for an aggregation value.

    Only the "retrievable" part of the property is stored in RAM.
    facet boolean False Creates a navigation facet from the property values.
    disjunctiveRefines
    (UI: Disjunctive refines)
    boolean False Makes refinements on this facet disjunctive ("checkboxes") rather than exclusive (select one).
    defaultValue string - If there is no value for this property in the document, use this default value.
    additionalMetaNames string - Maps these metas (as a comma-separated list) to this index or facet field.
    Usable only for properties having a dedicated field, a facet, or are searchable without prefix.
    arity
    (UI: Cardinality)
    enum(zero_or_one, one, many) zero_or_one The arity of the property (zero_or_one, one, many) {@code enum(zero_or_one, one, many)}
    name string - Property name
    comment string - Optional comment to explain property use.
    type enum(Planar) Coordinate system of this geo property.
    For now, only cartesian/planar systems are supported.

    Nested elements
    • tagsKeyValue*
    • AdvancedParams
    • KeyValue*
    • Label*
    Type Wrapper Description
    KeyValue* tags User-defined tags for this property. This can be used for example to implement 6W faceting of the data model.
    AdvancedParams -
    KeyValue* -
    Label* - Labels for the internationalization of this property's name
    XML representation
    <GeoProperty overrideClassExpansionParams="false" dedicatedField="true" searchable="true" retrievable="true" ramBased="false" facet="false" disjunctiveRefines="false" defaultValue="" additionalMetaNames="" arity="zero_or_one" name="" comment="" type="" ></GeoProperty>
  • ObjectProperty (deprecated) com.exalead.datamodel.v10.ObjectProperty
    Property storing a link to another DataModelClass.
    It can only store the identifier of the foreign object in the holding class. This only gives the ability to perform some high-level fetch of related objects. This does not enable searching for this object using foreign constraints.
    Parent objects
    • DataModelClass
    • DynamicProperty
    Attributes
    • arity enum(zero_or_one, one, many)
    • name string
    • comment string
    • type string
    • generateInnerjoinKey boolean
    • storeIdentifier boolean
    Name Type Default value Description
    arity
    (UI: Cardinality)
    enum(zero_or_one, one, many) zero_or_one The arity of the property (zero_or_one, one, many) {@code enum(zero_or_one, one, many)}
    name string - Property name
    comment string - Optional comment to explain property use.
    type string - Name of the related DataModelClass
    generateInnerjoinKey boolean False NOT IMPLEMENTED.
    Generates a field that can be used for performing INNERJOIN between this property and its class.
    Note that doing this incurs a memory cost.
    storeIdentifier boolean False Stores the unique identifiers of the property values for further retrieval.

    Nested elements
    • tagsKeyValue*
    • KeyValue*
    • Label*
    Type Wrapper Description
    KeyValue* tags User-defined tags for this property. This can be used for example to implement 6W faceting of the data model.
    KeyValue* -
    Label* - Labels for the internationalization of this property's name
    XML representation
    <ObjectProperty arity="zero_or_one" name="" comment="" type="" generateInnerjoinKey="false" storeIdentifier="false" ></ObjectProperty>
  • EnumProperty (deprecated) com.exalead.datamodel.v10.EnumProperty
    A property referring to a data model enum.
    Internals: always indexed as a category.
    This is not recommended, as it has little added value over an alphanum property. An alphanum property can actually be faster when using the "enumerated" flag.
    Parent objects
    • DataModelClass
    • DynamicProperty
    Attributes
    • arity enum(zero_or_one, one, many)
    • name string
    • comment string
    • type string
    • disjunctiveRefines boolean
    Name Type Default value Description
    arity
    (UI: Cardinality)
    enum(zero_or_one, one, many) zero_or_one The arity of the property (zero_or_one, one, many) {@code enum(zero_or_one, one, many)}
    name string - Property name
    comment string - Optional comment to explain property use.
    type string - Name of the related DataModelEnum.
    disjunctiveRefines boolean False Are refinements on this facet disjunctive ("checkboxes") or exclusive (select one)?

    Nested elements
    • tagsKeyValue*
    • KeyValue*
    • Label*
    Type Wrapper Description
    KeyValue* tags User-defined tags for this property. This can be used for example to implement 6W faceting of the data model.
    KeyValue* -
    Label* - Labels for the internationalization of this property's name
    XML representation
    <EnumProperty arity="zero_or_one" name="" comment="" type="" disjunctiveRefines="false" ></EnumProperty>
  • ExpansionParams com.exalead.datamodel.v10.ExpansionParams
    No documentation for this element.
    Parent objects
    • DataModelClass
    Attributes
    • generateAnalysis boolean
    • generateIndexSchema boolean
    • generatePrefix boolean
    • generateFacet boolean
    • generateMeta boolean
    • expandedPipelines string
    • expandedSearchLogics string
    Name Type Default value Description
    generateAnalysis
    (UI: Generate analysis config)
    boolean True Creates the required mappings and document processors for this property. Only set this option to "false" to take complete control over analysis expansion for this property.
    generateIndexSchema
    (UI: Generate index schema)
    boolean True Creates the required index fields for this property. Only set this option to "false" to take complete control over index schema expansion for this property.
    generatePrefix
    (UI: Generate prefix handler)
    boolean True Creates a prefix handler in the search logics for this property. Only set this option to "false" to take complete control over query prefix handler expansion for this property.
    generateFacet
    (UI: Generate category facet)
    boolean True Creates a category facet in the search logics for this property. Only set this option to "false" to take complete control over facet config expansion for this property.
    generateMeta
    (UI: Generate hit meta)
    boolean True Creates a hit meta in the search logics for this property. Only set this option to "false" to take complete control over meta expansion for this property.
    expandedPipelines string - A comma-separated list of analysis pipelines to generate mappings and document processors on. If empty, generate on all pipelines.
    expandedSearchLogics string - A comma-separated list of search logics to generate prefix handlers, facets and hit metas on. If empty, generate on all search logics.

    XML representation
    <ExpansionParams generateAnalysis="true" generateIndexSchema="true" generatePrefix="true" generateFacet="true" generateMeta="true" expandedPipelines="" expandedSearchLogics="" ></ExpansionParams>
  • DataModelEnum com.exalead.datamodel.v10.DataModelEnum
    A custom enum allows you to define a closed list of values for a property.
    Note that i18n is not implemented.
    This option is not recommended.
    Parent objects
    • DataModel
    Attributes
    • name string
    • strict boolean
    • defaultValue int
    Name Type Default value Description
    name string -
    strict boolean False If strict, this data type will only accept values listed in elements.
    Warning: This is not implemented for the moment.
    defaultValue int -1 Index of default value in the elements list. -1 = no default value for property

    Nested elements
    • DataModelEnumElement*
    • Label*
    Type Wrapper Description
    DataModelEnumElement* - Possible values for the enum.
    Label* - Labels for i18n. TODO not implemented
    XML representation
    <DataModelEnum name="" strict="false" defaultValue="-1" ></DataModelEnum>
  • DataModelEnumElement com.exalead.datamodel.v10.DataModelEnumElement
    No documentation for this element.
    Parent objects
    • DataModelEnum
    Attributes
    • name string
    Name Type Default value Description
    name string -

    Nested elements
    • Label*
    Type Wrapper Description
    Label* - Labels for i18n. TODO not implemented
    XML representation
    <DataModelEnumElement name="" ></DataModelEnumElement>
  • SemanticType com.exalead.datamodel.v10.SemanticType
    Parent objects
    • DataModel
    Attributes
    • name string
    • extraContexts string
    • tokenize boolean
    • urlProcessing boolean
    • indexNormalized boolean
    • indexLowercase boolean
    • indexExact boolean
    • indexSeparators boolean
    • detectLanguage boolean
    • extractWords boolean
    • extractNamedEntities boolean
    • extractRelatedTerms boolean
    • extractSpellCheckNGrams boolean
    • extractPhoneticForms boolean
    • tokenizationConfig string
    • rankForDedicatedMapping long
    • rankForTextMapping long
    • dictionaryName string
    Name Type Default value Description
    name string - Name for this type, to be used in the "type" field of the AlphanumProperty.
    extraContexts string - Extra analysis contexts (not datamodel-controlled) on which this text type must be applied.
    tokenize
    (UI: Tokenize)
    boolean True Splits phrases into individual words or tokens. Required for index-time semantic processing.
    urlProcessing
    (UI: Process as an URL)
    boolean False Creates 3 prefix handlers for this property, for the 'site', 'url', and 'inurl' features.
    indexNormalized
    (UI: Index as normalized)
    boolean True Indexes CaF&eacute; as cafe.
    indexLowercase
    (UI: Index as lowercase)
    boolean False Indexes CaF&eacute; as caf&eacute;
    indexExact
    (UI: Index as exact)
    boolean False Indexes CaF&eacute; as CaF&eacute;
    indexSeparators
    (UI: Index separators)
    boolean True Indexes the position of separators to enable search within a string. Select this option when using the "split" type prefix handler.
    detectLanguage
    (UI: Detect language)
    boolean True Determines the language of a document by analyzing its text.

    Required for extracting spell check ngrams, phonetic forms, named entities and related terms.

    For performance reasons, only select this option if the documents to be pushed do not already include a 'language' meta.

    Selecting this option creates a meta called 'language', as well as a Language facet for search results display in the Refinements panel.
    extractWords
    (UI: Extract words)
    boolean True Extracts the words of each document to the dictionary targeted by this semantic type.
    extractNamedEntities
    (UI: Extract named entities)
    boolean False Flags famous people, places, organizations or events, and annotates the corresponding index field with the prefix NE:<entity type>.

    This option adds:
    • A Named Entities Matcher processor to the semantic analysis pipeline.
    • Categories for each named entity annotation in a document.
    • Named entity facets in the search logic, to be displayed in the Refinements panel.
    extractRelatedTerms
    (UI: Extract related terms)
    boolean False Finds important concepts within the corpus and stores them in the dictionary targeted by this semantic type.

    To display related terms in the Refinements panel of your search application, you must enable them through the search logics.

    This option adds a Related Terms Extractor processor to the semantic analysis pipeline.
    extractSpellCheckNGrams
    (UI: Extract spell check ngrams)
    boolean False Calculates probability of word occurrences or word phrases within the corpus and stores them in the dictionary targeted by this semantic type.

    This significantly improves the effectiveness of spell-checking.

    This option adds a NGrams Extractor processor to the semantic analysis pipeline.
    extractPhoneticForms
    (UI: Extract phonetic forms)
    boolean False Creates a phonetic version for each word and stores them in the dictionary targeted by this semantic type.

    This significantly improves the effectiveness of spell check and enables phonetic search (for example, soundslike: exaleed).

    This option adds a Phonetizer processor to the semantic analysis pipeline.
    tokenizationConfig
    (UI: Tokenization config)
    string - Defines the tokenization config to use for analysis and search. Found in the linguistic configuration.
    rankForDedicatedMapping long 4 Ranking value for the mapping to a dedicated index field.
    rankForTextMapping long 3 Ranking value for the mapping to the "text" index field.
    dictionaryName
    (UI: Dictionary)
    string - A dictionary is a structure separated from the index, that stores all the words of an indexed document, and their number of occurrences in the corpus.

    It's used for linguistic expansion mechanisms such as spell-checking or regular expression matching.

    If value is "_None_", words will not be stored in a dictionary. In admin-ui, select "None" if you want this behavior.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* - Custom parameters.
    XML representation
    <SemanticType name="" extraContexts="" tokenize="true" urlProcessing="false" indexNormalized="true" indexLowercase="false" indexExact="false" indexSeparators="true" detectLanguage="true" extractWords="true" extractNamedEntities="false" extractRelatedTerms="false" extractSpellCheckNGrams="false" extractPhoneticForms="false" tokenizationConfig="tok0" rankForDedicatedMapping="4" rankForTextMapping="3" dictionaryName="" ></SemanticType>
  • ConnectorList com.exalead.mercury.mami.connect.v10.ConnectorList
    List of connectors defined in the system.
    Attributes
    • version long
    Name Type Default value Description
    version long - -

    Nested elements
    • Connector*
    Type Wrapper Description
    Connector* - List of connectors.
    XML representation
    <ConnectorList version="" ></ConnectorList>
  • Connector com.exalead.mercury.mami.connect.v10.Connector
    The configuration of a connector
    Parent objects
    • ConnectorList
    Attributes
    • name string
    • classId string
    • customClassId string
    • managed boolean
    • connectorServer string
    • buildGroup string
    • pushAPIServer string
    • authenticationMode enum(public, basic)
    • login string
    • password string
    • defaultDataModelClass string
    • documentsType string
    • generated boolean
    Name Type Default value Description
    name string - The name of the connector.
    classId string - Connector class identifier.
    This identifies the kind of data source this connectors connects to, as well as the actual implementation.
    The class identifier can be a reference to an exascript class, a Java class or a .NET Class.
    This attribute must be null for unmanaged connectors.
    The value provided should be one of the values returned by @see listConnectorTypes.
    customClassId string - Implementation class identifier.
    Optional.
    Can be used to specify a custom implementation of the connector.
    managed boolean False Indicates that the connector is managed by a framework.

    A managed connector is launched and operated as a CloudView service, whereas an 'unmanaged connector' is handled by a third-party process.
    connectorServer string - Defines the connector server hosting this connector.

    For managed connectors only. Connectors are deployed in a connector server. There are 3 kinds of connector server: exascript, Java or .NET.
    The value of this attribute should refer to a connector server defined in the deployment configuration (for example, exa0, java0, dotnet0).
    buildGroup string - Defines the build group that will receive the documents.

    For managed connectors only. For a single connector server, different connectors can push to different build groups.
    pushAPIServer string - Defines the Push API server that will receive the documents.

    For managed connectors only. For a single connector server, different connectors can push to different Push API servers.
    authenticationMode enum(public, basic) public The connector configuration parameters.

    Connector parameters such as the data sources or folders to actually index in the data source are provided as key-values.

    No configuration parameters should be provided for unmanaged connectors.

    The set of meta data to be automatically added to indexed documents.
    These meta-data items are inserted into each document from this connector when analyzing the document.
    Connector authentication mode values are: public and basic.
    If set to basic, the indexing-server will require authentication to push content with this connector name.
    login string - User login if authenticationMode parameter is set to basic.
    password string - User password if authenticationMode parameter is set to basic.
    defaultDataModelClass string - Specifies the class in which the documents should be indexed if the connector does not provide a dataModelClass indication for its documents.

    The default value (null) means that the documents will go in the default class of the DataModel.
    documentsType string - Type of documents produced by this source.
    The type of documents must match one of the types declared in your CloudView license file.
    generated boolean False Has this connector been generated automatically by another component or it has been manually added by the user.
    A generated connector must be configured by the component that has generated it and won't be editable in admin-ui

    Nested elements
    • configKeyValue*
    • forcedMetaKeyValue*
    • ConnectorScheduledScan*
    • PostProcessingPipeline
    • PushConfig
    • SourceCachingConfig
    • SourceFetchConfig
    • SourcePreviewConfig
    • SourceThumbnailsConfig
    Type Wrapper Description
    KeyValue* config
    KeyValue* forcedMeta
    ConnectorScheduledScan* - List of scheduled scans for the connector.
    PostProcessingPipeline - Pipeline post-processing documents sent by this connector.
    PushConfig - Specifies how documents are sent to the indexing-server.
    SourceCachingConfig -
    SourceFetchConfig -
    SourcePreviewConfig -
    SourceThumbnailsConfig -
    XML representation
    <Connector name="" classId="" customClassId="" managed="false" connectorServer="" buildGroup="" pushAPIServer="" authenticationMode="public" login="" password="" defaultDataModelClass="" documentsType="" generated="false" ></Connector>
  • ConnectorScheduledScan com.exalead.mercury.mami.connect.v10.ConnectorScheduledScan
    Defines the connector scheduling
    Parent objects
    • Connector
    Attributes
    • scanMode string
    Name Type Default value Description
    scanMode string - The scan mode to trigger.

    Nested elements
    • scanModeConfigKeyValue*
    • ConnectorSchedulingTrigger
    Type Wrapper Description
    KeyValue* scanModeConfig The configuration given to the scheduled scan.
    ConnectorSchedulingTrigger -
    XML representation
    <ConnectorScheduledScan scanMode="" ></ConnectorScheduledScan>
  • ConnectorSchedulingTrigger com.exalead.mercury.mami.connect.v10.ConnectorSchedulingTrigger
    Defines the scheduled interval.
    Parent objects
    • ConnectorScheduledScan
    Attributes
    • startTimestamp long
    • endTimestamp long
    Name Type Default value Description
    startTimestamp long -
    endTimestamp long -

    XML representation
    <ConnectorSchedulingTrigger startTimestamp="" endTimestamp="" ></ConnectorSchedulingTrigger>
  • SimpleConnectorSchedulingTrigger com.exalead.mercury.mami.connect.v10.SimpleConnectorSchedulingTrigger
    Defines the scheduled interval.
    Parent objects
    • ConnectorScheduledScan
    Attributes
    • startTimestamp long
    • endTimestamp long
    • repeatInterval long
    Name Type Default value Description
    startTimestamp long -
    endTimestamp long -
    repeatInterval long -

    XML representation
    <SimpleConnectorSchedulingTrigger startTimestamp="" endTimestamp="" repeatInterval="" ></SimpleConnectorSchedulingTrigger>
  • CronConnectorSchedulingTrigger com.exalead.mercury.mami.connect.v10.CronConnectorSchedulingTrigger
    Configures the scheduled scan for a connector, given a start timestamp and an end timestamp.
    Parent objects
    • ConnectorScheduledScan
    Attributes
    • startTimestamp long
    • endTimestamp long
    • cronExpression string
    Name Type Default value Description
    startTimestamp long -
    endTimestamp long -
    cronExpression string - The Quartz Cron expression made of five time and date fields.

    XML representation
    <CronConnectorSchedulingTrigger startTimestamp="" endTimestamp="" cronExpression="" ></CronConnectorSchedulingTrigger>
  • CustomPostProcessingPipeline com.exalead.mercury.mami.connect.v10.CustomPostProcessingPipeline
    Post-processing pipeline based on a custom Java class. A component must be providing, implementing the com.exalead.dataprocessing.processors.cloudview.papi.connect.ConnectorDataProcessingPipelineBuilder.java interface.
    The component must be packaged in a plugin.
    The ConnectorDataProcessingPipelineBuilder must create its DataProcessing API pipeline, and all documents will be sent through it.
    Parent objects
    • Connector
    Attributes
    • builderClassId string
    Name Type Default value Description
    builderClassId string - Java class id of the PipelineBuilder implementation.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* - Configuration of the PipelineBuilder implementation.
    XML representation
    <CustomPostProcessingPipeline builderClassId="" ></CustomPostProcessingPipeline>
  • SimplePostProcessingPipeline com.exalead.mercury.mami.connect.v10.SimplePostProcessingPipeline
    Creates a configured post-processing pipeline
    Nested elements
    • ScanPipelinePipelineBranch
    • FetchPipelinePipelineBranch
    Type Wrapper Description
    PipelineBranch ScanPipeline A PAPI Source processor will be automatically added at the beginning of the branch on connector's scan.
    PipelineBranch FetchPipeline A PAPI Source processor will be automatically added at the beginning of the branch on connector's fetch.
    XML representation
    <SimplePostProcessingPipeline ></SimplePostProcessingPipeline>
  • BasicPipelineBranch com.exalead.mercury.mami.connect.v10.BasicPipelineBranch
    A Simple PipelineBranch
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • input string
    • output string
    • autolink boolean
    • name string
    Name Type Default value Description
    input string - Name of the BranchAction used as input (with the processors'input if it is a Process action)
    output string - Name of the BranchAction used as input (with the processors'input if it is a Process action)
    autolink boolean False Links automatically successive BranchActions on their first input and output when they have no explicit link.
    name string - Unique name for this pipeline branch

    Nested elements
    • BranchAction*
    • PipelineLink*
    Type Wrapper Description
    BranchAction* - Actions in the branch
    PipelineLink* - Links between actions
    XML representation
    <BasicPipelineBranch input="" output="" autolink="false" name="" ></BasicPipelineBranch>
  • PipelineLink com.exalead.mercury.mami.connect.v10.PipelineLink
    Link between two BranchActions
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • source string
    • target string
    Name Type Default value Description
    source string - Name of the output which is used as this link source
    target string - Name of the input which is used as this link target

    XML representation
    <PipelineLink source="" target="" ></PipelineLink>
  • MultithreadedDispatchBranch com.exalead.mercury.mami.connect.v10.MultithreadedDispatchBranch
    Replicates a branch multiple times and connects a dispatch to the input and a union to the output
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • input string
    • output string
    • autolink boolean
    • name string
    • nbThreads int
    Name Type Default value Description
    input string - Name of the BranchAction used as input (with the processors'input if it is a Process action)
    output string - Name of the BranchAction used as input (with the processors'input if it is a Process action)
    autolink boolean False Links automatically successive BranchActions on their first input and output when they have no explicit link.
    name string - Unique name for this pipeline branch
    nbThreads int 4 Number of replications of this branch

    Nested elements
    • BranchAction*
    • PipelineLink*
    Type Wrapper Description
    BranchAction* - Actions in the branch
    PipelineLink* - Links between actions
    XML representation
    <MultithreadedDispatchBranch input="" output="" autolink="false" name="" nbThreads="4" ></MultithreadedDispatchBranch>
  • CustomProcess com.exalead.mercury.mami.connect.v10.CustomProcess
    Processes the records with a custom processor. The action name will be the processor name.
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • name string
    • classId string
    Name Type Default value Description
    name string - Unique name for this pipeline branch
    classId string - Processor's class

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* - Processor's configuration
    XML representation
    <CustomProcess name="" classId="" ></CustomProcess>
  • QueueProcess com.exalead.mercury.mami.connect.v10.QueueProcess
    Creates a QueueProcessor
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • name string
    • capacity int
    Name Type Default value Description
    name string - Unique name for this pipeline branch
    capacity int 0 Maximum number elements in the queue. 0 means no limit

    XML representation
    <QueueProcess name="" capacity="0" ></QueueProcess>
  • ForwardProcess com.exalead.mercury.mami.connect.v10.ForwardProcess
    Creates a ForwardProcessor
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • name string
    Name Type Default value Description
    name string - Unique name for this pipeline branch

    XML representation
    <ForwardProcess name="" ></ForwardProcess>
  • CloudViewPushAPITargetProcess com.exalead.mercury.mami.connect.v10.CloudViewPushAPITargetProcess
    Creates a CloudViewPushAPITargetProcessor. All instances will share the same PushAPI.
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • name string
    Name Type Default value Description
    name string - Unique name for this pipeline branch

    XML representation
    <CloudViewPushAPITargetProcess name="" ></CloudViewPushAPITargetProcess>
  • UnionProcess com.exalead.mercury.mami.connect.v10.UnionProcess
    Creates a UnionProcessor
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • name string
    • orderingByMarker boolean
    Name Type Default value Description
    name string - Unique name for this pipeline branch
    orderingByMarker boolean True This behavior enables the restoration of sessions when a stream has been split into multiple streams which are joined by this processor. Even if some split streams process faster, the elements from the previous session will all be sent before beginning to send the next one.

    XML representation
    <UnionProcess name="" orderingByMarker="true" ></UnionProcess>
  • RoundRobinProcess com.exalead.mercury.mami.connect.v10.RoundRobinProcess
    Creates a RoundRobinProcessor
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • name string
    Name Type Default value Description
    name string - Unique name for this pipeline branch

    XML representation
    <RoundRobinProcess name="" ></RoundRobinProcess>
  • SetFieldValue com.exalead.mercury.mami.connect.v10.SetFieldValue
    Sets the value of a field
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • name string
    • field string
    • value string
    Name Type Default value Description
    name string - Unique name for this pipeline branch
    field string - Field name
    value string - Value to set the field to

    XML representation
    <SetFieldValue name="" field="" value="" ></SetFieldValue>
  • RenameField com.exalead.mercury.mami.connect.v10.RenameField
    Renames a field
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • name string
    • origName string
    • newName string
    Name Type Default value Description
    name string - Unique name for this pipeline branch
    origName string - Original name of field.
    newName string - New name of field.

    XML representation
    <RenameField name="" origName="" newName="" ></RenameField>
  • DeleteFields com.exalead.mercury.mami.connect.v10.DeleteFields
    Deletes a set of fields
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • name string
    • fields string
    Name Type Default value Description
    name string - Unique name for this pipeline branch
    fields string - Comma-separated list of fields to remove.

    XML representation
    <DeleteFields name="" fields="" ></DeleteFields>
  • CustomTransform com.exalead.mercury.mami.connect.v10.CustomTransform
    Custom transformation
    Parent objects
    • BasicPipelineBranch
    • MultithreadedDispatchBranch
    • PipelineBranch
    Attributes
    • name string
    • classId string
    Name Type Default value Description
    name string - Unique name for this pipeline branch
    classId string -

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <CustomTransform name="" classId="" ></CustomTransform>
  • PushConfig com.exalead.mercury.mami.connect.v10.PushConfig
    Specifies how documents are sent to the indexing-server.
    Parent objects
    • Connector
    Attributes
    • buffer boolean
    • triggerIndexingAfterScan boolean
    • profilePushOperations boolean
    • automaticRestartOnFailure boolean
    Name Type Default value Description
    buffer boolean True Enables buffering of documents in the connector to send them as batches to the indexing server.
    triggerIndexingAfterScan boolean False Always triggers an indexing job after a completed scan.
    profilePushOperations boolean False Enables profiling of push operations.

    The profiling can be costly on small documents.
    automaticRestartOnFailure boolean True When a scan throws a PushAPIException does this connector automatically restart?
    This allows the connector to retry a scan operation when a component in the indexing chain crashes.
    Set this option to false to disable the retry behavior.

    XML representation
    <PushConfig buffer="true" triggerIndexingAfterScan="false" profilePushOperations="false" automaticRestartOnFailure="true" ></PushConfig>
  • SourceCachingConfig com.exalead.mercury.mami.connect.v10.SourceCachingConfig
    Defines how source documents are put in the document cache
    Parent objects
    • Connector
    Attributes
    • storeInDocumentCache boolean
    • minSizeForCachingB long
    • maxSizeForCachingB long
    Name Type Default value Description
    storeInDocumentCache boolean True
    minSizeForCachingB long -
    maxSizeForCachingB long -

    XML representation
    <SourceCachingConfig storeInDocumentCache="true" minSizeForCachingB="" maxSizeForCachingB="" ></SourceCachingConfig>
  • SourceFetchConfig com.exalead.mercury.mami.connect.v10.SourceFetchConfig
    Defines how source documents are "fetched" for download, preview and thumbnails
    Parent objects
    • Connector
    Attributes
    • allowRawDocumentFetch boolean
    • customFetcherClass string
    • customFetcherUrl string
    • fetchProtocol string
    Name Type Default value Description
    allowRawDocumentFetch boolean True
    customFetcherClass string -
    customFetcherUrl string - Base URL used for retrieving documents from this connector.

    This is used for preview, thumbnails and raw fetch from the search results.

    For unmanaged connectors, this is always used.

    For managed connectors, if this parameter is given, it completely replaces the retriever within the connectors server.
    fetchProtocol string - Protocol implemented by the customFetcherURL.
    One of v1, v2, rpv3, or networkRetriever

    XML representation
    <SourceFetchConfig allowRawDocumentFetch="true" customFetcherClass="" customFetcherUrl="" fetchProtocol="" ></SourceFetchConfig>
  • SourcePreviewConfig com.exalead.mercury.mami.connect.v10.SourcePreviewConfig
    Defines the configuration for image and HTML preview of the documents of a source.
    Parent objects
    • Connector
    Attributes
    • allowHTMLPreview boolean
    • allowImagePreview boolean
    Name Type Default value Description
    allowHTMLPreview boolean True
    allowImagePreview boolean True

    XML representation
    <SourcePreviewConfig allowHTMLPreview="true" allowImagePreview="true" ></SourcePreviewConfig>
  • SourceThumbnailsConfig com.exalead.mercury.mami.connect.v10.SourceThumbnailsConfig
    No documentation for this element.
    Parent objects
    • Connector
    Attributes
    • allowThumbnails boolean
    • precomputeThumbnails boolean
    • precomputedThumbnailsHeight int
    • precomputedThumbnailsWidth int
    • homePageOnly boolean
    Name Type Default value Description
    allowThumbnails boolean True
    precomputeThumbnails boolean False
    precomputedThumbnailsHeight int 120
    precomputedThumbnailsWidth int 120
    homePageOnly boolean False When crawling web sites, only compute and generate thumbnails for the home pages

    XML representation
    <SourceThumbnailsConfig allowThumbnails="true" precomputeThumbnails="false" precomputedThumbnailsHeight="120" precomputedThumbnailsWidth="120" homePageOnly="false" ></SourceThumbnailsConfig>
  • CrawlConfig com.exalead.mercury.mami.crawl.v21.CrawlConfig
    The crawl configuration.
    Attributes
    • version long
    • verbose boolean
    Name Type Default value Description
    version long - -
    verbose boolean False

    Nested elements
    • ICrawler*
    Type Wrapper Description
    ICrawler* -
    XML representation
    <CrawlConfig version="" verbose="false" ></CrawlConfig>
  • Crawler com.exalead.mercury.mami.crawl.v21.Crawler
    A crawler configuration.
    A crawler may contain a CrawlSchedulerConfig to overwrite the default fifo priorities.
    A crawler may contain a CustomCrawlConfig to enable custom processors.
    Parent objects
    • CrawlConfig
    Attributes
    • name string
    • documentsType string
    • fetcher string
    • crawlerServer string
    • connectorServer string
    • buildGroup string
    • dataModel string
    • storeTextOnly boolean
    • nthreads int
    • aggressive boolean
    • throttleTimeMS int
    • ignoreRobotsTxt boolean
    • enableConvertProcessor boolean
    • nearDuplicateDetector boolean
    • patternsDetector boolean
    • crawlSitemaps boolean
    • disableConditionalGet boolean
    • defaultAccept boolean
    • defaultIndex boolean
    • defaultFollow boolean
    • defaultFollowRoots boolean
    • enableSimpleSiteCollapsing boolean
    • simpleSiteCollapsingDepth int
    • mimeTypesMode string
    • smartRefresh boolean
    • smartRefreshMinAgeS int
    • smartRefreshMaxAgeS int
    • archiveDocuments boolean
    • enableConsolidation boolean
    Name Type Default value Description
    name string - The crawler name. It must be unique across all crawlers.
    documentsType string - The type of documents pushed by this connector.
    The type of documents must match one of the types declared in your CloudView license file.
    fetcher string - Which fetcher to use.
    crawlerServer string - Crawler server hosting this crawler. See Deployment configuration.
    connectorServer string - Connector server hosting the indexing part of this crawler. See Deployment configuration.
    buildGroup string - Target build group.
    dataModel string - The default data model for documents indexed by this crawler.
    storeTextOnly boolean True Whether to store original binary documents, or only converted text.
    nthreads int 1 The number of crawl threads which must be strictly positive.
    aggressive boolean False Whether to enable aggressive crawl, that never sleeps between two requests to the same host.
    throttleTimeMS int 2500 In the case of non-aggressive crawl, this defines the sleep interval between requests to the same host.
    ignoreRobotsTxt boolean False Whether to ignore robots.txt rules. Not recommended.
    enableConvertProcessor boolean True Whether to enable remoteconvert-based processor for links extracting in binary documents.
    nearDuplicateDetector boolean True Whether to enable the near-duplicate content detector.
    patternsDetector boolean True Whether to enable patterns detection in pages.
    crawlSitemaps boolean True Whether to crawl sitemaps.
    disableConditionalGet boolean False Whether to always fetch documents, even if the server tells it has not changed.
    defaultAccept boolean False Whether to crawl a url by default when it matches no other accept rule.
    defaultIndex boolean False Whether to index by default when a url matches no index rule.
    defaultFollow boolean False Whether to follow by default when a url matches no follow rule.
    defaultFollowRoots boolean True Whether to automatically follow root urls
    enableSimpleSiteCollapsing boolean True Whether to generate a site ID suitable for document collapsing.
    simpleSiteCollapsingDepth int 0 How many path segments to use to generate the site collapsing ID.
    mimeTypesMode string exclude Mime types white/black list
    smartRefresh boolean True Whether to crawl a fraction of refreshed urls.
    smartRefreshMinAgeS int 3600 Age in seconds at which we may refresh old urls.
    smartRefreshMaxAgeS int 604800 Age in seconds at which we force the refresh of old urls.
    archiveDocuments boolean False When enabled, deleted documents are not deleted, but kept with their deletion date.
    enableConsolidation boolean True Define if we use a standard PAPI or a consolidation PAPI.

    Nested elements
    • mimeTypesStringConstantValue*
    • sessionIdBlacklistStringConstantValue*
    • PushAPIFilterKeyValue*
    • rootsRoot*
    • rootsetsRootSet*
    • CrawlSchedulerConfig
    • CustomCrawlConfig
    • Rules*
    • UrlTesterData
    Type Wrapper Description
    StringConstantValue* mimeTypes
    StringConstantValue* sessionIdBlacklist SessionId blacklist. These parameters are removed from URLs with a path or query part containing them.
    KeyValue* PushAPIFilter
    Root* roots A list of root urls to start the crawl from.
    RootSet* rootsets A list of files to load urls/sites from.
    CrawlSchedulerConfig -
    CustomCrawlConfig -
    Rules* -
    UrlTesterData -
    XML representation
    <Crawler name="" documentsType="" fetcher="" crawlerServer="" connectorServer="" buildGroup="" dataModel="" storeTextOnly="true" nthreads="1" aggressive="false" throttleTimeMS="2500" ignoreRobotsTxt="false" enableConvertProcessor="true" nearDuplicateDetector="true" patternsDetector="true" crawlSitemaps="true" disableConditionalGet="false" defaultAccept="false" defaultIndex="false" defaultFollow="false" defaultFollowRoots="true" enableSimpleSiteCollapsing="true" simpleSiteCollapsingDepth="0" mimeTypesMode="exclude" smartRefresh="true" smartRefreshMinAgeS="3600" smartRefreshMaxAgeS="604800" archiveDocuments="false" enableConsolidation="true" ></Crawler>
  • CrawlSchedulerConfig com.exalead.mercury.mami.crawl.v21.CrawlSchedulerConfig
    Scheduler configuration.
    Use at your own risk.
    Parent objects
    • Crawler
    • FeedFetcher
    • ICrawler
    Attributes
    • priority0 int
    • priority1 int
    • priority2 int
    • priority3 int
    • priority4 int
    • refreshPriority int
    Name Type Default value Description
    priority0 int 100000 Fifo priorities. by default, only user-submitted urls
    priority1 int 10000 by default, only redirected urls
    priority2 int 1000 by default, only indexed and not followed urls
    priority3 int 100 by default, only indexed and followed urls
    priority4 int 10 by default, only followed and not indexed urls
    refreshPriority int 1 refresh source

    XML representation
    <CrawlSchedulerConfig priority0="100000" priority1="10000" priority2="1000" priority3="100" priority4="10" refreshPriority="1" ></CrawlSchedulerConfig>
  • CustomCrawlConfig com.exalead.mercury.mami.crawl.v21.CustomCrawlConfig
    Custom processors specification.
    Parent objects
    • Crawler
    • FeedFetcher
    • ICrawler
    Attributes
    • preProcessorClassId string
    • fetcherClassId string
    • processorClassId string
    • htmlProcessorClassId string
    • linksFilterClassId string
    • postProcessorClassId string
    • crawlerTemplate string
    Name Type Default value Description
    preProcessorClassId string - Custom PreProcessor.
    Called at the end of the preprocess pipe.
    fetcherClassId string - Custom Fetcher.
    processorClassId string - Custom Processor.
    Called at the end of the process pipe. Catches all mime types.
    htmlProcessorClassId string - Custom HTML Processor.
    Called at the of the html process pipe. Catches only html documents.
    linksFilterClassId string - Custom LinksFilter.
    Called at the end of the links filter list. Can decide whether to crawl an outgoing link.
    postProcessorClassId string - Custom PostProcessor.
    Called at the end of the postprocess pipe.
    crawlerTemplate string - Alternatively, specify the url of a xml file describing the whole crawler.

    XML representation
    <CustomCrawlConfig preProcessorClassId="" fetcherClassId="" processorClassId="" htmlProcessorClassId="" linksFilterClassId="" postProcessorClassId="" crawlerTemplate="" ></CustomCrawlConfig>
  • Rules com.exalead.mercury.mami.crawl.v21.Rules
    A rule set identified by a key.
    Parent objects
    • Crawler
    • FeedFetcher
    • ICrawler
    Attributes
    • key string
    • group string
    Name Type Default value Description
    key string - How to interpret these rules. auto, adminui: place index/follow/accept rules where relevant, automatically. may break complex rules depending on other key-values pre, post, link, papi: place in preprocessor, postprocessor, linksfilter or papi filter only.
    group string default Key used to group rules and root urls.

    Nested elements
    • Rule*
    Type Wrapper Description
    Rule* -
    XML representation
    <Rules key="" group="default" ></Rules>
  • Rule com.exalead.mercury.mami.crawl.v21.Rule
    No documentation for this element.
    Parent objects
    • Rules
    Attributes
    • message string
    Name Type Default value Description
    message string -

    Nested elements
    • Action*
    • Pattern*
    • PostFilter*
    Type Wrapper Description
    Action* -
    Pattern* -
    PostFilter* -
    XML representation
    <Rule message="" ></Rule>
  • ActionSetKV com.exalead.actionrules.v21.ActionSetKV
    No documentation for this element.
    Parent objects
    • Rule
    Attributes
    • key string
    • value string
    Name Type Default value Description
    key string -
    value string -

    XML representation
    <ActionSetKV key="" value="" ></ActionSetKV>
  • ActionAppendKV com.exalead.actionrules.v21.ActionAppendKV
    No documentation for this element.
    Parent objects
    • Rule
    Attributes
    • key string
    • value string
    Name Type Default value Description
    key string -
    value string -

    XML representation
    <ActionAppendKV key="" value="" ></ActionAppendKV>
  • ActionSetNoIndex com.exalead.actionrules.v21.ActionSetNoIndex
    No documentation for this element.
    XML representation
    <ActionSetNoIndex ></ActionSetNoIndex>
  • ActionSetNoFollow com.exalead.actionrules.v21.ActionSetNoFollow
    No documentation for this element.
    XML representation
    <ActionSetNoFollow ></ActionSetNoFollow>
  • ActionSetIgnore com.exalead.actionrules.v21.ActionSetIgnore
    No documentation for this element.
    XML representation
    <ActionSetIgnore ></ActionSetIgnore>
  • ActionUrlDeleteQueryArg com.exalead.actionrules.v21.ActionUrlDeleteQueryArg
    No documentation for this element.
    Parent objects
    • Rule
    Attributes
    • token string
    • caseSensitive boolean
    Name Type Default value Description
    token string -
    caseSensitive boolean True

    XML representation
    <ActionUrlDeleteQueryArg token="" caseSensitive="True" ></ActionUrlDeleteQueryArg>
  • ActionUrlDeleteQuerySessionId com.exalead.actionrules.v21.ActionUrlDeleteQuerySessionId
    No documentation for this element.
    Parent objects
    • Rule
    Attributes
    • token string
    • caseSensitive boolean
    • strict boolean
    Name Type Default value Description
    token string -
    caseSensitive boolean True
    strict boolean False

    XML representation
    <ActionUrlDeleteQuerySessionId token="" caseSensitive="True" strict="False" ></ActionUrlDeleteQuerySessionId>
  • ActionUrlDeletePathToken com.exalead.actionrules.v21.ActionUrlDeletePathToken
    No documentation for this element.
    Parent objects
    • Rule
    Attributes
    • token string
    • caseSensitive boolean
    Name Type Default value Description
    token string -
    caseSensitive boolean True

    XML representation
    <ActionUrlDeletePathToken token="" caseSensitive="True" ></ActionUrlDeletePathToken>
  • ActionUrlDeletePathSessionId com.exalead.actionrules.v21.ActionUrlDeletePathSessionId
    No documentation for this element.
    Parent objects
    • Rule
    Attributes
    • token string
    • caseSensitive boolean
    Name Type Default value Description
    token string -
    caseSensitive boolean True

    XML representation
    <ActionUrlDeletePathSessionId token="" caseSensitive="True" ></ActionUrlDeletePathSessionId>
  • ActionUrlRegexReplace com.exalead.actionrules.v21.ActionUrlRegexReplace
    No documentation for this element.
    Parent objects
    • Rule
    Attributes
    • input string
    • output string
    • field string
    Name Type Default value Description
    input string -
    output string -
    field string -

    XML representation
    <ActionUrlRegexReplace input="" output="" field="" ></ActionUrlRegexReplace>
  • ActionUrlAddQueryArg com.exalead.actionrules.v21.ActionUrlAddQueryArg
    No documentation for this element.
    Parent objects
    • Rule
    Attributes
    • token string
    • value string
    Name Type Default value Description
    token string -
    value string -

    XML representation
    <ActionUrlAddQueryArg token="" ></ActionUrlAddQueryArg>
  • ActionSetRepetitiveTokens com.exalead.actionrules.v21.ActionSetRepetitiveTokens
    No documentation for this element.
    Parent objects
    • Rule
    Attributes
    • keyName string
    • numerical boolean
    Name Type Default value Description
    keyName string crawlUrl.repetitive
    numerical boolean True

    XML representation
    <ActionSetRepetitiveTokens keyName="crawlUrl.repetitive" numerical="True" ></ActionSetRepetitiveTokens>
  • ActionUrlCapture com.exalead.actionrules.v21.ActionUrlCapture
    No documentation for this element.
    Parent objects
    • Rule
    Attributes
    • token string
    • field string
    • keyNamePrefix string
    Name Type Default value Description
    token string -
    field string -
    keyNamePrefix string crawlUrl.capture

    XML representation
    <ActionUrlCapture token="" field="" keyNamePrefix="crawlUrl.capture" ></ActionUrlCapture>
  • Accept com.exalead.mercury.mami.crawl.v21.Accept
    Actions on urls.
    XML representation
    <Accept ></Accept>
  • Ignore com.exalead.mercury.mami.crawl.v21.Ignore
    No documentation for this element.
    XML representation
    <Ignore ></Ignore>
  • Index com.exalead.mercury.mami.crawl.v21.Index
    No documentation for this element.
    XML representation
    <Index ></Index>
  • NoIndex com.exalead.mercury.mami.crawl.v21.NoIndex
    No documentation for this element.
    XML representation
    <NoIndex ></NoIndex>
  • Follow com.exalead.mercury.mami.crawl.v21.Follow
    No documentation for this element.
    XML representation
    <Follow ></Follow>
  • NoFollow com.exalead.mercury.mami.crawl.v21.NoFollow
    No documentation for this element.
    XML representation
    <NoFollow ></NoFollow>
  • AddMeta com.exalead.mercury.mami.crawl.v21.AddMeta
    Add a meta on a url that will be pushed to the PAPI.
    Parent objects
    • Rule
    Attributes
    • name string
    • value string
    Name Type Default value Description
    name string -
    value string -

    XML representation
    <AddMeta name="" value="" ></AddMeta>
  • Source com.exalead.mercury.mami.crawl.v21.Source
    Index matching urls in a different source.
    Parent objects
    • Rule
    Attributes
    • name string
    Name Type Default value Description
    name string -

    XML representation
    <Source name="" ></Source>
  • DataModelClass com.exalead.mercury.mami.crawl.v21.DataModelClass
    Change the datamodel class of matching urls.
    Parent objects
    • Rule
    Attributes
    • name string
    Name Type Default value Description
    name string -

    XML representation
    <DataModelClass name="" ></DataModelClass>
  • Priority com.exalead.mercury.mami.crawl.v21.Priority
    Shift the priority of urls.
    Parent objects
    • Rule
    Attributes
    • shift int
    Name Type Default value Description
    shift int 0 set a negative number to crawl faster, positive to crawl slower. For example, shift = -1 will make the url go in the next higher priority fifo.

    XML representation
    <Priority shift="0" ></Priority>
  • Or com.exalead.actionrules.v21.Or
    No documentation for this element.
    Nested elements
    • Pattern*
    Type Wrapper Description
    Pattern* -
    XML representation
    <Or ></Or>
  • And com.exalead.actionrules.v21.And
    No documentation for this element.
    Nested elements
    • Pattern*
    Type Wrapper Description
    Pattern* -
    XML representation
    <And ></And>
  • Not com.exalead.actionrules.v21.Not
    No documentation for this element.
    Nested elements
    • Pattern
    Type Wrapper Description
    Pattern -
    XML representation
    <Not ></Not>
  • Atom com.exalead.actionrules.v21.Atom
    Raw patterns
    Applies on the main url unless specified otherwise.
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • field string
    • kind string
    • norm string
    • value string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    field string - the field on which the pattern is applied

    A field may be the whole url or a part of it (url, scheme, host, port, path, query)
    kind string - specify the semantics of the attribute value
    - "length": specify the length of a field ([:10], [11:12], [30:])
    - "exact", "prefix", "suffix", "inside": specify a regexp and its anchoring
    @warning for readability purpose, regexp escaping policy is reversed.
    Special characters must be backslash escaped: write "www.\.\*.tv" instead of "www\..*\.tv" (backslash must be itself escaped in C-style strings, a single backslash is really needed, in xml for example)
    norm string none Specify the normalization level (default is case insentive match) values: norm, lower or none
    value string - value (regexp)
    matchedUrl string - If not empty, this rule applies on the url provided with the name matchedUrl, instead of the main url.
    litteral boolean True

    XML representation
    <Atom field="" kind="" norm="none" value="" matchedUrl="" litteral="True" ></Atom>
  • Url com.exalead.actionrules.v21.Url
    shortcut for url-exact
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <Url val="" norm="norm" matchedUrl="" litteral="True" ></Url>
  • Scheme com.exalead.actionrules.v21.Scheme
    shortcut for scheme-exact
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <Scheme val="" norm="norm" matchedUrl="" litteral="True" ></Scheme>
  • Host com.exalead.actionrules.v21.Host
    No documentation for this element.
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <Host val="" norm="norm" matchedUrl="" litteral="True" ></Host>
  • Domain com.exalead.actionrules.v21.Domain
    No documentation for this element.
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <Domain val="" norm="norm" matchedUrl="" litteral="True" ></Domain>
  • Port com.exalead.actionrules.v21.Port
    shortcut for port-exact
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <Port val="" norm="norm" matchedUrl="" litteral="True" ></Port>
  • Path com.exalead.actionrules.v21.Path
    shortcut for path-prefix
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <Path val="" norm="norm" matchedUrl="" litteral="True" ></Path>
  • Ext com.exalead.actionrules.v21.Ext
    shortcut for path-suffix
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <Ext val="" norm="norm" matchedUrl="" litteral="True" ></Ext>
  • Query com.exalead.actionrules.v21.Query
    shortcut for query-exact
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <Query val="" norm="norm" matchedUrl="" litteral="True" ></Query>
  • QueryArg com.exalead.actionrules.v21.QueryArg
    shortcut for query ?tok= or &tok=
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <QueryArg val="" norm="norm" matchedUrl="" litteral="True" ></QueryArg>
  • InQuery com.exalead.actionrules.v21.InQuery
    shortcut for query-inside
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <InQuery val="" norm="norm" matchedUrl="" litteral="True" ></InQuery>
  • InPath com.exalead.actionrules.v21.InPath
    shortcut for path-inside
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <InPath val="" norm="norm" matchedUrl="" litteral="True" ></InPath>
  • Length com.exalead.actionrules.v21.Length
    shortcut for field-length
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    • field string
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True
    field string -

    XML representation
    <Length val="" norm="norm" matchedUrl="" litteral="True" field="" ></Length>
  • ExternKey com.exalead.actionrules.v21.ExternKey
    Match on an extern key, not the default url
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    • key string
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True
    key string -

    XML representation
    <ExternKey val="" norm="norm" matchedUrl="" litteral="True" key="" ></ExternKey>
  • Num com.exalead.actionrules.v21.Num
    Test the numerical value of an extern key. Supported operators : <, <=, =, >, >=
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • key string
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    Name Type Default value Description
    key string -
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True

    XML representation
    <Num key="" val="" norm="norm" matchedUrl="" litteral="True" ></Num>
  • ExternKeyPrefix com.exalead.actionrules.v21.ExternKeyPrefix
    Match the prefix of an extern key
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    • key string
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True
    key string -

    XML representation
    <ExternKeyPrefix val="" norm="norm" matchedUrl="" litteral="True" key="" ></ExternKeyPrefix>
  • ExternKeyInside com.exalead.actionrules.v21.ExternKeyInside
    Match inside a list extern keys
    Parent objects
    • And
    • Config
    • Not
    • Or
    • Rule
    Attributes
    • val string
    • norm string
    • matchedUrl string
    • litteral boolean
    • key string
    Name Type Default value Description
    val string -
    norm string norm
    matchedUrl string -
    litteral boolean True
    key string -

    XML representation
    <ExternKeyInside val="" norm="norm" matchedUrl="" litteral="True" key="" ></ExternKeyInside>
  • PostFilterProba com.exalead.actionrules.v21.PostFilterProba
    Randomly return true or false. @param value The probability to return true.
    Parent objects
    • Rule
    Attributes
    • value float
    Name Type Default value Description
    value float -

    XML representation
    <PostFilterProba value="" ></PostFilterProba>
  • PostFilterRepetitiveTokens com.exalead.actionrules.v21.PostFilterRepetitiveTokens
    Filter tested after the pattern has matched.
    A postfilter allows to apply a filtering that cannot be expressed by rules.
    Parent objects
    • Rule
    Attributes
    • crossLevel string
    • numerical boolean
    Name Type Default value Description
    crossLevel string -
    numerical boolean True

    XML representation
    <PostFilterRepetitiveTokens crossLevel="" numerical="True" ></PostFilterRepetitiveTokens>
  • InferredDatePostFilter com.exalead.actionrules.v21.InferredDatePostFilter
    Filter tested after the pattern has matched.
    A postfilter allows to apply a filtering that cannot be expressed by rules.
    Parent objects
    • Rule
    Attributes
    • searchDateInPath boolean
    • searchDateInQuery boolean
    • maxYearsBefore int
    • maxYearsAfter int
    • maxMonthsBefore int
    • maxMonthsAfter int
    • maxDaysBefore int
    • maxDaysAfter int
    • searchDateFormats string
    • matchIfOutsideRange boolean
    • matchIfNoDate boolean
    Name Type Default value Description
    searchDateInPath boolean True
    searchDateInQuery boolean True
    maxYearsBefore int -1
    maxYearsAfter int -1
    maxMonthsBefore int -1
    maxMonthsAfter int -1
    maxDaysBefore int -1
    maxDaysAfter int -1
    searchDateFormats string -
    matchIfOutsideRange boolean False
    matchIfNoDate boolean False

    XML representation
    <InferredDatePostFilter searchDateInPath="True" searchDateInQuery="True" maxYearsBefore="-1" maxYearsAfter="-1" maxMonthsBefore="-1" maxMonthsAfter="-1" maxDaysBefore="-1" maxDaysAfter="-1" searchDateFormats="" matchIfOutsideRange="False" matchIfNoDate="False" ></InferredDatePostFilter>
  • CustomPostFilter com.exalead.actionrules.v21.CustomPostFilter
    Filter tested after the pattern has matched.
    A postfilter allows to apply a filtering that cannot be expressed by rules.
    Parent objects
    • Rule
    Attributes
    • classId string
    Name Type Default value Description
    classId string - The specified class must implement the {@code com.exalead.actionrules.CustomPostFilter} Exascript interface.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <CustomPostFilter classId="" ></CustomPostFilter>
  • UrlTesterData com.exalead.mercury.mami.crawl.v21.UrlTesterData
    No documentation for this element.
    Nested elements
    • urlsUrlTestConfig*
    Type Wrapper Description
    UrlTestConfig* urls
    XML representation
    <UrlTesterData ></UrlTesterData>
  • UrlTestConfig com.exalead.mercury.mami.crawl.v21.UrlTestConfig
    No documentation for this element.
    Parent objects
    • urls UrlTesterData
    Attributes
    • url string
    • group string
    • enableAdvancedMode boolean
    • accept boolean
    • index boolean
    • follow boolean
    Name Type Default value Description
    url string -
    group string default
    enableAdvancedMode boolean False
    accept boolean False
    index boolean False
    follow boolean False

    XML representation
    <UrlTestConfig url="" group="default" enableAdvancedMode="false" accept="false" index="false" follow="false" ></UrlTestConfig>
  • Root com.exalead.mercury.mami.crawl.v21.Root
    A crawl root.
    Note: there is a 4KB limit on the whole url + metas storage.
    Attributes
    • url string
    • site boolean
    • priority int
    • group string
    • kvs string
    Name Type Default value Description
    url string - The root url.
    site boolean True Enable site-mode: only crawl urls that belong to this 'site'.
    priority int 0 Priority shift. Increase or decrease priority. 0 means normal, -1 is higher priority, +1 lower.
    group string default Key used to group rules and root urls.
    kvs string - A semi-colon separated list of key-values. example: "key1=value1;key2=value2"

    XML representation
    <Root url="" site="true" priority="0" group="default" kvs="" ></Root>
  • RootSet com.exalead.mercury.mami.crawl.v21.RootSet
    A file from which to load a set of urls or sites to crawl.
    Parent objects
    • rootsets Crawler
    • rootsets FeedFetcher
    Attributes
    • url string
    • group string
    Name Type Default value Description
    url string -
    group string default Key used to group rules and root urls.

    XML representation
    <RootSet url="" group="default" ></RootSet>
  • FeedFetcher com.exalead.mercury.mami.crawl.v21.FeedFetcher
    A feedfetcher configuration.
    Parent objects
    • CrawlConfig
    Attributes
    • name string
    • documentsType string
    • fetcher string
    • crawlerServer string
    • connectorServer string
    • buildGroup string
    • dataModel string
    • storeTextOnly boolean
    • nthreads int
    • aggressive boolean
    • throttleTimeMS int
    • ignoreRobotsTxt boolean
    • enableConvertProcessor boolean
    • nearDuplicateDetector boolean
    • patternsDetector boolean
    • crawlSitemaps boolean
    • disableConditionalGet boolean
    • defaultAccept boolean
    • defaultIndex boolean
    • defaultFollow boolean
    • defaultFollowRoots boolean
    • enableSimpleSiteCollapsing boolean
    • simpleSiteCollapsingDepth int
    • mimeTypesMode string
    • smartRefresh boolean
    • smartRefreshMinAgeS int
    • smartRefreshMaxAgeS int
    • archiveDocuments boolean
    • enableConsolidation boolean
    • refreshDelayS int
    Name Type Default value Description
    name string - The crawler name. It must be unique across all crawlers.
    documentsType string - The type of documents pushed by this connector.
    The type of documents must match one of the types declared in your CloudView license file.
    fetcher string - Which fetcher to use.
    crawlerServer string - Crawler server hosting this crawler. See Deployment configuration.
    connectorServer string - Connector server hosting the indexing part of this crawler. See Deployment configuration.
    buildGroup string - Target build group.
    dataModel string - The default data model for documents indexed by this crawler.
    storeTextOnly boolean True Whether to store original binary documents, or only converted text.
    nthreads int 1 The number of crawl threads which must be strictly positive.
    aggressive boolean False Whether to enable aggressive crawl, that never sleeps between two requests to the same host.
    throttleTimeMS int 2500 In the case of non-aggressive crawl, this defines the sleep interval between requests to the same host.
    ignoreRobotsTxt boolean False Whether to ignore robots.txt rules. Not recommended.
    enableConvertProcessor boolean True Whether to enable remoteconvert-based processor for links extracting in binary documents.
    nearDuplicateDetector boolean True Whether to enable the near-duplicate content detector.
    patternsDetector boolean True Whether to enable patterns detection in pages.
    crawlSitemaps boolean True Whether to crawl sitemaps.
    disableConditionalGet boolean False Whether to always fetch documents, even if the server tells it has not changed.
    defaultAccept boolean False Whether to crawl a url by default when it matches no other accept rule.
    defaultIndex boolean False Whether to index by default when a url matches no index rule.
    defaultFollow boolean False Whether to follow by default when a url matches no follow rule.
    defaultFollowRoots boolean True Whether to automatically follow root urls
    enableSimpleSiteCollapsing boolean True Whether to generate a site ID suitable for document collapsing.
    simpleSiteCollapsingDepth int 0 How many path segments to use to generate the site collapsing ID.
    mimeTypesMode string exclude Mime types white/black list
    smartRefresh boolean True Whether to crawl a fraction of refreshed urls.
    smartRefreshMinAgeS int 3600 Age in seconds at which we may refresh old urls.
    smartRefreshMaxAgeS int 604800 Age in seconds at which we force the refresh of old urls.
    archiveDocuments boolean False When enabled, deleted documents are not deleted, but kept with their deletion date.
    enableConsolidation boolean True Define if we use a standard PAPI or a consolidation PAPI.
    refreshDelayS int 60 minimum delay before refreshing any url, default is 1 minute

    Nested elements
    • mimeTypesStringConstantValue*
    • sessionIdBlacklistStringConstantValue*
    • PushAPIFilterKeyValue*
    • feedsFeed*
    • rootsetsRootSet*
    • CrawlSchedulerConfig
    • CustomCrawlConfig
    • Rules*
    • UrlTesterData
    Type Wrapper Description
    StringConstantValue* mimeTypes
    StringConstantValue* sessionIdBlacklist SessionId blacklist. These parameters are removed from URLs with a path or query part containing them.
    KeyValue* PushAPIFilter
    Feed* feeds a list of feeds
    RootSet* rootsets A list of files to load urls/sites from.
    CrawlSchedulerConfig -
    CustomCrawlConfig -
    Rules* -
    UrlTesterData -
    XML representation
    <FeedFetcher name="" documentsType="" fetcher="" crawlerServer="" connectorServer="" buildGroup="" dataModel="" storeTextOnly="true" nthreads="1" aggressive="false" throttleTimeMS="2500" ignoreRobotsTxt="false" enableConvertProcessor="true" nearDuplicateDetector="true" patternsDetector="true" crawlSitemaps="true" disableConditionalGet="false" defaultAccept="false" defaultIndex="false" defaultFollow="false" defaultFollowRoots="true" enableSimpleSiteCollapsing="true" simpleSiteCollapsingDepth="0" mimeTypesMode="exclude" smartRefresh="true" smartRefreshMinAgeS="3600" smartRefreshMaxAgeS="604800" archiveDocuments="false" enableConsolidation="true" refreshDelayS="60" ></FeedFetcher>
  • Feed com.exalead.mercury.mami.crawl.v21.Feed
    A feed.
    Contains KeyValue* that are mapped to metas on all documents crawled from this root.
    Beware: there is a 4KB limit on the whole url + metas storage.
    Attributes
    • url string
    • site boolean
    • priority int
    • group string
    • kvs string
    • refreshPeriodS int
    • indexFeedItems boolean
    • indexItemDocuments boolean
    • findFeeds boolean
    • forceFeedMimeType boolean
    • findMediaLinks boolean
    Name Type Default value Description
    url string - The root url.
    site boolean True Enable site-mode: only crawl urls that belong to this 'site'.
    priority int 0 Priority shift. Increase or decrease priority. 0 means normal, -1 is higher priority, +1 lower.
    group string default Key used to group rules and root urls.
    kvs string - A semi-colon separated list of key-values. example: "key1=value1;key2=value2"
    refreshPeriodS int 600 how often to refresh this feed, default 10min
    indexFeedItems boolean True whether to index all items found in the feed with metas, before crawling them
    indexItemDocuments boolean True whether to crawl the items and index the full item pages
    findFeeds boolean False whether to crawl feeds found in html headers <link href="" rel="alternate" />
    forceFeedMimeType boolean True force processing of url as xml feed (for servers returning buggy content types) can't work with findFeeds enabled.
    findMediaLinks boolean True find <img src="" /> and youtube/dailymotion links in item text and push them as metas

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <Feed url="" site="true" priority="0" group="default" kvs="" refreshPeriodS="600" indexFeedItems="true" indexItemDocuments="true" findFeeds="false" forceFeedMimeType="true" findMediaLinks="true" ></Feed>
  • FetchConfig com.exalead.mercury.mami.fetch.v21.FetchConfig
    Data model //
    Attributes
    • version long
    • defaultFetcher string
    • dnsServer string
    • defaultMaxSizeKB int
    • defaultTruncate boolean
    • sessionLifetime int
    • fullDocumentMaxSizeKB int
    • crawlCacheProxyAddress string
    • crawlCacheProxyUsername string
    • crawlCacheProxyPassword string
    • crawlCacheRequestTimeoutMS long
    • globalProxyHost string
    • globalProxyPort int
    • globalProxyUsername string
    • globalProxyPassword string
    • globalProxyDomain string
    • nonProxyHosts string
    Name Type Default value Description
    version long - -
    defaultFetcher string -
    dnsServer string -
    defaultMaxSizeKB int -
    defaultTruncate boolean False
    sessionLifetime int 600 Cookie session lifetime in seconds
    fullDocumentMaxSizeKB int 32768
    crawlCacheProxyAddress string - Crawl through multibox (WebExperiencePlatform mode).
    crawlCacheProxyUsername string -
    crawlCacheProxyPassword string -
    crawlCacheRequestTimeoutMS long 10000
    globalProxyHost string -
    globalProxyPort int -
    globalProxyUsername string -
    globalProxyPassword string -
    globalProxyDomain string -
    nonProxyHosts string -

    Nested elements
    • mimesMimeConfig*
    • Fetcher*
    Type Wrapper Description
    MimeConfig* mimes
    Fetcher* -
    XML representation
    <FetchConfig version="" defaultFetcher="" dnsServer="" defaultMaxSizeKB="" defaultTruncate="false" sessionLifetime="600" fullDocumentMaxSizeKB="32768" crawlCacheProxyAddress="" crawlCacheProxyUsername="" crawlCacheProxyPassword="" crawlCacheRequestTimeoutMS="10000" globalProxyHost="" globalProxyPort="" globalProxyUsername="" globalProxyPassword="" globalProxyDomain="" nonProxyHosts="" ></FetchConfig>
  • Fetcher com.exalead.mercury.mami.fetch.v21.Fetcher
    No documentation for this element.
    Parent objects
    • FetchConfig
    Attributes
    • name string
    • type string
    • classId string
    • readTimeoutS int
    • writeTimeoutS int
    • connectTimeoutS int
    • maxDownloadTimeS int
    • userAgent string
    • from string
    • cookies boolean
    • proxyAddr string
    • proxyUsername string
    • proxyPassword string
    • proxyDomain string
    • useConnectForHttpsOverProxy boolean
    • useCrawlCache boolean
    • maxAgeS long
    Name Type Default value Description
    name string -
    type string -
    classId string -
    readTimeoutS int 15
    writeTimeoutS int 15
    connectTimeoutS int 30
    maxDownloadTimeS int 600
    userAgent string -
    from string -
    cookies boolean False
    proxyAddr string -
    proxyUsername string -
    proxyPassword string -
    proxyDomain string -
    useConnectForHttpsOverProxy boolean True
    useCrawlCache boolean False Crawl through crawl cache proxy. (WebExperiencePlatform mode).
    maxAgeS long 2592000 Max allowed age of docs fetched in cache. Older docs are recrawled. Default is 1 month

    Nested elements
    • headersHeader*
    • parametersKeyValue*
    • configRulesConfig*
    Type Wrapper Description
    Header* headers
    KeyValue* parameters
    Config* configRules
    XML representation
    <Fetcher name="" type="" classId="" readTimeoutS="15" writeTimeoutS="15" connectTimeoutS="30" maxDownloadTimeS="600" userAgent="" from="" cookies="false" proxyAddr="" proxyUsername="" proxyPassword="" proxyDomain="" useConnectForHttpsOverProxy="true" useCrawlCache="false" maxAgeS="2592000" ></Fetcher>
  • Header com.exalead.mercury.mami.fetch.v21.Header
    No documentation for this element.
    Parent objects
    • headers AddHeaders
    • headers Fetcher
    Attributes
    • name string
    • value string
    Name Type Default value Description
    name string -
    value string -

    XML representation
    <Header name="" value="" ></Header>
  • Config com.exalead.mercury.mami.fetch.v21.Config
    No documentation for this element.
    Parent objects
    • configRules Fetcher
    Attributes
    • name string
    Name Type Default value Description
    name string -

    Nested elements
    • Pattern*
    • RulesConfig
    Type Wrapper Description
    Pattern* -
    RulesConfig -
    XML representation
    <Config name="" ></Config>
  • Cookies com.exalead.mercury.mami.fetch.v21.Cookies
    No documentation for this element.
    Parent objects
    • Config
    Attributes
    • enable boolean
    Name Type Default value Description
    enable boolean False

    XML representation
    <Cookies enable="false" ></Cookies>
  • Proxy com.exalead.mercury.mami.fetch.v21.Proxy
    No documentation for this element.
    Parent objects
    • Config
    Attributes
    • addr string
    • username string
    • password string
    • domain string
    Name Type Default value Description
    addr string -
    username string -
    password string -
    domain string -

    XML representation
    <Proxy addr="" username="" password="" domain="" ></Proxy>
  • AddHeaders com.exalead.mercury.mami.fetch.v21.AddHeaders
    No documentation for this element.
    Nested elements
    • headersHeader*
    Type Wrapper Description
    Header* headers
    XML representation
    <AddHeaders ></AddHeaders>
  • Auth com.exalead.mercury.mami.fetch.v21.Auth
    No documentation for this element.
    Parent objects
    • Config
    Attributes
    • type string
    • username string
    • password string
    • realm string
    • domain string
    • host string
    Name Type Default value Description
    type string -
    username string -
    password string -
    realm string -
    domain string -
    host string -

    Nested elements
    • conditionCond
    • Post
    Type Wrapper Description
    Cond condition
    Post -
    XML representation
    <Auth type="" username="" password="" realm="" domain="" host="" ></Auth>
  • Post com.exalead.mercury.mami.fetch.v21.Post
    No documentation for this element.
    Parent objects
    • Auth
    Attributes
    • gatewayUrl string
    • formId string
    • formClass string
    • formName string
    • method string
    • action string
    • autoSubmittedForms int
    • additionalRequest string
    Name Type Default value Description
    gatewayUrl string -
    formId string - if there are more than one form on the gateway Url, a formName formId and/or a formClass can be specified to find the right one.
    formClass string -
    formName string -
    method string - when method or action are not null, they override the ones found in the form. when gatewayUrl is null, just use them. gatewayUrl, method and action must not all be null.
    action string -
    autoSubmittedForms int 0 How many times to try to find and submit a form after the login procedure.
    This is needed to get cookies for other domains in some SSOs.
    additionalRequest string - An additional URL to fetch after the HTML form authentication procedure, following all redirections to allow new cookies.
    Useful for some SSO including google sites.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <Post formId="" formClass="" formName="" method="" action="" autoSubmittedForms="0" additionalRequest="" ></Post>
  • Status com.exalead.mercury.mami.fetch.v21.Status
    No documentation for this element.
    Parent objects
    • And
    • Not
    • Or
    • condition Auth
    Attributes
    • success boolean
    • code int
    Name Type Default value Description
    success boolean True
    code int -

    XML representation
    <Status success="true" code="" ></Status>
  • Redirect com.exalead.mercury.mami.fetch.v21.Redirect
    No documentation for this element.
    Parent objects
    • And
    • Not
    • Or
    • condition Auth
    Attributes
    • success boolean
    • matches string
    Name Type Default value Description
    success boolean True
    matches string - only match redirections to a url with some string inside, else all redirections

    XML representation
    <Redirect success="true" matches="" ></Redirect>
  • ResponseHeader com.exalead.mercury.mami.fetch.v21.ResponseHeader
    No documentation for this element.
    Parent objects
    • And
    • Not
    • Or
    • condition Auth
    Attributes
    • success boolean
    • name string
    • value string
    Name Type Default value Description
    success boolean True
    name string -
    value string -

    XML representation
    <ResponseHeader success="true" name="" value="" ></ResponseHeader>
  • InBody com.exalead.mercury.mami.fetch.v21.InBody
    No documentation for this element.
    Parent objects
    • And
    • Not
    • Or
    • condition Auth
    Attributes
    • success boolean
    • text string
    Name Type Default value Description
    success boolean True
    text string -

    XML representation
    <InBody success="true" text="" ></InBody>
  • And com.exalead.mercury.mami.fetch.v21.And
    No documentation for this element.
    Parent objects
    • And
    • Not
    • Or
    • condition Auth
    Attributes
    • success boolean
    Name Type Default value Description
    success boolean True

    Nested elements
    • Cond*
    Type Wrapper Description
    Cond* -
    XML representation
    <And success="true" ></And>
  • Or com.exalead.mercury.mami.fetch.v21.Or
    No documentation for this element.
    Parent objects
    • And
    • Not
    • Or
    • condition Auth
    Attributes
    • success boolean
    Name Type Default value Description
    success boolean True

    Nested elements
    • Cond*
    Type Wrapper Description
    Cond* -
    XML representation
    <Or success="true" ></Or>
  • Not com.exalead.mercury.mami.fetch.v21.Not
    No documentation for this element.
    Parent objects
    • And
    • Not
    • Or
    • condition Auth
    Attributes
    • success boolean
    Name Type Default value Description
    success boolean True

    Nested elements
    • Cond
    Type Wrapper Description
    Cond -
    XML representation
    <Not success="true" ></Not>
  • AddParameters com.exalead.mercury.mami.fetch.v21.AddParameters
    No documentation for this element.
    Nested elements
    • parametersKeyValue*
    Type Wrapper Description
    KeyValue* parameters
    XML representation
    <AddParameters ></AddParameters>
  • MimeConfig com.exalead.mercury.mami.fetch.v21.MimeConfig
    No documentation for this element.
    Parent objects
    • mimes FetchConfig
    Attributes
    • mime string
    • maxSizeKB int
    • truncate boolean
    Name Type Default value Description
    mime string -
    maxSizeKB int -
    truncate boolean True When setting truncate to false, the file won't be crawled if size exceeds maxSizeKB. This is useful for binary files, such as pdf, because truncated binary files can not be processed

    XML representation
    <MimeConfig mime="" maxSizeKB="" truncate="true" ></MimeConfig>
  • ConvertConfig com.exalead.mercury.mami.convert.v10.ConvertConfig
    Global configuration for document conversion.
    This configuration impacts how binary files (e.g. images, office documents, ...) are handled during: - Indexing, Analysis, when using the ConvertDocumentProcessor - Document Preview.
    Attributes
    • version long
    Name Type Default value Description
    version long - -

    Nested elements
    • ConvertDocumentInputSettings
    • ConvertDocumentOutputSettings
    • ConvertDocumentProcessingSettings
    • ConvertInternalSettings
    • ConvertJavaPluginsSettings
    Type Wrapper Description
    ConvertDocumentInputSettings -
    ConvertDocumentOutputSettings -
    ConvertDocumentProcessingSettings -
    ConvertInternalSettings -
    ConvertJavaPluginsSettings -
    XML representation
    <ConvertConfig version="" ></ConvertConfig>
  • ConvertDocumentInputSettings com.exalead.mercury.mami.convert.v10.ConvertDocumentInputSettings
    Configuration of convert inputs handling.
    Parameters to accept inputs for conversion.
    Parent objects
    • ConvertConfig
    Attributes
    • minSizeKB int
    • maxSizeKB int
    • maxSizeForTextDocumentsKB int
    Name Type Default value Description
    minSizeKB int -1 Default minimum size in kilobytes for a document to be converted.

    The default is -1 which means no limit.

    Note: This setting defines the process default value, which can be overridden for each conversion command.
    maxSizeKB int -1 Default maximumm size in kilobytes for a document to be converted.

    The default is -1 which means use program defaults (250MB).

    Note: This setting defines the process default value, which can be overridden for each conversion command.
    maxSizeForTextDocumentsKB int -1 Default maximum size in kilobytes for textual (html, xml, text) documents to be converted.

    The default is -1 which means use program defaults (250MB).

    Note: This setting defines the process default value, which can be overridden for each conversion command.

    XML representation
    <ConvertDocumentInputSettings minSizeKB="-1" maxSizeKB="-1" maxSizeForTextDocumentsKB="-1" ></ConvertDocumentInputSettings>
  • ConvertDocumentOutputSettings com.exalead.mercury.mami.convert.v10.ConvertDocumentOutputSettings
    Configuration of convert outputs handling.
    Parameters to tune conversion outputs.
    Parent objects
    • ConvertConfig
    Attributes
    • maxSizeKB int
    • maxConvertedPagesForXmlOutput int
    • maxConvertedPagesForHtmlOutput int
    Name Type Default value Description
    maxSizeKB int -1 Default maximum size in bytes for converted documents.

    The default is -1 which means no limit.

    Note: This setting defines the process default value, which can be overridden for each conversion command.
    maxConvertedPagesForXmlOutput int -1 Default maximum number of document pages to be converted into xml.

    The default is -1 which means no limit.

    The definition of a page is tightly linked to document type: (pdf page, doc page, etc)

    Note: This setting defines the process default value, which can be overridden for each conversion command.
    maxConvertedPagesForHtmlOutput int -1 Default maximum number of document pages to be converted into html.

    The default is -1 which means no limit.

    The definition of a page is tightly linked to document type: (pdf page, doc page, etc)

    Note: This setting defines the process default value, which can be overridden for each conversion command.

    XML representation
    <ConvertDocumentOutputSettings maxSizeKB="-1" maxConvertedPagesForXmlOutput="-1" maxConvertedPagesForHtmlOutput="-1" ></ConvertDocumentOutputSettings>
  • ConvertDocumentProcessingSettings com.exalead.mercury.mami.convert.v10.ConvertDocumentProcessingSettings
    Configuration of conversion settings in processing stage.
    Parent objects
    • ConvertConfig
    Attributes
    • conversionTimeoutS int
    • conversionTimeoutPerMegabyteS int
    • conversionGraceTimeoutS int
    • enableImageResizing boolean
    • netcamMode string
    • netcamTimeoutS int
    • netcamAsyncTimeoutS int
    • netcamJobsPerMinion int
    • netcamMaxJobsPerMinion int
    • netcamJobsRetries int
    • netcamProxy string
    • ttfDir string
    • indexSingleContainersAsOneDocument boolean
    • singleContainersMaxRecursionDepth int
    • singleContainersMaxRecursionDocuments int
    • singleContainersMaxRecursionDocumentsTotal int
    • allowUnicode32 boolean
    • allowDocumentChars boolean
    • metaSeparator string
    • iFilterExtensions string
    • excelDateFormat int
    Name Type Default value Description
    conversionTimeoutS int -1 Default timeout in seconds for conversion.

    The default is -1 which means use program defaults (30 seconds).

    The conversion will be considered as failed if it takes longer than conversionTimeout.

    Note: This setting defines the process default value, which can be overridden for each conversion command.
    conversionTimeoutPerMegabyteS int -1 Default timeout in seconds per megabyte for conversion.

    The conversion fails if it takes more than conversionTimeout * (size of document in Megabytes).

    The default is -1 which means "undefined".

    Note: This setting defines the process default value, which can be overridden for each conversion command.
    conversionGraceTimeoutS int -1 Global conversion grace timeout value in seconds.

    The convert process will kill a non-responding minion thread after this timeout.

    The default is -1 which means use program defaults (30 seconds).
    enableImageResizing boolean True Enables commands related to images resizing (used in thumbnails computation).
    netcamMode string optional Enables Netcam feature: disabled: Disables the feature enabled: Enables the feature optional: Enables the feature if available {@code enum("disabled", "enabled", "optional")}
    netcamTimeoutS int -1 Default Netcam conversion timeout in seconds.

    The default is -1 which means use program defaults (30 seconds).

    Note: This setting defines the process default value, which can be overridden for each conversion command.
    netcamAsyncTimeoutS int -1 Netcam asynchronous command timeout in seconds.

    It should be set to a low value, as it is applied to async commands.

    The default is -1 which means use program defaults (10 seconds).
    netcamJobsPerMinion int -1 The number of Netcam jobs per minion thread.

    The default is -1 which means use program defaults (4).
    netcamMaxJobsPerMinion int -1 Total number of jobs a minion can process before being recycled.

    The default is -1 which means use program defaults (128).
    netcamJobsRetries int -1 Number of Netcam jobs retries if a remote exception occurs.

    The default is -1 which means use program defaults (4).
    netcamProxy string - Optional proxy, or null if undefined.
    ttfDir string - The font path (necessary on UNIX). Used for thumbnail generation.
    indexSingleContainersAsOneDocument boolean False Default indexing mode for containers (ZIP, TAR, PST, ...) to be converted as a single document, instead of indexing the directory only.

    By default, it indexes only the container directory.

    Note: This function has no impact on container commands (i.e., opening, listing, ...).
    Note: This setting defines the process default value, which can be overridden for each conversion command.
    singleContainersMaxRecursionDepth int 1 Default maximum recursion depth (for containers, or containers inside containers).

    Only taken in account if indexSingleContainersAsOneDocument = true

    Note: This setting defines the process default value, which can be overridden for each conversion command.
    singleContainersMaxRecursionDocuments int 2147483647 Default maximum number of documents that can be indexed in a container.

    Only taken in account if indexSingleContainersAsOneDocument = true

    Note: This setting defines the process default value, which can be overridden for each conversion command.
    singleContainersMaxRecursionDocumentsTotal int 2147483647 Default maximum number of documents that can be indexed in a container and all its children (for containers inside containers).

    Only taken in account if indexSingleContainersAsOneDocument = true

    Note: This setting defines the process default value, which can be overridden for each conversion command.
    allowUnicode32 boolean True Allows the use of 32-bit unicode points when processing documents.

    This will allow to produce Unicode characters greater than 65536.
    allowDocumentChars boolean True Allows the use of Unicode private range characters (E0XX) for separators (keyword, sentence, paragraph separators, ...)
    metaSeparator string : Character separator for meta-data namespaces.

    Note: MUST be a printable ascii character (Unicode codepoint must be higher than 32 and strictly lower than 128)
    iFilterExtensions string - Comma-separated list of extensions to be processed through the Windows
    IFilter interface.

    Only available on Windows.
    excelDateFormat int 0 Date default format to be used to interpret date cells in excel: 0:MM/DD/YYYY, 1:YYYY/MM/DD, 2:DD/MM/YYYY

    XML representation
    <ConvertDocumentProcessingSettings conversionTimeoutS="-1" conversionTimeoutPerMegabyteS="-1" conversionGraceTimeoutS="-1" enableImageResizing="true" netcamMode="optional" netcamTimeoutS="-1" netcamAsyncTimeoutS="-1" netcamJobsPerMinion="-1" netcamMaxJobsPerMinion="-1" netcamJobsRetries="-1" netcamProxy="" ttfDir="" indexSingleContainersAsOneDocument="false" singleContainersMaxRecursionDepth="1" singleContainersMaxRecursionDocuments="2147483647" singleContainersMaxRecursionDocumentsTotal="2147483647" allowUnicode32="true" allowDocumentChars="true" metaSeparator=":" iFilterExtensions="" excelDateFormat="0" ></ConvertDocumentProcessingSettings>
  • ConvertInternalSettings com.exalead.mercury.mami.convert.v10.ConvertInternalSettings
    Configuration of convert system settings.

    The StringValue list can be used to add new supported options, in raw format (i.e., the leading -- must be present).

    The legacy KeyValue list can be used to add new supported options, using short format for the key (without the leading -- ; such as "enable-foo").

    The value should be set to "true" if no value is to be set on the commandline side.
    Parent objects
    • ConvertConfig
    Attributes
    • retryOnMMAPFailed boolean
    • disableSharedMemory boolean
    • loggingLevel string
    • tmpDir string
    • selftestOnStartup boolean
    • restrictUserId boolean
    Name Type Default value Description
    retryOnMMAPFailed boolean False Default convert retry with regular I/O if mmap fails when the convert is responsible for fetching the bytes of a document.

    This is useful for FileSystem mounted in direct I/O.

    Note: This setting defines the process default value, which can be overridden for each conversion command.
    disableSharedMemory boolean False Disables shared memory support.
    loggingLevel string - Sets logging verbosity of the convert. verbose: Logs converted urls quiet: Logs errors only. {@code enum ("verbose", "quiet")}
    tmpDir string - The temporary path to override the system temporary path.
    selftestOnStartup boolean False The converter tests itself on startup.
    restrictUserId boolean False Restricts connections to the user running the server.

    Supported only on Windows and Linux platforms.

    Nested elements
    • ConvertInternalCacheSettings
    • ConvertInternalChildrenSettings
    • ConvertInternalPOSIXSettings
    • KeyValue*
    • StringValue*
    Type Wrapper Description
    ConvertInternalCacheSettings -
    ConvertInternalChildrenSettings -
    ConvertInternalPOSIXSettings -
    KeyValue* -
    StringValue* -
    XML representation
    <ConvertInternalSettings retryOnMMAPFailed="false" disableSharedMemory="false" loggingLevel="" tmpDir="" selftestOnStartup="false" restrictUserId="false" ></ConvertInternalSettings>
  • ConvertInternalCacheSettings com.exalead.mercury.mami.convert.v10.ConvertInternalCacheSettings
    Cache settings.
    The cache is mainly used to store preview files: css, javascript and images.
    Parent objects
    • ConvertInternalSettings
    Attributes
    • minAgeS long
    • maxAgeS long
    • maxSizeMB int
    • maxSizePerFileKB int
    Name Type Default value Description
    minAgeS long -1 Minimum age (in seconds) for an item in the cache.

    This is useful for big files that we don't want to frequently update in the cache, for performance reasons.

    The default is -1 which means no limit.
    maxAgeS long -1 Maximum age (in seconds) for an item in the cache.

    The default is -1 which means no limit.
    maxSizeMB int -1 Maximum size (in megabytes) for the cache.

    The default is -1 which means no limit.
    maxSizePerFileKB int -1 Maximum size (in kilobytes) for an item to be cached.

    The default is -1 which means no limit.

    XML representation
    <ConvertInternalCacheSettings minAgeS="-1" maxAgeS="-1" maxSizeMB="-1" maxSizePerFileKB="-1" ></ConvertInternalCacheSettings>
  • ConvertInternalChildrenSettings com.exalead.mercury.mami.convert.v10.ConvertInternalChildrenSettings
    Configuration of convert children.

    The convert forks children processes to run parallel conversion and make the service more robust to crashes.
    Parent objects
    • ConvertInternalSettings
    Attributes
    • maxChildren int
    • maxIdleChildren int
    • maxIdleChildrenPerGroup int
    • childSpawnTimeoutS int
    • ChildConvertInitTimeoutS int
    • exec32Mode string
    Name Type Default value Description
    maxChildren int -1 Maximum authorized number of children.

    The default is -1 which means use program defaults (128).
    maxIdleChildren int -1 Children that were created can be reused for future conversion.

    This parameter specifies the max number of idle children for future use.

    After long inactivity, all children will die and be re-forked if necessary.

    The default is -1 which means use program defaults (32).
    maxIdleChildrenPerGroup int -1 Children that were created for a specific group can be reused for future conversion.

    This parameter specifies the max number of idle children for each group for future use.

    After long inactivity, all children will die and be re-forked if necessary.

    The default is -1 which means use program defaults (automatic).
    childSpawnTimeoutS int -1 Children spawn timeout in seconds.

    The default is -1 which means use program defaults (20 seconds).
    ChildConvertInitTimeoutS int -1 Minion convert libraries and plugins initialization timeout in seconds.

    The default is -1 which means use program defaults (30 seconds).
    exec32Mode string disabled Select 32-bit support disabled: Disables the feature enabled: Enables the feature optional: Enables the feature if available {@code enum("disabled", "enabled", "optional")}

    XML representation
    <ConvertInternalChildrenSettings maxChildren="-1" maxIdleChildren="-1" maxIdleChildrenPerGroup="-1" childSpawnTimeoutS="-1" ChildConvertInitTimeoutS="-1" exec32Mode="disabled" ></ConvertInternalChildrenSettings>
  • ConvertInternalPOSIXSettings com.exalead.mercury.mami.convert.v10.ConvertInternalPOSIXSettings
    Convert process parameters, valid for a main of child process.

    Valid only for POSIX systems.
    Parent objects
    • ConvertInternalSettings
    Attributes
    • maxProcessMemorySizeMB int
    • maxCoreFileSizeMB int
    • maxCreatedFileSizeKB int
    • maxNumberOfOpenedFiles int
    • maxResidentMemorySizeMB int
    Name Type Default value Description
    maxProcessMemorySizeMB int -1 Maximum allowed memory for a convert process (main or child) (posix: maxas).

    The default is -1 which means no limit.
    maxCoreFileSizeMB int -1 Maximum allowed size for core files (posix: maxcore).

    The default is -1 which means no limit.
    maxCreatedFileSizeKB int -1 Maximum allowed size for created files such as temporary files etc (posix: maxfsize).

    The default is -1 which means no limit.
    maxNumberOfOpenedFiles int -1 Maximum allowed number of opened files (posix: maxnofile).

    The default is -1 which means no limit.
    maxResidentMemorySizeMB int -1 Maximum allowed size in bytes for resident memory (posix: maxrss).

    The default is -1 which means no limit.

    XML representation
    <ConvertInternalPOSIXSettings maxProcessMemorySizeMB="-1" maxCoreFileSizeMB="-1" maxCreatedFileSizeKB="-1" maxNumberOfOpenedFiles="-1" maxResidentMemorySizeMB="-1" ></ConvertInternalPOSIXSettings>
  • ConvertJavaPluginsSettings com.exalead.mercury.mami.convert.v10.ConvertJavaPluginsSettings
    Java global "format plugins" settings.
    Nested elements
    • ConvertJavaPluginSetting*
    Type Wrapper Description
    ConvertJavaPluginSetting* -
    XML representation
    <ConvertJavaPluginsSettings ></ConvertJavaPluginsSettings>
  • ConvertJavaPluginSetting com.exalead.mercury.mami.convert.v10.ConvertJavaPluginSetting
    Java global "format plugin" setting.
    Parent objects
    • ConvertJavaPluginsSettings
    Attributes
    • classId string
    • disabled boolean
    Name Type Default value Description
    classId string - Java class name.
    disabled boolean False Is the plugin disabled?

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <ConvertJavaPluginSetting classId="" disabled="false" ></ConvertJavaPluginSetting>
  • AnalysisConfig com.exalead.indexing.analysis.v10.AnalysisConfig
    AnalysisConfig represents a self-contained module for Document Analysis.

    AnalysisConfig is referenced by a BuildGroup.

    An analysis module defines a set of pipelines that are applied in this module.
    Attributes
    • name string
    • linguistic boolean
    Name Type Default value Description
    name string - Name of the analysis module.

    Must be unique.
    linguistic boolean True Extracts linguistic data for the dictionary, such as word counts.

    This impacts the ability to compute related terms and use word counts for ranking.

    Nested elements
    • AnalysisPipeline*
    Type Wrapper Description
    AnalysisPipeline* -
    XML representation
    <AnalysisConfig name="" linguistic="true" ></AnalysisConfig>
  • AnalysisPipeline com.exalead.indexing.analysis.v10.AnalysisPipeline
    A document analysis pipeline.

    Each pipeline has an associated accept condition. This condition is tested for each input document.

    If a document matches the condition, it is processed by this pipeline.

    If not, the condition is tested for the next pipeline in the list of pipelines defined in a DocumentAnalysis object.

    A document refused by all pipelines is neither processed nor indexed.


    Pipeline processing is made of several stages:
    • Document Processing Stage - is performed by a list of DocumentProcessor which process each Document sequentially.
      Document Processors manipulate the 'DocumentParts' (binary data pushed through the PAPI) and the 'DocumentChunks' (textual data obtained either from PAPI meta or by processing of Document Part or by processing of pre-existing Document Chunks)

      Each DocumentChunk has a textual content, a ContextName, a language, a score, may belong to a DocumentPart.

      A DocumentChunk belonging to no DocumentPart is called a root DocumentChunk.
    • Semantic Processing Stage - involves a list of SemanticProcessor which process each Document Chunk of each Document sequentially (except those for which Semantic Processing is disabled in the mapping).

      Semantic Processing segments text into 'tokens' and then processes text as a flow of tokens. SemanticAnnotations are produced on each token.
    • Mapping - involves mapping DocumentChunk and Semantic Annotations to index fields.
    Parent objects
    • AnalysisConfig
    Attributes
    • name string
    • errorAction string
    • reportDocumentErrors boolean
    • globalLogDocumentErrors boolean
    • autoBlacklistDocuments boolean
    • tokenizationConfig string
    • autoconfigureFromDataModel boolean
    • documentProcessorsProfiling boolean
    • semanticPipeTimeout int
    • slowDocumentWarningTimeUS long
    • semanticProcessorsProfiling boolean
    Name Type Default value Description
    name string -
    errorAction string continue Specifies the action to launch if there is a document error during processing:
    • "discard": Discards the document from the job. If the document was already in the index, it's not removed if it already existed.
    • "delete": Discards the document from the job and deletes it from the index.
    • "continue": Keeps processing the document. The document will probably be incomplete in the index.
    reportDocumentErrors boolean True Reports the document errors in the global reporting store, for further analysis.
    globalLogDocumentErrors boolean False Logs errors and exceptions reported by the processors in the global log (without stack trace).
    autoBlacklistDocuments boolean True Tries to add to a block list the documents triggering serious failure automatically.

    This option helps preventing loop failures, that is to say, when documents always trigger the same analysis process failures.
    tokenizationConfig string - Reference to the TokenizationConfig object to use for tokenization during Semantic Processing Stage.
    autoconfigureFromDataModel boolean True
    documentProcessorsProfiling boolean False Logs the CPU time spent for each document processor and for the main indexing phase.

    The total time spent for each processor is dumped in the analyzer log at the end of the job.
    semanticPipeTimeout int 0 CPU-time limit for the processing of a text chunk by the semantic pipe, in seconds.
    slowDocumentWarningTimeUS long 5000000 If the processing of a document is longer than this time, a message will be printed in the analyzer log.

    A value of 0 disables the warning feature.
    semanticProcessorsProfiling boolean False Logs the CPU time spent for each semantic processor.

    The total time spent for each processor is dumped in the analyzer log at the end of the job.

    Warning: This feature strongly impacts performance, only enable it if required.

    Nested elements
    • AcceptCondition
    • DocumentProcessor*
    • FilteringConfiguration
    • LanguageConfiguration*
    • MappingConfiguration
    • SemanticProcessor*
    Type Wrapper Description
    AcceptCondition -
    DocumentProcessor* -
    FilteringConfiguration -
    LanguageConfiguration* -
    MappingConfiguration -
    SemanticProcessor* -
    XML representation
    <AnalysisPipeline name="" errorAction="continue" reportDocumentErrors="true" globalLogDocumentErrors="false" autoBlacklistDocuments="true" tokenizationConfig="tok0" autoconfigureFromDataModel="true" documentProcessorsProfiling="false" semanticPipeTimeout="0" slowDocumentWarningTimeUS="5000000" semanticProcessorsProfiling="false" ></AnalysisPipeline>
  • AndCondition com.exalead.indexing.analysis.v10.AndCondition
    AndCondition matches if all children AcceptCondition match.
    Nested elements
    • AcceptCondition*
    Type Wrapper Description
    AcceptCondition* -
    XML representation
    <AndCondition ></AndCondition>
  • OrCondition com.exalead.indexing.analysis.v10.OrCondition
    OrCondition matches if one child matches.
    Nested elements
    • AcceptCondition*
    Type Wrapper Description
    AcceptCondition* -
    XML representation
    <OrCondition ></OrCondition>
  • NotCondition com.exalead.indexing.analysis.v10.NotCondition
    Matches if the child condition does not match.

    If there is no child condition (null), this condition never matches.
    Nested elements
    • AcceptCondition
    Type Wrapper Description
    AcceptCondition -
    XML representation
    <NotCondition ></NotCondition>
  • SourceCondition com.exalead.indexing.analysis.v10.SourceCondition
    SourceCondition matches if the source of the document matches 'source'.
    Parent objects
    • AnalysisPipeline
    • AndCondition
    • CGRDocumentProcessor
    • ConcatValues
    • ContentCleanup
    • ConvertTextExtractor
    • CoordinatesFormatter
    • CopyContext
    • CustomDocumentProcessor
    • DataModelClassResolver
    • DateFormatter
    • DebugCrashProcessor
    • DebugProcessor
    • DiscardDocument
    • DocumentProcessor
    • DocumentProcessorGroup
    • DoubleToLong
    • FixedRangeNumericalPartitioning
    • ForcedRangeNumericalPartitioning
    • FormatCheckerDate
    • GeoBBoxProcessor
    • GeoCategorizer
    • HTMLCSSExtractor
    • HTMLCSSSelector
    • HTMLRelevantContentExtractor
    • HTMLTableExtractor
    • InferFileExtension
    • InsertCurrentDate
    • JavaDocumentProcessor
    • JavaProcessor
    • JavaScriptProcessor
    • LanguageDetector
    • LanguageSetter
    • MIMEDetector
    • MathDocumentProcessor
    • MetaFinder
    • MimeTypeSetter
    • MultiContextCSVEncoder
    • MultiContextDocumentProcessor
    • NativeTextExtractor
    • NewChunk
    • NotCondition
    • NumericalFormatter
    • OrCondition
    • PLMExpandDocumentProcessor
    • PrecomputedThumbnailsDocumentProcessor
    • PrintfValues
    • PublicUrlProcessor
    • RealTimeAlerting
    • RemoteHTTPTransformer
    • RemoteMOTAPIDocumentProcessor
    • RemoveContexts
    • RenameContext
    • RenameUnmappedContexts
    • ReplaceContextNames
    • ReplaceRegexp
    • ReplaceValues
    • SemanticPipeDocumentProcessor
    • SetDefaultValue
    • SimilarStringToPart
    • SingleContextDocumentProcessor
    • SplitValues
    • StandardPartsMerger
    • StorageServiceDocumentProcessor
    • StringHash
    • StringHash32
    • StringHash64
    • StringTransform
    • TextToNum
    • URLCodec
    • URLTransformer
    • UTF8Checker
    • UniformRandomContextGenerator
    • UnitsOfMeasurementNormalizer
    • ValueSelector
    • WildcardIndexing
    • XpathExtractor
    • XpathFragmentExtractor
    • ZipfRandomContextGenerator
    Attributes
    • source string
    Name Type Default value Description
    source string - Value of the 'source' for the document against which to check.

    XML representation
    <SourceCondition source="" ></SourceCondition>
  • BuildGroupCondition com.exalead.indexing.analysis.v10.BuildGroupCondition
    BuildGroupCondition matches if the current buildgroup matches 'name'.
    Parent objects
    • AnalysisPipeline
    • AndCondition
    • CGRDocumentProcessor
    • ConcatValues
    • ContentCleanup
    • ConvertTextExtractor
    • CoordinatesFormatter
    • CopyContext
    • CustomDocumentProcessor
    • DataModelClassResolver
    • DateFormatter
    • DebugCrashProcessor
    • DebugProcessor
    • DiscardDocument
    • DocumentProcessor
    • DocumentProcessorGroup
    • DoubleToLong
    • FixedRangeNumericalPartitioning
    • ForcedRangeNumericalPartitioning
    • FormatCheckerDate
    • GeoBBoxProcessor
    • GeoCategorizer
    • HTMLCSSExtractor
    • HTMLCSSSelector
    • HTMLRelevantContentExtractor
    • HTMLTableExtractor
    • InferFileExtension
    • InsertCurrentDate
    • JavaDocumentProcessor
    • JavaProcessor
    • JavaScriptProcessor
    • LanguageDetector
    • LanguageSetter
    • MIMEDetector
    • MathDocumentProcessor
    • MetaFinder
    • MimeTypeSetter
    • MultiContextCSVEncoder
    • MultiContextDocumentProcessor
    • NativeTextExtractor
    • NewChunk
    • NotCondition
    • NumericalFormatter
    • OrCondition
    • PLMExpandDocumentProcessor
    • PrecomputedThumbnailsDocumentProcessor
    • PrintfValues
    • PublicUrlProcessor
    • RealTimeAlerting
    • RemoteHTTPTransformer
    • RemoteMOTAPIDocumentProcessor
    • RemoveContexts
    • RenameContext
    • RenameUnmappedContexts
    • ReplaceContextNames
    • ReplaceRegexp
    • ReplaceValues
    • SemanticPipeDocumentProcessor
    • SetDefaultValue
    • SimilarStringToPart
    • SingleContextDocumentProcessor
    • SplitValues
    • StandardPartsMerger
    • StorageServiceDocumentProcessor
    • StringHash
    • StringHash32
    • StringHash64
    • StringTransform
    • TextToNum
    • URLCodec
    • URLTransformer
    • UTF8Checker
    • UniformRandomContextGenerator
    • UnitsOfMeasurementNormalizer
    • ValueSelector
    • WildcardIndexing
    • XpathExtractor
    • XpathFragmentExtractor
    • ZipfRandomContextGenerator
    Attributes
    • name string
    Name Type Default value Description
    name string - Value of the "buildgroup" for the document against which to check.

    XML representation
    <BuildGroupCondition name="" ></BuildGroupCondition>
  • MetaCondition com.exalead.indexing.analysis.v10.MetaCondition
    MetaCondition matches if the Document contains a DocumentChunk whose meta name and value match the specified condition.
    Parent objects
    • AnalysisPipeline
    • AndCondition
    • CGRDocumentProcessor
    • ConcatValues
    • ContentCleanup
    • ConvertTextExtractor
    • CoordinatesFormatter
    • CopyContext
    • CustomDocumentProcessor
    • DataModelClassResolver
    • DateFormatter
    • DebugCrashProcessor
    • DebugProcessor
    • DiscardDocument
    • DocumentProcessor
    • DocumentProcessorGroup
    • DoubleToLong
    • FixedRangeNumericalPartitioning
    • ForcedRangeNumericalPartitioning
    • FormatCheckerDate
    • GeoBBoxProcessor
    • GeoCategorizer
    • HTMLCSSExtractor
    • HTMLCSSSelector
    • HTMLRelevantContentExtractor
    • HTMLTableExtractor
    • InferFileExtension
    • InsertCurrentDate
    • JavaDocumentProcessor
    • JavaProcessor
    • JavaScriptProcessor
    • LanguageDetector
    • LanguageSetter
    • MIMEDetector
    • MathDocumentProcessor
    • MetaFinder
    • MimeTypeSetter
    • MultiContextCSVEncoder
    • MultiContextDocumentProcessor
    • NativeTextExtractor
    • NewChunk
    • NotCondition
    • NumericalFormatter
    • OrCondition
    • PLMExpandDocumentProcessor
    • PrecomputedThumbnailsDocumentProcessor
    • PrintfValues
    • PublicUrlProcessor
    • RealTimeAlerting
    • RemoteHTTPTransformer
    • RemoteMOTAPIDocumentProcessor
    • RemoveContexts
    • RenameContext
    • RenameUnmappedContexts
    • ReplaceContextNames
    • ReplaceRegexp
    • ReplaceValues
    • SemanticPipeDocumentProcessor
    • SetDefaultValue
    • SimilarStringToPart
    • SingleContextDocumentProcessor
    • SplitValues
    • StandardPartsMerger
    • StorageServiceDocumentProcessor
    • StringHash
    • StringHash32
    • StringHash64
    • StringTransform
    • TextToNum
    • URLCodec
    • URLTransformer
    • UTF8Checker
    • UniformRandomContextGenerator
    • UnitsOfMeasurementNormalizer
    • ValueSelector
    • WildcardIndexing
    • XpathExtractor
    • XpathFragmentExtractor
    • ZipfRandomContextGenerator
    Attributes
    • name string
    • nameMode enum(equals, matches)
    • valueMode enum(equals, contains, exists, matches)
    • value string
    Name Type Default value Description
    name string - Name of the meta against which to check.
    nameMode enum(equals, matches) equals Meta name test mode:
    • "equals": Evaluates the DocumentChunk with a name equal to the specified one.
    • "matches": Evaluates the DocumentChunk with a name matching the specified regular expression.The match is case insensitive.
    valueMode enum(equals, contains, exists, matches) exists Value test mode:
    • "exists": Matches if a DocumentChunk pass the name condition.
    • "equals": Matches if a DocumentChunk pass the name condition and the textual content is equal to the 'value' attribute.
    • "contains": Matches if a DocumentChunk pass the name condition and the textual content contains 'value' (Pure string matching is performed without tokenization).
    • "matches": Matches if a DocumentChunk pass the name condition and the textual content matches the regular expression specified by the 'value' attribute. The match is case insensitive.
    value string - The string to check against the value of DocumentChunks.

    XML representation
    <MetaCondition name="" nameMode="equals" valueMode="exists" value="" ></MetaCondition>
  • MimeCondition com.exalead.indexing.analysis.v10.MimeCondition
    A condition that matches if the FIRST document part mime type is in the list.

    Note: Conditions work on document but mimes are set per document part.

    The MimeCondition only tests the mime type of the first part, if present.
    Nested elements
    • mimesStringValue*
    Type Wrapper Description
    StringValue* mimes
    XML representation
    <MimeCondition ></MimeCondition>
  • URLMatchCondition com.exalead.indexing.analysis.v10.URLMatchCondition
    A condition that matches if the URI matches the regexp.
    Parent objects
    • AnalysisPipeline
    • AndCondition
    • CGRDocumentProcessor
    • ConcatValues
    • ContentCleanup
    • ConvertTextExtractor
    • CoordinatesFormatter
    • CopyContext
    • CustomDocumentProcessor
    • DataModelClassResolver
    • DateFormatter
    • DebugCrashProcessor
    • DebugProcessor
    • DiscardDocument
    • DocumentProcessor
    • DocumentProcessorGroup
    • DoubleToLong
    • FixedRangeNumericalPartitioning
    • ForcedRangeNumericalPartitioning
    • FormatCheckerDate
    • GeoBBoxProcessor
    • GeoCategorizer
    • HTMLCSSExtractor
    • HTMLCSSSelector
    • HTMLRelevantContentExtractor
    • HTMLTableExtractor
    • InferFileExtension
    • InsertCurrentDate
    • JavaDocumentProcessor
    • JavaProcessor
    • JavaScriptProcessor
    • LanguageDetector
    • LanguageSetter
    • MIMEDetector
    • MathDocumentProcessor
    • MetaFinder
    • MimeTypeSetter
    • MultiContextCSVEncoder
    • MultiContextDocumentProcessor
    • NativeTextExtractor
    • NewChunk
    • NotCondition
    • NumericalFormatter
    • OrCondition
    • PLMExpandDocumentProcessor
    • PrecomputedThumbnailsDocumentProcessor
    • PrintfValues
    • PublicUrlProcessor
    • RealTimeAlerting
    • RemoteHTTPTransformer
    • RemoteMOTAPIDocumentProcessor
    • RemoveContexts
    • RenameContext
    • RenameUnmappedContexts
    • ReplaceContextNames
    • ReplaceRegexp
    • ReplaceValues
    • SemanticPipeDocumentProcessor
    • SetDefaultValue
    • SimilarStringToPart
    • SingleContextDocumentProcessor
    • SplitValues
    • StandardPartsMerger
    • StorageServiceDocumentProcessor
    • StringHash
    • StringHash32
    • StringHash64
    • StringTransform
    • TextToNum
    • URLCodec
    • URLTransformer
    • UTF8Checker
    • UniformRandomContextGenerator
    • UnitsOfMeasurementNormalizer
    • ValueSelector
    • WildcardIndexing
    • XpathExtractor
    • XpathFragmentExtractor
    • ZipfRandomContextGenerator
    Attributes
    • regexp string
    Name Type Default value Description
    regexp string - The regexp.

    Note: It is not anchored by default ; i.e., use '.*\.asp to match .asp URIs.

    XML representation
    <URLMatchCondition regexp="" ></URLMatchCondition>
  • FilenameMatchCondition com.exalead.indexing.analysis.v10.FilenameMatchCondition
    A condition that matches if the FIRST document part Filename type matches the regexp.

    Note: Conditions work on document but Filenames are set per document part.

    FilenameMatchCondition only tests the Filename type of the first part, if present.
    Parent objects
    • AnalysisPipeline
    • AndCondition
    • CGRDocumentProcessor
    • ConcatValues
    • ContentCleanup
    • ConvertTextExtractor
    • CoordinatesFormatter
    • CopyContext
    • CustomDocumentProcessor
    • DataModelClassResolver
    • DateFormatter
    • DebugCrashProcessor
    • DebugProcessor
    • DiscardDocument
    • DocumentProcessor
    • DocumentProcessorGroup
    • DoubleToLong
    • FixedRangeNumericalPartitioning
    • ForcedRangeNumericalPartitioning
    • FormatCheckerDate
    • GeoBBoxProcessor
    • GeoCategorizer
    • HTMLCSSExtractor
    • HTMLCSSSelector
    • HTMLRelevantContentExtractor
    • HTMLTableExtractor
    • InferFileExtension
    • InsertCurrentDate
    • JavaDocumentProcessor
    • JavaProcessor
    • JavaScriptProcessor
    • LanguageDetector
    • LanguageSetter
    • MIMEDetector
    • MathDocumentProcessor
    • MetaFinder
    • MimeTypeSetter
    • MultiContextCSVEncoder
    • MultiContextDocumentProcessor
    • NativeTextExtractor
    • NewChunk
    • NotCondition
    • NumericalFormatter
    • OrCondition
    • PLMExpandDocumentProcessor
    • PrecomputedThumbnailsDocumentProcessor
    • PrintfValues
    • PublicUrlProcessor
    • RealTimeAlerting
    • RemoteHTTPTransformer
    • RemoteMOTAPIDocumentProcessor
    • RemoveContexts
    • RenameContext
    • RenameUnmappedContexts
    • ReplaceContextNames
    • ReplaceRegexp
    • ReplaceValues
    • SemanticPipeDocumentProcessor
    • SetDefaultValue
    • SimilarStringToPart
    • SingleContextDocumentProcessor
    • SplitValues
    • StandardPartsMerger
    • StorageServiceDocumentProcessor
    • StringHash
    • StringHash32
    • StringHash64
    • StringTransform
    • TextToNum
    • URLCodec
    • URLTransformer
    • UTF8Checker
    • UniformRandomContextGenerator
    • UnitsOfMeasurementNormalizer
    • ValueSelector
    • WildcardIndexing
    • XpathExtractor
    • XpathFragmentExtractor
    • ZipfRandomContextGenerator
    Attributes
    • regexp string
    Name Type Default value Description
    regexp string - The regexp.
    Note: It is not anchored by default ; i.e., use '.*\.doc' to match .doc files.

    XML representation
    <FilenameMatchCondition regexp="" ></FilenameMatchCondition>
  • BinaryContentCondition com.exalead.indexing.analysis.v10.BinaryContentCondition
    A condition that matches if the FIRST document part binary content type matches the binary string.

    Note: Conditions work on document but content is set per document part.

    BinaryContentCondition only tests the binary content of the first part, if present.
    Parent objects
    • AnalysisPipeline
    • AndCondition
    • CGRDocumentProcessor
    • ConcatValues
    • ContentCleanup
    • ConvertTextExtractor
    • CoordinatesFormatter
    • CopyContext
    • CustomDocumentProcessor
    • DataModelClassResolver
    • DateFormatter
    • DebugCrashProcessor
    • DebugProcessor
    • DiscardDocument
    • DocumentProcessor
    • DocumentProcessorGroup
    • DoubleToLong
    • FixedRangeNumericalPartitioning
    • ForcedRangeNumericalPartitioning
    • FormatCheckerDate
    • GeoBBoxProcessor
    • GeoCategorizer
    • HTMLCSSExtractor
    • HTMLCSSSelector
    • HTMLRelevantContentExtractor
    • HTMLTableExtractor
    • InferFileExtension
    • InsertCurrentDate
    • JavaDocumentProcessor
    • JavaProcessor
    • JavaScriptProcessor
    • LanguageDetector
    • LanguageSetter
    • MIMEDetector
    • MathDocumentProcessor
    • MetaFinder
    • MimeTypeSetter
    • MultiContextCSVEncoder
    • MultiContextDocumentProcessor
    • NativeTextExtractor
    • NewChunk
    • NotCondition
    • NumericalFormatter
    • OrCondition
    • PLMExpandDocumentProcessor
    • PrecomputedThumbnailsDocumentProcessor
    • PrintfValues
    • PublicUrlProcessor
    • RealTimeAlerting
    • RemoteHTTPTransformer
    • RemoteMOTAPIDocumentProcessor
    • RemoveContexts
    • RenameContext
    • RenameUnmappedContexts
    • ReplaceContextNames
    • ReplaceRegexp
    • ReplaceValues
    • SemanticPipeDocumentProcessor
    • SetDefaultValue
    • SimilarStringToPart
    • SingleContextDocumentProcessor
    • SplitValues
    • StandardPartsMerger
    • StorageServiceDocumentProcessor
    • StringHash
    • StringHash32
    • StringHash64
    • StringTransform
    • TextToNum
    • URLCodec
    • URLTransformer
    • UTF8Checker
    • UniformRandomContextGenerator
    • UnitsOfMeasurementNormalizer
    • ValueSelector
    • WildcardIndexing
    • XpathExtractor
    • XpathFragmentExtractor
    • ZipfRandomContextGenerator
    Attributes
    • offset int
    • match string
    Name Type Default value Description
    offset int - Offset in bytes for the binary data to be compared, in bytes (0 for the beginning of the file).
    Negative values are taken as offset from the end of the file (-1 for the last byte).
    match string - Binary string to be compared.

    The string may contain any ASCII (7-bit) character, or the following '\' escape sequences:
    • \xNN An hexadecimal-encoded character (N part of '0'..'9' or 'A'..'F')
    • \NNN An octal-encoded character (N part of '0'..'9')
    • \n Character 10
    • \r Character 13
    • \\ Character '\'
    • \" Character '"'
    • \? Any character

    XML representation
    <BinaryContentCondition offset="" match="" ></BinaryContentCondition>
  • DataModelClassCondition com.exalead.indexing.analysis.v10.DataModelClassCondition
    A condition that matches if the document has the corresponding DataModel.
    Parent objects
    • AnalysisPipeline
    • AndCondition
    • CGRDocumentProcessor
    • ConcatValues
    • ContentCleanup
    • ConvertTextExtractor
    • CoordinatesFormatter
    • CopyContext
    • CustomDocumentProcessor
    • DataModelClassResolver
    • DateFormatter
    • DebugCrashProcessor
    • DebugProcessor
    • DiscardDocument
    • DocumentProcessor
    • DocumentProcessorGroup
    • DoubleToLong
    • FixedRangeNumericalPartitioning
    • ForcedRangeNumericalPartitioning
    • FormatCheckerDate
    • GeoBBoxProcessor
    • GeoCategorizer
    • HTMLCSSExtractor
    • HTMLCSSSelector
    • HTMLRelevantContentExtractor
    • HTMLTableExtractor
    • InferFileExtension
    • InsertCurrentDate
    • JavaDocumentProcessor
    • JavaProcessor
    • JavaScriptProcessor
    • LanguageDetector
    • LanguageSetter
    • MIMEDetector
    • MathDocumentProcessor
    • MetaFinder
    • MimeTypeSetter
    • MultiContextCSVEncoder
    • MultiContextDocumentProcessor
    • NativeTextExtractor
    • NewChunk
    • NotCondition
    • NumericalFormatter
    • OrCondition
    • PLMExpandDocumentProcessor
    • PrecomputedThumbnailsDocumentProcessor
    • PrintfValues
    • PublicUrlProcessor
    • RealTimeAlerting
    • RemoteHTTPTransformer
    • RemoteMOTAPIDocumentProcessor
    • RemoveContexts
    • RenameContext
    • RenameUnmappedContexts
    • ReplaceContextNames
    • ReplaceRegexp
    • ReplaceValues
    • SemanticPipeDocumentProcessor
    • SetDefaultValue
    • SimilarStringToPart
    • SingleContextDocumentProcessor
    • SplitValues
    • StandardPartsMerger
    • StorageServiceDocumentProcessor
    • StringHash
    • StringHash32
    • StringHash64
    • StringTransform
    • TextToNum
    • URLCodec
    • URLTransformer
    • UTF8Checker
    • UniformRandomContextGenerator
    • UnitsOfMeasurementNormalizer
    • ValueSelector
    • WildcardIndexing
    • XpathExtractor
    • XpathFragmentExtractor
    • ZipfRandomContextGenerator
    Attributes
    • className string
    Name Type Default value Description
    className string - The restricted DataModel class

    XML representation
    <DataModelClassCondition className="" ></DataModelClassCondition>
  • CustomDirectiveCondition com.exalead.indexing.analysis.v10.CustomDirectiveCondition
    A condition that matches if the document has the specified directive name, with an optional specific value.
    Parent objects
    • AnalysisPipeline
    • AndCondition
    • CGRDocumentProcessor
    • ConcatValues
    • ContentCleanup
    • ConvertTextExtractor
    • CoordinatesFormatter
    • CopyContext
    • CustomDocumentProcessor
    • DataModelClassResolver
    • DateFormatter
    • DebugCrashProcessor
    • DebugProcessor
    • DiscardDocument
    • DocumentProcessor
    • DocumentProcessorGroup
    • DoubleToLong
    • FixedRangeNumericalPartitioning
    • ForcedRangeNumericalPartitioning
    • FormatCheckerDate
    • GeoBBoxProcessor
    • GeoCategorizer
    • HTMLCSSExtractor
    • HTMLCSSSelector
    • HTMLRelevantContentExtractor
    • HTMLTableExtractor
    • InferFileExtension
    • InsertCurrentDate
    • JavaDocumentProcessor
    • JavaProcessor
    • JavaScriptProcessor
    • LanguageDetector
    • LanguageSetter
    • MIMEDetector
    • MathDocumentProcessor
    • MetaFinder
    • MimeTypeSetter
    • MultiContextCSVEncoder
    • MultiContextDocumentProcessor
    • NativeTextExtractor
    • NewChunk
    • NotCondition
    • NumericalFormatter
    • OrCondition
    • PLMExpandDocumentProcessor
    • PrecomputedThumbnailsDocumentProcessor
    • PrintfValues
    • PublicUrlProcessor
    • RealTimeAlerting
    • RemoteHTTPTransformer
    • RemoteMOTAPIDocumentProcessor
    • RemoveContexts
    • RenameContext
    • RenameUnmappedContexts
    • ReplaceContextNames
    • ReplaceRegexp
    • ReplaceValues
    • SemanticPipeDocumentProcessor
    • SetDefaultValue
    • SimilarStringToPart
    • SingleContextDocumentProcessor
    • SplitValues
    • StandardPartsMerger
    • StorageServiceDocumentProcessor
    • StringHash
    • StringHash32
    • StringHash64
    • StringTransform
    • TextToNum
    • URLCodec
    • URLTransformer
    • UTF8Checker
    • UniformRandomContextGenerator
    • UnitsOfMeasurementNormalizer
    • ValueSelector
    • WildcardIndexing
    • XpathExtractor
    • XpathFragmentExtractor
    • ZipfRandomContextGenerator
    Attributes
    • directiveName string
    • directiveValue string
    Name Type Default value Description
    directiveName string - The expected directive name
    directiveValue string - An optional expected value for the given directive

    XML representation
    <CustomDirectiveCondition directiveName="" ></CustomDirectiveCondition>
  • LanguageDetector com.exalead.indexing.analysis.v10.LanguageDetector
    Language detection is performed using the text of all the DocumentChunks associated with the specified input ContextNames for which language was not already detected or specified.

    The whole text of all these DocumentChunks is taken into account by a statistical algorithm that detects the language.

    This language is then set as the language for all specified chunks.
    For example, the language attribute of a DocumentChunk is used by semantic processing.

    Language is represented by its iso639-1 code: fr, en.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • languageContext string
    • languagesToDetect string
    • defaultLanguage string
    • exclude boolean
    • outputContext string
    • minLangPercentage int
    • languagesToKeep int
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    languageContext string - If this is not null and if there is a DocumentChunk with a ContextName matching 'languageContext':
    • no automatic detection will be performed,
    • the language specified will be used as the language of the DocumentChunks associated with the ContextNames specified as input.
    languagesToDetect string - If not null, restrict the language detector to a set of languages.

    If you only have a small set of languages to detect, you can restrict language detector to this set to improve precision.

    List is comma-separated, ex: "en,fr"
    defaultLanguage string - If not null, 'defaultLanguage' will be used as the default language when automatic detection fails.
    exclude boolean False If true, "inputContexts" is an exclude list instead of an include list.

    Language detection is then performed on all DocumentChunks except those whose ContextName appears in 'inputContexts'.
    outputContext string - ContextName of the DocumentChunk to create.

    It will contain the language detected in the processed DocumentChunks as defined in ISO 639-1.
    minLangPercentage int 33 Minimum ratio ([0-100]) of language to be detected (0 = always keeps a detected language)
    languagesToKeep int 0 Keeps the n most represented languages in the document. A value of 0 lets the minLangPercentage select the languages.

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <LanguageDetector name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" languageContext="" languagesToDetect="" defaultLanguage="" exclude="false" outputContext="" minLangPercentage="33" languagesToKeep="0" ></LanguageDetector>
  • LanguageSetter com.exalead.indexing.analysis.v10.LanguageSetter
    The language is set as the language for all the DocumentChunks associated with the specified input ContextNames.

    For example, the language attribute of a DocumentChunk is used by semantic processing.

    The language is represented by its iso639-1 code: fr, en
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • language iso code
    • outputContext string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    language iso code - Language specified by ISO 639-1 code.
    outputContext string - ContextName of the DocumentChunk to create.

    It will contain the language name as defined in ISO 639-1.

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <LanguageSetter name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" language="en" outputContext="" ></LanguageSetter>
  • ContentCleanup com.exalead.indexing.analysis.v10.ContentCleanup
    Analyzes each DocumentChunk and performs whitespace removal, 'Whitespaces' being defined by the Unicode specification.

    This includes ' ' '\r' and '\n'.

    Input: All DocumentChunks associated with the specified 'inputContext' ContextNames.

    Output: Same as input
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • duplicateWhitespaces boolean
    • leading boolean
    • trailing boolean
    • spaces boolean
    • stripHTML boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    duplicateWhitespaces boolean False Removes duplicate whitespaces. (' ' -> ' ')
    leading boolean False Removes the leading whitespaces
    trailing boolean False Removes the trailing whitespaces
    spaces boolean False Removes *all* whitespaces.
    stripHTML boolean False Strips HTML tags

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <ContentCleanup name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" duplicateWhitespaces="false" leading="false" trailing="false" spaces="false" stripHTML="false" ></ContentCleanup>
  • ValueSelector com.exalead.indexing.analysis.v10.ValueSelector
    Takes the input contexts in the specified order, and as soon as one is found, it copies the content to the output context and stops.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - ContextName to be associated with the DocumentChunk created for each selection.

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <ValueSelector name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></ValueSelector>
  • UTF8Checker com.exalead.indexing.analysis.v10.UTF8Checker
    Checks that the text passing through is valid UTF-8.
    Emits a warning with the document URI and the context name if input is malformed.
    Optionally deletes invalid chunks.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • deleteInvalidChunks boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    deleteInvalidChunks boolean False Removes invalid chunks from documents.

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <UTF8Checker name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" deleteInvalidChunks="false" ></UTF8Checker>
  • ConcatValues com.exalead.indexing.analysis.v10.ConcatValues
    Concatenates all textual content of DocumentChunks where ContextName matches 'inputContexts', and joins them with the 'join' string.

    A single DocumentChunk with ContextName 'outputContext' is created as an output.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • join string
    • strict boolean
    • allowDuplicates boolean
    • cartesianProduct boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - ContextName to be associated with the DocumentChunk created for each concatenated value.
    join string - Optional string inserted between concatenated values.
    strict boolean True Forces all the input contexts found to generate the concatenation.
    allowDuplicates boolean True If true, and if there are multiple DocumentChunks with the same ContextName, it concatenates them all.

    If false, only the first DocumentChunk among all those with the same ContextName is kept.
    cartesianProduct boolean False If there are multiple DocumentChunks with the same ContextName, it generates the cartesian product between all values.

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <ConcatValues name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" join="" strict="true" allowDuplicates="true" cartesianProduct="false" ></ConcatValues>
  • RemoveContexts com.exalead.indexing.analysis.v10.RemoveContexts
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <RemoveContexts name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></RemoveContexts>
  • MultiContextCSVEncoder com.exalead.indexing.analysis.v10.MultiContextCSVEncoder
    Creates a DocumentChunk containing the ContextName and the textual value of the DocumentChunks matching 'inputContexts'.

    This processor can be used, for instance, to store arbitrary (key,value) pairs into one single index field.

    Note that this storing method is inefficient and should be used with caution. @csh AC_MULTICONTEXT_ENCODER_ID
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • processUnmappedContexts boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The ContextName used for newly created chunks.
    processUnmappedContexts boolean False All DocumentChunks with an unmapped ContextName in the document will be used for input.

    This can be used to emulate the 'default meta' and 'content' field feature of CloudView 4.6.

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <MultiContextCSVEncoder name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" processUnmappedContexts="false" ></MultiContextCSVEncoder>
  • StringHash com.exalead.indexing.analysis.v10.StringHash
    The StringHash processor computes a signed hash of the textual input value.

    For example, this value can be used in a field used for grouping.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • nbBits int
    • outputContext string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    nbBits int 64 The size of the hash, in bits, including the sign bit.

    The hash values will be in [-2^(nbBits-1); 2^(nbBits-1) - 1].
    outputContext string - The ContextName used for the newly created chunk.

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <StringHash name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" nbBits="64" outputContext="" ></StringHash>
  • StringHash64 com.exalead.indexing.analysis.v10.StringHash64
    The StringHash processor computes a signed hash of the textual input value on 64 bits.

    For example, this value can be used in a field used for grouping.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The ContextName used for the newly created chunk.

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <StringHash64 name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></StringHash64>
  • StringHash32 com.exalead.indexing.analysis.v10.StringHash32
    The StringHash processor computes a signed hash of the textual input value on 32 bits.

    For example, this value can be used in a field used for grouping.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The ContextName used for the newly created chunk.

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <StringHash32 name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></StringHash32>
  • NumericalFormatter com.exalead.indexing.analysis.v10.NumericalFormatter
    The Numerical Formatter processor creates valid numerical chunks from various number formats.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • precision int
    • round int
    • removeTrailingZeros boolean
    • groupSeparator string
    • decimalSeparator string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The ContextName used for the newly created chunk.

    If null, it uses the same name as the input.
    precision int 0 Number of digits relevant in the decimal part.
    round int 0 Rounds the integer part with this range.
    removeTrailingZeros boolean True Removes the trailing zeros in the decimal part.
    groupSeparator string - group separator
    decimalSeparator string . decimal separator

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <NumericalFormatter name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" precision="0" round="0" removeTrailingZeros="true" ></NumericalFormatter>
  • CoordinatesFormatter com.exalead.indexing.analysis.v10.CoordinatesFormatter
    The Coordinates Formatter processor creates a normalized chunk for the latitude and longitude.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • latitudeContext string
    • latitudeFormat enum(DMS, Decimal)
    • longitudeContext string
    • longitudeFormat enum(DMS, Decimal)
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The ContextName used for the newly created chunk.
    latitudeContext string - The ContextName used as input for the latitude
    latitudeFormat enum(DMS, Decimal) The input format for the latitude
    Value can be one of
    • DMS
    • Decimal
    longitudeContext string - The ContextName used as input for the longitude
    longitudeFormat enum(DMS, Decimal) The input format for the longitude
    Value can be one of
    • DMS
    • Decimal

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <CoordinatesFormatter name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" latitudeContext="" latitudeFormat="" longitudeContext="" longitudeFormat="" ></CoordinatesFormatter>
  • DebugProcessor com.exalead.indexing.analysis.v10.DebugProcessor
    Dumps all the DocumentChunks named after 'inputContexts' on Standard Output.

    This provides a log of the 'Analysis' process. @descr
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • dump boolean
    • outputContext string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    dump boolean True
    outputContext string - The ContextName used for the newly created chunk.

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <DebugProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" dump="true" ></DebugProcessor>
  • RemoteMOTAPIDocumentProcessor com.exalead.indexing.analysis.v10.RemoteMOTAPIDocumentProcessor
    The processing of each input context will be handled by the targeted remote API. @param targetBuildGroups list of build groups that should be used to handle processing. @param remoteMOTAPIConfigName the name of the RemoteMOTAPIConfig object as seen in RemoteMOTAPIConfig.xml high level configuration file.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • remoteMOTAPIConfigName string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    remoteMOTAPIConfigName string -

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • targetInstancesStringValue*
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    StringValue* targetInstances
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <RemoteMOTAPIDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" remoteMOTAPIConfigName="" ></RemoteMOTAPIDocumentProcessor>
  • StringTransform com.exalead.indexing.analysis.v10.StringTransform
    Applies textual transformations on chunks from several contexts:
    • trims blanks at the beginning and end of chunks
    • reduces sequences of blanks to just one
    • changes text to uppercase/lowercase/normalized/capitalized

    Outputs replace inputs.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • spaces string
    • form string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    spaces string - What to do with spaces ("trim" or "normalize-spaces", default set to nothing)
    form string - What transformation to apply ("lowercase", "uppercase", "normalized", "capitalized", default set to nothing)

    Nested elements
    • inputContextsStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* inputContexts The processor will only be applied to DocumentChunks with a ContextName specified in this list.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <StringTransform name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" spaces="" form="" ></StringTransform>
  • ReplaceValues com.exalead.indexing.analysis.v10.ReplaceValues
    The ReplaceValues processor compares all DocumentChunks for a given inputContext with the specified KeyValue map.

    When the DocumentChunk value is an exact match, it is replaced by the specified string.

    This processor can be used, for instance, to normalize different spelling for document metadata.
    @csh AC_REPLACE_VALUES_ID
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    • KeyValue*
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    KeyValue* -
    XML representation
    <ReplaceValues inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></ReplaceValues>
  • PublicUrlProcessor com.exalead.indexing.analysis.v10.PublicUrlProcessor
    For each input DocumentChunk associated with the 'inputContext' ContextName, 4 DocumentChunks are created, each associated with a different ContextName:
    • 'treeOutputContext'
    • 'leafOutputContext'
    • 'urlOutputContext'
    • 'urlCategoryOutputContext'
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • treeOutputContext string
    • leafOutputContext string
    • urlOutputContext string
    • urlPathOutputContext string
    • maxPathDepth int
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    treeOutputContext string - The ContextName for the DocumentChunk created from the category path encoding the web site tree.
    leafOutputContext string - The ContextName for the DocumentChunks created from the complete, normalized, URL.
    urlOutputContext string - The ContextName for the DocumentChunk created from the complete, normalized URL.
    urlPathOutputContext string - The ContextName for the DocumentChunk created from the normalized URL.
    maxPathDepth int 4 maximum depth of url path

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <PublicUrlProcessor inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" treeOutputContext="" leafOutputContext="" urlOutputContext="" urlPathOutputContext="" maxPathDepth="4" ></PublicUrlProcessor>
  • DateFormatter com.exalead.indexing.analysis.v10.DateFormatter
    If a document chunk matches either:
    • a custom input format defined with UNIX date syntax (for example,%Y/%m/%d-%H:%M:%S)
    • one of the automatically recognized date formats (click icon for more information)
    the Date Formatter generates three additional document chunks, each with its own context name, using the following naming convention:
    • $inputContext$dateTimeOutputContext (Default format: %Y/%m/%d-%H:%M:%S)
    • $inputContext$dateOutputContext (Default format: %Y/%m/%d)
    • $inputContext$timeOutputContext (Default format: %H:%M:%S)
    @csh AC_DATE_FORMATTER_ID
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • dateTimeOutputContext string
    • dateTimeOutputFormat string
    • dateOutputContext string
    • dateOutputFormat string
    • timeOutputContext string
    • timeOutputFormat string
    • inputFormat string
    • removeOriginalChunk boolean
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    dateTimeOutputContext string - Suffix for the name of the DocumentChunk containing the date as defined by dateTimeOutputFormat (default YYYY/MM/DD-HH:MM:SS).

    The original ContextName of the input DocumentChunk and this suffix are concatenated ($orig$dateTimeOutputContext) to produce the ContextName actually used.

    This DocumentChunk is usually used for date display.
    dateTimeOutputFormat string - A date and time output format compliant with libc's strftime.
    dateOutputContext string - Suffix for the name of the DocumentChunk containing the date as defined by dateOutputFormat (default YYYY/MM/DD).

    The original name of the input DocumentChunk and this suffix are concatenated ($orig$dateTimeOutputContext) to produce the name actually used.

    This DocumentChunk is usually remapped to a category for navigation.
    dateOutputFormat string - A date output format compliant with libc's strftime.
    timeOutputContext string - Suffix for the name of the DocumentChunk containing the date as defined by timeOutputFormat (default HH:MM:SS).

    The original name of the input DocumentChunk and this suffix are concatenated ($orig$dateTimeOutputContext) to produce the name actually used.
    timeOutputFormat string - A time output format compliant with libc's strftime.
    inputFormat string - An optional date input format, compliant with libc's strptime() format.

    If such a format is provided, the automatic date format heuristic is disabled, and the provided date format is used exclusively.

    Documentation of accepted formats: (days and month literals are only recognized in English)
    • Day
      • %a: weekday abbreviated ("Mon", ...)
      • %A: weekday full ("Monday", ...)
      • %d: day of the month, zero filled [01-31]
      • %e: Equivalent to %d [1-31]
      • %j: day year, zero filled [001-366]
      • %u: day of week starting with Monday (1), i.e. mtwtfss [7 (for Sunday)]
      • %w: day of week as a decimal number [0,6], with 0 representing Sunday
    • Week
      • %U: week number of the year (Sunday as first day of the week) as a decimal number [00,53]
      • %W: week number of the year (Monday as the first day of the week) as a decimal number [01,53]
      • %V: week of the year [01-53]
    • Month
      • %m: the month number [01-12]
      • %b: month locale abbreviated ("Aug", ...)
      • %h: equivalent to %b
      • %B: locale's full month, variable length ("August")
    • Year
      • %y: The year within the century with two-digit dates, for example [69,99] is mapped to [1969,1999] and [00,68] is mapped to [2000,2068]
      • %Y: The year, including the century (for example, 2014)
      • %g: last two digits of year of ISO week number (see %G)
      • %G: year of ISO week number (see %V), for example, 2014; normally useful only with %V
    • Century
      • %C: The century number [00,99]
    • Date
      • %D: Equivalent to mm/dd/yy (08/20/14)
      • %x: locale's date representation (mm/dd/yy), 08/20/2014
      • %F: %Y-%m-%d (2014-08-20)
    • Hours
      • %l: hour (12-hour clock), for example, [1-12]
      • %I: hour (12-hour clock) zero filled, [01-12]
      • %k: hour (24 hour), for example, 17
      • %H: hour (24 hour) zero padded, 17
      • %p: locale's upper case AM or PM (blank in many locales), for example, PM
      • %P: locale's lower case am or pm, for example, pm
    • Minutes
      • %M: The minute [00-59]
    • Seconds
      • %s: seconds since 00:00:00 1970-01-01 UTC (Unix epoch), for example, 1345483096
      • %S: seconds [00-60], (The 60 is necessary to accommodate a leap second)
    • Time
      • %r: hours, minutes, seconds (12-hour clock), for example, 05:18:16 PM
      • %R: hours, minutes (24-hour clock), for example, 17:18
      • %T: hours, minutes, seconds (24-hour clock), for example, 17:18:16
      • %X: locale's time representation, for example, 11:07:26 AM
      • %dt: AM or PM
    • Date and Time
      • %c: locale's date and time, for example, Sat Nov 04 12:02:33 EST 1989
    • Others
      • %n: Any white space
      • %t: Any white space
      • %%: Replaced by %
    removeOriginalChunk boolean True Removes the original input chunk.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <DateFormatter inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" dateTimeOutputContext="" dateTimeOutputFormat="" dateOutputContext="" dateOutputFormat="" timeOutputContext="" timeOutputFormat="" inputFormat="" removeOriginalChunk="true" ></DateFormatter>
  • FormatCheckerDate com.exalead.indexing.analysis.v10.FormatCheckerDate
    The FormatCheckDate processor checks the chunk matches either:
    • a custom input format defined with UNIX date syntax (for example,%Y/%m/%d-%H:%M:%S)
    • one of the automatically recognized date formats
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • inputFormat string
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    inputFormat string - An optional date input format, compliant with libc's strptime() format.

    If such a format is provided, the automatic date format heuristic is disabled, and the provided date format is used exclusively.

    Documentation of accepted formats: (days and month literals are only recognized in English)
    • %a: The day of the week ("Monday", ...)
    • %A: Equivalent to %a
    • %b: The month ("January", ...)
    • %B: Equivalent to %b
    • %c: Equivalent to %a %b %e %H:%M:%S %Y
    • %C: The century number [00,99]
    • %d: The day of the month [01,31]
    • %D: Equivalent to %m/%d/%y
    • %e: Equivalent to %d
    • %h: Equivalent to %b
    • %H: The hour (24-hour clock) [00,23]
    • %I: The hour (12-hour clock) [01,12]
    • %j: The day number of the year [001,366]
    • %m: The month number [01,12]
    • %M: The minute [00,59]
    • %n: Any white space
    • %dt: AM or PM
    • %r: Equivalent to %I:%M:%S %p
    • %R: Equivalent to %H:%M
    • %S: The seconds [00,60]
    • %t: Any white space
    • %T: Equivalent to %H:%M:%S
    • %U: The week number of the year (Sunday as the first day of the week) as a decimal number [00,53]
    • %w: The weekday as a decimal number [0,6], with 0 representing Sunday
    • %W: The week number of the year (Monday as the first day of the week) as a decimal number [00,53]
    • %x: Equivalent to %m/%d/%y
    • %X: Equivalent to %H:%M:%S
    • %y: The year within century. (for two-digit dates, [69,99] is mapped to [1969,1999] and [00,68] is mapped to [2000,2068])
    • %Y: The year, including the century (for example, 1988)
    • %%: Replaced by %

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <FormatCheckerDate inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" inputFormat="" ></FormatCheckerDate>
  • SplitValues com.exalead.indexing.analysis.v10.SplitValues
    Splits the content of all DocumentChunks associated with the ContextName 'inputContext' using 'separator' as a separator regular expression.

    A new DocumentChunk is created for each segment, with 'outputContext' as the ContextName.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • separator string
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - ContextName to be associated with the DocumentChunk created for each split segment.
    separator string - Separator around which to split.

    ASTL library is used to perform regular expression matching.

    The regular expression language supported is Perl 5, WITHOUT support for:
    • assertions like \b, \B, ?=, ?!, ?<=, ?<!
    • backreferences \1, \2, ...
    • UNICODE escaping like \u0020 or \p{name}
    • non-greedy (lazy) repeat operators like ??, *?, +?

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <SplitValues inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" separator="" ></SplitValues>
  • RenameContext com.exalead.indexing.analysis.v10.RenameContext
    Each DocumentChunk with ContextName matching 'inputContext' is renamed with a ContextName 'outputContext'.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The new ContextName for DocumentChunks with ContextName matching 'inputContext'.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <RenameContext inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></RenameContext>
  • CopyContext com.exalead.indexing.analysis.v10.CopyContext
    Copies all DocumentChunks with 'inputContext' as ContextName, and creates new DocumentChunks with the same score, language and part but with 'outputContext' as ContextName.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • requiredAnnotation string
    • restrictValues string
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The ContextName used for newly created chunks.
    requiredAnnotation string - The name of the required annotation the chunk must have to be copied.

    If null, no special handling is done on annotations.
    restrictValues string - A regexp which values of the chunk must match to be copied to the output context.

    Values that don't match the regexp will not be copied.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <CopyContext inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" requiredAnnotation="" restrictValues="" ></CopyContext>
  • FixedRangeNumericalPartitioning com.exalead.indexing.analysis.v10.FixedRangeNumericalPartitioning
    Matches numerical values in a range. It transforms a numerical value into a matching range, based on a fixed range size.

    For example, with rangeSize = 100,
    • 101 -> 100_199
    • 234 -> 200_299

    It also works for negative numbers:
    • -20 -> -100_-1
    • 0 -> 0_99

    This helps to create categories (for navigation) from numerical values.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • separator string
    • rangeSize long
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The ContextName used for newly created chunks.
    separator string _ The range separator.
    rangeSize long 1 The size of the range to consider.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <FixedRangeNumericalPartitioning inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" separator="_" rangeSize="1" ></FixedRangeNumericalPartitioning>
  • ForcedRangeNumericalPartitioning com.exalead.indexing.analysis.v10.ForcedRangeNumericalPartitioning
    Transforms a numerical value into the text value associated to its matching range from a set of predetermined ranges specified in 'NumericalRange'.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • separator string
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The ContextName used for newly created chunks.
    separator string _ The separator between the beginning and the end of the range.

    This parameter is deprecated.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    • NumericalRange*
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    NumericalRange* - The forced ranges.
    XML representation
    <ForcedRangeNumericalPartitioning inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" separator="_" ></ForcedRangeNumericalPartitioning>
  • NumericalRange com.exalead.indexing.analysis.v10.NumericalRange
    Associates text with a numerical range.

    The range includes all values >= beg and <= end (beg <= x <= end).
    A range corresponding to a unique value with beg = end is allowed.
    Parent objects
    • ForcedRangeNumericalPartitioning
    Attributes
    • beg long
    • end long
    • text string
    Name Type Default value Description
    beg long - The lower bound.
    end long - The upper bound.
    text string - The associated text.

    XML representation
    <NumericalRange beg="" end="" text="" ></NumericalRange>
  • TextToNum com.exalead.indexing.analysis.v10.TextToNum
    Processor to hack an approximate sort on a text field.

    Implements a surjection from the set of strings to the set of integers [0..N] with N close but inferior or equal to 18,446,744,073,709,551,615

    User defines an ordered alphabet. A first surjection from the set of all strings to the set of finite sequences of symbols taken from this alphabet is applied (strip the string from symbols out of the alphabet).

    A partial order relation is inferred on the latter set by the alphabet (lexicographical order).

    For obvious cardinal numbers reasons (one set is infinite the other is not), the second surjection cannot be partial-order preserving. The idea is to preserve the relation on the shorter strings, AND preserve the relation between shorter strings and longer strings, such as:
    • if STRING2ULONG('shortstring1') <= STRING2ULONG('shortstring2') then 'shortstring1' <= 'shortstring2'
    • STRING2ULONG('longstring1') <= STRING2ULONG('longstring2') does NOT insure 'longstring1' <= 'longstring2'
    • if STRING2ULONG('shortstring1') <= STRING2ULONG('longstring2') then 'shortstring1' <= 'longstring2'

    The size of the prefix obviously depends on the size of the alphabet.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • alphabet string
    • outputContext string
    • nbBits int
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    alphabet string 0123456789abcdefghijklmnopqrstuvwxyz The ordered alphabet.
    outputContext string - The ContextName used for the newly created chunk.
    nbBits int 63 Number of bits of unsigned field used for sorting.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <TextToNum inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" alphabet="0123456789abcdefghijklmnopqrstuvwxyz" outputContext="" nbBits="63" ></TextToNum>
  • DoubleToLong com.exalead.indexing.analysis.v10.DoubleToLong
    Using this processor you can store floating point values into signed fields that can then be queried with the DoublePrefixHandler.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • precision int
    • outputContext string
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    precision int 1000 The multiplicator. Each value will be multiplied by this factor.
    outputContext string - The ContextName used for the newly created chunk.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <DoubleToLong inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" precision="1000" outputContext="" ></DoubleToLong>
  • GeoBBoxProcessor com.exalead.indexing.analysis.v10.GeoBBoxProcessor
    The Geo BBox processor converts the input geometry from WKT to WKB
    and compute its bouding box. Both WKB and bounding box are returned as chunks. @descr
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • precision int
    • bboxMetaName string
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    precision int 6 The number of decimals that will be used in geometrical representations and computations.
    bboxMetaName string -

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <GeoBBoxProcessor inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" precision="6" bboxMetaName="" ></GeoBBoxProcessor>
  • JavaProcessor (deprecated) com.exalead.indexing.analysis.v10.JavaProcessor
    Allows documents to be sent to a java process for analysis.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • id string
    • target string
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    id string
    target string

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <JavaProcessor inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" id="" target="" ></JavaProcessor>
  • ReplaceRegexp com.exalead.indexing.analysis.v10.ReplaceRegexp
    Substitutes the content substring of all DocumentChunks having the ContextName 'inputContext', using:
    • 'pattern' as the matching substring regular expression
    • and 'value' as the replacement value.

    This value may have the form of sed output format using references to captures \0 through \9.

    A new DocumentChunk is created with the substitutions.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • pattern string
    • value string
    • replaceAll boolean
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - ContextName to be associated with the DocumentChunk created for each new context.
    pattern string - Pattern used to match the substrings to replace.

    ASTL library is used to perform regular expression matching.

    The regular expression language supported is Perl 5, WITHOUT support for:
    • lazy (non-greedy) quantifiers like *?, +?, ??, {n}?, {n,}?, {n,m}?
    • possessive quantifiers like *+, ++, ?+, {n}+, {n,}+, {n,m}+
    • assertions like \b, \B, \A, \z, \Z, \G
    • look-around assertions (?=pattern), (?!pattern), (?<=pattern), (?<!pattern)
    • named captures (?'name'pattern), (?<name>pattern)
    • numeric and named backreferences like \1, \g1, g{-1}, \g{name}, k<name>, k'name'
    • named Unicode character \N{name}
    • all operators related to Perl code inlining like (?{ code })
    • all operators related to backtracking algorithm control like independent subexpression (?>pattern)
    • \C matching a single C char (octet)
    • of the pattern-match modifiers (?pimsx-imsx) only (?i:pattern) and (?i) are supported (no negative form)
    value string - The replacement value (sed-like output format).
    replaceAll boolean True Replaces all first occurrences of patterns.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <ReplaceRegexp inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" pattern="" value="" replaceAll="true" ></ReplaceRegexp>
  • URLCodec com.exalead.indexing.analysis.v10.URLCodec
    URL encode/decode with UTF-8 charset only
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • encodeURIComponent boolean
    • mode enum(encode, decode)
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - Stores URL encoded form in outputContext.

    If outputContext = inputContext, it removes the original chunk.
    encodeURIComponent boolean True If true (default), it encodes the following characters: ',' '/' '?' ':' '@' '&' '=' '+' '$' '#'
    mode enum(encode, decode) encode mode = "encode" or "decode"

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <URLCodec inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" encodeURIComponent="true" mode="encode" ></URLCodec>
  • WildcardIndexing com.exalead.indexing.analysis.v10.WildcardIndexing
    Computes the input chunk substring to perform efficient prefix/substring/suffix search
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • exactScore int
    • prefixSearch boolean
    • prefixScore int
    • suffixSearch boolean
    • suffixScore int
    • substringSearch boolean
    • substringScore int
    • maxStringSize int
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - Stores exact/prefix/substring/suffix in outputContext.

    If outputContext = inputContext, it removes the original chunk.
    exactScore int 4 Specifies the score for an exact match.
    prefixSearch boolean True Enables the prefix search.
    prefixScore int 3 Specifies the score for a prefix match.
    suffixSearch boolean True Enables the suffix search.
    suffixScore int 2 Specifies the score for a suffix match.
    substringSearch boolean True Enables the substring search.
    substringScore int 1 Specifies the score for a substring match.
    maxStringSize int 100 Specifies the max string size for which this processor will be applied.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <WildcardIndexing inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" exactScore="4" prefixSearch="true" prefixScore="3" suffixSearch="true" suffixScore="2" substringSearch="true" substringScore="1" maxStringSize="100" ></WildcardIndexing>
  • URLTransformer com.exalead.indexing.analysis.v10.URLTransformer
    Parses a context string as a regular URL (RFC 2396, "Uniform Resource Identifier") and transforms it according to the given URL pattern.

    A new DocumentChunk is created with the substitution.

    Pattern used to transform the URL (in the form &lt;scheme&gt;://&lt;authority&gt;&lt;path&gt;?&lt;query&gt;#&lt;fragment&gt;):
    • Characters other than '$' or '\' are kept as-is
    • The '$' character and the '\' character must be escaped with a leading \
    • The ${expression} form allows to compute a string expression based on URL components (see "Expression" below)

    Expression used inside the enclosing ${}:
    • url: Original URL
    • scheme: Scheme name ("http", "https", "file", ...)
    • authority: Authority (host:port or host) (may be empty)
    • host: Hostname part of the authority (may be empty)
    • port: Port number part of the authority (may be empty)
    • userInfo: username:password field of the authority (may be empty)
    • file: File starting with / and query string, if any
    • pathurl: Normalized absolute path starting with /
    • path: Normalized absolute path (may start with C:\ on Windows)
    • query: Normalized query part starting with ? (may be empty)
    • args: Query part without the leading ? (may be empty)
    • fragment: Fragment part starting with #(may be empty)
    • reference: Reference part ; i.e., fragment without the leading # (may be empty)
    • arg:name: Query part argument identified by its name, unescaped (you must re-escape it using "urlencode:" when necessary)
    • str:string: The final argument is not a variable name, but a string (only useful for clarity purpose)
    • tolower:<i>expression</i>: Transform into lowercase (ONLY A-Z)
    • toupper:<i>expression</i>: Transform into uppercase (ONLY a-z)
    • urlencode:<i>expression</i> :URL encoding (%NN or +)
    • urlpathencode:expression</i>: URL encoding outside / fragments
    • urldecode:<i>expression</i>: URL decoding
    • pathslash:<i>expression</i>: Convert \ into /
    • pathantislash:<i>expression</i>: Convert / into \

    Notes:
    • Unreserved characters are unescaped during URL processing (i.e., never '%' or '\')
    • The lower other similar prefix accept recursion (i.e., the expression "${urlpathencode:pathantislash:toupper:path}" is valid)
    • Both "file://C:\path" and "file:///C:\path" will produce path="/C:\path"

    Examples:
    • With the input context value "http://www.example.com/bar/foo?bar=42"
      • "hello, world" => "hello, world"
      • "the scheme is ${scheme}" => "the scheme is http"
      • "the scheme is \${scheme}" => "the scheme is \${scheme}
      • "http://myserver${path}${query}" => "http://myserver/bar/foo?bar=42"
      • "http://myserver/applet?f=${urlpathencode:path}&t=${arg:bar}" => "http://myserver/applet?f=/bar/foo&t=42"
      • "http://myserver/applet?f=${urlencode:path}&t=${arg:bar}" => "http://myserver/applet?f=%2Fbar%2Ffoo&t=42"
      • "http://myserver/applet?f=${urlpathencode:pathantislash:toupper:path}" => "http://myserver/applet?f=%5CBAR%5CFOO"
    • With the input context value "file:///C:/My%20Documents/Document.doc"
      • "${pathantislash:urldecode:path}" => "C:\My Documents\Document.doc"
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • urlPattern string
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - ContextName to be associated with the DocumentChunk created for each new context.
    urlPattern string - Pattern used to transform the URL.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <URLTransformer inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" urlPattern="" ></URLTransformer>
  • GeoCategorizer com.exalead.indexing.analysis.v10.GeoCategorizer
    A processor that categorizes geographic points given their inclusion in a GeoDomain.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • inputContext string
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    Name Type Default value Description
    inputContext string - The processor will only be applied to DocumentChunks with this ContextName.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - ContextName of the chunk to create.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    • GeoDomain*
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    GeoDomain* -
    XML representation
    <GeoCategorizer inputContext="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></GeoCategorizer>
  • DiskDomain com.exalead.search.v30.DiskDomain
    No documentation for this element.
    Parent objects
    • ExplicitGeoFacet
    • GeoCategorizer
    Attributes
    • title string
    • id int
    • radius double
    • x double
    • y double
    Name Type Default value Description
    title string -
    id int 0 Unique identifier of this domain. If id=0 (its default value) the category path will be the set of vertices. Otherwise, it will be the id value.
    radius
    (UI: Radius)
    double - Disk radius in meters
    x double - First coordinate of the center for the DiskDomain.

    If the point type is XY, it will be interpreted as the X coordinate (integer units).

    For geographic points (GPS), it will be interpreted as the latitude coordinate.
    y double - Second coordinate of the center for the DiskDomain.

    If the point type is XY, it will be interpreted as the Y coordinate (integer units).

    For geographic points (GPS), it will be interpreted as the longitude coordinate.

    XML representation
    <DiskDomain title="" id="0" radius="" x="" y="" ></DiskDomain>
  • PolygonDomain com.exalead.search.v30.PolygonDomain
    No documentation for this element.
    Parent objects
    • ExplicitGeoFacet
    • GeoCategorizer
    Attributes
    • title string
    • id int
    • vertices string
    Name Type Default value Description
    title string -
    id int 0 Unique identifier of this domain. If id=0 (its default value) the category path will be the set of vertices. Otherwise, it will be the id value.
    vertices string - Polygon vertices, as a list of (x,y) coordinates.

    For example: "0.0,0.0;1.1,0.1;1.1,1.1"

    XML representation
    <PolygonDomain title="" id="0" vertices="" ></PolygonDomain>
  • KMLDomain com.exalead.search.v30.KMLDomain
    Definition of a geographic domain using a KML or KMZ resource
    Parent objects
    • ExplicitGeoFacet
    • GeoCategorizer
    Attributes
    • title string
    • id int
    • resource string
    • KMZ boolean
    Name Type Default value Description
    title string -
    id int 0 Unique identifier of this domain. If id=0 (its default value) the category path will be the set of vertices. Otherwise, it will be the id value.
    resource string -
    KMZ boolean False Is this resource a KMZ resource?

    XML representation
    <KMLDomain title="" id="0" resource="" KMZ="false" ></KMLDomain>
  • SHPDomain com.exalead.search.v30.SHPDomain
    No documentation for this element.
    Parent objects
    • ExplicitGeoFacet
    • GeoCategorizer
    Attributes
    • title string
    • id int
    • shpResource string
    • shxResource string
    • dbfResource string
    Name Type Default value Description
    title string -
    id int 0 Unique identifier of this domain. If id=0 (its default value) the category path will be the set of vertices. Otherwise, it will be the id value.
    shpResource string -
    shxResource string -
    dbfResource string -

    XML representation
    <SHPDomain title="" id="0" shpResource="" shxResource="" dbfResource="" ></SHPDomain>
  • MimeTypeSetter com.exalead.indexing.analysis.v10.MimeTypeSetter
    Manually sets the mime type
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • value string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    value string - New mime type

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <MimeTypeSetter name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" value="" ></MimeTypeSetter>
  • MetaFinder com.exalead.indexing.analysis.v10.MetaFinder
    Keeps track of all document metas
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <MetaFinder name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></MetaFinder>
  • JavaDocumentProcessor com.exalead.indexing.analysis.v10.JavaDocumentProcessor
    Takes Java code either inline or from a file, and executes it on-the-fly.

    For production mode, we recommend packaging your custom code as a Java Plugin (CVPlugin) and using the Custom Document Processor to call it.
    Plugins allow better packaging and source code maintenance.
    Note that this processor does not work when the Java JVM used to run Cloudview is only a JRE and not a JDK
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • path string
    • priority int
    • sourceCode string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    path string - User defined path to a Java file containing the processor code
    priority int - Defines which path to use (0: user defined path, 1: resource managed path (inlined Java))
    sourceCode string - Inline Java code

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <JavaDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" path="" priority="" sourceCode="" ></JavaDocumentProcessor>
  • JavaScriptProcessor (deprecated) com.exalead.indexing.analysis.v10.JavaScriptProcessor
    This document processor is deprecated. Use the Java document processor instead.
    The JavaScript Processor takes a JS script and executes it.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • path string
    • priority int
    • script string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    path string - User defined path to a JS file containing the processor code
    priority int - Defines which path to use (0: user defined path, 1: resource managed path (inlined JS))
    script string - Inline script

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <JavaScriptProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" path="" priority="" script="" ></JavaScriptProcessor>
  • StorageServiceDocumentProcessor com.exalead.indexing.analysis.v10.StorageServiceDocumentProcessor
    Queries the storage for any meta to attach to the document.

    Multi-valued pairs are pushed as multi-valued metas.

    For example:
    • The storage key "nb_comment" will be attached as "nb_comment" meta on the document.
    • The storage key "tags[]" will be attached as "tags" multi-valued meta on the document.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • instance string
    • metaIdentifier string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    instance string - Storage service instance
    metaIdentifier string - Defines an optional meta name that will be used as storage Identifier instead of the document Uri.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <StorageServiceDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" instance="" metaIdentifier="" ></StorageServiceDocumentProcessor>
  • MathDocumentProcessor com.exalead.indexing.analysis.v10.MathDocumentProcessor
    Performs mathematical operations on a numerical field. Expressions must be prefaced by a $.

    For example, the expression `$ht_price * 1.196` finds the first chunk in the `ht_price` context, and replaces all occurrences of `ht_price` with the mathematical expression.

    The result will be a new text chunk, either in the Output context (if specified), or in the original `ht_price` context.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • expression string
    • outputContext string
    • floatingPoint boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    expression string - Arithmetic expression to evaluate.
    For example: "$file_size + 42"
    outputContext string - ContextName of the chunk to create.
    floatingPoint boolean False Output: A floating point number instead of the default integer one.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <MathDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" expression="" outputContext="" floatingPoint="false" ></MathDocumentProcessor>
  • PrecomputedThumbnailsDocumentProcessor com.exalead.indexing.analysis.v10.PrecomputedThumbnailsDocumentProcessor
    The Precomputed Thumbnails Document Processor precomputes thumbnails of the first DocumentPart.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • convertAddresses string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    convertAddresses string - Semicolon separated list of convert instance names or urls to use.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <PrecomputedThumbnailsDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></PrecomputedThumbnailsDocumentProcessor>
  • RealTimeAlerting com.exalead.indexing.analysis.v10.RealTimeAlerting
    The Real-time alerting document processor matches queries defined by end-users and alerts them as soon as possible a new matching document is indexed. To be used only when not in task queue mode.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor

    Nested elements
    • fromDataModelDocumentProcessor
    • alertGroupsAlertGroup*
    • customPublishersCustomPublisher*
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AlertGroup* alertGroups List of alert groups handled by this processor, empty means ALL groups
    CustomPublisher* customPublishers
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <RealTimeAlerting name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></RealTimeAlerting>
  • AlertGroup com.exalead.indexing.analysis.v10.AlertGroup
    No documentation for this element.
    Parent objects
    • alertGroups RealTimeAlerting
    Attributes
    • name string
    Name Type Default value Description
    name string -

    XML representation
    <AlertGroup name="" ></AlertGroup>
  • CustomPublisher com.exalead.indexing.analysis.v10.CustomPublisher
    Custom publisher configuration
    Parent objects
    • customPublishers RealTimeAlerting
    Attributes
    • classId string
    Name Type Default value Description
    classId string - Custom publisher type

    Nested elements
    • configKeyValue*
    Type Wrapper Description
    KeyValue* config
    XML representation
    <CustomPublisher classId="" ></CustomPublisher>
  • MIMEDetector com.exalead.indexing.analysis.v10.MIMEDetector
    The MIME detector operates on each DocumentPart for which a MIME-type is not available.

    The MIME-type can be specified for each DocumentPart in the PAPI.

    For DocumentPart, the 'bytes' and the 'filename' are used to guess the real MIME-type and charset.

    The guessed MIME-type and the charset are then set as attributes of the DocumentPart.

    Input: The DocumentPart of the document.

    Output: 'mime' and 'encodingToUse' attributes of DocumentParts.

    This document processor does not create any document chunks.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • defaultValue string
    • defaultCharset string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    defaultValue string - Default mime to use if not detected.
    defaultCharset string - On text or HTML files, the MIME detector tries to detect charset encoding automatically.

    If the encoding cannot be detected, this 'defaultCharset' is used.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <MIMEDetector name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" defaultValue="" defaultCharset="" ></MIMEDetector>
  • NativeTextExtractor com.exalead.indexing.analysis.v10.NativeTextExtractor
    Extraction is performed for the following data types:
    • text/plain for Text files.
    • text/html for HTML Files.
    • application/x-exalead-document for CloudView 4.6 document format (com.exalead.document)
    • application/x-exalead-ndoc for CloudView 5 internal document format, binary.
    • application/x-exalead-ndoc-v10+xml for CloudView internal document format, XML.
    @csh AC_TEXTEXTRACTOR_HTML_ID
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • annotateHTML boolean
    • skipInvisibleHTMLText boolean
    • extractJs boolean
    • extractHTMLTables boolean
    • extractHTMLStyles boolean
    • extractHTMLForms boolean
    • maxHTMLAnnotationDepth int
    • disableAutomaticHTMLDTDFix boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    annotateHTML boolean False Adds some stylish annotations to DocumentChunks (for HTML files only):
    • html:p for DocumentChunks generated from <p>
    • html:row for DocumentChunks generated from <tr>
    • html:column for DocumentChunks generated from <td> or <th>
    • html:table for DocumentChunks generated from <table>
    • html:h1 for DocumentChunks generated from <h1>
    • html:h2 for DocumentChunks generated from <h2>
    • html:h3 for DocumentChunks generated from <h3>
    • html:h4 for DocumentChunks generated from <h4>
    • html:h5 for DocumentChunks generated from <h5>
    • html:h6 for DocumentChunks generated from <h6>
    • html:link for DocumentChunks generated from <a>, <iframe> or <frame>
      • html:link:rel if the link has a "rel" attribute
      • html:link:name if the link has a "name" attribute
    • html:list for DocumentChunks generated from <ul>, <ol> or <dl>
    • html:item for DocumentChunks generated from <li>
    • html:bold for DocumentChunks generated from <b> or <strong>
    • html:italic for DocumentChunks generated from <i> or <em>
    • html:underline for DocumentChunks generated from <u>
    • html:strike for DocumentChunks generated from <s> or <strike>
    • html:pre for DocumentChunks generated from <pre>
    • html:invisible for DocumentChunks containing invisible text (display: none, white on white)
    • html:class for DocumentChunks taken in a CSS class
    • html:id for DocumentChunks taken in a CSS id
    • html:img:src for DocumentChunks created from a <img>

    It also creates specific HTML DocumentChunks with the following contexts:
    • html:lang when parsing a <html> containing the "lang" attribute
    • html:xml:lang when parsing a <html> containing the "xml:lang" attribute
    • html:title when parsing a <title>
    • html:title:other when parsing a second <title>
    • html:base:href when parsing a <base>
    • html:link when parsing a <link> containing the "src" attribute and annotated by:
      • html:link:rel if the link has a "rel" attribute
      • html:link:type if the link has a "type" attribute
    • html:http-equiv:NAME when parsing a http-equiv meta
    • html:meta:NAME when parsing a meta named "NAME"
    skipInvisibleHTMLText boolean False Skips the invisible text. For example, white fonts on white backgrounds (for HTML files only).
    extractJs boolean False Tries to parse JavaScript and then extract links.
    extractHTMLTables boolean False Adds annotations on table, tr, td, th
    extractHTMLStyles boolean False Adds annotations on style attributes.
    extractHTMLForms boolean False Add annotations on Forms, select.
    maxHTMLAnnotationDepth int 20 Prevents new annotations from being created after @c maxHTMLAnnotationDepth HTML level.
    disableAutomaticHTMLDTDFix boolean False Disables automatic DTD fix on HTML documents.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <NativeTextExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" annotateHTML="false" skipInvisibleHTMLText="false" extractJs="false" extractHTMLTables="false" extractHTMLStyles="false" extractHTMLForms="false" maxHTMLAnnotationDepth="20" disableAutomaticHTMLDTDFix="false" ></NativeTextExtractor>
  • ConvertTextExtractor com.exalead.indexing.analysis.v10.ConvertTextExtractor
    This processor performs text content extraction for all MIME-types (300+ file formats are currently handled).

    See the "Supported Formats" technical note for more information.

    Text, HTML, and built-in data types must be processed by the 'NativeTextExtractor' rather than this processor.

    Make sure to have a 'NativeTextExtractor' before the ConvertTextExtractor in your pipeline. @csh AC_TEXTEXTRACTOR_MIME_ID
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • looseTextDetection boolean
    • forceContent boolean
    • minInputSizeKB long
    • maxInputSizeKB long
    • maxRecursionDepth int
    • maxRecursionDocuments int
    • maxRecursionDocumentsTotal int
    • strictSizeCheck boolean
    • retryIO string
    • filter string
    • timeoutMs long
    • priority string
    • embedded string
    • attachments string
    • styles string
    • forceConversion boolean
    • startPage long
    • maxPages long
    • maxOutputSizeKB long
    • allowUnicode32 boolean
    • allowDocumentChars boolean
    • outsideIn string
    • outsideInFallback string
    • outsideInOnly string
    • outsideInForPreview string
    • outsideInSimpleXHTMLFallback string
    • ocr string
    • ocrFallback string
    • ocrDetect string
    • ocrQuality string
    • ocrLang string
    • ocrTimeoutMs long
    • ocrMaxPages int
    • ocrPriority string
    • httpProxyUrl string
    • disablePlugins boolean
    • overrideAddresses string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    looseTextDetection boolean True Looses text detection to detect more text files, including suspicious ones (not *.txt or *.html) ("true", "false")
    forceContent boolean False Forces to accept the content, even if the MIME type does not seem to be a known or supported MIME type.
    minInputSizeKB long -1 Minimum document size accepted, in kilobytes.
    maxInputSizeKB long -1 Maximum document size accepted, in kilobytes.
    maxRecursionDepth int -1 Maximum recursion depth.
    maxRecursionDocuments int -1 Maximum number of documents that can be converted in one directory level.
    maxRecursionDocumentsTotal int -1 Maximum number of documents that can be converted over all levels.
    strictSizeCheck boolean False Strict size validation mode (even for partial reads).
    retryIO string - Uses regular I/O when mmap fails. ("true", "false")
    filter string - Native filter identifier list to be used specifically.

    The list is a comma-separated (,) list of filter identifiers with optional ending argument(s) separated by semi-colons (;).

    If the filter identifier is prefixed by '!', the corresponding filter will be explicitly excluded.

    The special filter identifier '*' stands for "all other filters".

    First match wins: "*,!doc" is identical to "*".

    For example: filter="!jpeg,*" will accept all filters but the jpeg filter.
    timeoutMs long -1 Conversion timeout value, in milliseconds.

    If the conversion process takes longer, the remote side attemps to abort the conversion process.
    priority string - Worker thread(s) priority to be used for the processing ("normal", "lowest", "very low", "low", "normal", "high", "very high")
    embedded string - Includes embedded images ("true", "false", "optional")
    attachments string - Includes embedded attachments ("true", "false", "optional")
    styles string - Attempts to extract more text styles for HTML conversion ("true", "false", "optional")
    forceConversion boolean False Attempts to generate an empty document upon conversion error (may be ignored)
    startPage long -1 Starts conversion from this page number (page number starts at 1).

    This parameter is only taken into account for image processing and may be ignored.
    maxPages long -1 Maximum number of pages to process for xml conversion (may be ignored).
    maxOutputSizeKB long -1 Maximum output size on the remote side, in kilobytes.

    If the generated output exceeds this value, the document may be truncated or invalid.
    allowUnicode32 boolean False Allows the use of 32-bit unicode points.
    allowDocumentChars boolean False Allows the use of Unicode private range characters (E0XX) for separators (keyword, sentence, paragraph separators, ...)
    outsideIn string - This feature is no longer supported. ("true", "false", "optional")
    outsideInFallback string - This feature is no longer supported. ("true", "false", "optional")
    outsideInOnly string - This feature is no longer supported. ("true", "false", "optional")
    outsideInForPreview string - This feature is no longer supported. ("true", "false", "optional")
    outsideInSimpleXHTMLFallback string - This feature is no longer supported. ("true", "false", "optional")
    ocr string - Converts using OCR ("true", "false", "optional")
    ocrFallback string - Fallback to OCR if heuristics deem it necessary ("true", "false", "optional")
    ocrDetect string - Detects documents requiring OCR (and rejects them) ("true", "false")
    ocrQuality string - OCR quality ("fast", "normal", "best")
    ocrLang string - OCR language(s) ("en" for English, "en;fr" for French and English, etc.)
    ocrTimeoutMs long -1 OCR conversion timeout value, in milliseconds.

    If the OCR process takes longer, the remote side attemps to abort the conversion process.

    This value overrides the timeout value if the processing involves an OCR operation.
    ocrMaxPages int -1 Maximum number of pages to process for OCR.
    ocrPriority string - Worker thread(s) priority to be used for the OCR processing ("normal", "lowest", "very low", "low", "normal", "high", "very high")
    httpProxyUrl string - Optional HTTP proxy URL.

    The URL can embed credentials if required.
    disablePlugins boolean False Disables external plugins.
    overrideAddresses string -

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    • KeyValue*
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    KeyValue* -
    XML representation
    <ConvertTextExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" looseTextDetection="true" forceContent="false" minInputSizeKB="-1" maxInputSizeKB="-1" maxRecursionDepth="-1" maxRecursionDocuments="-1" maxRecursionDocumentsTotal="-1" strictSizeCheck="false" retryIO="" filter="" timeoutMs="-1" priority="" embedded="" attachments="" styles="" forceConversion="false" startPage="-1" maxPages="-1" maxOutputSizeKB="-1" allowUnicode32="false" allowDocumentChars="false" outsideIn="" outsideInFallback="" outsideInOnly="" outsideInForPreview="" outsideInSimpleXHTMLFallback="" ocr="" ocrFallback="" ocrDetect="" ocrQuality="" ocrLang="" ocrTimeoutMs="-1" ocrMaxPages="-1" ocrPriority="" httpProxyUrl="" disablePlugins="false" overrideAddresses="" ></ConvertTextExtractor>
  • RemoteHTTPTransformer com.exalead.indexing.analysis.v10.RemoteHTTPTransformer
    The processor posts part bytes to the remote HTTP service, and gets the typed resource as a result.

    The remote service may return a Document.MIME_V10 document, or any other document that can later be processed in the pipeline.

    If the remote service returns a non "OK" HTTP status (!= 200 error code), the corresponding error is passed as a regular error.

    The service may also advertise a filename, using the standard Content-Disposition's 'filename' attribute.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • remoteUrl string
    • timeoutMs int
    • httpIdleTimeoutMs int
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    remoteUrl string - Remote URL
    timeoutMs int - Remote processor timeout, in milliseconds.
    This is the timeout.
    httpIdleTimeoutMs int - Cached HTTP connection idle timeout. This is an advanced setting.

    For efficiency, the RemoteHTTPTransformer maintains a pool of opened connections to the remote HTTP service.

    This defines the timeout for connections which are no longer used.

    Default is 10.000.

    Nested elements
    • fromDataModelDocumentProcessor
    • argMappingRemoteHTTPTransformerRemoteArgMapping*
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    RemoteHTTPTransformerRemoteArgMapping* argMapping Argument(s) mapping, if any. @see RemoteHTTPTransformerRemoteArgMapping
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <RemoteHTTPTransformer name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" remoteUrl="" timeoutMs="" httpIdleTimeoutMs="" ></RemoteHTTPTransformer>
  • RemoteHTTPTransformerRemoteArgMapping com.exalead.indexing.analysis.v10.RemoteHTTPTransformerRemoteArgMapping
    Transformation
    RemoteHTTPTransformer argument mapping.
    Parent objects
    • argMapping RemoteHTTPTransformer
    Attributes
    • key string
    • value string
    • defaultValue string
    Name Type Default value Description
    key string - URL key to map.
    This key name will be used as remote HTTP argument name.
    value string - Value to use. If @c null, the @c defaultValue value will be used.

    The following values names are reserved:
    • $docname: the document name or URI
    • $msg.uri: see @c com.exalead.mercury.papi.PAPIMessage
    • $msg.source: see @c com.exalead.mercury.papi.PAPIMessage
    • $part.name: see @c com.exalead.indexing.DocPart
    • $part.filename: see @c com.exalead.indexing.DocPart
    • $part.encoding: see @c com.exalead.indexing.DocPart
    • $part.forcedMime: see @c com.exalead.indexing.DocPart
    • $part.mimeHint: see @c com.exalead.indexing.DocPart
    • $part.mime: see @c com.exalead.indexing.DocPart
    • $part.encodingToUse: see @c com.exalead.indexing.DocPart
    • $part.bytes.length: see @c com.exalead.indexing.DocPart
    • $part.customDirectives.*: see @c com.exalead.indexing.DocPart
    • $$$foo: escaping for $foo
    defaultValue string - Value to use if the @c value is @c null.

    If this value is @c null, the empty string will be used.

    XML representation
    <RemoteHTTPTransformerRemoteArgMapping key="" value="" defaultValue="" ></RemoteHTTPTransformerRemoteArgMapping>
  • StandardPartsMerger com.exalead.indexing.analysis.v10.StandardPartsMerger
    This processor does nothing if there are no DocumentParts (only root DocumentChunks).

    This processor needs one DocumentPart called the 'Master Part'.

    If there is only one part, this part is the 'Master Part'.

    If there are multiple parts, the part named after the 'masterPart' attribute is the 'Master Part'. @csh AC_STANDARDPARTS_MERGER_ID
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • masterPart string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    masterPart string - Name of the master part.
    This name should be "master" to follow the convention used by connectors that send documents composed of multiple parts (e.g. mails with attachments).

    Nested elements
    • fromDataModelDocumentProcessor
    • partSpecificContextsStringValue*
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    StringValue* partSpecificContexts The ContextNames of the DocumentChunk from the non-master part that should be copied to the root document.
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <StandardPartsMerger name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" masterPart="" ></StandardPartsMerger>
  • SemanticPipeDocumentProcessor com.exalead.indexing.analysis.v10.SemanticPipeDocumentProcessor
    Instantiates a semantic pipe and creates chunks out of resulting annotations.

    It can be used to instantiate classification processors, and perform document level operations from their output.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • pipeline string
    • annotations string
    • topLevelAnnotationsOnly boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    pipeline string - Analysis pipeline on which semantic processors will be used.
    annotations string - A chunk will be created for each annotation which name is in the list. Comma-separated list of annotations.
    topLevelAnnotationsOnly boolean False Considers top level annotations only. For example, results from the QueryMatcher or Fast Rules.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    • SemanticProcessor*
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    SemanticProcessor* - List of semantic processors to use
    XML representation
    <SemanticPipeDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" annotations="" topLevelAnnotationsOnly="false" ></SemanticPipeDocumentProcessor>
  • Anchorer com.exalead.indexing.analysis.v10.Anchorer
    Adds an annotation on the first and last tokens of either a processed sequence (first/last) or a range defined by an annotation a (first_a/last_a)
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • tagsToAnchor string
    • finalAnnotationOnNextToken boolean
    • finalCannotBeSepSpace boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    tagsToAnchor string - List of comma-separated tags on which to work
    finalAnnotationOnNextToken boolean False If true, sets final annotation on the token after the last token of annotation a
    finalCannotBeSepSpace boolean False If final can't be a space, the annotation last may be set on the next non-blank token

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <Anchorer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" tagsToAnchor="" finalAnnotationOnNextToken="false" finalCannotBeSepSpace="false" ></Anchorer>
  • CompoundWordSplitter com.exalead.indexing.analysis.v10.CompoundWordSplitter
    Annotates compound words that use CamelCase (like SearchServer) or underscores (like my_variable) to separate the root words. This allows users to search for the root words individually.
    Annotations generated:
    • "compound": for example, compound="search server"
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • tokenizeAnnotations boolean
    • doCamelCase boolean
    • doUnderscore boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    tokenizeAnnotations boolean True Subtokenizes "SearchServer" into "Search" "Server" automatically, and keep original annotations.
    doCamelCase boolean True Separates compound words before each capital letter. For example, the annotation for "CamelCase" is compound="camel case".
    doUnderscore boolean True Separates multi-word strings wherever there is an underscore. For example, the annotation for "under_score" is compound="under score".

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <CompoundWordSplitter name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" tokenizeAnnotations="true" doCamelCase="true" doUnderscore="true" ></CompoundWordSplitter>
  • OntologyMatcher com.exalead.indexing.analysis.v10.OntologyMatcher
    An OntologyMatcher detects concepts defined in an ontology in the textual content of the Document Chunks.

    Typically, an ontology contains a list of business terms to be detected. Resulting Annotations are mapped to enable navigation by business concepts.
    Annotations generated:
    • Depends on the resource (See Pkg).
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • enableApproxMatching boolean
    • minWordSizeForDist1 int
    • minWordSizeForDist2 int
    • resourceDir string
    • restrictLanguage boolean
    • keepLongestMatch boolean
    • keepLongestMatchInterTag boolean
    • tokenizeAnnotations boolean
    • annotationsToIgnore string
    • ignoreSpaces boolean
    • annotationPrefix string
    • trustLevelBasedDedup boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    enableApproxMatching boolean False Enables approximative matching in ontology. Approximative matching uses the Damerau-Levenshtein edit distance.
    minWordSizeForDist1 int 3 Minimum number of chars in token to enable the Damerau-Levenshtein distance of 1.
    minWordSizeForDist2 int 8 Minimum number of chars in token to enable the Damerau-Levenshtein distance of 2.
    resourceDir string - URL for the directory containing the ontology (data://, file;// or resource://).
    restrictLanguage boolean True Keeps only the expression added with language == Language.XX or with the document language.

    For example, if the Ontology contains an expression added with language=En, it will be extracted only for an English document if restrictLanguage is set to true.
    keepLongestMatch boolean True Keeps only the longest match.

    For example, if you have 5 tokens ('a', 'b', 'c', 'd', 'e') and 4 annotations 'a', 'a-c', 'b-c-d' and 'd-e', this option will only keep 'b-c-d' and remove all other annotations.
    keepLongestMatchInterTag boolean False Keeps only the longest match (tag independant).

    For example, if you have 5 tokens ('a', 'b', 'c', 'd', 'e') and 4 annotations 'a', 'a-c', 'b-c-d' and 'd-e', this option will only keep 'b-c-d' and remove all other annotations.
    tokenizeAnnotations boolean False If you have some multi-tokens annotations (like "super market" annotation on token "supermarket", this option will automatically subtokenize "supermarket" in "super" "market" and keep original annotations.

    If you enable this option, keepLongestMatch and keepLongestMatcherInterTag will be set to true.
    annotationsToIgnore string - Sets the list of annotations to be ignored (comma-separated).

    This feature allows you to define a list of words/expressions to ignore in the recognition of this ontology.

    For example, if you add:
    • the expressions "of" and "the" with the tag "toIgnore" in ontology A,
    • and the expression "website embassy" in ontology B with tagsToIgnore="toIgnore",
    ... you will be able to match "website of the embassy", "website of embassy" and "website embassy".
    ignoreSpaces boolean False If your ontology was compiled with matchOnSeparators=false, this allows 'lemonde' to retrieve 'le monde' or 'le monde' to retrieve 'lemonde'.

    If your ontology was compiled with matchOnSeparators=true, this allows 'le monde' to retrieve 'le monde'.
    annotationPrefix string A prefix to add to each annotation tag.

    For example, if the package of the entry matched in the ontology is "exalead.location.country" and the annotationPrefix is "myOntology_", an annotation will be added with the tag "myOntology_exalead.location.country".
    trustLevelBasedDedup boolean False Keeps only the annotation with the highest trust level when several entries from a package match the same text chunk.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <OntologyMatcher name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" enableApproxMatching="false" minWordSizeForDist1="3" minWordSizeForDist2="8" resourceDir="" restrictLanguage="true" keepLongestMatch="true" keepLongestMatchInterTag="false" tokenizeAnnotations="false" annotationsToIgnore="" ignoreSpaces="false" annotationPrefix="" trustLevelBasedDedup="false" ></OntologyMatcher>
  • NamedEntitiesMatcher com.exalead.indexing.analysis.v10.NamedEntitiesMatcher
    The Named Entities Matcher detects named entities such as people, organizations, or places, in the textual content of the document.

    It generates annotations like NE.person or NE.organization, using ontology-based matching and/or rule-based matching.
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceDir string
    • rules string
    • prefix string
    • language string
    • partOfSpeechFiltering boolean
    • useKnownWordsForDisambiguisation boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceDir string - URL for the resource (data://, file;// or resource://).
    rules string ne Defines which entities will be extracted:
    • The default value, ne triggers the extraction of people, organizations, locations and events.
    • The value ne-all triggers the extraction of all types of entities.
    prefix string NE Prefix to add in front of each annotation generated by the named entity matcher.
    language string - Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.
    partOfSpeechFiltering boolean True It discards annotations for parts of text made of a name followed by a verb or an adverb with the first letter in uppercase.

    This filter is useful if your documents contain a lot of titles with several capitalized words (what is called 'Title Case').

    It applies to NE.person, NE.place and NE.organization.
    useKnownWordsForDisambiguisation boolean True Uses a resource of known words to disambiguate named entities candidates. It works only for English and French.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <NamedEntitiesMatcher name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" prefix="NE" language="en" partOfSpeechFiltering="true" useKnownWordsForDisambiguisation="true" ></NamedEntitiesMatcher>
  • Classifier com.exalead.indexing.analysis.v10.Classifier
    A Classifier classifies a whole document according to the existing annotations on selected Document Chunks.

    The annotations are matched against a learning resource.
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceDir string
    • annotationName string
    • language iso code
    • excludedLanguages string
    • addAnnotationsOnKeywords boolean
    • maxAnnotations int
    • minTrustLevel int
    • maxKeywords int
    • minKeywords int
    • collapseToken boolean
    • extraPrefixAnnotations string
    • extraAnnotationsMinTrustLevel int
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceDir string - URL for the vocabulary resource (data://, file;// or resource://)
    annotationName string - Name of the annotation to add.
    language iso code - Language for which the vocabulary classifier is activated.
    excludedLanguages string Language for which the vocabulary classifier is deactivated (works only if language=xx, comma-separated).
    addAnnotationsOnKeywords boolean False If true, it adds annotations to all matching tokens.
    maxAnnotations int -1 Maximum number of annotations per document.
    minTrustLevel int 0 The minimum trust level of categories to keep.
    maxKeywords int -1 The maximum number of keywords to keep.
    minKeywords int 1 The minimum number of keywords per class.
    collapseToken boolean False If true, all identical tokens are collapsed.
    extraPrefixAnnotations string The optional list of prefix annotations to keep (comma-separated).
    extraAnnotationsMinTrustLevel int 100 The minimum trust level to keep an extra annotation.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <Classifier name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" annotationName="" language="en" excludedLanguages="" addAnnotationsOnKeywords="false" maxAnnotations="-1" minTrustLevel="0" maxKeywords="-1" minKeywords="1" collapseToken="false" extraPrefixAnnotations="" extraAnnotationsMinTrustLevel="100" ></Classifier>
  • HierarchicalClassifier com.exalead.indexing.analysis.v10.HierarchicalClassifier
    A Classifier classifies a whole document according to the existing annotations on selected Document Chunks.

    The annotations are matched against a learning resource.
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • resourceDir string
    • annotationName string
    • language iso code
    • excludedLanguages string
    • addAnnotationsOnKeywords boolean
    • maxAnnotations int
    • minTrustLevel int
    • maxKeywords int
    • minKeywords int
    • collapseToken boolean
    • extraPrefixAnnotations string
    • extraAnnotationsMinTrustLevel int
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    Name Type Default value Description
    resourceDir string - URL for the vocabulary resource (data://, file;// or resource://)
    annotationName string - Name of the annotation to add.
    language iso code - Language for which the vocabulary classifier is activated.
    excludedLanguages string Language for which the vocabulary classifier is deactivated (works only if language=xx, comma-separated).
    addAnnotationsOnKeywords boolean False If true, it adds annotations to all matching tokens.
    maxAnnotations int -1 Maximum number of annotations per document.
    minTrustLevel int 0 The minimum trust level of categories to keep.
    maxKeywords int -1 The maximum number of keywords to keep.
    minKeywords int 1 The minimum number of keywords per class.
    collapseToken boolean False If true, all identical tokens are collapsed.
    extraPrefixAnnotations string The optional list of prefix annotations to keep (comma-separated).
    extraAnnotationsMinTrustLevel int 100 The minimum trust level to keep an extra annotation.
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <HierarchicalClassifier resourceDir="" annotationName="" language="en" excludedLanguages="" addAnnotationsOnKeywords="false" maxAnnotations="-1" minTrustLevel="0" maxKeywords="-1" minKeywords="1" collapseToken="false" extraPrefixAnnotations="" extraAnnotationsMinTrustLevel="100" name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></HierarchicalClassifier>
  • RulesMatcher com.exalead.indexing.analysis.v10.RulesMatcher
    A RuleMatcher applies a rule engine on the textual content of the DocumentChunks.

    The rules are defined in a separate XML 'resourceFile' and are a combination of regular expression, word matching and boolean operators over content.
    Annotations generated:
    • The matching rule defined in the XML specifies the annotation to generate
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceFile string
    • language iso code
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceFile string - URL for the resource (data://, file;// or resource://).
    language iso code - Language for which this processor is activated.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <RulesMatcher name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceFile="" language="en" ></RulesMatcher>
  • RelatedTerms com.exalead.indexing.analysis.v10.RelatedTerms
    Extracts all possible related terms.

    Only one instance of this processor may exist per input context.
    Annotations generated:
    • "relatedTerm": RelatedTerm identifier (stored in the dictionary and in the index)
    • "relatedTermDisplay": display form of the RelatedTerm (stored in the dictionary)
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • relatedTermsMinSpan int
    • relatedTermsMaxSpan int
    • maxRelatedTermsPerDoc int
    • keepLongestMatch boolean
    • dictionaryName string
    • preprocResourceDir string
    • whitelistResource string
    • blacklistResource string
    • withPartOfSpeech boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    relatedTermsMinSpan int 3 Minimum number of words (excluding stop words) in an automatically extracted term (not applicable to allow list).
    relatedTermsMaxSpan int 6 Maximum number of words (excluding stop words) in an automatically extracted term (not applicable to allow list).
    maxRelatedTermsPerDoc int 64 The maximum number of related terms per document.
    keepLongestMatch boolean True Keeps only the longest term when several overlap.

    For example, if you have 5 tokens ('a', 'b', 'c', 'd', 'e') and 4 related terms 'a', 'a-c', 'b-c-d' and 'd-e', this option will only keep 'b-c-d' and remove all other related terms.
    dictionaryName string - Name of the dictionary populated by terms extracted by this processor.
    If null, use the default dictionary.
    preprocResourceDir string - URL for the resource of the related terms preprocessor (data://, file;// or resource://).

    If null, we use the standard preprocessor of the product.
    whitelistResource string - Path to a related terms allow list resource.
    blacklistResource string - Path to a related terms blocklist resource.
    withPartOfSpeech boolean True Adds a PartOfSpeechTagger to the list of processors automatically. Improves quality of automatically extracted terms.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <RelatedTerms name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" relatedTermsMinSpan="3" relatedTermsMaxSpan="6" maxRelatedTermsPerDoc="64" keepLongestMatch="true" preprocResourceDir="" withPartOfSpeech="true" ></RelatedTerms>
  • PartOfSpeechTagger com.exalead.indexing.analysis.v10.PartOfSpeechTagger
    A PartOfSpeechTagger detects the part of speech for each word in the text of Document Chunks.

    It improves the quality of other processors, such as the named entity detector or the sentiment analyzer.
    Annotations generated:
    • "tagger"
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceDir string
    • language string
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceDir string - URL for the resource (data://, file;// or resource://).
    language string - Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <PartOfSpeechTagger name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" language="en" ></PartOfSpeechTagger>
  • Phonetizer com.exalead.indexing.analysis.v10.Phonetizer
    Creates a phonetic form for each word.

    This processor is used:
    • as a helper for other processors (like Ontology Matcher, or Semantic Extractor), which need to perform phonetic matches.
    • to perform search-time phonetic analysis using the Phonetic expansion module (this creates the dictionary of phonetic forms that will be used by the expansion module at search-time).
    • to greatly improve the quality of spell checking.

    Annotations generated:
    • "phonetic"
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceFile string
    • language string
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceFile string - URL for the resource (data://, file;// or resource://).
    language string - Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <Phonetizer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceFile="" language="en" ></Phonetizer>
  • NGramsExtractor com.exalead.indexing.analysis.v10.NGramsExtractor
    Extracts normalized word-grams. N-grams are useful for spell checking and statistical processings.
    Annotations generated:
    • "ngram"
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • min int
    • max int
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    min int 2 Minimum ngram size
    max int 3 Maximum ngram size

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <NGramsExtractor name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" min="2" max="3" ></NGramsExtractor>
  • Lemmatizer com.exalead.indexing.analysis.v10.Lemmatizer
    Creates a lemmatized form for each word (nouns and adjectives only).

    This processor is mostly used as a helper for other processors (like Ontology Matcher, or Semantic Extractor), which need to perform lemmatized matches.

    Annotations generated:
    • "lemma": normalized lemmatized form of the word (singular/masculine)
    • "lemma_lowercase": lemmatized form of the word (singular/masculine)
    • "fsingular": normalized singular form of the word
    • "fsingular_lowercase": singular form of the word
    • "masculine": if the token is a masculine word
    • "feminine": if the token is a feminine word
    • "neuter": if the token is neuter
    • "singular": if the word is singular
    • "plural": if the word is plural
    • "unnumbered": if the word is unnumbered
    • "pos": the static Part of Speech
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceDir string
    • language string
    • lemmatizeNormalizedAnnotations boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceDir string - URL for the resource (data://, file;// or resource://).
    language string - Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.
    lemmatizeNormalizedAnnotations boolean False

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <Lemmatizer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" language="en" lemmatizeNormalizedAnnotations="false" ></Lemmatizer>
  • AcronymDetector com.exalead.indexing.analysis.v10.AcronymDetector
    Detects acronyms like 'o.n.u' and extracts 'onu'.
    '.', '-' and ' ' are the standard acronym separators.

    Custom alphanumeric separators can be added with the "separators" attribute.
    Annotations generated:
    • "acronym"
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • addNormalizerAnnotation boolean
    • separators string
    • language string
    • strict boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    addNormalizerAnnotation boolean False
    separators string - List of allowed separators chars separated by ',' (can only be alphanumerical, for example, 'and' to handle '1 and 1')
    language string - Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.
    strict boolean True In strict mode, the only separator is dot.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <AcronymDetector name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" addNormalizerAnnotation="false" separators="" language="en" strict="true" ></AcronymDetector>
  • Normalizer com.exalead.indexing.analysis.v10.Normalizer
    Normalizes all tags given in input tags field.
    Annotations generated:
    • "NORMALIZE"
    • "LOWERCASE"
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • inputTags string
    • trustLevel int
    • transliteration boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    inputTags string - Normalize all tags of "inputTags" (comma-separated list of tags).
    trustLevel int 100
    transliteration boolean True When normalizing, convert some characters to their latin equivalent

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <Normalizer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" inputTags="" trustLevel="100" transliteration="true" ></Normalizer>
  • FarTextAnnotator com.exalead.indexing.analysis.v10.FarTextAnnotator
    A FarTextAnnotator annotates alphanumeric tokens with 'annotation' if they are farther than 'startOffset'
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • startOffset int
    • annotation string
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    startOffset int 8192
    annotation string fartext

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <FarTextAnnotator name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" startOffset="8192" annotation="fartext" ></FarTextAnnotator>
  • Chunker com.exalead.indexing.analysis.v10.Chunker
    A chunker detects noun groups.
    Annotations generated:
    • "gadv": adverbal group
    • "gadj": adjectival group
    • "gnoun": noun group
    • "gverb": verbal group
    • "gprep": prepositional group
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceDir string
    • language string
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceDir string - URL for the resource (data://, file;// or resource://).
    language string - Languages for which the processor is activated; if no language is specified, the processor is activated for all languages.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <Chunker name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" language="en" ></Chunker>
  • SentimentAnalyzer com.exalead.indexing.analysis.v10.SentimentAnalyzer
    Analyzes the nouns and adjectives present in the text. It detects topics and annotates the document with:
    • a global rating of good, bad or neutral
    • a rating per topic
    • the adjective(s) used in the document
    @require Tokenizer, Lemmatizer, PartOfSpeechTagger, RelatedTermsPreprocessor, RelatedTermsExtractor, NamedEntitiesMatcher, Chunker @annotations "sentiment" annotation on nouns with a modulated ("really", "quite", "not") appreciation @document-annotations "document_sentiment" annotation on the document with either "good", "bad" or "neutral" and a confidence ratio @attribute resourceDir (defaults to resource://sentiment/sentiment.bin): @attribute language (defaults to all supported languages): @attribute summarize (defaults to false): @attribute annotateGlobally (defaults to false): @attribute showPackage (defaults to false): @attribute packageCount (defaults to false): @attribute nounPackage DEPRECATED (defaults to true): @attribute ignorePartOfSpeech (defaults to false):
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceDir string
    • language iso code
    • annotateGlobally boolean
    • annotatePronouns boolean
    • ignorePartOfSpeech boolean
    • ignoreRelatedTerms boolean
    • legacyAnnotations boolean
    • notApplicableAnnotations boolean
    • normalizeTrustLevels boolean
    • nounPackage boolean
    • packageCount boolean
    • showPackage boolean
    • suggest boolean
    • summarize boolean
    • suggestOutput string
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceDir string - URL for the resource (data://, file;// or resource://).
    language iso code -
    annotateGlobally boolean False
    annotatePronouns boolean False
    ignorePartOfSpeech boolean False
    ignoreRelatedTerms boolean False
    legacyAnnotations boolean False
    notApplicableAnnotations boolean True
    normalizeTrustLevels boolean True
    nounPackage boolean True
    packageCount boolean False
    showPackage boolean False
    suggest boolean False
    summarize boolean False
    suggestOutput string -

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <SentimentAnalyzer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" language="en" annotateGlobally="false" annotatePronouns="false" ignorePartOfSpeech="false" ignoreRelatedTerms="false" legacyAnnotations="false" notApplicableAnnotations="true" normalizeTrustLevels="true" nounPackage="true" packageCount="false" showPackage="false" suggest="false" summarize="false" suggestOutput="" ></SentimentAnalyzer>
  • FastRulesMatcher com.exalead.indexing.analysis.v10.FastRulesMatcher
    Annotates a document using a set of XML rules, compiled for efficiency.

    The rules are described with the query language using the AND, OR and NOT operators, as well as 'context' matching operators.

    The rules can also match whole chunks (and not just words) per regular expressions.
    Annotations generated:
    • Depending on the resources (See FastRulesDefinition)
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceDir string
    • allowsExprStartingBySeparators boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceDir string - Directory containing the matcher resources. Must not be empty.
    allowsExprStartingBySeparators boolean False If you have expressions starting with a separator (",", ";", "&", ...), then you must set this option to true.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <FastRulesMatcher name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" allowsExprStartingBySeparators="false" ></FastRulesMatcher>
  • SnowballStemmer com.exalead.indexing.analysis.v10.SnowballStemmer
    Creates the stemmed form of each word. This uses the Snowball stemming algorithms.

    This processor is mostly used as a helper for other processors (like Ontology Matcher, or Semantic Extractor), which need to perform stemmed matches.

    Annotations generated:
    • "stem"
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <SnowballStemmer name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></SnowballStemmer>
  • DebugSemanticProcessor com.exalead.indexing.analysis.v10.DebugSemanticProcessor
    Dumps all annotated tokens in the specified format on Standard Output, or in @c outputFile. (Log of the 'Analysis' process)
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputFile string
    • format enum(html, xml)
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    outputFile string -
    format enum(html, xml) html Output format.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <DebugSemanticProcessor name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputFile="" format="html" ></DebugSemanticProcessor>
  • SQI (deprecated) com.exalead.indexing.analysis.v10.SQI
    A SemanticProcessor applies semantic processing on the textual content of the DocumentChunks.

    A Semantic Processor creates SemanticAnnotations on tokens.

    These SemanticAnnotations can then be used in the Mapping.
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceDir string
    • breakOnSentence boolean
    • breakOnParagraph boolean
    • breakOnLine boolean
    • matchAllRules boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceDir string - URL for the resource (data://, file:// or resource://)
    breakOnSentence boolean False If true, there will be maximum one match per sentence, and no match for inter-sentence.

    This option will add the SentenceFinder automatically.
    breakOnParagraph boolean True If true, there will be maximum one match per paragraph, and no match for inter-paragraph.
    breakOnLine boolean False If true, there will be maximum one match per line, and no match for inter-line.
    matchAllRules boolean True If true, it returns the full list of matched rules.

    If false, it returns the first matched rule only.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <SQI name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" breakOnSentence="false" breakOnParagraph="true" breakOnLine="false" matchAllRules="true" ></SQI>
  • SemanticExtractor com.exalead.indexing.analysis.v10.SemanticExtractor
    The resource describes the features to extract, with their term, type and range for numerical values according to a set of rules.
    Annotations generated:
    • Depending on the resource (See SemanticExtractorConfig)
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceDir string
    • prefix string
    • breakOnSentence boolean
    • breakOnParagraph boolean
    • breakOnLine boolean
    • matchAllRules boolean
    • language iso code
    • annotateUnusedTokensWith string
    • overlappingMatches boolean
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceDir string - URL of the compiled semantic extractor file. Use the format data://, file:// or resource://.
    prefix string - Output annotations prefix
    breakOnSentence boolean False If true, there will be maximum one match per sentence, and no match for inter-sentence.

    This option will add the SentenceFinder automatically.
    breakOnParagraph boolean True If true, there will be maximum one match per paragraph, and no match for inter-paragraph.
    breakOnLine boolean False If true, there will be maximum one match per line, and no match for inter-line.
    matchAllRules boolean True If true, it returns the full list of matched rules.

    If false, it returns only the first matched rule.
    language iso code - Language for which the extractor is activated.
    If null, all languages are activated.
    annotateUnusedTokensWith string - Used in the context of query rewriting by the Semantic Query Analyzer.
    overlappingMatches boolean True If true, reports all matches even if their locations overlap.

    Only makes sense when matchAllRules is true.

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <SemanticExtractor name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceDir="" prefix="" breakOnSentence="false" breakOnParagraph="true" breakOnLine="false" matchAllRules="true" language="en" overlappingMatches="true" ></SemanticExtractor>
  • ProximityProcessor com.exalead.indexing.analysis.v10.ProximityProcessor
    A proximity processor detects and annotates pieces of text where several annotations occur given distance constraints.
    Possible constraints (non mutually exclusive):
    • token window size
    • distance between annotations
    • sentence/paragraph scope

    Annotations generated:
    • Depending on the resource (See Proximity)
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceFile string
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceFile string - URL for the resource (data://, file:// or resource://)

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <ProximityProcessor name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceFile="" ></ProximityProcessor>
  • AnnotationManager com.exalead.indexing.analysis.v10.AnnotationManager
    An annotation manager implements basic operations on annotations: copy/removal/selection according to a number of conditions like:
    • Removal of overlaping annotations
    • Selection of the most frequent annotations
    • Copy of an annotation unless block listed
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • resourceFile string
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    resourceFile string - URL for the resource (data://, file:// or resource://)

    Nested elements
    • fromDataModelSemanticProcessor
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    XML representation
    <AnnotationManager name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" resourceFile="" ></AnnotationManager>
  • CustomSemanticProcessor com.exalead.indexing.analysis.v10.CustomSemanticProcessor
    A custom semantic processor allows you to plug in custom code in the semantic pipeline.
    Parent objects
    • AnalysisPipeline
    • SemanticPipeDocumentProcessor
    • SemanticProcessorModule
    • SemanticQueryAnalysisConfig
    Attributes
    • name string
    • contexts string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • classId string
    Name Type Default value Description
    name string - Name of the Semantic Processor.

    This name is only used for tracing and debugging purposes.
    contexts string - Comma-separated list of the ContextNames of the Document Chunks on which this processor should be applied.

    If this list is empty, all DocumentChunks are processed.
    dataModelState string - Is this semantic processor managed by a data model?
    @enum{null,auto,customized, error}.
    If null, this semantic processor is not related to the data model.

    If "auto", this semantic processor is auto-generated by the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disables the DocumentProcessor
    classId string - The specified class must implement the {@code com.exalead.indexing.analysis.semantic.CustomSemanticProcessorInterface} Exascript interface.

    Nested elements
    • fromDataModelSemanticProcessor
    • KeyValue*
    Type Wrapper Description
    SemanticProcessor fromDataModel If dataModelState is "customized", you will find here the original semantic processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    KeyValue* -
    XML representation
    <CustomSemanticProcessor name="" contexts="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" classId="" ></CustomSemanticProcessor>
  • PrintfValues com.exalead.indexing.analysis.v10.PrintfValues
    Prints textual content of DocumentChunks according to a formatting string.

    This string contains variables in one of the 3 following formats:
    1. $(name), the name of a context: output is the textual content of this context.
    2. $/name:regexp/, the name of a context whose chunks must match the regexp: output is the piece of text that has matched.
    3. $/name:regexp:format/, the name of a context whose chunks must match the regexp: output is defined by a sed-like format referencing the regexp subexpressions.

    Warning: In the regexp and format parts, colons and slashes must be escaped with a backslash.

    For example : "$(firstname) $(lastname) : $/age:[0-9]+/ $/date:([0-9]{2})([0-9]{2})([0-9]{4}):day=\\1 month=\\2 year=\\3"

    Warning: The context used in this method cannot be produced by another processor. It should come from the connector.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • formattingString string
    • outputContext string
    • strict boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    formattingString string - This string contains variables in one of the 3 following formats:
    1. $(name), the name of a context: output is the textual content of this context.
    2. $/name:regexp/, the name of a context whose chunks must match the regexp: output is the piece of text that has matched.
    3. $/name:regexp:format/, the name of a context whose chunks must match the regexp: output is defined by a sed-like format referencing the regexp subexpressions.

    Warning: Colons and slashes must be escaped with a backslash.

    For example : "$(firstname) $(lastname) : $/age:[0-9]+/ $/date:([0-9]{2})([0-9]{2})([0-9]{4}):day=\\1 month=\\2 year=\\3"
    outputContext string - ContextName to be associated with the DocumentChunk created for each generated value.
    strict boolean True Forces all the manipulated contexts found to process.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <PrintfValues name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" formattingString="" outputContext="" strict="true" ></PrintfValues>
  • RenameUnmappedContexts com.exalead.indexing.analysis.v10.RenameUnmappedContexts
    This Document Processor changes the ContextName for all DocumentChunks associated with a ContextName that does not have a Mapping Configuration.

    This avoids extensive renaming using RenameContext.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The new ContextName for DocumentChunks with an unmapped ContextName.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <RenameUnmappedContexts name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></RenameUnmappedContexts>
  • NewChunk com.exalead.indexing.analysis.v10.NewChunk
    Creates a new DocumentChunk with 'outputContext' as ContextName, and textual content specified in 'value'.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • value string
    • partName string
    • language iso code
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The ContextName used for newly created chunks.
    value string - The value used for newly created chunks.
    partName string - The part to which the chunk should belong.

    If nothing is specified here, the chunk will be handled as a global chunk.
    language iso code - Language of the chunk, as an ISO639 code.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <NewChunk name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" value="" partName="" language="en" ></NewChunk>
  • UniformRandomContextGenerator com.exalead.indexing.analysis.v10.UniformRandomContextGenerator
    Adds a new DocumentChunk for one document out of 'modulo' documents processed.

    The textual content of the DocumentChunk is picked out of the list specified in 'values', with a uniform distribution. @descr
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • modulo int
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The ContextName used for newly created chunks.
    modulo int - Inverse probability of adding the new chunk. Must be a strictly positive integer.

    Nested elements
    • fromDataModelDocumentProcessor
    • valuesStringValue*
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    StringValue* values List of possible values.
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <UniformRandomContextGenerator name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" modulo="" ></UniformRandomContextGenerator>
  • ZipfRandomContextGenerator com.exalead.indexing.analysis.v10.ZipfRandomContextGenerator
    Adds a new document chunk for one document out of 'modulo'.

    The textual content of the document chunk is picked out of the list specified in 'values', with a non-uniform discrete Zipf distribution. @descr
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • outputContext string
    • modulo int
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • parameter double
    Name Type Default value Description
    outputContext string - The ContextName used for newly created chunks.
    modulo int - Inverse probability of adding the new chunk. Must be a strictly positive integer.
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    parameter double - The exponent characterizing the distribution.

    Nested elements
    • valuesStringValue*
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    StringValue* values List of possible values.
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <ZipfRandomContextGenerator outputContext="" modulo="" name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" parameter="" ></ZipfRandomContextGenerator>
  • HTMLRelevantContentExtractor com.exalead.indexing.analysis.v10.HTMLRelevantContentExtractor
    The HTMLRelevantContentExtractor extracts the most relevant parts of an HTML document.

    Generally, the relevant part of an HTML document is the article on the middle of the page. The header, the footer and the menus are often the same on all pages and should not be indexed.

    The extraction can be tuned using different attributes. @csh AC_HTMLRELEVANT_CONTENT_ID
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • relevantChunkContext string
    • newContextName string
    • irrelevantChunkContext string
    • retrieveFieldContext string
    • irrelevantChunkAnnotation string
    • minScore int
    • minParagraphWords int
    • minTitleWords int
    • linkAllowedInTitle boolean
    • paragraphBoost int
    • maxWordInLinkRatio int
    • titleBoost int
    • classBoost int
    • keepOnlyBestChunk boolean
    • skipBlockquotes boolean
    • skipPre boolean
    • keepImages boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    relevantChunkContext string relevantcontent Relevant text chunks will be copied in this context.
    newContextName string relevantcontent Deprecated, use 'relevantChunkContext'.
    irrelevantChunkContext string excludedcontent Irrelevant text chunks will be copied in this context.
    retrieveFieldContext string htmlcontent Original text chunks will be moved in this context.
    irrelevantChunkAnnotation string - If set, the HTMLRelevantContentExtractor will annotate each irrelevant chunk with an annotation.
    minScore int 15 Internally, the HTMLRelevantContentExtractor assigns a score to each chunk of its input. Use 'minScore' to keep only chunks having a score greater than a value.
    minParagraphWords int 7 The minimum number of words a &lt;p&gt; chunk must have to be considered as a paragraph and be boosted.
    minTitleWords int 3 The minimum number of words a title must have to be boosted.
    linkAllowedInTitle boolean True By default, the links contained in a page title produce a malus, this can be disabled.
    paragraphBoost int 10 Each time a paragraph will be detected, the score will be increased by this value.
    maxWordInLinkRatio int 2 The maximum allowed ratio of words contained in links in a chunk of text.
    titleBoost int 5 Each time a title will be detected, the score will be increased by this value.
    classBoost int 10 Each time a CSS class included in 'idsAndClassesToKeep' will be detected, the score will be increased by this value.
    keepOnlyBestChunk boolean False If true, the 'relevantcontent' will only be composed by the main article of the page.
    skipBlockquotes boolean False Ability to skip HTML blockquote tags.
    skipPre boolean False Ability to skip HTML pre tags.
    keepImages boolean False If true, the HTML image annotations will be kept in the new context.

    Nested elements
    • fromDataModelDocumentProcessor
    • idsAndClassesToIgnoreStringValue*
    • idsAndClassesToKeepStringValue*
    • annotationsToCopyStringValue*
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    StringValue* idsAndClassesToIgnore The list of CSS classes and HTML ids to ignore.
    StringValue* idsAndClassesToKeep The list of CSS classes and HTML ids to boost.
    StringValue* annotationsToCopy The list of annotations to keep in the new context.
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <HTMLRelevantContentExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" relevantChunkContext="relevantcontent" newContextName="relevantcontent" irrelevantChunkContext="excludedcontent" retrieveFieldContext="htmlcontent" irrelevantChunkAnnotation="" minScore="15" minParagraphWords="7" minTitleWords="3" linkAllowedInTitle="true" paragraphBoost="10" maxWordInLinkRatio="2" titleBoost="5" classBoost="10" keepOnlyBestChunk="false" skipBlockquotes="false" skipPre="false" keepImages="false" ></HTMLRelevantContentExtractor>
  • HTMLTableExtractor com.exalead.indexing.analysis.v10.HTMLTableExtractor
    Extracts all HTML tables having minColumnsRequired < nb cols < maxColumnsRequired and duplicates them in context {@link newContextName}
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • newContextName string
    • minColumnsRequired int
    • maxColumnsRequired int
    • concatenateRows boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    newContextName string webtable The ContextName used for newly created chunks.
    minColumnsRequired int 2 The minimum number of columns required to extract.
    maxColumnsRequired int 2147483647 The maximum number of columns required to extract.
    concatenateRows boolean False Concatenates all rows.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <HTMLTableExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" newContextName="webtable" minColumnsRequired="2" maxColumnsRequired="2147483647" concatenateRows="false" ></HTMLTableExtractor>
  • DiscardDocument com.exalead.indexing.analysis.v10.DiscardDocument
    DEPRECATED. It does not stop the processing of the document. To do so, add a custom document processor with the following code:
    document.setProcessingFlag(Operation.DISCARD_AND_DELETE); ((AnalysisDocumentProcessingContext) context).stopProcessingAfterCurrentProcessor(); @desc Discards documents from the pipeline.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • logDiscardedDocuments boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    logDiscardedDocuments boolean False If true, the URI of each discarded documents is logged on each analysis process log file.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <DiscardDocument name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" logDiscardedDocuments="false" ></DiscardDocument>
  • ReplaceContextNames com.exalead.indexing.analysis.v10.ReplaceContextNames
    Replaces the first matching substring of context names with the given replacement.

    For example, inputSubstring="abc" and outputReplacement="bar" will rename context abcdef to bardef and somethingabcstuff to somethingbarstuff
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • inputSubstring string
    • outputReplacement string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    inputSubstring string - The piece of string to be replaced.
    outputReplacement string - The replacement string.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <ReplaceContextNames name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" inputSubstring="" outputReplacement="" ></ReplaceContextNames>
  • HTMLCSSSelector com.exalead.indexing.analysis.v10.HTMLCSSSelector
    Deletes all text chunks that are not annotated with a class or an id specified in {@link classes} or {@link ids}
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor

    Nested elements
    • fromDataModelDocumentProcessor
    • classesStringValue*
    • idsStringValue*
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    StringValue* classes
    StringValue* ids
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <HTMLCSSSelector name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></HTMLCSSSelector>
  • HTMLCSSExtractor com.exalead.indexing.analysis.v10.HTMLCSSExtractor
    Extracts all text chunks annotated with a class or an id specified in {@link classes} or {@link ids}, and duplicates them in context {@link outputContext}
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - ContextName to be associated with the DocumentChunk created for each new context.

    Nested elements
    • fromDataModelDocumentProcessor
    • classesStringValue*
    • idsStringValue*
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    StringValue* classes List of classes used to determine whether a chunk must be duplicated.
    StringValue* ids List of ids used to determine whether a chunk must be duplicated.
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <HTMLCSSExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" ></HTMLCSSExtractor>
  • DataModelClassResolver com.exalead.indexing.analysis.v10.DataModelClassResolver
    This processor takes the value of the "datamodel_class" papi directive to determine the DataModelClass of the document.

    If this directive is not found, we assume this is the default class.

    If this is not the default class, all metas corresponding to an existing DataModelProperty are prefixed with the type of the class declaring the property (it may be a superclass of the class).

    For the processors following this processor in the pipeline, you must refer to the Data Model property by prefixing it with its class name. For processors preceding this processor in the pipeline, use the meta name only (without prefix).
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <DataModelClassResolver name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></DataModelClassResolver>
  • SetDefaultValue com.exalead.indexing.analysis.v10.SetDefaultValue
    This processor looks for specified contexts. If they are not present in document, they are created with a configured value.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    • KeyValue*
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    KeyValue* -
    XML representation
    <SetDefaultValue name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></SetDefaultValue>
  • CustomDocumentProcessor com.exalead.indexing.analysis.v10.CustomDocumentProcessor
    A Custom document processor allows you to plug in custom code packaged as a CVPlugin into the document processing pipeline.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • classId string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    classId string - Class identifier.
    The specified class must implement the com.exalead.pdoc.analysis.CustomDocumentProcessor Java Interface.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    • KeyValue*
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    KeyValue* -
    XML representation
    <CustomDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" classId="" ></CustomDocumentProcessor>
  • InferFileExtension com.exalead.indexing.analysis.v10.InferFileExtension
    When the file_extension meta is not present, finds the file extension based on the file name or the mime meta (if one of these two is present).
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <InferFileExtension name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></InferFileExtension>
  • InsertCurrentDate com.exalead.indexing.analysis.v10.InsertCurrentDate
    Adds the current date in an output context
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • outputContext string
    • format string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    outputContext string - The ContextName used for newly created chunks.
    format string - Either "unixts" or a SimpleDateFormat specification

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <InsertCurrentDate name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" outputContext="" format="" ></InsertCurrentDate>
  • XpathExtractor com.exalead.indexing.analysis.v10.XpathExtractor
    Extraction is performed for the following data types:
    • text/html. HTML Files.
    • application/xml. XML Files.

    Warning: To put before the NativeTextExtractor because the 'bytes' of each Document Binary Part are deleted by the NativeTextExtractor.

    Limitations: This extractor handles node set and string functions. Not number and boolean. You can use number or boolean functions inside your xpath //img[starts-with(@src, "http://")] because this xpath return a set of nodes (<img>) but xpath count(//img) doesn't work because it returns a number. @csh AC_XPATH_EXTRACTOR_ID
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • htmlParserToUse enum(htmlCleaner, tagSoup)
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    htmlParserToUse enum(htmlCleaner, tagSoup) htmlCleaner HTML parser to use in priority.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    • XpathRule*
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XpathRule* -
    XML representation
    <XpathExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" htmlParserToUse="htmlCleaner" ></XpathExtractor>
  • XpathRule com.exalead.indexing.analysis.v10.XpathRule
    No documentation for this element.
    Parent objects
    • XpathExtractor
    Attributes
    • metaName string
    • xpath string
    • concatMutiMatch boolean
    Name Type Default value Description
    metaName string -
    xpath string -
    concatMutiMatch boolean True Concatenates all results in a value when the xpath expression returns several results. Otherwise, it adds each match in a multiValued meta.

    It should be unselected if you want each node returned by xpath expression in different value (like list of item).

    XML representation
    <XpathRule metaName="" xpath="" concatMutiMatch="true" ></XpathRule>
  • XpathFragmentExtractor com.exalead.indexing.analysis.v10.XpathFragmentExtractor
    Input: All DocumentChunks associated with the specified 'inputContext' ContextNames. Input can be XML or HTML fragment.

    Output: DocumentChunks are created for each Xpath Fragment Rule. Each DocumentChunk is associated with the specified 'Meta name' ContextName.

    Warning: To put before the NativeTextExtractor because the 'bytes' of each Document Binary Part are deleted by the NativeTextExtractor.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • inputFragmentMeta string
    • parserToUse enum(htmlCleaner, tagSoup, xmlParser)
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    inputFragmentMeta string -
    parserToUse enum(htmlCleaner, tagSoup, xmlParser) xmlParser Parser to use in priority.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    • XpathFragmentRule*
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XpathFragmentRule* -
    XML representation
    <XpathFragmentExtractor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" inputFragmentMeta="" parserToUse="xmlParser" ></XpathFragmentExtractor>
  • XpathFragmentRule com.exalead.indexing.analysis.v10.XpathFragmentRule
    No documentation for this element.
    Parent objects
    • XpathFragmentExtractor
    Attributes
    • metaName string
    • xpath string
    Name Type Default value Description
    metaName string -
    xpath string -

    XML representation
    <XpathFragmentRule metaName="" xpath="" ></XpathFragmentRule>
  • SimilarStringToPart com.exalead.indexing.analysis.v10.SimilarStringToPart
    Converts the signatures in a string format from a meta to a binary part
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • version int
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    version int 1 Specifies the version.

    Nested elements
    • fromDataModelDocumentProcessor
    • valuesStringValue*
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    StringValue* values List of the names of the metas to parse and to transform to part.
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <SimilarStringToPart name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" version="1" ></SimilarStringToPart>
  • DocumentProcessorGroup com.exalead.indexing.analysis.v10.DocumentProcessorGroup
    Contains a list of document processors, which are executed only if this group document processor condition matches.
    It avoids condition duplication or distinct pipelines creation when several processors share the same condition.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    • DocumentProcessor*
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    DocumentProcessor* -
    XML representation
    <DocumentProcessorGroup name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" ></DocumentProcessorGroup>
  • UnitsOfMeasurementNormalizer com.exalead.indexing.analysis.v10.UnitsOfMeasurementNormalizer
    Unit of measurement detector and convertor
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • indexField string
    • indexFieldUnitSymbol string
    • suffixName string
    • removeContext boolean
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    indexField string - The index field in which the value will be stored.
    indexFieldUnitSymbol string - The output unit symbol
    suffixName string _um Output suffix to create a new meta as output
    removeContext boolean False Remove contexts after processing

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <UnitsOfMeasurementNormalizer name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" indexField="" indexFieldUnitSymbol="" suffixName="_um" removeContext="false" ></UnitsOfMeasurementNormalizer>
  • DebugCrashProcessor com.exalead.indexing.analysis.v10.DebugCrashProcessor
    Causes crashes for debugging purpose
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • type string
    • delay int
    • count int
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    type string exception The crash type {@code enum(noop,exception,oom,infiniteloop,nullptr,abort,assert,segv,intdiv)}
    delay int 0 Trigger delay in seconds.
    count int 3 Trigger document count.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <DebugCrashProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" type="exception" delay="0" count="3" ></DebugCrashProcessor>
  • PLMExpandDocumentProcessor com.exalead.indexing.analysis.v10.PLMExpandDocumentProcessor
    Treat plm metas to generate octrees and matrices for PLMExpand.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • metaMatrix string
    • fieldMatrix string
    • fieldInvMatrix string
    • metaCGR string
    • fieldOctree string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    metaMatrix string matrix Name of the meta containing the matrix data.
    fieldMatrix string matrix Name of the target matrix field.
    fieldInvMatrix string invmatrix Name of the target matrix field.
    metaCGR string cgr Name of the meta containing the CGRs.
    fieldOctree string octree Name of target octree field.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <PLMExpandDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" metaMatrix="matrix" fieldMatrix="matrix" fieldInvMatrix="invmatrix" metaCGR="cgr" fieldOctree="octree" ></PLMExpandDocumentProcessor>
  • CGRDocumentProcessor com.exalead.indexing.analysis.v10.CGRDocumentProcessor
    Calls convert to generate octrees.
    Parent objects
    • AnalysisPipeline
    • DocumentProcessorGroup
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • disabled boolean
    • partCGR string
    • partOctree string
    • docIdentifyer string
    Name Type Default value Description
    name string - Name of this processor.

    The name of a processor is used only for tracing and debugging purposes.
    dataModelState string - Is this document processor managed by a data model?
    @enum{null,auto,customized, error}.
    • If null, this document processor is not related to a data model.
    • If "auto", this document processor is auto-generated by a data model.
    • If "customized", this document processor was auto-generated by a data model and then customized.
    • If "error", there is a conflict between this document processor and the data model.
    dataModelClass string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelClass that generated this DocumentProcessor.
    dataModelProperty string - If dataModelState is either "auto" or "customized", you will find here the name of the DataModelProperty that generated this DocumentProcessor.
    disabled boolean False Disable the DocumentProcessor
    partCGR string CGR Name of the part containing the CGR data (tesselation).
    partOctree string octree Name of the part used to store the resulting octree.
    docIdentifyer string majorid Name of the meta identifying the document.

    Nested elements
    • fromDataModelDocumentProcessor
    • AcceptCondition
    Type Wrapper Description
    DocumentProcessor fromDataModel If dataModelState is "customized", you will find here the original document processor generated by the data model.

    Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
    AcceptCondition - Expresses the enablement condition of this DocumentProcessor.
    XML representation
    <CGRDocumentProcessor name="" dataModelState="" dataModelClass="" dataModelProperty="" disabled="false" partCGR="CGR" partOctree="octree" docIdentifyer="majorid" ></CGRDocumentProcessor>
  • FilteringConfiguration com.exalead.indexing.analysis.v10.FilteringConfiguration
    Filters to apply to the words extracted from the semantic processors.

    Words that do not satisfy these conditions will not be indexed.

    The filtered values are expressed by the number of unicode characters.
    Parent objects
    • AnalysisPipeline
    Attributes
    • wordMaxLength int
    • hexCharMax int
    • hexLengthMin int
    • maxNumChars int
    Name Type Default value Description
    wordMaxLength int 100 Maximal length of a word. 100 is the default value.
    hexCharMax int 0 Maximal number of hexadecimal characters that can appear in a word.

    This filter applies only for words bigger than 'hexLengthMin'.
    0 = no filter (default value)
    hexLengthMin int 0 Minimal number of characters in a word for the hexadecimal filter to apply.
    0 = no filter (default value)
    maxNumChars int 0 Maximal number of characters in a word.
    0 = no filter (default value)

    XML representation
    <FilteringConfiguration wordMaxLength="100" hexCharMax="0" hexLengthMin="0" maxNumChars="0" ></FilteringConfiguration>
  • LanguageConfiguration com.exalead.indexing.analysis.v10.LanguageConfiguration
    Configuration of the linguistic extraction for a given language.
    Parent objects
    • AnalysisPipeline
    Attributes
    • language iso code
    • generateWordDict boolean
    • wordDictModulo int
    • maxWordDictWordsPerDocument long
    • maxExtractedWordLength int
    • spellCheckNGramMaxSize int
    • spellCheckNGramsDictModulo int
    • maxSpellCheckNGramsPerDocument long
    • maxExtractedSpellCheckNGramLength int
    • relatedTermsDictModulo int
    • maxRelatedTermsDictContextsPerDocument long
    Name Type Default value Description
    language iso code - The language iso code
    generateWordDict boolean False Extracts words for the global dictionary.
    wordDictModulo int 1 Word extraction modulo, by default extract all words.
    maxWordDictWordsPerDocument long -1 Maximum number of words extracted per document.
    maxExtractedWordLength int 64 Maximum length a word must have to be extracted.
    spellCheckNGramMaxSize int 3 Maximum number of consecutive words for spellchecking.

    If the value is set to '-1', spellcheck data is not generated for this language.
    0 and 1 values are illegal, default is 3.
    spellCheckNGramsDictModulo int 5 NGrams extraction modulo. It extracts 1 ngram out of 5 by default.
    maxSpellCheckNGramsPerDocument long -1 Maximum number of ngrams extracted per document.
    maxExtractedSpellCheckNGramLength int 256 Maximum length an ngram must have to be extracted.
    relatedTermsDictModulo int 1 Submits 1 out of X documents for related terms generation.

    If the value is set to 0, related terms are not generated for this language.
    maxRelatedTermsDictContextsPerDocument long -1 Maximum number of related terms extracted per document.

    XML representation
    <LanguageConfiguration language="en" generateWordDict="false" wordDictModulo="1" maxWordDictWordsPerDocument="-1" maxExtractedWordLength="64" spellCheckNGramMaxSize="3" spellCheckNGramsDictModulo="5" maxSpellCheckNGramsPerDocument="-1" maxExtractedSpellCheckNGramLength="256" relatedTermsDictModulo="1" maxRelatedTermsDictContextsPerDocument="-1" ></LanguageConfiguration>
  • MappingConfiguration com.exalead.indexing.analysis.v10.MappingConfiguration
    Specifies how DocumentChunks and their SemanticAnnotations populate the index and the dictionary.
    Nested elements
    • AnnotationMapping*
    • ContextMapping*
    • FieldIndexingLimit*
    • FieldRetrievalLimit*
    • GenerateAnnotationsForContext*
    • PartMapping*
    • WordCountMapping*
    Type Wrapper Description
    AnnotationMapping* - List of mappings from annotations to index targets, with associated parameters.
    ContextMapping* - List mappings from contexts to index targets, with associated parameters.
    FieldIndexingLimit* - Word count limits to apply to texts mapped to index fields for search.
    FieldRetrievalLimit* - Size limits (in bytes) to apply to texts mapped to the index for retrieval.
    GenerateAnnotationsForContext* - List of contexts to process with a semantic pipeline before mapping.
    PartMapping* - List mappings from parts to index targets, with associated parameters.
    WordCountMapping* - Specify where to map Word count.
    XML representation
    <MappingConfiguration ></MappingConfiguration>
  • AnnotationMapping com.exalead.indexing.analysis.v10.AnnotationMapping
    Defines how SemanticAnnotations are used to populate index fields.
    Parent objects
    • MappingConfiguration
    Attributes
    • name string
    • context string
    • patternMatch boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    name string - Name of the SemanticAnnotation to map.
    context string - Optional input context restricting the mapping from the annotations coming from a specific context.

    Incompatible with the patternMatch feature.
    patternMatch boolean False Matches all annotations matching this pattern (must be a valid regular expression).
    dataModelState string - Is this annotation target managed by a data model? @enum{null,auto,customized}.
    If null, this annotation mapping is not related to a data model.
    If "auto", this annotation mapping is auto-generated by a data model
    If "customized", this annotation mapping was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this annotation mapping.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this annotation mapping.

    Nested elements
    • fromDataModelAnnotationMapping
    • AnnotationTarget*
    Type Wrapper Description
    AnnotationMapping fromDataModel If dataModelState is "customized", you will find here the original annotation mapping generated by the data model.
    Use this to easily show what reverting to "auto" from "customized" would imply
    AnnotationTarget* -
    XML representation
    <AnnotationMapping name="" context="" patternMatch="false" dataModelState="" dataModelClass="" dataModelProperty="" ></AnnotationMapping>
  • CategoryAnnotationTarget com.exalead.indexing.analysis.v10.CategoryAnnotationTarget
    CategoryAnnotationTarget is used to create a new category path inside an index category field, out of a SemanticAnnotation.

    The category path is built by the concatenation of the 'categoryRoot' and the selected 'form' of the annotation.
    Parent objects
    • AnnotationMapping
    Attributes
    • indexField string
    • forcedRank long
    • rankBoost long
    • form string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • categoryRoot string
    • categoryAppend boolean
    • appendAnnotationNameToRoot boolean
    • retrievable boolean
    • cleanupContent boolean
    • detectTitle boolean
    Name Type Default value Description
    indexField string -
    forcedRank long -
    rankBoost long 0
    form string normalized Which form of SemanticAnnotation value should we index? {@code enum(exact,normalized)}
    dataModelState string - Is this annotation target managed by a data model?
    @enum{null,auto,customized}.

    If null, this prefix handler is not related to a data model.

    If "auto", this prefix handler is auto-generated by a data model.

    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this AnnotationTarget.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this AnnotationTarget.
    categoryRoot string - Prefix used to build the CategoryPath.
    categoryAppend boolean True Builds the category path by concatenating the categoryRoot and the selected 'form' of the annotation.

    If false, only the category root will be used.
    appendAnnotationNameToRoot boolean False Appends the annotation name between the root and the value.
    retrievable boolean False If true, the category path is retrievable and can be used to create facets.

    If false, the category path is only searchable.
    (Advanced usage. langdate hacks)
    cleanupContent boolean True Removes trailing and leading spaces.

    Removes category path without AlphaNum character.
    detectTitle boolean False Detect words set after # in path and use them as title

    Nested elements
    • fromDataModelAnnotationTarget
    Type Wrapper Description
    AnnotationTarget fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily see what reverting to "auto" from "customized" would imply.
    XML representation
    <CategoryAnnotationTarget indexField="" forcedRank="" rankBoost="0" form="normalized" dataModelState="" dataModelClass="" dataModelProperty="" categoryRoot="" categoryAppend="true" appendAnnotationNameToRoot="false" retrievable="false" cleanupContent="true" detectTitle="false" ></CategoryAnnotationTarget>
  • StandardAnnotationTarget com.exalead.indexing.analysis.v10.StandardAnnotationTarget
    StandardAnnotationTarget is used to index the textual content of a SemanticAnnotation.

    The selected 'form' of the SemanticAnnotation is used to populate an index field.
    Parent objects
    • AnnotationMapping
    Attributes
    • indexField string
    • forcedRank long
    • rankBoost long
    • form string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • searchable boolean
    • indexLevel string
    • customIndexKind int
    • retrievable boolean
    • retrieveField string
    Name Type Default value Description
    indexField string -
    forcedRank long -
    rankBoost long 0
    form string normalized Which form of SemanticAnnotation value should we index? {@code enum(exact,normalized)}
    dataModelState string - Is this annotation target managed by a data model?
    @enum{null,auto,customized}.

    If null, this prefix handler is not related to a data model.

    If "auto", this prefix handler is auto-generated by a data model.

    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this AnnotationTarget.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this AnnotationTarget.
    searchable boolean False If true, the SemanticAnnotation can be searched for.
    indexLevel string - If searchable, index kind where data will be indexed.

    Can be "exact", "lowercase", "normalized" or "custom".
    customIndexKind int - If indexLevel = "custom", this index kind will be used.
    retrievable boolean False If true, the SemanticAnnotation can be retrieved.
    retrieveField string - The field where the SemanticAnnotation is stored for retrieval, if 'retrievable' is set to true.
    If null, 'indexField' will be used to store the SemanticAnnotation for retrieval.

    Nested elements
    • fromDataModelAnnotationTarget
    Type Wrapper Description
    AnnotationTarget fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily see what reverting to "auto" from "customized" would imply.
    XML representation
    <StandardAnnotationTarget indexField="" forcedRank="" rankBoost="0" form="normalized" dataModelState="" dataModelClass="" dataModelProperty="" searchable="false" indexLevel="" customIndexKind="" retrievable="false" retrieveField="" ></StandardAnnotationTarget>
  • EnumFacetAnnotationTarget com.exalead.indexing.analysis.v10.EnumFacetAnnotationTarget
    EnumFacetAnnotationTarget maps the annotations according to the specified EnumFacet.
    Parent objects
    • AnnotationMapping
    Attributes
    • indexField string
    • forcedRank long
    • rankBoost long
    • form string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • enumFacetId string
    Name Type Default value Description
    indexField string -
    forcedRank long -
    rankBoost long 0
    form string normalized Which form of SemanticAnnotation value should we index? {@code enum(exact,normalized)}
    dataModelState string - Is this annotation target managed by a data model?
    @enum{null,auto,customized}.

    If null, this prefix handler is not related to a data model.

    If "auto", this prefix handler is auto-generated by a data model.

    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this AnnotationTarget.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this AnnotationTarget.
    enumFacetId string - The id of the EnumFacetAnnotationTarget this target refers to.

    Nested elements
    • fromDataModelAnnotationTarget
    Type Wrapper Description
    AnnotationTarget fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily see what reverting to "auto" from "customized" would imply.
    XML representation
    <EnumFacetAnnotationTarget indexField="" forcedRank="" rankBoost="0" form="normalized" dataModelState="" dataModelClass="" dataModelProperty="" enumFacetId="" ></EnumFacetAnnotationTarget>
  • ContextMapping com.exalead.indexing.analysis.v10.ContextMapping
    ContextMapping specifies how DocumentChunks with a given ContextName are remapped to index fields and whether they are used to populate the dictionary.
    Parent objects
    • MappingConfiguration
    Attributes
    • name string
    • prefixMatch boolean
    • unprefix boolean
    • patternMatch boolean
    • semantic boolean
    • resourceFreq int
    • tokenizationConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    name string - ContextName of the DocumentChunks to map.
    prefixMatch boolean False Matches all context that starts with this prefix.
    unprefix boolean False Remove the prefix that was used to match.
    patternMatch boolean False Matches all context matching this pattern (must be a valid regular expression).
    semantic boolean True Performs semantic processing on the DocumentChunks processed by this mapping.

    If false, the textual content of the DocumentChunks will not be tokenized before indexing. This can be used to index 'exact raw values'.
    resourceFreq int 1 To extract a resource, select the frequency to add.

    For example, if you have a 'firstname lastname' entry, you may want to simulate a frequency of 1000 to avoid spellcheck on this entry.
    tokenizationConfig string -
    dataModelState string - Is this content target managed by a data model? @enum{null,auto,customized}.
    If null, this context mapping is not related to a data model.
    If "auto", this context mapping is auto-generated by a data model
    If "customized", this context mapping was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this context mapping.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this ContextMapping

    Nested elements
    • fromDataModelContextMapping
    • Target*
    Type Wrapper Description
    ContextMapping fromDataModel If dataModelState is "customized", you will find here the original context mapping generated by the data model.

    Use this to easily show what reverting to "auto" from "customized" would imply.
    Target* -
    XML representation
    <ContextMapping name="" prefixMatch="false" unprefix="false" patternMatch="false" semantic="true" resourceFreq="1" tokenizationConfig="tok0" dataModelState="" dataModelClass="" dataModelProperty="" ></ContextMapping>
  • CategoryContentTarget com.exalead.indexing.analysis.v10.CategoryContentTarget
    CategoryContentTarget is used to map a DocumentChunk to a category.

    A Category Path is created for each DocumentChunk processed.

    The textual content of the DocumentChunk is used to build a Category Path.
    'indexField' should be a category field (usually called 'categories' or 'security').
    Parent objects
    • ContextMapping
    Attributes
    • indexField string
    • forcedRank long
    • rankBoost long
    • categoryRoot string
    • categoryAppend boolean
    • appendContextNameToRoot boolean
    • form string
    • retrievable boolean
    • cleanupContent boolean
    • detectTitle boolean
    Name Type Default value Description
    indexField string - The indexField to populate with this content.

    If null, the contextName of the DocumentChunk will be used for the index field.
    forcedRank long - Sets the ranking value for chunks in this mapping.
    -1 means that the chunk internal ranking value is kept.
    rankBoost long 0 Offsets the chunk internal ranking value.

    Use it only when forcedRank = -1

    For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
    categoryRoot string - Builds the category path.
    categoryAppend boolean True Appends the textual content of the DocumentChunk to the category root.

    If false, only the category root will be used.
    appendContextNameToRoot boolean False Appends the context name between the root and the value.
    form string normalized The form of the word to be used to build the category path. {@code enum(exact,normalized)}
    retrievable boolean False Stores the category path, which enables display and navigation by category path.

    If false, we only index the SemanticAnnotation (Advanced usage - langdate hacks).
    cleanupContent boolean True If true:
    • Removes trailing and leading unicode-spaces.
    • Replaces all sequences of unicode-space characters by a single 'space' character.
    • Does not map to the category in append mode if the DocumentChunk does not contain at least one unicode alpha-numerical character.
    detectTitle boolean False Detect words set after # in path and use them as title

    XML representation
    <CategoryContentTarget indexField="" forcedRank="" rankBoost="0" categoryRoot="" categoryAppend="true" appendContextNameToRoot="false" form="normalized" retrievable="false" cleanupContent="true" detectTitle="false" ></CategoryContentTarget>
  • DateCategoryContentTarget com.exalead.indexing.analysis.v10.DateCategoryContentTarget
    CategoryContentTarget specific to date.
    Parent objects
    • ContextMapping
    Attributes
    • categoryRoot string
    • categoryAppend boolean
    • appendContextNameToRoot boolean
    • form string
    • retrievable boolean
    • cleanupContent boolean
    • detectTitle boolean
    • indexField string
    • forcedRank long
    • rankBoost long
    • inputFormat string
    Name Type Default value Description
    categoryRoot string - Builds the category path.
    categoryAppend boolean True Appends the textual content of the DocumentChunk to the category root.

    If false, only the category root will be used.
    appendContextNameToRoot boolean False Appends the context name between the root and the value.
    form string normalized The form of the word to be used to build the category path. {@code enum(exact,normalized)}
    retrievable boolean False Stores the category path, which enables display and navigation by category path.

    If false, we only index the SemanticAnnotation (Advanced usage - langdate hacks).
    cleanupContent boolean True If true:
    • Removes trailing and leading unicode-spaces.
    • Replaces all sequences of unicode-space characters by a single 'space' character.
    • Does not map to the category in append mode if the DocumentChunk does not contain at least one unicode alpha-numerical character.
    detectTitle boolean False Detect words set after # in path and use them as title
    indexField string - The indexField to populate with this content.

    If null, the contextName of the DocumentChunk will be used for the index field.
    forcedRank long - Sets the ranking value for chunks in this mapping.
    -1 means that the chunk internal ranking value is kept.
    rankBoost long 0 Offsets the chunk internal ranking value.

    Use it only when forcedRank = -1

    For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
    inputFormat string - Specifies the input format of the date, in UNIX date format. Set null value for automatic detection of standard formats.

    XML representation
    <DateCategoryContentTarget categoryRoot="" categoryAppend="true" appendContextNameToRoot="false" form="normalized" retrievable="false" cleanupContent="true" detectTitle="false" indexField="" forcedRank="" rankBoost="0" inputFormat="" ></DateCategoryContentTarget>
  • StandardContentTarget com.exalead.indexing.analysis.v10.StandardContentTarget
    A StandardContentTarget is used to populate a textual, numerical or date index field, with the content of a DocumentChunk.
    Parent objects
    • ContextMapping
    Attributes
    • indexField string
    • forcedRank long
    • rankBoost long
    • prefixWithContext boolean
    • addStartEnd boolean
    • indexPrefixes boolean
    • prefixesScore int
    • maxPrefixLength int
    • indexSuffixes boolean
    • suffixesScore int
    • maxSuffixLength int
    • indexSubstrings boolean
    • substringsScore int
    • searchable boolean
    • retrievable boolean
    • retrieveField string
    • indexNormalized boolean
    • indexLowercase boolean
    • indexExact boolean
    • indexSeparators boolean
    • addBreakBetweenChunks boolean
    Name Type Default value Description
    indexField string - The indexField to populate with this content.

    If null, the contextName of the DocumentChunk will be used for the index field.
    forcedRank long - Sets the ranking value for chunks in this mapping.
    -1 means that the chunk internal ranking value is kept.
    rankBoost long 0 Offsets the chunk internal ranking value.

    Use it only when forcedRank = -1

    For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
    prefixWithContext boolean False Enables prefixing of all words in inverted lists by 'contextName#'.
    addStartEnd boolean False Enables the introduction of a word __start__ before chunk content and a word __end__ after chunk content.

    Only valid if Chunk is mapped with semantic=true.

    This option is compatible with prefixContextName: produce contextName#__start__ and contextName#__end__)
    indexPrefixes boolean False Enables the indexing of all prefixes for each word with a score = prefixScore.

    The prefix can be mapped to a specific type if you add 'prefix' in formIndexingConfig.
    prefixesScore int 1 Score given to words' prefixes. The document relevance is determined by its score.

    The text matching score basically represents the "distance" between a search query and a document.
    maxPrefixLength int 0 Maximum length of the extracted prefixes.
    indexSuffixes boolean False Enables the indexing of all suffixes for each word with a score = suffixScore.

    The suffix can be mapped to a specific kind if you add 'suffix' in formIndexingConfig.
    suffixesScore int 1 Score given to words' prefixes. The document relevance is determined by its score.

    The text matching score basically represents the "distance" between a search query and a document.
    maxSuffixLength int 0 Maximum length of the extracted suffixes.
    indexSubstrings boolean False Enables the indexing of all substrings for each word with a score = substringScore.

    The suffix can be mapped to a specific kind if you add 'substring' in formIndexingConfig.
    substringsScore int 1 Score given to extracted substrings. Document relevance is determined by its score.

    The text matching score basically represents the "distance" between a search query and a document.
    searchable boolean True Marks the content of the DocumentChunk as indexed and searchable.
    retrievable boolean True Enables the content of the DocumentChunk to be directly stored in the index, so that it can be retrieved.

    For numerical values, retrievability allows you to sort results by field.
    retrieveField string - The index field in which the content will be stored.

    If null, the content will be put in 'indexField'.
    indexNormalized boolean True Enables the indexing of the normalized form of the word.
    indexLowercase boolean False Enables the indexing of the lowercase (non-normalized) form of each token.
    indexExact boolean False Enables the indexing of the exact (non-normalized) form of each token.
    indexSeparators boolean False Enables the indexing of the index standard separators.

    Indexed standard separators are: paragraph, sentence and page.

    Standard separators indexing is required for the SPLIT operator to work with these separators.
    addBreakBetweenChunks boolean True Enables the introduction of a break between document chunks by the indexer.

    This forbids phrase matching across these chunks and has an impact on search when using double-quotes expressions or the 'NEXT' operator.

    For example, if a document has a "title" chunk containing "foo" and a "text" chunk containing "bar", and they are both remapped to the text field.
    • If addBreakBetweenChunks is false, then the document will match on the query "foo bar", foo NEXT bar
    • If addBreakBetweenChunks is true, then the document will not match the query "foo bar" nor foo NEXT bar but will match the query foo AND bar

    Nested elements
    • DecreaseRankOnAnnotation*
    • IncreaseRankOnAnnotation*
    • RankOnAnnotation*
    Type Wrapper Description
    DecreaseRankOnAnnotation* - List of DecreaseRankOnAnnotation
    IncreaseRankOnAnnotation* - List of IncreaseRankOnAnnotation
    RankOnAnnotation* - List of RankOnAnnotation
    XML representation
    <StandardContentTarget indexField="" forcedRank="" rankBoost="0" prefixWithContext="false" addStartEnd="false" indexPrefixes="false" prefixesScore="1" maxPrefixLength="0" indexSuffixes="false" suffixesScore="1" maxSuffixLength="0" indexSubstrings="false" substringsScore="1" searchable="true" retrievable="true" retrieveField="" indexNormalized="true" indexLowercase="false" indexExact="false" indexSeparators="false" addBreakBetweenChunks="true" ></StandardContentTarget>
  • DateContentTarget com.exalead.indexing.analysis.v10.DateContentTarget
    DateContentTarget defines indexing a date.
    Parent objects
    • ContextMapping
    Attributes
    • prefixWithContext boolean
    • addStartEnd boolean
    • indexPrefixes boolean
    • prefixesScore int
    • maxPrefixLength int
    • indexSuffixes boolean
    • suffixesScore int
    • maxSuffixLength int
    • indexSubstrings boolean
    • substringsScore int
    • searchable boolean
    • retrievable boolean
    • retrieveField string
    • indexNormalized boolean
    • indexLowercase boolean
    • indexExact boolean
    • indexSeparators boolean
    • addBreakBetweenChunks boolean
    • indexField string
    • forcedRank long
    • rankBoost long
    • inputFormat string
    Name Type Default value Description
    prefixWithContext boolean False Enables prefixing of all words in inverted lists by 'contextName#'.
    addStartEnd boolean False Enables the introduction of a word __start__ before chunk content and a word __end__ after chunk content.

    Only valid if Chunk is mapped with semantic=true.

    This option is compatible with prefixContextName: produce contextName#__start__ and contextName#__end__)
    indexPrefixes boolean False Enables the indexing of all prefixes for each word with a score = prefixScore.

    The prefix can be mapped to a specific type if you add 'prefix' in formIndexingConfig.
    prefixesScore int 1 Score given to words' prefixes. The document relevance is determined by its score.

    The text matching score basically represents the "distance" between a search query and a document.
    maxPrefixLength int 0 Maximum length of the extracted prefixes.
    indexSuffixes boolean False Enables the indexing of all suffixes for each word with a score = suffixScore.

    The suffix can be mapped to a specific kind if you add 'suffix' in formIndexingConfig.
    suffixesScore int 1 Score given to words' prefixes. The document relevance is determined by its score.

    The text matching score basically represents the "distance" between a search query and a document.
    maxSuffixLength int 0 Maximum length of the extracted suffixes.
    indexSubstrings boolean False Enables the indexing of all substrings for each word with a score = substringScore.

    The suffix can be mapped to a specific kind if you add 'substring' in formIndexingConfig.
    substringsScore int 1 Score given to extracted substrings. Document relevance is determined by its score.

    The text matching score basically represents the "distance" between a search query and a document.
    searchable boolean True Marks the content of the DocumentChunk as indexed and searchable.
    retrievable boolean True Enables the content of the DocumentChunk to be directly stored in the index, so that it can be retrieved.

    For numerical values, retrievability allows you to sort results by field.
    retrieveField string - The index field in which the content will be stored.

    If null, the content will be put in 'indexField'.
    indexNormalized boolean True Enables the indexing of the normalized form of the word.
    indexLowercase boolean False Enables the indexing of the lowercase (non-normalized) form of each token.
    indexExact boolean False Enables the indexing of the exact (non-normalized) form of each token.
    indexSeparators boolean False Enables the indexing of the index standard separators.

    Indexed standard separators are: paragraph, sentence and page.

    Standard separators indexing is required for the SPLIT operator to work with these separators.
    addBreakBetweenChunks boolean True Enables the introduction of a break between document chunks by the indexer.

    This forbids phrase matching across these chunks and has an impact on search when using double-quotes expressions or the 'NEXT' operator.

    For example, if a document has a "title" chunk containing "foo" and a "text" chunk containing "bar", and they are both remapped to the text field.
    • If addBreakBetweenChunks is false, then the document will match on the query "foo bar", foo NEXT bar
    • If addBreakBetweenChunks is true, then the document will not match the query "foo bar" nor foo NEXT bar but will match the query foo AND bar
    indexField string - The indexField to populate with this content.

    If null, the contextName of the DocumentChunk will be used for the index field.
    forcedRank long - Sets the ranking value for chunks in this mapping.
    -1 means that the chunk internal ranking value is kept.
    rankBoost long 0 Offsets the chunk internal ranking value.

    Use it only when forcedRank = -1

    For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
    inputFormat string - Specifies the input format of the date, in UNIX date format. Set null value for automatic detection of standard formats.

    Nested elements
    • DecreaseRankOnAnnotation*
    • IncreaseRankOnAnnotation*
    • RankOnAnnotation*
    Type Wrapper Description
    DecreaseRankOnAnnotation* - List of DecreaseRankOnAnnotation
    IncreaseRankOnAnnotation* - List of IncreaseRankOnAnnotation
    RankOnAnnotation* - List of RankOnAnnotation
    XML representation
    <DateContentTarget prefixWithContext="false" addStartEnd="false" indexPrefixes="false" prefixesScore="1" maxPrefixLength="0" indexSuffixes="false" suffixesScore="1" maxSuffixLength="0" indexSubstrings="false" substringsScore="1" searchable="true" retrievable="true" retrieveField="" indexNormalized="true" indexLowercase="false" indexExact="false" indexSeparators="false" addBreakBetweenChunks="true" indexField="" forcedRank="" rankBoost="0" inputFormat="" ></DateContentTarget>
  • DecreaseRankOnAnnotation com.exalead.indexing.analysis.v10.DecreaseRankOnAnnotation
    Allows you to decrease the ranking when some words are flagged by an annotation (part of speech, ontology, ...).
    Parent objects
    • DateContentTarget
    • StandardContentTarget
    Attributes
    • annotationName string
    • annotationValue string
    • value int
    Name Type Default value Description
    annotationName string - Name of the targeted annotation.
    annotationValue string - Value of the annotation that will trigger the decrease in ranking.
    value int - Number to decrease from the ranking when triggered.

    XML representation
    <DecreaseRankOnAnnotation annotationName="" annotationValue="" value="" ></DecreaseRankOnAnnotation>
  • IncreaseRankOnAnnotation com.exalead.indexing.analysis.v10.IncreaseRankOnAnnotation
    Allows you to increase the ranking when some words are flagged by an annotation (part of speech, ontology, ...).
    Parent objects
    • DateContentTarget
    • StandardContentTarget
    Attributes
    • annotationName string
    • annotationValue string
    • value int
    Name Type Default value Description
    annotationName string - Name of the targeted annotation.
    annotationValue string - Value of the annotation that will trigger the increase in ranking.
    value int - Number to increase in the ranking when triggered.

    XML representation
    <IncreaseRankOnAnnotation annotationName="" annotationValue="" value="" ></IncreaseRankOnAnnotation>
  • RankOnAnnotation com.exalead.indexing.analysis.v10.RankOnAnnotation
    Modifies ranking when some words are flagged by a given annotation.
    Parent objects
    • DateContentTarget
    • StandardContentTarget
    Attributes
    • annotationName string
    • annotationValue string
    • forcedRank int
    Name Type Default value Description
    annotationName string - The annotation that triggers the ranking modification.
    annotationValue string - The annotation value required to trigger the ranking modification.
    forcedRank int - The new ranking.

    XML representation
    <RankOnAnnotation annotationName="" annotationValue="" forcedRank="" ></RankOnAnnotation>
  • CustomContentTarget com.exalead.indexing.analysis.v10.CustomContentTarget
    CustomerContentTarget defines indexing by a custom 'Index Kind'.
    Parent objects
    • ContextMapping
    Attributes
    • indexField string
    • forcedRank long
    • rankBoost long
    • searchable boolean
    • retrieveField string
    • retrievable boolean
    • indexKind int
    • addBreakBetweenChunks boolean
    Name Type Default value Description
    indexField string - The indexField to populate with this content.

    If null, the contextName of the DocumentChunk will be used for the index field.
    forcedRank long - Sets the ranking value for chunks in this mapping.
    -1 means that the chunk internal ranking value is kept.
    rankBoost long 0 Offsets the chunk internal ranking value.

    Use it only when forcedRank = -1

    For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
    searchable boolean True If true, the content of the DocumentChunk will be indexed and searchable.
    retrieveField string - The index field in which the content will be stored.

    If null, the content will be put in 'indexField'.
    retrievable boolean True Stores the content of the DocumentChunk directly in the index, so that it can be retrieved.

    For numerical values, retrievability enables to sort results by field.
    indexKind int - Index 'Kind' to use for indexing content.
    addBreakBetweenChunks boolean True If true, the indexer introduces a break between document chunks.

    This forbids phrase matching across these chunks and has an impact on search when using double-quotes expressions or the 'NEXT' operator.
    For example, if a document has a "title" chunk containing "foo" and a "text" chunk containing "bar", and they are both remapped to the text field:
    • If addBreakBetweenChunks is false, then the document will match on the query "foo bar", foo NEXT bar
    • If addBreakBetweenChunks is true, then the document will not match the query "foo bar" nor foo NEXT bar but will match the query foo AND bar

    XML representation
    <CustomContentTarget indexField="" forcedRank="" rankBoost="0" searchable="true" retrieveField="" retrievable="true" indexKind="" addBreakBetweenChunks="true" ></CustomContentTarget>
  • EnumFacetContentTarget com.exalead.indexing.analysis.v10.EnumFacetContentTarget
    EnumFacetContentTarget maps the content according to the specified EnumFacet.
    Parent objects
    • ContextMapping
    Attributes
    • indexField string
    • forcedRank long
    • rankBoost long
    • enumFacetId string
    • form string
    Name Type Default value Description
    indexField string - The indexField to populate with this content.

    If null, the contextName of the DocumentChunk will be used for the index field.
    forcedRank long - Sets the ranking value for chunks in this mapping.
    -1 means that the chunk internal ranking value is kept.
    rankBoost long 0 Offsets the chunk internal ranking value.

    Use it only when forcedRank = -1

    For example, if forcedRank=-1, rankBoost=2, and the chunk internal ranking value is 4, the final rank will be 6.
    enumFacetId string - The id of the EnumFacet this target refers to.
    form string normalized The form of the values for the facet stringValues {@code enum(exact,normalized)}

    XML representation
    <EnumFacetContentTarget indexField="" forcedRank="" rankBoost="0" enumFacetId="" form="normalized" ></EnumFacetContentTarget>
  • DictionaryTarget com.exalead.indexing.analysis.v10.DictionaryTarget
    A DictionaryTarget specifies how a DocumentChunk or semantic annotation is processed to the dictionary.
    Parent objects
    • ContextMapping
    Attributes
    • dictionaryName string
    • words boolean
    • ngrams boolean
    • rt boolean
    • phonemes boolean
    Name Type Default value Description
    dictionaryName string -
    words boolean True
    ngrams boolean False
    rt boolean False
    phonemes boolean False

    XML representation
    <DictionaryTarget dictionaryName="" words="true" ngrams="false" rt="false" phonemes="false" ></DictionaryTarget>
  • PartTarget com.exalead.indexing.analysis.v10.PartTarget
    A PartTarget specifies how a Part is processed to populate the index.
    Parent objects
    • ContextMapping
    Attributes
    • indexField string
    Name Type Default value Description
    indexField string - The index field in which the content will be stored.

    XML representation
    <PartTarget indexField="" ></PartTarget>
  • FieldIndexingLimit com.exalead.indexing.analysis.v10.FieldIndexingLimit
    Limits the number of words that can be retrieved from a given field.
    Parent objects
    • MappingConfiguration
    Attributes
    • fieldName string
    • maxNbWords int
    Name Type Default value Description
    fieldName string - Field to limit.
    maxNbWords int - Maximum number of words for this field.

    XML representation
    <FieldIndexingLimit fieldName="" maxNbWords="" ></FieldIndexingLimit>
  • FieldRetrievalLimit com.exalead.indexing.analysis.v10.FieldRetrievalLimit
    Limits the size of text that can be retrieved from a given field.

    In some standard configuration, a FieldRetrievalLimit on the 'text' field is set to "maxLength=4096".

    This limits the size of the index on disk.
    Parent objects
    • MappingConfiguration
    Attributes
    • retrieveField string
    • maxLength int
    Name Type Default value Description
    retrieveField string - Field to limit.
    maxLength int - Max text size in bytes.

    The text will be clipped to the nearest word.

    Text is stored in UTF-8.

    XML representation
    <FieldRetrievalLimit retrieveField="" maxLength="" ></FieldRetrievalLimit>
  • GenerateAnnotationsForContext com.exalead.indexing.analysis.v10.GenerateAnnotationsForContext
    Forces a context to be processed by the SemanticProcessor pipeline and to process semantic annotations.
    Parent objects
    • MappingConfiguration
    Attributes
    • name string
    • prefixMatch boolean
    • patternMatch boolean
    • tokenizationConfig string
    Name Type Default value Description
    name string - ContextName of the DocumentChunks to map.
    prefixMatch boolean False Matches any context starting with this prefix.
    patternMatch boolean False Matches any context matching this regular expression.
    tokenizationConfig string - If set, it forces the tokenization configuration to use.

    XML representation
    <GenerateAnnotationsForContext name="" prefixMatch="false" patternMatch="false" tokenizationConfig="tok0" ></GenerateAnnotationsForContext>
  • PartMapping com.exalead.indexing.analysis.v10.PartMapping
    PartMapping specifies how parts are remapped to index fields.
    Parent objects
    • MappingConfiguration
    Attributes
    • name string
    • prefixMatch boolean
    • patternMatch boolean
    Name Type Default value Description
    name string - Name of the Part to map.
    prefixMatch boolean False Matches all parts that starts with this prefix.
    patternMatch boolean False Matches all parts matching this pattern (must be a valid regular expression).

    Nested elements
    • PartTarget*
    Type Wrapper Description
    PartTarget* -
    XML representation
    <PartMapping name="" prefixMatch="false" patternMatch="false" ></PartMapping>
  • WordCountMapping com.exalead.indexing.analysis.v10.WordCountMapping
    Specify where to map Word count.
    Parent objects
    • MappingConfiguration
    Attributes
    • fromName string
    • toName string
    Name Type Default value Description
    fromName string - Compute the word count of this field.
    toName string - Store the word count to this field.

    XML representation
    <WordCountMapping fromName="" toName="" ></WordCountMapping>
  • IndexSchema com.exalead.mercury.mami.indexing.v10.IndexSchema
    Configuration for an index schema.
    This defines the fields actually stored in an index.

    Most commonly, only one index schema is defined, and used by all build groups (for all slices).

    This configuration is referenced in the BuildGroup element in 'Deployment'.
    Attributes
    • name string
    • allowIntensiveDiskAccess boolean
    Name Type Default value Description
    name string -
    allowIntensiveDiskAccess boolean False Allows intensive operations like sorting or faceting to be performed on disk (SSD should be preferred).

    Nested elements
    • AttributeGroupStore*
    • FieldConfig*
    Type Wrapper Description
    AttributeGroupStore* -
    FieldConfig* -
    XML representation
    <IndexSchema name="" allowIntensiveDiskAccess="false" ></IndexSchema>
  • AttributeGroupStore com.exalead.mercury.mami.indexing.v10.AttributeGroupStore
    Configuration of an attribute group. An attribute group define how attributes should be persisted on disk.
    Parent objects
    • IndexSchema
    Attributes
    • id int
    • label string
    • format enum(SimpleRowOrientedStore, ItemOrientedStore)
    • retrievableRoles string
    • leafSize int
    Name Type Default value Description
    id int - A unique identifier for this attribute group.
    label string - A human readable name for this attribute group.
    format enum(SimpleRowOrientedStore, ItemOrientedStore) ItemOrientedStore Specifies how to persist the data on disk for this attribute group.
    retrievableRoles string - Specifies a comma-separated list of annotations to be handled in this attribute group store. Ex: @Facetable,@Sortable,@Display
    leafSize int 30720 If the format is SimpleRowOrientedStore, configures the leaf size (i.e., maximum IO size read per DID).

    XML representation
    <AttributeGroupStore id="" label="" format="ItemOrientedStore" retrievableRoles="" leafSize="30720" ></AttributeGroupStore>
  • AlphanumFieldConfig com.exalead.mercury.mami.indexing.v10.AlphanumFieldConfig
    This field stores alphanumeric values (i.e., 'text', 'title').
    Parent objects
    • IndexSchema
    Attributes
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    • maxStoredWordPosition int
    • maxInlineWordPositions int
    • useVariablePositionsEncoding boolean
    • storeTf boolean
    • bloomFilter boolean
    • gzip boolean
    • implementation enum(strbtree, trie, fsm)
    • nbWordsPerLeaf int
    • optimizePatternSearch boolean
    • patternSearchOptimizationType enum(ngramv1, ngramv2)
    Name Type Default value Description
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0
    maxStoredWordPosition int - Number of words, starting from the beginning of the document, for which word positions will be stored in the index.

    This enables proximity ranking and position searching (NEAR, NEXT, ...) up to this number of words in the document.
    '0' should be used to disable position storing.
    maxInlineWordPositions int 2 Advanced setting controlling how many positions are inlined in the main data file for each word of each document.
    useVariablePositionsEncoding boolean False Advanced setting to choose which positions encoding algorithm should be used.

    Variable position encoding should be used to reduce index size when indexing big documents.
    storeTf boolean False Stores the number of terms of each document.

    This information may be used by the ranking algorithm to normalize term frequencies (as "nbTerms").

    This costs a few bytes of RAM per document.
    bloomFilter boolean False Activates a Bloom filter per slot. This speeds up requests containing words that are not present in the field on a given slot.

    Disable this option if all words of the request for this field are always matching, and if you compact into big slots regularly.

    Enable this option if there is either a lot of misses (e.g. on the "text" field) or if you have small updates (e.g. with real-time indexing).
    gzip boolean True Activates content compression
    implementation enum(strbtree, trie, fsm) fsm Advanced configuration. Internal structure used to store the field dictionary.
    nbWordsPerLeaf int 1000 Advanced configuration. If using the strbtree structure, it configures the number of words per leaf.
    optimizePatternSearch boolean True Adds extra informations to the index for pattern search optimization. If false, optimizes data structures for size.
    patternSearchOptimizationType enum(ngramv1, ngramv2) ngramv1 Select algorithm based on dictionary ngrams or indexed ngrams (untokenized text only).

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <AlphanumFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" maxStoredWordPosition="" maxInlineWordPositions="2" useVariablePositionsEncoding="false" storeTf="false" bloomFilter="false" gzip="true" implementation="fsm" nbWordsPerLeaf="1000" optimizePatternSearch="true" patternSearchOptimizationType="ngramv1" ></AlphanumFieldConfig>
  • RiceEncoderConfig com.exalead.mercury.mami.indexing.v10.RiceEncoderConfig
    No documentation for this element.
    Parent objects
    • AlphanumFieldConfig
    • BinaryFieldConfig
    • CategoryFieldConfig
    • DateFieldConfig
    • DoubleFieldConfig
    • FieldConfig
    • GeoFieldConfig
    • HierarchyFieldConfig
    • LegacySignedFieldConfig
    • LegacyUnsignedFieldConfig
    • NumericalFieldConfig
    • PointFieldConfig
    • ReferenceFieldConfig
    • SignedFieldConfig
    • SortableFieldConfig
    • StandardFieldConfig
    • TextFieldConfig
    • TimeFieldConfig
    • UidFieldConfig
    • UnsignedFieldConfig
    • ValueFieldConfig
    Attributes
    • bytesPerBlock int
    • positionsRiceCodingParam int
    • dataFilesPrefetchPages int
    • extFilesPrefetchPages int
    Name Type Default value Description
    bytesPerBlock int 1024
    positionsRiceCodingParam int 1024
    dataFilesPrefetchPages int 2
    extFilesPrefetchPages int 2

    XML representation
    <RiceEncoderConfig bytesPerBlock="1024" positionsRiceCodingParam="1024" dataFilesPrefetchPages="2" extFilesPrefetchPages="2" ></RiceEncoderConfig>
  • VarIntEncoderConfig com.exalead.mercury.mami.indexing.v10.VarIntEncoderConfig
    Stores each integer in varint encoding
    XML representation
    <VarIntEncoderConfig ></VarIntEncoderConfig>
  • Apollo11EncoderConfig com.exalead.mercury.mami.indexing.v10.Apollo11EncoderConfig
    Stores each integer in Apollo11 encoding
    XML representation
    <Apollo11EncoderConfig ></Apollo11EncoderConfig>
  • NoOpEncoderConfig com.exalead.mercury.mami.indexing.v10.NoOpEncoderConfig
    Trivial encoder. For debugging purposes only
    XML representation
    <NoOpEncoderConfig ></NoOpEncoderConfig>
  • FastNoPosEncoderConfig com.exalead.mercury.mami.indexing.v10.FastNoPosEncoderConfig
    An encoder that only stores docids, not ranks nor positions.
    Parent objects
    • AlphanumFieldConfig
    • BinaryFieldConfig
    • CategoryFieldConfig
    • DateFieldConfig
    • DoubleFieldConfig
    • FieldConfig
    • GeoFieldConfig
    • HierarchyFieldConfig
    • LegacySignedFieldConfig
    • LegacyUnsignedFieldConfig
    • NumericalFieldConfig
    • PointFieldConfig
    • ReferenceFieldConfig
    • SignedFieldConfig
    • SortableFieldConfig
    • StandardFieldConfig
    • TextFieldConfig
    • TimeFieldConfig
    • UidFieldConfig
    • UnsignedFieldConfig
    • ValueFieldConfig
    Attributes
    • didsPerBlock int
    Name Type Default value Description
    didsPerBlock int 256

    XML representation
    <FastNoPosEncoderConfig didsPerBlock="256" ></FastNoPosEncoderConfig>
  • LegacyUnsignedFieldConfig com.exalead.mercury.mami.indexing.v10.LegacyUnsignedFieldConfig
    No documentation for this element.
    Parent objects
    • IndexSchema
    Attributes
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    • bitsForValue int
    Name Type Default value Description
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0
    bitsForValue int 32 Number of bits used to store numerical values.

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <LegacyUnsignedFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" bitsForValue="32" ></LegacyUnsignedFieldConfig>
  • LegacySignedFieldConfig com.exalead.mercury.mami.indexing.v10.LegacySignedFieldConfig
    No documentation for this element.
    Parent objects
    • IndexSchema
    Attributes
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    Name Type Default value Description
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <LegacySignedFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></LegacySignedFieldConfig>
  • PointFieldConfig com.exalead.mercury.mami.indexing.v10.PointFieldConfig
    This type of field is used to store geographical points using either GPS coordinates (WGS84) or planar X,Y coordinates (Meter).
    Parent objects
    • IndexSchema
    Attributes
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    • geoType enum(WGS84, Meter)
    • blockSize int
    • exact boolean
    Name Type Default value Description
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0
    geoType enum(WGS84, Meter) WGS84 Value can be one of
    • WGS84
    • Meter
    blockSize int 8192
    exact boolean True

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <PointFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" geoType="WGS84" blockSize="8192" exact="true" ></PointFieldConfig>
  • GeoFieldConfig com.exalead.mercury.mami.indexing.v10.GeoFieldConfig
    This type of field is used to store 2D geometries using either planar X,Y coordinates (Meter).
    Parent objects
    • IndexSchema
    Attributes
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    • geoType enum(Meter)
    • maxBlockSize int
    • precision int
    • bboxFieldName string
    Name Type Default value Description
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0
    geoType enum(Meter) Meter Value can be one of
    • Meter
    maxBlockSize int 24
    precision int 6
    bboxFieldName string -

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <GeoFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" geoType="Meter" maxBlockSize="24" precision="6" bboxFieldName="" ></GeoFieldConfig>
  • UidFieldConfig com.exalead.mercury.mami.indexing.v10.UidFieldConfig
    This field stores a unique value in order to facilitate search.
    Parent objects
    • IndexSchema
    Attributes
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    • dictStorage enum(strbtree, trie, fsm)
    • bitsetThreshold int
    Name Type Default value Description
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0
    dictStorage enum(strbtree, trie, fsm) fsm Associative array implementation.
    bitsetThreshold int 10000 Number of requested documents before switching from a dynamic array to a bitset representation.

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <UidFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" dictStorage="fsm" bitsetThreshold="10000" ></UidFieldConfig>
  • ValueFieldConfig com.exalead.mercury.mami.indexing.v10.ValueFieldConfig
    Stores alphanumerical content with an internal ordinal mapping, which makes it suitable for efficient facetting. Each term is limited to 1024 bytes.
    Parent objects
    • IndexSchema
    Attributes
    • deltaRefEncodeMultivaluedValues boolean
    • sortMultivaluedValues boolean
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    • ignorePresentBit boolean
    • minMemberNbBits int
    • bloomFilter boolean
    • hashThreshold int
    • implementation enum(strbtree, fsm)
    • optimizeListsForPatternSearch boolean
    Name Type Default value Description
    deltaRefEncodeMultivaluedValues boolean True Delta ref encode multivalued values.
    sortMultivaluedValues boolean True Storing multivalued RAM-based values in an increasing order consumes less RAM.

    This must be disabled to use some advanced multivalued virtual functions.
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0
    ignorePresentBit boolean False Uses and loads the present bit.
    minMemberNbBits int 5 Min number of bits for attr part for value field.
    bloomFilter boolean False Activates a Bloom filter per slot. This speeds up requests containing words that are not present in the field on a given slot.

    Disable this option if all words of the request for this field are always matching, and if you compact into big slots regularly.

    Enable this option if there is either a lot of misses (e.g. on the "text" field) or if you have small updates (e.g. with real-time indexing).
    hashThreshold int 128 Stores a hash value in field dictionary instead of the original data if value length is greater than this threshold.
    implementation enum(strbtree, fsm) fsm Advanced configuration. Internal structure used to store the field dictionary.
    optimizeListsForPatternSearch boolean False speed up pattern search by reducing the number of opened inverted lists at the expense of indexing time and disk space.

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <ValueFieldConfig deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ignorePresentBit="false" minMemberNbBits="5" bloomFilter="false" hashThreshold="128" implementation="fsm" optimizeListsForPatternSearch="false" ></ValueFieldConfig>
  • TextFieldConfig com.exalead.mercury.mami.indexing.v10.TextFieldConfig
    Stores alphanumerical content with an internal ordinal mapping, which makes it suitable for efficient facetting. Each term is limited to 1024 bytes.
    Parent objects
    • IndexSchema
    Attributes
    • ramBased boolean
    • multiContext boolean
    • retrievable boolean
    • ignorePresentBit boolean
    • minMemberNbBits int
    • bloomFilter boolean
    • hashThreshold int
    • implementation enum(strbtree, fsm)
    • optimizeListsForPatternSearch boolean
    • deltaRefEncodeMultivaluedValues boolean
    • sortMultivaluedValues boolean
    • fieldName string
    • searchable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    • storePositions boolean
    Name Type Default value Description
    ramBased boolean True A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    retrievable boolean True
    ignorePresentBit boolean False Uses and loads the present bit.
    minMemberNbBits int 5 Min number of bits for attr part for value field.
    bloomFilter boolean False Activates a Bloom filter per slot. This speeds up requests containing words that are not present in the field on a given slot.

    Disable this option if all words of the request for this field are always matching, and if you compact into big slots regularly.

    Enable this option if there is either a lot of misses (e.g. on the "text" field) or if you have small updates (e.g. with real-time indexing).
    hashThreshold int 128 Stores a hash value in field dictionary instead of the original data if value length is greater than this threshold.
    implementation enum(strbtree, fsm) fsm Advanced configuration. Internal structure used to store the field dictionary.
    optimizeListsForPatternSearch boolean False speed up pattern search by reducing the number of opened inverted lists at the expense of indexing time and disk space.
    deltaRefEncodeMultivaluedValues boolean True Delta ref encode multivalued values.
    sortMultivaluedValues boolean True Storing multivalued RAM-based values in an increasing order consumes less RAM.

    This must be disabled to use some advanced multivalued virtual functions.
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0
    storePositions boolean True Store positions for seq nodes and proximity scoring.

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <TextFieldConfig ramBased="true" multiContext="false" retrievable="true" ignorePresentBit="false" minMemberNbBits="5" bloomFilter="false" hashThreshold="128" implementation="fsm" optimizeListsForPatternSearch="false" deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" fieldName="" searchable="false" dataModelState="" multivalued="false" version="0" storePositions="true" ></TextFieldConfig>
  • ReferenceFieldConfig com.exalead.mercury.mami.indexing.v10.ReferenceFieldConfig
    Stores alphanumerical content with an internal ordinal mapping, which makes it suitable for efficient facetting. Each term is limited to 1024 bytes.
    Parent objects
    • IndexSchema
    Attributes
    • ramBased boolean
    • multiContext boolean
    • retrievable boolean
    • ignorePresentBit boolean
    • minMemberNbBits int
    • bloomFilter boolean
    • hashThreshold int
    • implementation enum(strbtree, fsm)
    • optimizeListsForPatternSearch boolean
    • deltaRefEncodeMultivaluedValues boolean
    • sortMultivaluedValues boolean
    • fieldName string
    • searchable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    Name Type Default value Description
    ramBased boolean True A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    retrievable boolean True
    ignorePresentBit boolean False Uses and loads the present bit.
    minMemberNbBits int 5 Min number of bits for attr part for value field.
    bloomFilter boolean False Activates a Bloom filter per slot. This speeds up requests containing words that are not present in the field on a given slot.

    Disable this option if all words of the request for this field are always matching, and if you compact into big slots regularly.

    Enable this option if there is either a lot of misses (e.g. on the "text" field) or if you have small updates (e.g. with real-time indexing).
    hashThreshold int 128 Stores a hash value in field dictionary instead of the original data if value length is greater than this threshold.
    implementation enum(strbtree, fsm) fsm Advanced configuration. Internal structure used to store the field dictionary.
    optimizeListsForPatternSearch boolean False speed up pattern search by reducing the number of opened inverted lists at the expense of indexing time and disk space.
    deltaRefEncodeMultivaluedValues boolean True Delta ref encode multivalued values.
    sortMultivaluedValues boolean True Storing multivalued RAM-based values in an increasing order consumes less RAM.

    This must be disabled to use some advanced multivalued virtual functions.
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <ReferenceFieldConfig ramBased="true" multiContext="false" retrievable="true" ignorePresentBit="false" minMemberNbBits="5" bloomFilter="false" hashThreshold="128" implementation="fsm" optimizeListsForPatternSearch="false" deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" fieldName="" searchable="false" dataModelState="" multivalued="false" version="0" ></ReferenceFieldConfig>
  • UnsignedFieldConfig com.exalead.mercury.mami.indexing.v10.UnsignedFieldConfig
    No documentation for this element.
    Parent objects
    • IndexSchema
    Attributes
    • bitsForValue int
    • blockSize int
    • deltaRefEncodeMultivaluedValues boolean
    • sortMultivaluedValues boolean
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    Name Type Default value Description
    bitsForValue int 63 Number of bits used to store numerical values.

    For unsigned numerical fields, the possible values are [0; 2^N - 1], and the field values are stored on N bits.

    For signed fields (signed integer and double), the possible values are [-2^N, 2^N - 1], and the field values are stored on (N+1) bits.
    blockSize int 8192
    deltaRefEncodeMultivaluedValues boolean True Delta ref encode multivalued values.
    sortMultivaluedValues boolean True Storing multivalued RAM-based values in an increasing order consumes less RAM.

    This must be disabled to use some advanced multivalued virtual functions.
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <UnsignedFieldConfig bitsForValue="63" blockSize="8192" deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></UnsignedFieldConfig>
  • SignedFieldConfig com.exalead.mercury.mami.indexing.v10.SignedFieldConfig
    No documentation for this element.
    Parent objects
    • IndexSchema
    Attributes
    • bitsForValue int
    • blockSize int
    • deltaRefEncodeMultivaluedValues boolean
    • sortMultivaluedValues boolean
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    Name Type Default value Description
    bitsForValue int 63 Number of bits used to store numerical values.

    For unsigned numerical fields, the possible values are [0; 2^N - 1], and the field values are stored on N bits.

    For signed fields (signed integer and double), the possible values are [-2^N, 2^N - 1], and the field values are stored on (N+1) bits.
    blockSize int 8192
    deltaRefEncodeMultivaluedValues boolean True Delta ref encode multivalued values.
    sortMultivaluedValues boolean True Storing multivalued RAM-based values in an increasing order consumes less RAM.

    This must be disabled to use some advanced multivalued virtual functions.
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <SignedFieldConfig bitsForValue="63" blockSize="8192" deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></SignedFieldConfig>
  • DoubleFieldConfig com.exalead.mercury.mami.indexing.v10.DoubleFieldConfig
    Configuration of a double precision floating point number field.
    Parent objects
    • IndexSchema
    Attributes
    • bitsForValue int
    • blockSize int
    • deltaRefEncodeMultivaluedValues boolean
    • sortMultivaluedValues boolean
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    • precision int
    Name Type Default value Description
    bitsForValue int 63 Number of bits used to store numerical values.

    For unsigned numerical fields, the possible values are [0; 2^N - 1], and the field values are stored on N bits.

    For signed fields (signed integer and double), the possible values are [-2^N, 2^N - 1], and the field values are stored on (N+1) bits.
    blockSize int 8192
    deltaRefEncodeMultivaluedValues boolean True Delta ref encode multivalued values.
    sortMultivaluedValues boolean True Storing multivalued RAM-based values in an increasing order consumes less RAM.

    This must be disabled to use some advanced multivalued virtual functions.
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0
    precision int 4 Number of relevant digits in the decimal part.

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <DoubleFieldConfig bitsForValue="63" blockSize="8192" deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" precision="4" ></DoubleFieldConfig>
  • TimeFieldConfig com.exalead.mercury.mami.indexing.v10.TimeFieldConfig
    No documentation for this element.
    Parent objects
    • IndexSchema
    Attributes
    • deltaRefEncodeMultivaluedValues boolean
    • sortMultivaluedValues boolean
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    Name Type Default value Description
    deltaRefEncodeMultivaluedValues boolean True Delta ref encode multivalued values.
    sortMultivaluedValues boolean True Storing multivalued RAM-based values in an increasing order consumes less RAM.

    This must be disabled to use some advanced multivalued virtual functions.
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <TimeFieldConfig deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></TimeFieldConfig>
  • DateFieldConfig com.exalead.mercury.mami.indexing.v10.DateFieldConfig
    No documentation for this element.
    Parent objects
    • IndexSchema
    Attributes
    • deltaRefEncodeMultivaluedValues boolean
    • sortMultivaluedValues boolean
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    Name Type Default value Description
    deltaRefEncodeMultivaluedValues boolean True Delta ref encode multivalued values.
    sortMultivaluedValues boolean True Storing multivalued RAM-based values in an increasing order consumes less RAM.

    This must be disabled to use some advanced multivalued virtual functions.
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <DateFieldConfig deltaRefEncodeMultivaluedValues="true" sortMultivaluedValues="true" ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></DateFieldConfig>
  • BinaryFieldConfig com.exalead.mercury.mami.indexing.v10.BinaryFieldConfig
    No documentation for this element.
    Parent objects
    • IndexSchema
    Attributes
    • ramBased boolean
    • multiContext boolean
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    • gzip boolean
    Name Type Default value Description
    ramBased boolean False A value field must be RAM-based to perform synthesis efficiently.
    multiContext boolean False
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0
    gzip boolean False Activates content compression

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <BinaryFieldConfig ramBased="false" multiContext="false" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" gzip="false" ></BinaryFieldConfig>
  • CategoryFieldConfig com.exalead.mercury.mami.indexing.v10.CategoryFieldConfig
    Stores hierarchy content. Each term is limited to 1024 bytes.
    Parent objects
    • IndexSchema
    Attributes
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    • ramBased boolean
    • implementation enum(strbtree, fsm)
    Name Type Default value Description
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0
    ramBased boolean True A value field must be RAM-based to perform synthesis efficiently.
    implementation enum(strbtree, fsm) strbtree Advanced configuration. Internal structure used to store the field dictionary.

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <CategoryFieldConfig fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ramBased="true" implementation="strbtree" ></CategoryFieldConfig>
  • HierarchyFieldConfig com.exalead.mercury.mami.indexing.v10.HierarchyFieldConfig
    Stores hierarchy content. Each term is limited to 1024 bytes.
    Parent objects
    • IndexSchema
    Attributes
    • ramBased boolean
    • implementation enum(strbtree, fsm)
    • fieldName string
    • searchable boolean
    • retrievable boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • multivalued boolean
    • version int
    Name Type Default value Description
    ramBased boolean True A value field must be RAM-based to perform synthesis efficiently.
    implementation enum(strbtree, fsm) strbtree Advanced configuration. Internal structure used to store the field dictionary.
    fieldName string - The name of the field.
    The name of a field can only contain lower-case characters, numbers and underscore. [a-z0-9_]+
    searchable boolean False Allows users to query on this field (using a prefix handler).
    retrievable boolean False Allows the content of this field to be retrieved at query time and displayed in the search results.
    dataModelState string - Is this index field config managed by a data model? @enum{null,auto,customized}.
    If null, this is not related to a data model.
    If "auto", this is auto-generated by a data model.
    If "customized", this was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or customized", you will find here the name of the DataModelClass that generated this field config.
    dataModelProperty string - If dataModelState is "auto" or customized", you will find here the name of the DataModelProperty that generated this field config.
    multivalued boolean False
    version int 0

    Nested elements
    • fromDataModelFieldConfig
    • ListsEncoderConfig
    Type Wrapper Description
    FieldConfig fromDataModel If dataModelState is "customized", you will find here the original object generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    ListsEncoderConfig - Configuration of the inverted lists encoder.

    If no configuration is specified, a Rice encoder is used.
    XML representation
    <HierarchyFieldConfig ramBased="true" implementation="strbtree" fieldName="" searchable="false" retrievable="false" dataModelState="" multivalued="false" version="0" ></HierarchyFieldConfig>
  • IndexingConfig com.exalead.mercury.mami.indexing.v10.IndexingConfig
    No documentation for this element.
    Attributes
    • name string
    Name Type Default value Description
    name string -

    Nested elements
    • AnalysisPolicy
    • CommitTriggerCondition*
    • ImportPolicy
    • IndexManagementPolicy
    • WriteAttributeSlotConfig*
    • WriteSlotConfig
    Type Wrapper Description
    AnalysisPolicy -
    CommitTriggerCondition* -
    ImportPolicy -
    IndexManagementPolicy -
    WriteAttributeSlotConfig* -
    WriteSlotConfig -
    XML representation
    <IndexingConfig name="" ></IndexingConfig>
  • FixedThreadsAnalysisPolicy com.exalead.mercury.mami.indexing.v10.FixedThreadsAnalysisPolicy
    Instantiates a fixed number of analysis threads.

    Dispatches documents according to their DIDs (Document IDs) and slice.
    Parent objects
    • IndexingConfig
    Attributes
    • maxRAMConsumptionThreshold enum(disabled, enabled, auto)
    • maxRAMConsumptionMB int
    • nbThreads int
    Name Type Default value Description
    maxRAMConsumptionThreshold enum(disabled, enabled, auto) enabled When reaching the RAM value specified, analysis is stopped and analyzed documents are imported to the index. Then analysis starts again.
    • Enabled: Commits when the RAM size reaches the Threshold value specified (by default, 2048 MB).
    • Auto: Commits when the RAM size reaches 2048 MB.'
    maxRAMConsumptionMB int 2048 The maximum of non-java RAM the analyzer can allocate. Reaching this limit triggers a commit.
    nbThreads int 4 Number of threads to allocate.

    XML representation
    <FixedThreadsAnalysisPolicy maxRAMConsumptionThreshold="enabled" maxRAMConsumptionMB="2048" nbThreads="4" ></FixedThreadsAnalysisPolicy>
  • PerSliceAnalysisPolicy com.exalead.mercury.mami.indexing.v10.PerSliceAnalysisPolicy
    Instantiates an analysis thread for each slice.

    Dispatches documents according to their slice.

    Consumes less RAM than the 'FixedThreadsAnalysisPolicy'.
    Parent objects
    • IndexingConfig
    Attributes
    • maxRAMConsumptionThreshold enum(disabled, enabled, auto)
    • maxRAMConsumptionMB int
    • nbThreads int
    Name Type Default value Description
    maxRAMConsumptionThreshold enum(disabled, enabled, auto) enabled When reaching the RAM value specified, analysis is stopped and analyzed documents are imported to the index. Then analysis starts again.
    • Enabled: Commits when the RAM size reaches the Threshold value specified (by default, 2048 MB).
    • Auto: Commits when the RAM size reaches 2048 MB.'
    maxRAMConsumptionMB int 2048 The maximum of non-java RAM the analyzer can allocate. Reaching this limit triggers a commit.
    nbThreads int 1 Uses N threads per slice.

    XML representation
    <PerSliceAnalysisPolicy maxRAMConsumptionThreshold="enabled" maxRAMConsumptionMB="2048" nbThreads="1" ></PerSliceAnalysisPolicy>
  • SameThreadAnalysisPolicy com.exalead.mercury.mami.indexing.v10.SameThreadAnalysisPolicy
    Instantiates an analysis thread for each incoming PAPI thread.

    Each PAPI thread analyzes its tasks synchronously.
    Parent objects
    • IndexingConfig
    Attributes
    • maxRAMConsumptionThreshold enum(disabled, enabled, auto)
    • maxRAMConsumptionMB int
    Name Type Default value Description
    maxRAMConsumptionThreshold enum(disabled, enabled, auto) enabled When reaching the RAM value specified, analysis is stopped and analyzed documents are imported to the index. Then analysis starts again.
    • Enabled: Commits when the RAM size reaches the Threshold value specified (by default, 2048 MB).
    • Auto: Commits when the RAM size reaches 2048 MB.'
    maxRAMConsumptionMB int 2048 The maximum of non-java RAM the analyzer can allocate. Reaching this limit triggers a commit.

    XML representation
    <SameThreadAnalysisPolicy maxRAMConsumptionThreshold="enabled" maxRAMConsumptionMB="2048" ></SameThreadAnalysisPolicy>
  • AutomaticAnalysisPolicy com.exalead.mercury.mami.indexing.v10.AutomaticAnalysisPolicy
    Depending on the number of threads specified, CloudView automatically chooses the most efficient analysis policy.

    Changes made in Analyze require a restart of CloudView, or at least of the indexing server process, to be taken into account.
    Parent objects
    • IndexingConfig
    Attributes
    • maxRAMConsumptionThreshold enum(disabled, enabled, auto)
    • maxRAMConsumptionMB int
    • nbThreads int
    Name Type Default value Description
    maxRAMConsumptionThreshold enum(disabled, enabled, auto) enabled When reaching the RAM value specified, analysis is stopped and analyzed documents are imported to the index. Then analysis starts again.
    • Enabled: Commits when the RAM size reaches the Threshold value specified (by default, 2048 MB).
    • Auto: Commits when the RAM size reaches 2048 MB.'
    maxRAMConsumptionMB int 2048 The maximum of non-java RAM the analyzer can allocate. Reaching this limit triggers a commit.
    nbThreads int 0 If not set or set with a multiple of 'nbSlices', it uses the 'PerSliceAnalysisPolicy'.

    Otherwise, it uses 'FixedThreadsAnalysisPolicy'.

    XML representation
    <AutomaticAnalysisPolicy maxRAMConsumptionThreshold="enabled" maxRAMConsumptionMB="2048" nbThreads="0" ></AutomaticAnalysisPolicy>
  • NumberOfTasksBasedCommitTriggerCondition com.exalead.mercury.mami.indexing.v10.NumberOfTasksBasedCommitTriggerCondition
    Triggers a commit after the specified No. tasks has been processed.
    The No. of tasks calculation is executed each time a batch of documents received, to avoid performance penalties. You might therefore have a bit more than the specified No. of tasks analyzed.
    Parent objects
    • IndexingConfig
    • AggregationTriggerConditions ConsolidationConfig
    • CommitTriggerConditions ConsolidationConfig
    Attributes
    • nbTasks int
    Name Type Default value Description
    nbTasks int - The number of tasks

    XML representation
    <NumberOfTasksBasedCommitTriggerCondition nbTasks="" ></NumberOfTasksBasedCommitTriggerCondition>
  • SizeBasedCommitTriggerCondition com.exalead.mercury.mami.indexing.v10.SizeBasedCommitTriggerCondition
    Triggers a commit when the Max size (MB) is reached.
    Parent objects
    • IndexingConfig
    • AggregationTriggerConditions ConsolidationConfig
    • CommitTriggerConditions ConsolidationConfig
    Attributes
    • maxSizeMB int
    Name Type Default value Description
    maxSizeMB int - Max size threshold in MB

    XML representation
    <SizeBasedCommitTriggerCondition maxSizeMB="" ></SizeBasedCommitTriggerCondition>
  • RAMUsageCommitTriggerCondition com.exalead.mercury.mami.indexing.v10.RAMUsageCommitTriggerCondition
    Triggers a commit when RAM usage reaches the limit.
    Parent objects
    • IndexingConfig
    • AggregationTriggerConditions ConsolidationConfig
    • CommitTriggerConditions ConsolidationConfig
    Attributes
    • maxRAMUsageInMB int
    Name Type Default value Description
    maxRAMUsageInMB int - Max RAM usage in MB

    XML representation
    <RAMUsageCommitTriggerCondition maxRAMUsageInMB="" ></RAMUsageCommitTriggerCondition>
  • PeriodicCommitTriggerCondition com.exalead.mercury.mami.indexing.v10.PeriodicCommitTriggerCondition
    Commits every N seconds after the first push order done after the last commit.
    Parent objects
    • IndexingConfig
    • AggregationTriggerConditions ConsolidationConfig
    • CommitTriggerConditions ConsolidationConfig
    Attributes
    • delayS long
    Name Type Default value Description
    delayS long - Time in seconds between two commits.

    XML representation
    <PeriodicCommitTriggerCondition delayS="" ></PeriodicCommitTriggerCondition>
  • InactivityCommitTriggerCondition com.exalead.mercury.mami.indexing.v10.InactivityCommitTriggerCondition
    Inactivity-based condition.
    This condition is triggered when:
    • there is no new data for the specified time period
    • AND at least the specified No. tasks has been analyzed.
    Parent objects
    • IndexingConfig
    • AggregationTriggerConditions ConsolidationConfig
    • CommitTriggerConditions ConsolidationConfig
    Attributes
    • numberOfTasks int
    • inactivityTimeS long
    • inactivityTimeMs long
    Name Type Default value Description
    numberOfTasks int - Minimum number of tasks to trigger a commit.
    inactivityTimeS long - After N seconds of no indexing activity, it is defined as inactive.
    inactivityTimeMs long - After N milliseconds of no indexing activity, it is defined as inactive.
    If both timeS and timeMs are set, wait 1000*timeS + timeMs milliseconds.

    XML representation
    <InactivityCommitTriggerCondition numberOfTasks="" inactivityTimeS="" inactivityTimeMs="" ></InactivityCommitTriggerCondition>
  • ParallelImportPolicy com.exalead.mercury.mami.indexing.v10.ParallelImportPolicy
    For each analysis buffers one generation is created.

    Analysis buffers are imported in parallel.
    Parent objects
    • IndexingConfig
    Attributes
    • nbThreads int
    Name Type Default value Description
    nbThreads int 8 The number of parallel import.

    XML representation
    <ParallelImportPolicy nbThreads="8" ></ParallelImportPolicy>
  • MergedImportPolicy com.exalead.mercury.mami.indexing.v10.MergedImportPolicy
    All analysis buffers are merged into a single one to be imported in an unique generation.
    XML representation
    <MergedImportPolicy ></MergedImportPolicy>
  • StandardIndexManagementPolicy com.exalead.mercury.mami.indexing.v10.StandardIndexManagementPolicy
    Default index (service + build) runtime configuration
    Parent objects
    • IndexingConfig
    Attributes
    • gcEveryS int
    Name Type Default value Description
    gcEveryS int 120 Trigger a GC every N seconds.

    Nested elements
    • CommitPolicy
    • CompactPolicies
    • UploadPolicy
    Type Wrapper Description
    CommitPolicy - The commit policy used to configured how the index persists its file to disk.
    CompactPolicies - The compact policies used to trigger slots compaction.
    UploadPolicy - The upload policy used to replicate new slots to replicas.
    XML representation
    <StandardIndexManagementPolicy gcEveryS="120" ></StandardIndexManagementPolicy>
  • StandardCommitPolicy com.exalead.mercury.mami.indexing.v10.StandardCommitPolicy
    Default commit policy
    XML representation
    <StandardCommitPolicy ></StandardCommitPolicy>
  • CompactPolicies com.exalead.mercury.mami.indexing.v10.CompactPolicies
    No documentation for this element.
    Parent objects
    • AdvancedConfig
    • StandardIndexManagementPolicy
    Attributes
    • synchronous boolean
    • maxParallelFullCompacts int
    • type enum(mmap, pagecache)
    • maxPageCacheSizeMB int
    • pageCachePageSizeKB int
    • priorityCompactThreshold int
    • lowPriorityCompactNbThreads int
    • highPriorityCompactNbThreads int
    Name Type Default value Description
    synchronous boolean False By default, compaction jobs are asynchronous. If set, compacts will be done synchronously just after imports.
    maxParallelFullCompacts int 0 Limit the number of full compacts in parallel, can be useful when you don't have too much disk space available. 0 means no limit.
    type enum(mmap, pagecache) mmap Specifies which I/O mode is used while compacting. (
    Value can be null or one of
    • mmap
    • pagecache
    )
    maxPageCacheSizeMB int 32 If the policy uses the PageCache mode, it specifies the max cache size.
    pageCachePageSizeKB int 8 If the policy uses the PageCache mode, it specifies the page size.
    priorityCompactThreshold int 64 When compacting a slot gen0-gen1, consider as a priority compact if gen1-gen0 < priorityCompactThreshold. Default is 48. (0: disabled)
    lowPriorityCompactNbThreads int 2 Number of threads to use for a compact having low priority (0: all available threads).
    highPriorityCompactNbThreads int 0 Number of threads to use for a compact having high priority (0: all available threads).

    Nested elements
    • AutoCompactPolicy*
    Type Wrapper Description
    AutoCompactPolicy* - Specifies the auto-compact policies.
    XML representation
    <CompactPolicies synchronous="false" maxParallelFullCompacts="0" type="mmap" maxPageCacheSizeMB="32" pageCachePageSizeKB="8" priorityCompactThreshold="64" lowPriorityCompactNbThreads="2" highPriorityCompactNbThreads="0" ></CompactPolicies>
  • NumberOfSlotsBasedCompactPolicy com.exalead.mercury.mami.indexing.v10.NumberOfSlotsBasedCompactPolicy
    Compaction policy based on a fixed number of slots for a given number of generations.
    Parent objects
    • CompactPolicies
    Attributes
    • component string
    • arity int
    • maxSlotSizeMb long
    Name Type Default value Description
    component string -
    arity int 4 Specifies the number of slots of the same length required to compact.
    maxSlotSizeMb long 5000 If a slot reaches this size; it will never be used by the next automatic compaction processes.

    Nested elements
    • FullCompactPolicy
    Type Wrapper Description
    FullCompactPolicy -
    XML representation
    <NumberOfSlotsBasedCompactPolicy arity="4" maxSlotSizeMb="5000" ></NumberOfSlotsBasedCompactPolicy>
  • MaxSizeFullCompactPolicy com.exalead.mercury.mami.indexing.v10.MaxSizeFullCompactPolicy
    A FullCompactPolicy that compacts all slots into one whenever the "tail" of small slots exceeds a certain ratio of the large first slot.

    This policy is appropriate when auto-compacts are restricted to slots under a certain size for performance reasons.

    In this case, a full optimization can occasionally be triggered to purge the deletes.

    If not, the deletes occurring in later slots would never be deleted, incurring performance costs at query-time and extra disk space consumption.
    Parent objects
    • AutoCompactPolicy
    • LowLatencyCompactPolicy
    • NoCompactPolicy
    • NumberOfSlotsBasedCompactPolicy
    • SlotsLogSizeBasedCompactPolicy
    • SlotsSizeBasedCompactPolicy
    Attributes
    • percentage int
    • minSlots int
    Name Type Default value Description
    percentage int 100 Minimum percentage to launch a full compaction.

    Compacts all slots into one whenever the "tail" of small slots exceeds a certain percentage of the large first slot.

    Eg: with percentage=100, when cumulated size of all slots except biggest is higher than size of the biggest slot, a full compact is triggered.
    minSlots int 2 Minimum number of slots before triggering a full compact.

    XML representation
    <MaxSizeFullCompactPolicy percentage="100" minSlots="2" ></MaxSizeFullCompactPolicy>
  • ArityBasedFullCompactPolicy com.exalead.mercury.mami.indexing.v10.ArityBasedFullCompactPolicy
    A FullCompactPolicy that compacts all slots into one whenever the "tail" of slots with smaller arities exceeds together a certain arity.

    The idea is that the arity-based policy guarantees occasional full-compaction, but the time interval between full-compaction increases exponentially.

    This add-on policy caps the increase at a certain arity, and schedules full-compacts at regular intervals afterwards.

    This policy is appropriate when auto-compacts are managed per generation-arity.

    In this case, a full optimization can occasionally be triggered to purge the deletes.

    If not, the deletes occurring in later slots would never be deleted, incurring performance costs at query-time and extra disk space consumption.
    Parent objects
    • AutoCompactPolicy
    • LowLatencyCompactPolicy
    • NoCompactPolicy
    • NumberOfSlotsBasedCompactPolicy
    • SlotsLogSizeBasedCompactPolicy
    • SlotsSizeBasedCompactPolicy
    Attributes
    • maxArity int
    • minSize long
    Name Type Default value Description
    maxArity int 256 Whenever the long tail total arity reaches maxArity, a full compact is scheduled.

    The "long tail" are the slots whose span has an arity inferior to this parameter.

    This is generally a multiple of the auto-compact Arity policy arity parameter.
    minSize long 0 Slots below this size are considered neglectable.

    XML representation
    <ArityBasedFullCompactPolicy maxArity="256" minSize="0" ></ArityBasedFullCompactPolicy>
  • SlotsSizeBasedCompactPolicy com.exalead.mercury.mami.indexing.v10.SlotsSizeBasedCompactPolicy
    Compaction policy based on size that produces slots with similar size.

    When N consecutive slots have a size below targetSizeForCompactionMB, it performs a compaction if:
    • N is at least minArity AND
      • The N+1 slot makes the size above targetSizeForCompactionMB OR
      • The size is above minSizeForCompactionMB
    Parent objects
    • CompactPolicies
    Attributes
    • component string
    • targetSizeForCompactionMB int
    • minSizeForCompactionMB int
    • minArity int
    Name Type Default value Description
    component string -
    targetSizeForCompactionMB int 200 Targeted size for a compacted slot.
    minSizeForCompactionMB int 50 Minimum size required to compact.
    minArity int 2 Minimum number of slots required to compact.

    Nested elements
    • FullCompactPolicy
    Type Wrapper Description
    FullCompactPolicy -
    XML representation
    <SlotsSizeBasedCompactPolicy targetSizeForCompactionMB="200" minSizeForCompactionMB="50" minArity="2" ></SlotsSizeBasedCompactPolicy>
  • SlotsLogSizeBasedCompactPolicy com.exalead.mercury.mami.indexing.v10.SlotsLogSizeBasedCompactPolicy
    A CompactPolicy that tries to compact slots into levels of exponentially increasing size, where each level has fewer slots than the value of the compact factor.

    Whenever extra slots (beyond the compact factor upper bound) are encountered, all slots within the level are compacted.
    Parent objects
    • CompactPolicies
    Attributes
    • component string
    • compactFactor int
    • minSize long
    • maxSize long
    Name Type Default value Description
    component string -
    compactFactor int 10 Determines how often slots are compacted.

    With smaller values, less RAM is used while indexing, and searches on unoptimized indices are faster, but indexing speed is slower.

    With larger values, more RAM is used during indexing, and while searches on unoptimized indices are slower, indexing is faster.

    Thus larger values (greater than 10) are best for batch index creation, and smaller values (lower than 10) for indices that are interactively maintained.
    minSize long 1048576 A size setting type which sets the minimum size for the lowest level slots.

    Slots below this size are considered to be on the same level (even if they vary drastically in size) and will be merged whenever there are mergeFactor for them.

    This effectively truncates the "long tail" of small slots that would otherwise be created into a single level.

    If you set this too large, it can greatly increase the merging cost during indexing (if you flush many small slots).
    maxSize long 9223372036854775807 A size setting type which sets the largest slot that may be merged with other segments.

    Nested elements
    • FullCompactPolicy
    Type Wrapper Description
    FullCompactPolicy -
    XML representation
    <SlotsLogSizeBasedCompactPolicy compactFactor="10" minSize="1048576" maxSize="9223372036854775807" ></SlotsLogSizeBasedCompactPolicy>
  • LowLatencyCompactPolicy com.exalead.mercury.mami.indexing.v10.LowLatencyCompactPolicy
    Compacts when the size of all small slots is above the average large slot size, or when the number of slots is above nbLargeSlots + maxNbSmallSlots.
    Parent objects
    • CompactPolicies
    Attributes
    • component string
    • nbLargeSlots int
    • maxNbSmallSlots int
    • gatherSmallsAtTheEnd boolean
    • contiguousCompact boolean
    Name Type Default value Description
    component string -
    nbLargeSlots int 8 The number of large slots to keep.
    maxNbSmallSlots int 8 Maximum number of small slots allowed. As soon as this limit is reached, small slots are compacted together.
    gatherSmallsAtTheEnd boolean True
    contiguousCompact boolean False

    Nested elements
    • FullCompactPolicy
    Type Wrapper Description
    FullCompactPolicy -
    XML representation
    <LowLatencyCompactPolicy nbLargeSlots="8" maxNbSmallSlots="8" gatherSmallsAtTheEnd="true" contiguousCompact="false" ></LowLatencyCompactPolicy>
  • NoCompactPolicy com.exalead.mercury.mami.indexing.v10.NoCompactPolicy
    Compact policy that does not perform any compact.
    Parent objects
    • CompactPolicies
    Attributes
    • component string
    Name Type Default value Description
    component string -

    Nested elements
    • FullCompactPolicy
    Type Wrapper Description
    FullCompactPolicy -
    XML representation
    <NoCompactPolicy ></NoCompactPolicy>
  • StandardUploadPolicy com.exalead.mercury.mami.indexing.v10.StandardUploadPolicy
    Default upload policy
    Parent objects
    • StandardIndexManagementPolicy
    Attributes
    • waitBetweenSwitchesS int
    Name Type Default value Description
    waitBetweenSwitchesS int - If strictly positive, all slices switch to a generation sequentially, and we wait this time in seconds between two slices. This spreads the temporary memory consumption to avoid large memory spikes and swap out.

    XML representation
    <StandardUploadPolicy waitBetweenSwitchesS="" ></StandardUploadPolicy>
  • WriteAttributeSlotConfig com.exalead.mercury.mami.indexing.v10.WriteAttributeSlotConfig
    Write attribute slot configuration
    Parent objects
    • IndexingConfig
    Attributes
    • type enum(directio, sequential)
    • groupId int
    Name Type Default value Description
    type enum(directio, sequential) directio Access type for writing the new slots.
    Value can be null or one of
    • directio
    • sequential
    groupId int - Specifies which attribute group store this access configuration applies to.

    XML representation
    <WriteAttributeSlotConfig type="directio" groupId="" ></WriteAttributeSlotConfig>
  • WriteSlotConfig com.exalead.mercury.mami.indexing.v10.WriteSlotConfig
    Write slot configuration
    Parent objects
    • IndexingConfig
    Attributes
    • type enum(directio, sequential)
    Name Type Default value Description
    type enum(directio, sequential) sequential Access type for writing the new slots.
    Value can be null or one of
    • directio
    • sequential

    XML representation
    <WriteSlotConfig type="sequential" ></WriteSlotConfig>
  • IndexRuntimeConfigList com.exalead.mercury.mami.indexing.v10.IndexRuntimeConfigList
    Lists all index runtime config list.
    Attributes
    • version long
    Name Type Default value Description
    version long - -

    Nested elements
    • CacheConfig*
    • IndexRuntimeConfig*
    Type Wrapper Description
    CacheConfig* - Lists PageCache configurations
    IndexRuntimeConfig* - Lists runtime configurations
    XML representation
    <IndexRuntimeConfigList version="" ></IndexRuntimeConfigList>
  • CacheConfig com.exalead.mercury.mami.indexing.v10.CacheConfig
    PageCache configuration. Warning: The index page cache is limited to 32000 files in the index directory. If you get an error like "FileRAM: too many cached files (c_max_files=32767)", it means that the limit has been crossed and you should set a more aggressive compact policy.
    Parent objects
    • IndexRuntimeConfigList
    Attributes
    • name string
    • cacheSizeMB int
    • pageSizeKB int
    • maxSimultaneousIOOperations int
    Name Type Default value Description
    name string - The cache ID.
    cacheSizeMB int 256 Maximum cache size in MB.
    pageSizeKB int 8 Page size in KB.
    maxSimultaneousIOOperations int 32 Specifies the max number of simultaneous I/O.

    XML representation
    <CacheConfig name="" cacheSizeMB="256" pageSizeKB="8" maxSimultaneousIOOperations="32" ></CacheConfig>
  • IndexRuntimeConfig com.exalead.mercury.mami.indexing.v10.IndexRuntimeConfig
    Index runtime configuration for an instance of an index slice.
    Use key values arguments to provide custom configuration keys.
    Parent objects
    • IndexRuntimeConfigList
    Attributes
    • name string
    • newGenerationBandwidthLimitKB int
    • compactBandwidthLimitKB int
    • ramBasedAttrGroupLoadPolicy enum(rebuild, copyAndPatch)
    Name Type Default value Description
    name string -
    newGenerationBandwidthLimitKB int 0
    compactBandwidthLimitKB int 0
    ramBasedAttrGroupLoadPolicy enum(rebuild, copyAndPatch) copyAndPatch Value can be one of
    • rebuild
    • copyAndPatch

    Nested elements
    • AttributeGroupAccess*
    • FieldRuntimeConfig*
    • QueryAutocacheConfig
    • ReplicationConfig
    • WarmupConfig
    Type Wrapper Description
    AttributeGroupAccess* -
    FieldRuntimeConfig* -
    QueryAutocacheConfig -
    ReplicationConfig -
    WarmupConfig -
    XML representation
    <IndexRuntimeConfig name="" newGenerationBandwidthLimitKB="0" compactBandwidthLimitKB="0" ramBasedAttrGroupLoadPolicy="copyAndPatch" ></IndexRuntimeConfig>
  • AttributeGroupAccess com.exalead.mercury.mami.indexing.v10.AttributeGroupAccess
    Configuration specifying how to access the attribute group at runtime.
    Parent objects
    • IndexRuntimeConfig
    Attributes
    • groupId string
    • runType enum(mmap, pagecache, direct, RAMRow, RAMColumnDense)
    • preload boolean
    • mlock boolean
    • cacheId string
    Name Type Default value Description
    groupId string - Specifies which attribute group store this access configuration applies to.
    runType enum(mmap, pagecache, direct, RAMRow, RAMColumnDense) mmap Specifies how the attribute group should be accessed at runtime.
    preload boolean False For RAM-based access type, specifies if the attribute group should be loaded in RAM at startup instead of at access time.
    mlock boolean False For RAM-based access type, specifies if the attribute group should be locked in RAM. Preventing it being moved to the swap area.
    cacheId string - For pagecache I/O type, specifies the cache ID.

    XML representation
    <AttributeGroupAccess groupId="" runType="mmap" preload="false" mlock="false" cacheId="" ></AttributeGroupAccess>
  • FieldRuntimeConfig com.exalead.mercury.mami.indexing.v10.FieldRuntimeConfig
    Configuration specifying the index field at runtime.
    Parent objects
    • IndexRuntimeConfig
    Attributes
    • name string
    • dictType enum(mmap, pagecache)
    • type enum(mmap, pagecache)
    • preload boolean
    • mlock boolean
    • cacheId string
    Name Type Default value Description
    name string - The index field name.
    dictType enum(mmap, pagecache) mmap Specifies the I/O mode used to load the dictionary part of an index field. (
    Value can be one of
    • mmap
    • pagecache
    )
    type enum(mmap, pagecache) mmap Specifies the I/O mode used to load the component. (
    Value can be one of
    • mmap
    • pagecache
    )
    preload boolean False Should the field be preloaded? This will force the field to be loaded in RAM at startup.
    mlock boolean False Should the field be locked in RAM.
    cacheId string - If PageCache is used, it specifies the cache ID.

    XML representation
    <FieldRuntimeConfig name="" dictType="mmap" type="mmap" preload="false" mlock="false" cacheId="" ></FieldRuntimeConfig>
  • QueryAutocacheConfig com.exalead.mercury.mami.indexing.v10.QueryAutocacheConfig
    Query #autocache configuration.
    Parent objects
    • IndexRuntimeConfig
    Attributes
    • totalCacheSizeMB int
    • queryCacheSizeMB int
    • maxCachedQueries int
    Name Type Default value Description
    totalCacheSizeMB int 20 Maximum cache size in MB (cross queries).
    queryCacheSizeMB int 5 Maximum cached query size.
    maxCachedQueries int 20 Number of queries cached.

    XML representation
    <QueryAutocacheConfig totalCacheSizeMB="20" queryCacheSizeMB="5" maxCachedQueries="20" ></QueryAutocacheConfig>
  • ReplicationConfig com.exalead.mercury.mami.indexing.v10.ReplicationConfig
    Slice replication configuration
    Nested elements
    • AttributeReplicationConfig*
    • FieldReplicationConfig*
    Type Wrapper Description
    AttributeReplicationConfig* - Configures the direction usage in attribute replication.
    FieldReplicationConfig* - Configures the direction usage in field replication.
    XML representation
    <ReplicationConfig ></ReplicationConfig>
  • AttributeReplicationConfig com.exalead.mercury.mami.indexing.v10.AttributeReplicationConfig
    Attribute's replication configuration
    Parent objects
    • ReplicationConfig
    Attributes
    • groupId string
    • type enum(directio, sequential)
    Name Type Default value Description
    groupId string - Group id of the attribute to configure
    type enum(directio, sequential) directio Access type
    Value can be null or one of
    • directio
    • sequential

    XML representation
    <AttributeReplicationConfig type="directio" ></AttributeReplicationConfig>
  • FieldReplicationConfig com.exalead.mercury.mami.indexing.v10.FieldReplicationConfig
    Index field replication configuration
    Parent objects
    • ReplicationConfig
    Attributes
    • name string
    • type enum(directio, sequential)
    • dictType enum(directio, sequential)
    Name Type Default value Description
    name string - Name of the field to configure.
    type enum(directio, sequential) directio Access type
    Value can be null or one of
    • directio
    • sequential
    dictType enum(directio, sequential) directio Access type for the dictionary
    Value can be null or one of
    • directio
    • sequential

    XML representation
    <FieldReplicationConfig type="directio" dictType="directio" ></FieldReplicationConfig>
  • WarmupConfig com.exalead.mercury.mami.indexing.v10.WarmupConfig
    Index warmup configuration
    Parent objects
    • IndexRuntimeConfig
    Attributes
    • warmupQueryFile string
    • maxWarmupDurationS int
    Name Type Default value Description
    warmupQueryFile string - Warmup list of single queries
    maxWarmupDurationS int 5 Maximum time for warmup. Open the index after and prints a warning indicating which line number has been reached

    XML representation
    <WarmupConfig warmupQueryFile="" maxWarmupDurationS="5" ></WarmupConfig>
  • BuildGroupConfig com.exalead.mercury.mami.deploy.v10.BuildGroupConfig
    Configuration of a build group.

    A "Build Group" is defined by references to sub-configurations defined in other MAMI:
    • Analysis (how documents are processed).
    • Index Builder (how indexing jobs are scheduled and managed).
    • Index Schema (schema of the index slices being built).
    • Task Queue (how input document processing tasks are queued before jobs).
    • Similar Document (optional)

    Several build groups may share some or all their sub-configuration.
    In most configuration, all build groups would share the same index schema configuration.

    When built with the same schema, index slices built by different build groups can be queried together (see the Search MAMI).
    Attributes
    • buildGroup string
    • dataModel string
    • indexingConfig string
    Name Type Default value Description
    buildGroup string - Name of the build group. This name should be unique.
    dataModel string - Name of the data model.
    indexingConfig string - Name of an indexing configuration (IndexingConfig element in Indexing MAMI).

    Nested elements
    • DIHConfig
    • DidAllocationPolicy
    • DocumentCacheConfig
    • PrecomputedThumbnailsConfig
    • ScratchHook*
    • SlicePartioningPolicy
    Type Wrapper Description
    DIHConfig -
    DidAllocationPolicy -
    DocumentCacheConfig -
    PrecomputedThumbnailsConfig -
    ScratchHook* -
    SlicePartioningPolicy -
    XML representation
    <BuildGroupConfig buildGroup="" dataModel="" indexingConfig="" ></BuildGroupConfig>
  • DIHConfig com.exalead.mercury.mami.deploy.v10.DIHConfig
    A DIHConfig is a set of parameters for a DIH.
    Parent objects
    • BuildGroupConfig
    Attributes
    • compactArity int
    • nbBloomBitsPerElement int
    • nbElementsInLeaf int
    • readMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)
    • enumMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)
    • compactMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)
    Name Type Default value Description
    compactArity int 4 Number of consecutive slots to trigger a compact.
    nbBloomBitsPerElement int 20 Number of bits per elements in the DIH's StrBTree's bloom filter.
    nbElementsInLeaf int 100 Number of entries in each of the DIH's StrBTree's leaves.
    readMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential) mmap Read mode of the DIH's StrBTree, except for enumeration.
    Value can be null or one of
    • auto
    • direct
    • mmap
    • mmap_mlock
    • mmap_mload
    • pagecache
    • random
    • sequential
    enumMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential) mmap Read mode of the DIH's StrBTree, for enumeration.
    Value can be null or one of
    • auto
    • direct
    • mmap
    • mmap_mlock
    • mmap_mload
    • pagecache
    • random
    • sequential
    compactMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential) mmap Read mode of the DIH's StrBTree, for compact.
    Value can be null or one of
    • auto
    • direct
    • mmap
    • mmap_mlock
    • mmap_mload
    • pagecache
    • random
    • sequential

    XML representation
    <DIHConfig compactArity="4" nbBloomBitsPerElement="20" nbElementsInLeaf="100" readMode="mmap" enumMode="mmap" compactMode="mmap" ></DIHConfig>
  • ContiguousDidAllocationPolicy com.exalead.mercury.mami.deploy.v10.ContiguousDidAllocationPolicy
    Base-class specifying how DIDs (Document IDs) are assigned to the documents.
    Parent objects
    • BuildGroupConfig
    Attributes
    • increasing boolean
    • startingPoint int
    • endingPoint nullableint
    Name Type Default value Description
    increasing boolean True Assign DIDs in an increasing order.
    startingPoint int 0 Start point of the allocation. By default, the first DID will have value '1'.
    endingPoint nullableint - End point of the allocation. By default, it will be Integer.MAX_VALUE if increasing or 1 if decreasing.

    XML representation
    <ContiguousDidAllocationPolicy increasing="true" startingPoint="0" endingPoint="" ></ContiguousDidAllocationPolicy>
  • DocumentCacheConfig com.exalead.mercury.mami.deploy.v10.DocumentCacheConfig
    Configuration for the document cache.
    Parent objects
    • BuildGroupConfig
    Attributes
    • path string
    • compactArity int
    • nbBloomBitsPerElement int
    • nbElementsInLeaf int
    • readMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)
    • enumMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)
    • compactMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential)
    • diskCompressionAlgorithm enum(none, fastlz, gzip, lcs, lz4)
    • temporaryFilesCompressionAlgorithm enum(none, fastlz, gzip, lz4)
    Name Type Default value Description
    path string - Location of the document cache on the filesystem.
    Unless otherwise specified, the document cache is located in the "cache" subdirectory of the build group.
    compactArity int 4 Number of consecutive slots to trigger a compact.
    nbBloomBitsPerElement int 10 Number of bits per element in the document cache StrBTree bloom filter.
    nbElementsInLeaf int 20 Number of entries in each of the document cache StrBTree leaves.
    readMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential) auto Read mode of the document cache StrBTree, except for enumeration.
    Value can be null or one of
    • auto
    • direct
    • mmap
    • mmap_mlock
    • mmap_mload
    • pagecache
    • random
    • sequential
    enumMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential) auto Read mode of the document cache StrBTree, for enumeration.
    Value can be null or one of
    • auto
    • direct
    • mmap
    • mmap_mlock
    • mmap_mload
    • pagecache
    • random
    • sequential
    compactMode enum(auto, direct, mmap, mmap_mlock, mmap_mload, pagecache, random, sequential) auto Read mode of the document cache StrBTree, for compact.
    Value can be null or one of
    • auto
    • direct
    • mmap
    • mmap_mlock
    • mmap_mload
    • pagecache
    • random
    • sequential
    diskCompressionAlgorithm enum(none, fastlz, gzip, lcs, lz4) fastlz Algorithm to compress the document cache on disk.
    Value can be null or one of
    • none
    • fastlz
    • gzip
    • lcs
    • lz4
    temporaryFilesCompressionAlgorithm enum(none, fastlz, gzip, lz4) fastlz Algorithm to compress the temporary files on disk.
    Value can be null or one of
    • none
    • fastlz
    • gzip
    • lz4

    XML representation
    <DocumentCacheConfig path="" compactArity="4" nbBloomBitsPerElement="10" nbElementsInLeaf="20" readMode="auto" enumMode="auto" compactMode="auto" diskCompressionAlgorithm="fastlz" temporaryFilesCompressionAlgorithm="fastlz" ></DocumentCacheConfig>
  • PrecomputedThumbnailsConfig com.exalead.mercury.mami.deploy.v10.PrecomputedThumbnailsConfig
    No documentation for this element.
    Parent objects
    • BuildGroupConfig
    Attributes
    • computeThreads int
    Name Type Default value Description
    computeThreads int 4

    XML representation
    <PrecomputedThumbnailsConfig computeThreads="4" ></PrecomputedThumbnailsConfig>
  • FSPrecomputedThumbnailsConfig (deprecated) com.exalead.mercury.mami.deploy.v10.FSPrecomputedThumbnailsConfig
    No documentation for this element.
    Parent objects
    • BuildGroupConfig
    Attributes
    • computeThreads int
    Name Type Default value Description
    computeThreads int 4

    XML representation
    <FSPrecomputedThumbnailsConfig computeThreads="4" ></FSPrecomputedThumbnailsConfig>
  • GDSPrecomputedThumbnailsConfig (deprecated) com.exalead.mercury.mami.deploy.v10.GDSPrecomputedThumbnailsConfig
    No documentation for this element.
    Parent objects
    • BuildGroupConfig
    Attributes
    • computeThreads int
    • ramBufferSizeMB long
    • readMode enum(normal, direct)
    Name Type Default value Description
    computeThreads int 4
    ramBufferSizeMB long 16
    readMode enum(normal, direct) direct Value can be null or one of
    • normal
    • direct

    XML representation
    <GDSPrecomputedThumbnailsConfig computeThreads="4" ramBufferSizeMB="16" readMode="direct" ></GDSPrecomputedThumbnailsConfig>
  • ScratchHook com.exalead.mercury.mami.deploy.v10.ScratchHook
    A Hook to plug custom exa code on BuildGroup scratch
    Parent objects
    • BuildGroupConfig
    Attributes
    • classId string
    Name Type Default value Description
    classId string - The specified class must implement the {@code com.exalead.mercury.indexing.CustomScratchHook} Exascript interface.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <ScratchHook classId="" ></ScratchHook>
  • BasicSlicePartioningPolicy com.exalead.mercury.mami.deploy.v10.BasicSlicePartioningPolicy
    Basic partionning function based on a URL hash and a '%' (modulo).
    XML representation
    <BasicSlicePartioningPolicy ></BasicSlicePartioningPolicy>
  • SearchLogic com.exalead.mercury.mami.search.v20.SearchLogic
    Controls how searches are processed.
    Attributes
    • name string
    • tokenizationConfig string
    • dataModel string
    • customQueryProcessingFactoryClassId string
    • fetchMasks boolean
    • sliceDownAction enum(error, ignore)
    • sliceDownReportMode enum(quiet, local, global)
    • searchTimeoutAction enum(error, ignore)
    • searchTimeoutReportMode enum(quiet, local, global)
    • searchLimitAction enum(error, ignore)
    • searchLimitReportMode enum(quiet, local, global)
    • retrieveSourceField boolean
    • handleNegativeSecurityTokens boolean
    Name Type Default value Description
    name string - Name of the search logic.
    This name is referred to with the 'logic' parameter in the Search API.
    tokenizationConfig string - Name of the tokenization configuration to use when using this search logic.

    For consistency, this should be the same tokenization configuration as in the dictionary and analysis configurations.
    dataModel string - Name of the data model this search logic targets.

    A Search Logic can only target one data model.
    customQueryProcessingFactoryClassId string - Java class implementing the QueryProcessingRunnerFactory interface to build the thread-specific QueryProcessingRunners (which expand the actual query).
    fetchMasks boolean False Fetches masks in searcher. Required to get the "mask" field in the hits.
    sliceDownAction enum(error, ignore) ignore What to do if one of the slice is down when the search is performed.

    A slice is considered as down if all replicas are down (and if the master is also down in the default search target configuration).
    One of "error" or "ignore". Default is "ignore".
    sliceDownReportMode enum(quiet, local, global) local Specifies how to report a down slice error in 'Troubleshooting > Logs':
    • quiet: No log.
    • local: Error is reported in the Process log.
    • global: Error is reported in both Process and Global logs.
    searchTimeoutAction enum(error, ignore) ignore What to do when slices time out.
    One of "error" or "ignore". Default is "ignore".
    searchTimeoutReportMode enum(quiet, local, global) local Specifies how to report a timeout error in 'Troubleshooting > Logs':
    • quiet: No log.
    • local: Error is reported in the Process log.
    • global: Error is reported in both Process and Global logs.
    searchLimitAction enum(error, ignore) ignore What to do if a search limit is reached.
    One of "error" or "ignore". Default is "ignore".
    searchLimitReportMode enum(quiet, local, global) local Specifies how to report a search limit error in 'Troubleshooting > Logs':
    • quiet: No log.
    • local: Error is reported in the Process log.
    • global: Error is reported in both Process and Global logs.
    retrieveSourceField boolean True Should the "source" field be automatically added to the retrieved field?
    handleNegativeSecurityTokens boolean True When enabled the security restriction will handle document security token starting with ~ as negative token.
    The negative security tokens are prioritary above standard tokens.
    The query restriction generated for a query with the tokens tok1, tok2, tok3 is ((tok1 OR tok2 OR tok3) AND NOT ~tok1 AND NOT ~tok2 AND NOT ~tok3).
    If the security sources return a lot of security tokens and the connectors do not push negative security tokens it can be interesting to disable negative security tokens handling to speed up the queries (the security query restriction will be (tok1 OR tok2 OR tok3) in this case.

    Nested elements
    • ContentRestriction
    • FacetsConfig
    • HitConfig
    • LimitsConfig
    • PostFilterConfig
    • QueryRestrictionConfig
    • RelatedTermsSynthesisConfig
    • RelevanceConfig
    • Sort
    • SpellCheckerConfig
    • TimezoneConfig
    • UQLConfig
    • VirtualFieldDefinition*
    Type Wrapper Description
    ContentRestriction - Content restriction configuration for this search logic.
    FacetsConfig - Facets configuration for this search logic.
    HitConfig - Hit content configuration for this search logic.
    LimitsConfig - Limits configuration for this search logic.
    PostFilterConfig - Post filtering configuration for this search logic.
    QueryRestrictionConfig - Query restriction configuration for this search logic.
    RelatedTermsSynthesisConfig - Related terms configuration for this search logic.
    RelevanceConfig - Relevance configuration for this search logic.
    Sort - Sort configuration for this search logic.
    SpellCheckerConfig - Spell checker configuration for this search logic.
    TimezoneConfig -
    UQLConfig - UQL configuration for this search logic.
    VirtualFieldDefinition* - Virtual fields configuration for this search logic.
    XML representation
    <SearchLogic name="" tokenizationConfig="tok0" dataModel="" customQueryProcessingFactoryClassId="" fetchMasks="false" sliceDownAction="ignore" sliceDownReportMode="local" searchTimeoutAction="ignore" searchTimeoutReportMode="local" searchLimitAction="ignore" searchLimitReportMode="local" retrieveSourceField="true" handleNegativeSecurityTokens="true" ></SearchLogic>
  • ContentRestriction com.exalead.mercury.mami.search.v20.ContentRestriction
    No documentation for this element.
    Parent objects
    • SearchLogic
    Attributes
    • queryString string
    Name Type Default value Description
    queryString string - Query string to restrict content.

    Only the documents matching this query string will be taken into consideration.

    The query string is not subject to any kind of linguistic expansion whatsoever.
    This query string is AND' ed with the user actual query.

    XML representation
    <ContentRestriction queryString="" ></ContentRestriction>
  • FacetsConfig com.exalead.search.v30.FacetsConfig
    Configuration for all facets calculated at search time.
    Parent objects
    • SearchLogic
    Attributes
    • maxHits int
    • enabled boolean
    • optimizedForExhaustiveRankedSynthesis boolean
    • categoryArrayBufferSize int
    Name Type Default value Description
    maxHits int - Number of hits on which category navigation will be computed.

    0 means exhaustive navigation, within the limits of the matched hits.

    Note: Heapsort executor is used if this setting is defined in ranked mode. If not in ranked mode, this setting is applied per slice.
    enabled boolean True Enables faceting.
    optimizedForExhaustiveRankedSynthesis boolean True When search is running in "heap" mode, it locally sorts the DIDs before calculating facets.

    This decreases significantly LLC miss.
    categoryArrayBufferSize int 1000 Size of the result blocks sent to the searchserver.

    This is an optimization parameter.

    Lower values slightly decrease memory consumption when there is a large number of slices, but increase global CPU consumption.

    Nested elements
    • Facet*
    Type Wrapper Description
    Facet* - Definition of a facet.
    XML representation
    <FacetsConfig maxHits="" enabled="true" optimizedForExhaustiveRankedSynthesis="true" categoryArrayBufferSize="1000" ></FacetsConfig>
  • CategoryFacet com.exalead.search.v30.CategoryFacet
    Configuration for a facet.
    Parent objects
    • FacetsConfig
    Attributes
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • root string
    • maxCategoryTreeDepth int
    • maxCategoryTreeDepthFromRoot int
    • categoryField string
    • form enum(exact, lowercase, normalized)
    • hitsMaxCategories int
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    root
    (UI: Root)
    string - In the categories field, category values are stored in a tree. The root of the tree is called "Top".
    maxCategoryTreeDepth
    (UI: Max. tree depth from current refinement)
    int 0 Maximum depth of the returned category tree.

    Whatever the level of refinement, there will never be more levels retrieved (this ensures we are not overtaking the maxCategoryTreeDepthFromRoot).
    maxCategoryTreeDepthFromRoot
    (UI: Max. tree depth from root)
    int 0 Maximum depth of the returned category tree, starting from the global root of the category.

    Setting this attribute to N means that no categories will be kept below this level.
    categoryField
    (UI: Index field)
    string - Specifies from which field the categories should be retrieved.
    form
    (UI: Form)
    enum(exact, lowercase, normalized) normalized Normalization form for the category path.
    Value can be one of
    • exact
    • lowercase
    • normalized

    This must match the form with which the category has been mapped at indexing time.
    hitsMaxCategories
    (UI: Max. categories in hits)
    int 100 Maximum number of facet values displayed in hits.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <CategoryFacet id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" root="" maxCategoryTreeDepth="0" maxCategoryTreeDepthFromRoot="0" categoryField="" form="normalized" hitsMaxCategories="100" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></CategoryFacet>
  • AdditionalFacetView com.exalead.search.v30.AdditionalFacetView
    No documentation for this element.
    Parent objects
    • AutoTileGeoFacet
    • AutoTileGeoV2Facet
    • CategoryFacet
    • ConcatValueFacet
    • DateFacet
    • DynamicDateFacet
    • DynamicRangesNumericalFacet
    • ExplicitGeoFacet
    • ExplicitRangesNumericalFacet
    • Facet
    • FixedRangesNumericalFacet
    • Hierarchical2DFacet
    • LuaFacet
    • MultiDimensionFacet
    • NumericalFacet
    • OptimizedEnumFacet
    • RelatedTermFacet
    • ValueFacet
    • VirtualFacet
    Attributes
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • id string
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    Name Type Default value Description
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    id string - Unique sub-identifier of this view. Will be concatenated to the facet's id.
    Used in Mashup-Builder and client applications to identify the facet.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.

    XML representation
    <AdditionalFacetView sortFunction="count" aggregateScores="false" id="" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" ></AdditionalFacetView>
  • AdditionalCategoryFacetView com.exalead.search.v30.AdditionalCategoryFacetView
    No documentation for this element.
    Parent objects
    • AutoTileGeoFacet
    • AutoTileGeoV2Facet
    • CategoryFacet
    • ConcatValueFacet
    • DateFacet
    • DynamicDateFacet
    • DynamicRangesNumericalFacet
    • ExplicitGeoFacet
    • ExplicitRangesNumericalFacet
    • Facet
    • FixedRangesNumericalFacet
    • Hierarchical2DFacet
    • LuaFacet
    • MultiDimensionFacet
    • NumericalFacet
    • OptimizedEnumFacet
    • RelatedTermFacet
    • ValueFacet
    • VirtualFacet
    Attributes
    • id string
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • maxCategoryTreeDepth int
    • maxCategoryTreeDepthFromRoot int
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    Name Type Default value Description
    id string - Unique sub-identifier of this view. Will be concatenated to the facet's id.
    Used in Mashup-Builder and client applications to identify the facet.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    maxCategoryTreeDepth
    (UI: Max. tree depth from current refinement)
    int 0 Maximum number of categories displayed in the facet from the current refinement.
    Setting this attribute to N means that no categories will be kept below this level.
    0 means no limit.
    maxCategoryTreeDepthFromRoot
    (UI: Max. tree depth from root)
    int 0 Maximum depth of the category tree displayed in the facet from the Top/ level.
    You can't go further than this limit whatever the refinement level.
    0 means no limit.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.

    XML representation
    <AdditionalCategoryFacetView id="" sortFunction="count" aggregateScores="false" maxCategoryTreeDepth="0" maxCategoryTreeDepthFromRoot="0" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" ></AdditionalCategoryFacetView>
  • AdditionalDateFacetView com.exalead.search.v30.AdditionalDateFacetView
    No documentation for this element.
    Parent objects
    • AutoTileGeoFacet
    • AutoTileGeoV2Facet
    • CategoryFacet
    • ConcatValueFacet
    • DateFacet
    • DynamicDateFacet
    • DynamicRangesNumericalFacet
    • ExplicitGeoFacet
    • ExplicitRangesNumericalFacet
    • Facet
    • FixedRangesNumericalFacet
    • Hierarchical2DFacet
    • LuaFacet
    • MultiDimensionFacet
    • NumericalFacet
    • OptimizedEnumFacet
    • RelatedTermFacet
    • ValueFacet
    • VirtualFacet
    Attributes
    • id string
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • withYear boolean
    • yearDesc boolean
    • withMonth boolean
    • monthDesc boolean
    • withWeek boolean
    • weekDesc boolean
    • withDay boolean
    • dayDesc boolean
    • withHour boolean
    • hourDesc boolean
    • withMinute boolean
    • minuteDesc boolean
    • withSecond boolean
    • secondDesc boolean
    • maxCategoryTreeDepth int
    • maxCategoryTreeDepthFromRoot int
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    Name Type Default value Description
    id string - Unique sub-identifier of this view. Will be concatenated to the facet's id.
    Used in Mashup-Builder and client applications to identify the facet.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    withYear boolean True Creates a virtual category for each year.
    yearDesc boolean True
    withMonth boolean True Creates a virtual category for each month.
    monthDesc boolean False
    withWeek boolean False Creates a virtual category for each week.
    weekDesc boolean False
    withDay boolean True Creates a virtual category for each day.
    dayDesc boolean False
    withHour boolean False Creates a virtual category for each hour.
    hourDesc boolean False
    withMinute boolean False Creates a virtual category for each minute.
    minuteDesc boolean False
    withSecond boolean False Creates a virtual category for each second.
    secondDesc boolean False
    maxCategoryTreeDepth
    (UI: Max. tree depth from current refinement)
    int 0 Maximum depth of the returned category tree.

    Whatever the level of refinement, there will never be more levels retrieved (this ensures we are not overtaking the maxCategoryTreeDepthFromRoot).
    maxCategoryTreeDepthFromRoot
    (UI: Max. tree depth from root)
    int 0 Maximum depth of the returned category tree, starting from the global root of the category.

    Setting this attribute to N means that no categories will be kept below this level.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.

    XML representation
    <AdditionalDateFacetView id="" sortFunction="count" aggregateScores="false" withYear="true" yearDesc="true" withMonth="true" monthDesc="false" withWeek="false" weekDesc="false" withDay="true" dayDesc="false" withHour="false" hourDesc="false" withMinute="false" minuteDesc="false" withSecond="false" secondDesc="false" maxCategoryTreeDepth="0" maxCategoryTreeDepthFromRoot="0" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" ></AdditionalDateFacetView>
  • AggregationFunction com.exalead.search.v30.AggregationFunction
    No documentation for this element.
    Parent objects
    • AutoTileGeoFacet
    • AutoTileGeoV2Facet
    • CategoryFacet
    • ConcatValueFacet
    • DateFacet
    • DynamicDateFacet
    • DynamicRangesNumericalFacet
    • ExplicitGeoFacet
    • ExplicitRangesNumericalFacet
    • Facet
    • FixedRangesNumericalFacet
    • Hierarchical2DFacet
    • LuaFacet
    • MultiDimensionFacet
    • NumericalFacet
    • OptimizedEnumFacet
    • RelatedTermFacet
    • ValueFacet
    • VirtualFacet
    Attributes
    • id string
    • expr string
    • functionName string
    Name Type Default value Description
    id string - Unique identifier of this function.
    expr string - The expression on which to perform the synthesis. It can reference fields or virtual fields.
    functionName string - Aggregation function name. We support the following aggregation functions:
    • AVG: average field value
    • MAX: maximum field value
    • MIN: minimum field value
    • SUM: sum of field values
    • COUNT: number of values
    • STDDEV: standard deviation of field values
    • CENTILE(X): X-centile of field values. X is a double between 0.0 and 100.0.
    • MAXDATE(format): maximum field value. The expression must be an index time. The output is parsed according to the specified <i>format</i>
    • MINDATE(format): minimum field value. The expression must be an index time. The output is parsed according to the specified <i>format</i>

    XML representation
    <AggregationFunction id="" expr="" functionName="" ></AggregationFunction>
  • ExplicitRangesNumericalFacet com.exalead.search.v30.ExplicitRangesNumericalFacet
    Facet on a numerical expression
    Parent objects
    • FacetsConfig
    Attributes
    • expr string
    • min nullabledouble
    • max nullabledouble
    • lsb int
    • msb int
    • maxValues int
    • defaultPrecision int
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    expr
    (UI: Expression)
    string - Virtual field expression for this facet.
    min
    (UI: Exclude values below)
    nullabledouble - Excludes values below this threshold from the facet.
    max
    (UI: Exclude values above)
    nullabledouble - Excludes values above this threshold from the facet.
    lsb
    (UI: Least significant bit)
    int 0 Least significant bit of the numerical value to use.

    Only used for integral types.
    msb
    (UI: Most significant bit)
    int 63 Most significant bit of the numerical value to use.

    Only used for integral types.
    maxValues
    (UI: Max. categories)
    int 100 Maximum number of displayed values.
    defaultPrecision
    (UI: Default precision)
    int 0 Default precision, in number of trailing digits, for float-typed expressions with double precision (e.g. "#random()").

    For example, if defaultPrecision is 2, 0.011 and 0.012 will be considered similar, and categories will be displayed with two significant digits (e.g. [0.00;0.99])
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    • Range*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    Range* -
    XML representation
    <ExplicitRangesNumericalFacet expr="" min="" max="" lsb="0" msb="63" maxValues="100" defaultPrecision="0" virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></ExplicitRangesNumericalFacet>
  • Range com.exalead.search.v30.Range
    No documentation for this element.
    Parent objects
    • ExplicitRangesNumericalFacet
    Attributes
    • min nullabledouble
    • max nullabledouble
    • lsb int
    • msb int
    • title string
    Name Type Default value Description
    min
    (UI: Min)
    nullabledouble - The lower bound of the range (null means minus infinity).
    max
    (UI: Max)
    nullabledouble - The upper bound of the range (null means plus infinity).
    lsb
    (UI: Least significant bit)
    int 0 Least significant bit of the numerical value to use.

    Only used for integral types.
    msb
    (UI: Most significant bit)
    int 63 Most significant bit of the numerical value to use.

    Only used for integral types.
    title string -

    XML representation
    <Range min="" max="" lsb="0" msb="63" title="" ></Range>
  • FixedRangesNumericalFacet com.exalead.search.v30.FixedRangesNumericalFacet
    Facet on a numerical expression
    Parent objects
    • FacetsConfig
    Attributes
    • expr string
    • min nullabledouble
    • max nullabledouble
    • lsb int
    • msb int
    • maxValues int
    • defaultPrecision int
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • rangeSize double
    • aboveMax boolean
    • belowMin boolean
    • rangeTitleFormat string
    • aboveTitleFormat string
    • belowTitleFormat string
    • singletonTitleFormat string
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    expr
    (UI: Expression)
    string - Virtual field expression for this facet.
    min
    (UI: Exclude values below)
    nullabledouble - Excludes values below this threshold from the facet.
    max
    (UI: Exclude values above)
    nullabledouble - Excludes values above this threshold from the facet.
    lsb
    (UI: Least significant bit)
    int 0 Least significant bit of the numerical value to use.

    Only used for integral types.
    msb
    (UI: Most significant bit)
    int 63 Most significant bit of the numerical value to use.

    Only used for integral types.
    maxValues
    (UI: Max. categories)
    int 100 Maximum number of displayed values.
    defaultPrecision
    (UI: Default precision)
    int 0 Default precision, in number of trailing digits, for float-typed expressions with double precision (e.g. "#random()").

    For example, if defaultPrecision is 2, 0.011 and 0.012 will be considered similar, and categories will be displayed with two significant digits (e.g. [0.00;0.99])
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    rangeSize
    (UI: Range size)
    double 0.0 Size of each range.
    0 means that the resulting facet will contain one range for each unique value.
    aboveMax
    (UI: Above max range)
    boolean False Creates a range [max;] that includes all values above the max.
    belowMin
    (UI: Below min range)
    boolean False Creates a range [;min] that includes all values below the min.
    rangeTitleFormat
    (UI: Range title format)
    string [$;$] Format for the generated range titles.

    The first dollar sign will be replaced with the range's lower bound, and the second dollar sign will be replaced with the range's upper bound.

    To write a dollar sign, use \$.
    aboveTitleFormat
    (UI: Above max range title format)
    string [$;] Format for the generated "above max" range.

    The dollar sign will be replaced with the range's lower bound.

    To write a dollar sign, use \$.
    belowTitleFormat
    (UI: Below min range title format)
    string [;$] Format for the generated "below min" range.

    The dollar sign will be replaced with the range's upper bound.

    To write a dollar sign, use \$.
    singletonTitleFormat
    (UI: Singleton range title format)
    string $ Format for the generated single value range titles (like [3;3]).

    The dollar sign will be replaced with the value.

    To write a dollar sign, use \$.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <FixedRangesNumericalFacet expr="" min="" max="" lsb="0" msb="63" maxValues="100" defaultPrecision="0" virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" rangeSize="0.0" aboveMax="false" belowMin="false" rangeTitleFormat="[$;$]" aboveTitleFormat="[$;]" belowTitleFormat="[;$]" singletonTitleFormat="$" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></FixedRangesNumericalFacet>
  • DynamicRangesNumericalFacet com.exalead.search.v30.DynamicRangesNumericalFacet
    Facet on a numerical expression
    Parent objects
    • FacetsConfig
    Attributes
    • expr string
    • min nullabledouble
    • max nullabledouble
    • lsb int
    • msb int
    • maxValues int
    • defaultPrecision int
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • minimumRangeSize double
    • nbRanges int
    • policy enum(linear, geometrical, auto)
    • adjustRanges boolean
    • moreAccurate boolean
    • rangeTitleFormat string
    • exclusiveRightBracket boolean
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    expr
    (UI: Expression)
    string - Virtual field expression for this facet.
    min
    (UI: Exclude values below)
    nullabledouble - Excludes values below this threshold from the facet.
    max
    (UI: Exclude values above)
    nullabledouble - Excludes values above this threshold from the facet.
    lsb
    (UI: Least significant bit)
    int 0 Least significant bit of the numerical value to use.

    Only used for integral types.
    msb
    (UI: Most significant bit)
    int 63 Most significant bit of the numerical value to use.

    Only used for integral types.
    maxValues
    (UI: Max. categories)
    int 100 Maximum number of displayed values.
    defaultPrecision
    (UI: Default precision)
    int 0 Default precision, in number of trailing digits, for float-typed expressions with double precision (e.g. "#random()").

    For example, if defaultPrecision is 2, 0.011 and 0.012 will be considered similar, and categories will be displayed with two significant digits (e.g. [0.00;0.99])
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    minimumRangeSize double 1.0 Size of each bin used to compute cardinality.
    nbRanges int 5 The maximal number of ranges to output.
    policy enum(linear, geometrical, auto) linear The policy to generate the ranges
    Value can be null or one of
    • linear
    • geometrical
    • auto
    adjustRanges boolean True Tries to adjust the ranges on multiples of 10.
    moreAccurate boolean False Better linear/geometrical ranges, but slower.
    rangeTitleFormat string [$;$] Format for the generated range titles.

    The first dollar sign will be replaced with the range's lower bound, and the second dollar sign will be replaced with the range's upper bound.

    To write a dollar sign, use \$.
    exclusiveRightBracket
    (UI: Exclusive right bracket [a;b[)
    boolean False If exclusiveRightBracket is set to true then the facet will be represented by an open range [a,b[, that means it will not contain hits with value equal to b.
    otherwise the facet will be in inclusive mode [a,b].
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <DynamicRangesNumericalFacet expr="" min="" max="" lsb="0" msb="63" maxValues="100" defaultPrecision="0" virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" minimumRangeSize="1.0" nbRanges="5" policy="linear" adjustRanges="true" moreAccurate="false" rangeTitleFormat="[$;$]" exclusiveRightBracket="false" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></DynamicRangesNumericalFacet>
  • ExplicitGeoFacet com.exalead.search.v30.ExplicitGeoFacet
    Facet on geographical points based on disk and/or polygons
    Parent objects
    • FacetsConfig
    Attributes
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • field string
    • hitsMaxCategories int
    • useGeometryAsDomainId boolean
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    field string - The point field on which to perform the synthesis.
    hitsMaxCategories int 100 Maximum number of categories displayed in hits.
    useGeometryAsDomainId boolean True The domain id will be computed with the geometric parameters. Otherwise, an unique id will be used. (recommanded value: false, except for backward compatibility)
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    • GeoDomain*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    GeoDomain* - List of GeoDomains
    XML representation
    <ExplicitGeoFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" field="" hitsMaxCategories="100" useGeometryAsDomainId="true" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></ExplicitGeoFacet>
  • AutoTileGeoFacet com.exalead.search.v30.AutoTileGeoFacet
    Configuration for a facet.
    Parent objects
    • FacetsConfig
    Attributes
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • field string
    • xMin double
    • yMin double
    • xMax double
    • yMax double
    • xBinSize double
    • yBinSize double
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    field string - The point field on which to perform the synthesis.
    xMin double -90 Minimum bounding box X value.
    yMin double -180 Minimum bounding box Y value.
    xMax double 90 Maximum bounding box X value.
    yMax double 180 Maximum bounding box Y value.
    xBinSize double 0.1 Size of a tile in the X direction.
    yBinSize double 0.1 Size of a tile in the Y direction.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <AutoTileGeoFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" field="" xMin="-90" yMin="-180" xMax="90" yMax="180" xBinSize="0.1" yBinSize="0.1" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></AutoTileGeoFacet>
  • AutoTileGeoV2Facet com.exalead.search.v30.AutoTileGeoV2Facet
    Configuration for a facet.
    Parent objects
    • FacetsConfig
    Attributes
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • field string
    • xMin double
    • yMin double
    • xMax double
    • yMax double
    • xNbCells int
    • yNbCells int
    • hitsMaxCategories int
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    field string - The geov2 field on which to perform the synthesis.
    xMin double -90 Minimum bounding box X value.
    yMin double -180 Minimum bounding box Y value.
    xMax double 90 Maximum bounding box X value.
    yMax double 180 Maximum bounding box Y value.
    xNbCells int 10 Number of tiles in the X direction.
    yNbCells int 10 Number of tiles in the Y direction.
    hitsMaxCategories int 100 Maximum number of categories displayed in hits.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <AutoTileGeoV2Facet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" field="" xMin="-90" yMin="-180" xMax="90" yMax="180" xNbCells="10" yNbCells="10" hitsMaxCategories="100" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></AutoTileGeoV2Facet>
  • DynamicDateFacet com.exalead.search.v30.DynamicDateFacet
    A DynamicDateFacet will generate max 'maxCategories' categories, scaling the resolution to a day, week, month or year level.
    Parent objects
    • FacetsConfig
    Attributes
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • expr string
    • yearOutputFormat string
    • quarterOutputFormat string
    • monthOutputFormat string
    • weekOutputFormat string
    • dayOutputFormat string
    • hourOutputFormat string
    • minuteOutputFormat string
    • secondOutputFormat string
    • enableYearLevel boolean
    • enableQuarterLevel boolean
    • enableMonthLevel boolean
    • enableWeekLevel boolean
    • enableDayLevel boolean
    • enableHourLevel boolean
    • enableMinuteLevel boolean
    • enableSecondLevel boolean
    • enableISO8601Compliance boolean
    • generateMissingIntervals boolean
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    expr string - The date field or virtual expr to use.
    yearOutputFormat string %Y The output format used to generate year-based categories.
    quarterOutputFormat string %Y %Q The output format used to generate quarter-based categories (use %Q to specify the quarter number).
    monthOutputFormat string %b %Y The output format used to generate month-based categories.
    weekOutputFormat string week %V %Y The output format used to generate week-based categories.
    dayOutputFormat string %Y/%m/%d The output format used to generate day-based categories.
    hourOutputFormat string %Y/%m/%d %H The output format used to generate hour-based categories.
    minuteOutputFormat string %Y/%m/%d %H:%M The output format used to generate minute-based categories.
    secondOutputFormat string %Y/%m/%d %H:%M:%S The output format used to generate second-based categories.
    enableYearLevel boolean True Enables the year level
    enableQuarterLevel boolean False Enables the quarter level
    enableMonthLevel boolean True Enables the month level
    enableWeekLevel boolean False Enables the week level
    enableDayLevel boolean True Enables the day level
    enableHourLevel boolean False Enables the hour level
    enableMinuteLevel boolean False Enables the minute level
    enableSecondLevel boolean False Enables the second level
    enableISO8601Compliance boolean True Enables the ISO 8601 standard instead of the American standard.

    In ISO 8601, weeks start on Monday and a few rules determine the first and last weeks of each year.

    Use compatible Output formats. Typically, use %V for ISO 8601 instead of %U for American, otherwise week numbering will not be consistent with grouping.
    generateMissingIntervals boolean True Generates missing dates setting their count to 0 and aggregation values to NO_VALUE
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <DynamicDateFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" expr="" yearOutputFormat="%Y" quarterOutputFormat="%Y %Q" monthOutputFormat="%b %Y" weekOutputFormat="week %V %Y" dayOutputFormat="%Y/%m/%d" hourOutputFormat="%Y/%m/%d %H" minuteOutputFormat="%Y/%m/%d %H:%M" secondOutputFormat="%Y/%m/%d %H:%M:%S" enableYearLevel="true" enableQuarterLevel="false" enableMonthLevel="true" enableWeekLevel="false" enableDayLevel="true" enableHourLevel="false" enableMinuteLevel="false" enableSecondLevel="false" enableISO8601Compliance="true" generateMissingIntervals="true" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></DynamicDateFacet>
  • DateFacet com.exalead.search.v30.DateFacet
    Configuration for a facet.
    Parent objects
    • FacetsConfig
    Attributes
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • expr string
    • startExpr string
    • endExpr string
    • beforeStart boolean
    • afterEnd boolean
    • withYear boolean
    • yearDesc boolean
    • withMonth boolean
    • monthDesc boolean
    • withWeek boolean
    • weekDesc boolean
    • withDay boolean
    • dayDesc boolean
    • withHour boolean
    • hourDesc boolean
    • withMinute boolean
    • minuteDesc boolean
    • withSecond boolean
    • secondDesc boolean
    • useAsFirstLevel int
    • maxCategoryTreeDepth int
    • maxCategoryTreeDepthFromRoot int
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    expr string - The date field or virtual expr on which the synthesis must be performed.
    startExpr string #datetime(0) A CONSTANT virtual expression that evaluates the time when the synthesis starts (e.g. #now(), #fromunixts(1304865423) or #datetime(1983,2,28)).
    endExpr string #datetime(10000) A CONSTANT virtual expression that evaluates the time when the synthesis ends.
    beforeStart boolean False Creates a virtual category for all dates before the start.
    afterEnd boolean False Creates a virtual category for all dates after the end.
    withYear boolean True Creates a virtual category for each year.
    yearDesc boolean True
    withMonth boolean True Creates a virtual category for each month.
    monthDesc boolean False
    withWeek boolean False Creates a virtual category for each week.
    weekDesc boolean False
    withDay boolean True Creates a virtual category for each day.
    dayDesc boolean False
    withHour boolean False Creates a virtual category for each hour.
    hourDesc boolean False
    withMinute boolean False Creates a virtual category for each minute.
    minuteDesc boolean False
    withSecond boolean False Creates a virtual category for each second.
    secondDesc boolean False
    useAsFirstLevel int 0 Used internally by multidim refinements to tell which level is the first level below root.
    maxCategoryTreeDepth
    (UI: Max. tree depth from current refinement)
    int 0 Maximum depth of the returned category tree.

    Whatever the level of refinement, there will never be more levels retrieved (this ensures we are not overtaking the maxCategoryTreeDepthFromRoot).
    maxCategoryTreeDepthFromRoot
    (UI: Max. tree depth from root)
    int 0 Maximum depth of the returned category tree, starting from the global root of the category.

    Setting this attribute to N means that no categories will be kept below this level.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <DateFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" expr="" startExpr="#datetime(0)" endExpr="#datetime(10000)" beforeStart="false" afterEnd="false" withYear="true" yearDesc="true" withMonth="true" monthDesc="false" withWeek="false" weekDesc="false" withDay="true" dayDesc="false" withHour="false" hourDesc="false" withMinute="false" minuteDesc="false" withSecond="false" secondDesc="false" useAsFirstLevel="0" maxCategoryTreeDepth="0" maxCategoryTreeDepthFromRoot="0" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></DateFacet>
  • ValueFacet com.exalead.search.v30.ValueFacet
    Configuration for a facet.
    Parent objects
    • FacetsConfig
    Attributes
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • field string
    • hitsMaxCategories int
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    field string - The Value field on which to perform the synthesis.
    hitsMaxCategories int 100 Maximum number of categories displayed in hits.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <ValueFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" field="" hitsMaxCategories="100" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></ValueFacet>
  • RelatedTermFacet com.exalead.search.v30.RelatedTermFacet
    A facet implementing a related-term specific synthesis.
    Parent objects
    • FacetsConfig
    Attributes
    • field string
    • hitsMaxCategories int
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • dictionary string
    • minCorpusFrequency int
    • corpusLowPassFilter double
    • localLowPassFilter double
    • blacklist string
    • maxRelatedTermsHitsPerSlice int
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    field string - The Value field on which to perform the synthesis.
    hitsMaxCategories int 100 Maximum number of categories displayed in hits.
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    dictionary string - The dictionary containing keywords.
    minCorpusFrequency int 2 Minimum number of occurrences in the whole index for a term to be possibly selected for synthesis.
    corpusLowPassFilter double 0.5 Filters out terms occurring more than this threshold in the whole index.
    localLowPassFilter double 0.9 Filters out terms occurring more than this threshold in the result set.
    blacklist string - The related-terms blocklist.
    maxRelatedTermsHitsPerSlice
    (UI: Max hits per slice)
    int 5000 Maximum number of hits per slice used to compute related terms synthesis. Set to 0 for exhaustive. Note that the number of hits processed per slice to compute related terms synthesis will never exceed FacetsConfig's maxHits.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <RelatedTermFacet field="" hitsMaxCategories="100" virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" dictionary="" minCorpusFrequency="2" corpusLowPassFilter="0.5" localLowPassFilter="0.9" maxRelatedTermsHitsPerSlice="5000" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></RelatedTermFacet>
  • ConcatValueFacet com.exalead.search.v30.ConcatValueFacet
    Configuration for a facet.
    Parent objects
    • FacetsConfig
    Attributes
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • fieldA string
    • fieldB string
    • hitsMaxCategories int
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    fieldA string - The Value field on which to perform the synthesis.
    fieldB string - The Value field on which to perform the synthesis.
    hitsMaxCategories int 100 Maximum number of categories displayed in hits.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <ConcatValueFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" fieldA="" fieldB="" hitsMaxCategories="100" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></ConcatValueFacet>
  • LuaFacet com.exalead.search.v30.LuaFacet
    A virtual facet implemented using the Lua scripting language.
    Parent objects
    • FacetsConfig
    Attributes
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • luaCode string
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    luaCode string - The lua code for the facet.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <LuaFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" luaCode="" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></LuaFacet>
  • Hierarchical2DFacet com.exalead.search.v30.Hierarchical2DFacet
    Configuration for a facet.
    Parent objects
    • FacetsConfig
    Attributes
    • virtualRoot string
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • id1 string
    • id2 string
    • withDimSwitch boolean
    • secondarySortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • secondaryReverse boolean
    • secondarySortAggregationFunction string
    • secondaryExplicitSortOrderValues string
    • hierarchical2DFacetRepresentation boolean
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    virtualRoot
    (UI: Virtual root)
    string - Path for the resulting root 'virtual' category.
    When using a virtual facet (a facet which is not stored in a category field), all created categories are attached to a virtual root path.

    For example: Top/VirtualRoot/[0;5]
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    id1 string - id of the first facet on which to perform the synthesis
    id2 string - id of the second facet on which to perform the synthesis
    withDimSwitch boolean True Adds a '__dimension_switch__' virtual category between categories of the first and second dimensions.
    secondarySortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count The function to use to sort on the second dimension (see 'sortFunction' in Facet).
    secondaryReverse boolean False Reverses the secondary sort (see 'reverse' in Facet).
    secondarySortAggregationFunction string - The aggregation function used to sort by aggregation values.
    secondaryExplicitSortOrderValues string - The ordered list of values used to sort, comma-separated (use the backslash to escape a comma)
    hierarchical2DFacetRepresentation boolean True Enables the use of asHierarchical2DFacet() in search client.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <Hierarchical2DFacet virtualRoot="" id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" id1="" id2="" withDimSwitch="true" secondarySortFunction="count" secondaryReverse="false" hierarchical2DFacetRepresentation="true" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></Hierarchical2DFacet>
  • OptimizedEnumFacet com.exalead.search.v30.OptimizedEnumFacet
    Configuration for a facet.
    Parent objects
    • FacetsConfig
    Attributes
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • virtualRoot string
    • enumFacetId string
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    virtualRoot string - The root of the resulting 'virtual' category.
    enumFacetId string - Id of the referred EnumFacet.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    XML representation
    <OptimizedEnumFacet id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" virtualRoot="" enumFacetId="" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></OptimizedEnumFacet>
  • MultiDimensionFacet com.exalead.search.v30.MultiDimensionFacet
    Configuration for a facet.
    Parent objects
    • FacetsConfig
    Attributes
    • id string
    • type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue)
    • refinementPolicy enum(exclusive, disjunctive, norefine)
    • implementation enum(autodetect, favor_cpu, favor_memory)
    • inNavigation boolean
    • inHits boolean
    • maxSliceCategories int
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • aggregateScores boolean
    • virtualRoot string
    • additionalTreeRepresentation boolean
    • withAdditionalTreeRepresentationDimSwitch boolean
    • minDocsPerCategory int
    • maxCategories int
    • maxCategoriesPerLevel int
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • reverse boolean
    • index int
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    id string - Unique identifier of this facet.

    Used in Mashup-Builder and client applications to identify the facet.
    type enum(category, h2d, num_explicit, num_fixed, num_dynamic, geo, autotile, value, multi, enum, date, dyndate, autotilev2, concatvalue) Used for convenience in various introspection code
    refinementPolicy
    (UI: Refinement policy)
    enum(exclusive, disjunctive, norefine) exclusive Defines the Refinements panel behavior.
    • Disjunctive: Allows you to select multiple categories at once for refinement (it creates an OR operation). Continues to display all categories when one is selected.
    • Exclusive: Only allows you to select one category for refinement. Hides other categories when one is selected.
    • No refine: Allows you to display all categories without enabling refinements.
    implementation
    (UI: Implementation)
    enum(autodetect, favor_cpu, favor_memory) autodetect Internal algorithm for implementation of this facet.

    Value can be null or one of
    • autodetect
    • favor_cpu
    • favor_memory
    .
    • "favor_cpu" consumes less CPU, at the expense of additional RAM consumption (RAM is consumed for each facet value in the index).
    • "favor_memory" reduces RAM consumption (RAM is only consumed for each facet value in the result set).
    • "autodetect" automatically selects a suitable policy based on the result set and facet data.
    inNavigation
    (UI: Use for navigation)
    boolean True Computes navigation on this facet.
    inHits
    (UI: Use for hit content)
    boolean True Computes the value of this facet for each hit.
    maxSliceCategories
    (UI: Max categories per slices)
    int 0 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    sortFunction
    (UI: Sort by)
    enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count Specifies a sorting function:
    • Aggregation Value: Sorts the categories using one of the aggregation functions specified in "Aggregations". The "Function Name" field that appears below, lets you enter the aggregation function name. Default sorting direction is descending.
    • Alphanum: Sorts the categories lexicographically. The category path is used here, not the title. Default sorting direction is ascending.
    • Count: Sorts the categories by decreasing order, with the number of documents matching the query and having this facet.
    • Date: Sorts the categories by:
      • Default with decreasing year, increasing month, increasing day
      • Reverse with increasing year, decreasing month, decreasing day
    • Explicit order: Sorts the categories using an explicit order. The "Explicit order" field that appears below, lets you specify a comma-separated list of values.
    • Latitude: Sorts the categories by latitude, using the average of points.
    • Longitude: Sorts the categories by longitude, using the average of points.
    • Num: Tries to parse the category path as an integer, and sorts decreasingly. If the category is prefixed by a number it will parse the prefix. In case of failure, it fallbacks to lexicographical sorting. Default sorting direction is ascending.
    • Range: If the categories are ranges in the form [a;b] (as for NumericalFacet), it sorts the categories per increasing midrange value. Default sorting direction is ascending.
    • Relevance: Sorts the categories by decreasing relevance. Relevance is defined by taking into account both the number of documents matching the query and having this facet, and the total number of documents having the facet. The idea is to use a method of ponderation similar to the classical TF-IDF.
    aggregateScores
    (UI: Aggregate scores)
    boolean False If selected, the category score will be the sum of the scores of all its hits.
    virtualRoot string - The virtual root used for the tree representation.
    additionalTreeRepresentation boolean False
    withAdditionalTreeRepresentationDimSwitch boolean False Adds a '__dimension_switch__' virtual category between each dimension of the tree reprentation.

    This is required if you're working with hierarhical dimensions.
    minDocsPerCategory
    (UI: Min. docs per category)
    int 1 Minimum number of docs in a category for it to be included in the Refinements panel.
    maxCategories
    (UI: Max. categories)
    int 0 Maximum number of categories to display in the facet.
    0 means all categories will be displayed.
    maxCategoriesPerLevel
    (UI: Max. categories per level)
    int 100 Maximum number of facet values in each level of the categories hierarchy.
    0 means that all values are selected.

    The sortFunction is used to select the "best" facet values for each level.
    sortAggregationFunction
    (UI: Aggregation function)
    string - Aggregation function used to sort by aggregation values.
    explicitSortOrderValues
    (UI: Explicit order)
    string - Ordered list of values used for explicit sort, comma-separated (use the backslash to escape a comma).
    reverse
    (UI: Reverse)
    boolean False Reverse sorting order.
    index int - Internal.
    dataModelState string -
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this facet.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this facet.

    Nested elements
    • fromDataModelFacet
    • AdditionalFacetView*
    • AggregationFunction*
    • MultiFacetDimension*
    Type Wrapper Description
    Facet fromDataModel
    AdditionalFacetView* -
    AggregationFunction* - A numerical computation to perform on hits matching each value of the facet.
    MultiFacetDimension* -
    XML representation
    <MultiDimensionFacet id="" type="" refinementPolicy="exclusive" implementation="autodetect" inNavigation="true" inHits="true" maxSliceCategories="0" sortFunction="count" aggregateScores="false" virtualRoot="" additionalTreeRepresentation="false" withAdditionalTreeRepresentationDimSwitch="false" minDocsPerCategory="1" maxCategories="0" maxCategoriesPerLevel="100" reverse="false" index="" dataModelState="" dataModelClass="" dataModelProperty="" ></MultiDimensionFacet>
  • MultiFacetDimension com.exalead.search.v30.MultiFacetDimension
    Determines the features of one dimension for a MultiDimensionFacet.
    Parent objects
    • MultiDimensionFacet
    Attributes
    • id string
    • sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng)
    • reverse boolean
    • sortAggregationFunction string
    • explicitSortOrderValues string
    • maxCategories int
    Name Type Default value Description
    id string - The existing 1d facet id to use for this dimension.
    sortFunction enum(count, relevancy, alphanum, num, range, date, aggregation, explicit, lat, lng) count The sort function on this dimension (does not apply to other dimensions).
    reverse boolean False Reverses the sort (see 'reverse' in Facet).
    sortAggregationFunction string - The aggregation function used to sort by aggregation values.
    explicitSortOrderValues string - The ordered list of values used to sort, comma-separated (use the backslash to escape a comma).
    maxCategories int 0 The maximum number of categories after sorting in this dimension.

    XML representation
    <MultiFacetDimension id="" sortFunction="count" reverse="false" maxCategories="0" ></MultiFacetDimension>
  • HitConfig com.exalead.search.v30.HitConfig
    No documentation for this element.
    Parent objects
    • SearchLogic
    Attributes
    • fullHits int
    Name Type Default value Description
    fullHits int 10 Number of full hits to retrieve per page.
    -1 means unlimited.

    Nested elements
    • AdvancedPipeline
    • DataModelClassMetaInformation
    • Meta*
    • Metas*
    Type Wrapper Description
    AdvancedPipeline -
    DataModelClassMetaInformation -
    Meta* -
    Metas* -
    XML representation
    <HitConfig fullHits="10" ></HitConfig>
  • AdvancedPipeline com.exalead.search.v30.AdvancedPipeline
    No documentation for this element.
    Nested elements
    • CSVMultiField*
    • HitOperation*
    • MultiContextField*
    • RankingField*
    • RetrievedField*
    Type Wrapper Description
    CSVMultiField* -
    HitOperation* -
    MultiContextField* -
    RankingField* -
    RetrievedField* -
    XML representation
    <AdvancedPipeline ></AdvancedPipeline>
  • CSVMultiField com.exalead.search.v30.CSVMultiField
    No documentation for this element.
    Parent objects
    • AdvancedPipeline
    Attributes
    • indexField string
    Name Type Default value Description
    indexField string - CSV index field to retrieve.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <CSVMultiField indexField="" ></CSVMultiField>
  • StandardDecodingOperation com.exalead.search.v30.StandardDecodingOperation
    Decodes url or idna encoded strings
    Parent objects
    • Meta
    Attributes
    • name string
    • encoding enum(url, idna)
    Name Type Default value Description
    name string - Name of the meta subject of the operation.

    Not required when this is inside a 'Meta' tag.
    encoding enum(url, idna) Value can be one of
    • url
    • idna

    XML representation
    <StandardDecodingOperation name="" encoding="" ></StandardDecodingOperation>
  • ValueDisplaySelectionOperation com.exalead.search.v30.ValueDisplaySelectionOperation
    No documentation for this element.
    Parent objects
    • Meta
    Attributes
    • name string
    • minValues int
    • maxValues int
    • maxBytesToProcess int
    • relaxAndNodesPolicy enum(never, toplevel, always)
    • facetIds string
    • extraPrefixHandlers string
    • analysisConfig string
    • analysisPipeline string
    Name Type Default value Description
    name string - Name of the meta subject of the operation.

    Not required when this is inside a 'Meta' tag.
    minValues int 0 Minimum number of values to select.
    maxValues int 50 Maximum number of values to select.
    maxBytesToProcess int 131072 Maximum number of bytes to process in the input text. The remaining bytes are not taken into account for the value selection operation. (0=deactivated)
    relaxAndNodesPolicy enum(never, toplevel, always) never Relax query AND nodes (handle them as if they were OR nodes).
    Value can be null or one of
    • never
    • toplevel
    • always
    facetIds string
    extraPrefixHandlers string
    analysisConfig string - Analysis config in which analysisPipeline is searched for tokenization config and extra semantic processors.
    analysisPipeline string - Pipeline in analysisConfig searched for tokenization config and extra semantic processors.

    XML representation
    <ValueDisplaySelectionOperation name="" minValues="0" maxValues="50" maxBytesToProcess="131072" relaxAndNodesPolicy="never" facetIds="" extraPrefixHandlers="" analysisConfig="" analysisPipeline="" ></ValueDisplaySelectionOperation>
  • SnippetOperation com.exalead.search.v30.SnippetOperation
    No documentation for this element.
    Parent objects
    • Meta
    Attributes
    • name string
    • minLength int
    • maxLength int
    • maxSentenceSegmentLength int
    • minNbWordsInRelevantSentence int
    • maxSentenceSegments int
    • maxBytesToProcess int
    • removeDuplicateSegments boolean
    • maxConsecutiveSeparators int
    • splitOnSentences boolean
    • highlight boolean
    • minimalPatternHighlight boolean
    • relaxAndNodesPolicy enum(never, toplevel, always)
    • ifMeta string
    • ifMetasMatch string
    • highlightFacetIds string
    • highlightExtraPrefixHandlers string
    • analysisConfig string
    • analysisPipeline string
    Name Type Default value Description
    name string - Name of the meta subject of the operation.

    Not required when this is inside a 'Meta' tag.
    minLength int 100 Minimum length of the snippet (in characters).
    maxLength int 150 Maximum length of the snippet (in characters).
    maxSentenceSegmentLength int 150 Maximum length of sentence segments in the snippet (in characters).
    minNbWordsInRelevantSentence int 10 Minimum number of words for a sentence to be considered for snippetization
    maxSentenceSegments int 3 Maximum number of sentence segments in the snippet.
    maxBytesToProcess int 131072 Maximum number of bytes to process in the input text. The remaining bytes are not taken into account for the snippet operation. (0=deactivated)
    removeDuplicateSegments boolean False Remove duplicate segments from the snippet.

    Using this option with a large number of segments may have a negative impact on performance.
    maxConsecutiveSeparators int 0 Maximum number of consecutive separators. When there are more than this number of separators, they are replaced with a single space character.
    splitOnSentences boolean True Don't span over sentence boundaries.
    highlight boolean True Highlight the snippet's tokens matching the query.
    minimalPatternHighlight boolean False If true, only highlights the alphanumerical part of the pattern. For example, *foo* will only highlight foo in the word football.
    relaxAndNodesPolicy enum(never, toplevel, always) toplevel Relax query AND nodes (handle them as if they were OR nodes).
    Value can be null or one of
    • never
    • toplevel
    • always
    ifMeta string - Fallback meta if one of the ifMetasMatch meta matches the query.
    ifMetasMatch string Comma-separated list of metas to test against the user query.

    If a match is found, the summary is filled with the ifMeta value.
    highlightFacetIds string Comma-separated list of facets id for which you want to highlight the values of the refines in the current meta.
    highlightExtraPrefixHandlers string Comma-separated list of prefix handlers for which you want to highlight the query in the current meta.
    eg: if highlightExtraPrefixHandlers="spellslike" spellslike:foo will highlight foo and its alternate forms in the current meta
    analysisConfig string - Analysis config in which analysisPipeline is searched for tokenization config and extra semantic processors.
    analysisPipeline string - Pipeline in analysisConfig searched for tokenization config and extra semantic processors.

    XML representation
    <SnippetOperation name="" minLength="100" maxLength="150" maxSentenceSegmentLength="150" minNbWordsInRelevantSentence="10" maxSentenceSegments="3" maxBytesToProcess="131072" removeDuplicateSegments="false" maxConsecutiveSeparators="0" splitOnSentences="true" highlight="true" minimalPatternHighlight="false" relaxAndNodesPolicy="toplevel" ifMetasMatch="" highlightFacetIds="" highlightExtraPrefixHandlers="" analysisConfig="" analysisPipeline="" ></SnippetOperation>
  • OctreeJSONOperation com.exalead.search.v30.OctreeJSONOperation
    No documentation for this element.
    Parent objects
    • Meta
    Attributes
    • name string
    Name Type Default value Description
    name string - Name of the meta subject of the operation.

    Not required when this is inside a 'Meta' tag.

    XML representation
    <OctreeJSONOperation name="" ></OctreeJSONOperation>
  • MatrixOperation com.exalead.search.v30.MatrixOperation
    No documentation for this element.
    Parent objects
    • Meta
    Attributes
    • name string
    Name Type Default value Description
    name string - Name of the meta subject of the operation.

    Not required when this is inside a 'Meta' tag.

    XML representation
    <MatrixOperation name="" ></MatrixOperation>
  • DateFormatOperation com.exalead.search.v30.DateFormatOperation
    No documentation for this element.
    Parent objects
    • Meta
    Attributes
    • name string
    • outputFormat string
    Name Type Default value Description
    name string - Name of the meta subject of the operation.

    Not required when this is inside a 'Meta' tag.
    outputFormat string %Y/%m/%d Date format

    XML representation
    <DateFormatOperation name="" outputFormat="%Y/%m/%d" ></DateFormatOperation>
  • TimeFormatOperation com.exalead.search.v30.TimeFormatOperation
    No documentation for this element.
    Parent objects
    • Meta
    Attributes
    • name string
    • outputFormat string
    Name Type Default value Description
    name string - Name of the meta subject of the operation.

    Not required when this is inside a 'Meta' tag.
    outputFormat string %Y/%m/%d %H:%M:%S Datetime format

    XML representation
    <TimeFormatOperation name="" outputFormat="%Y/%m/%d %H:%M:%S" ></TimeFormatOperation>
  • PrintfOperation com.exalead.search.v30.PrintfOperation
    No documentation for this element.
    Parent objects
    • Meta
    Attributes
    • name string
    • outputFormat string
    Name Type Default value Description
    name string - Name of the meta subject of the operation.

    Not required when this is inside a 'Meta' tag.
    outputFormat string - Prints Meta according to this format, controls the output as in C printf.

    XML representation
    <PrintfOperation name="" outputFormat="" ></PrintfOperation>
  • HighlightOperation com.exalead.search.v30.HighlightOperation
    No documentation for this element.
    Parent objects
    • Meta
    Attributes
    • name string
    • highlightFacetIds string
    • highlightExtraPrefixHandlers string
    • minNbWordsInRelevantSentence int
    • maxBytesToProcess int
    • maxConsecutiveSeparators int
    • relaxAndNodesPolicy enum(never, toplevel, always)
    • minimalPatternHighlight boolean
    • analysisConfig string
    • analysisPipeline string
    Name Type Default value Description
    name string - Name of the meta subject of the operation.

    Not required when this is inside a 'Meta' tag.
    highlightFacetIds string
    highlightExtraPrefixHandlers string
    minNbWordsInRelevantSentence int 10 Minimum number of words for a sentence to be considered for snippetization
    maxBytesToProcess int 131072 Maximum number of bytes to process in the input text. The remaining bytes are not taken into account for the operation. (0=deactivated)
    maxConsecutiveSeparators int 0 Maximum number of consecutive separators. When there are more than this number of separators, they are replaced with a single space character.
    relaxAndNodesPolicy enum(never, toplevel, always) toplevel Relax query AND nodes (handle them as if they were OR nodes).
    Value can be null or one of
    • never
    • toplevel
    • always
    minimalPatternHighlight boolean False If true, only highlights the alphanumerical part of the pattern. For example, *foo* will only highlight foo in the word football.
    analysisConfig string - Analysis config in which analysisPipeline is searched for tokenization config and extra semantic processors.
    analysisPipeline string - Pipeline in analysisConfig searched for tokenization config and extra semantic processors.

    XML representation
    <HighlightOperation name="" highlightFacetIds="" highlightExtraPrefixHandlers="" minNbWordsInRelevantSentence="10" maxBytesToProcess="131072" maxConsecutiveSeparators="0" relaxAndNodesPolicy="toplevel" minimalPatternHighlight="false" analysisConfig="" analysisPipeline="" ></HighlightOperation>
  • TruncateOperation com.exalead.search.v30.TruncateOperation
    Truncates a meta value if it exceeds a configured threshold value.
    Parent objects
    • Meta
    Attributes
    • name string
    • maxLength int
    • strict boolean
    Name Type Default value Description
    name string - Name of the meta subject of the operation.

    Not required when this is inside a 'Meta' tag.
    maxLength int 150 Max character length after which the content will be truncated.
    strict boolean False Truncates even if a blank character is not met.

    XML representation
    <TruncateOperation name="" maxLength="150" strict="false" ></TruncateOperation>
  • CustomMetaOperation com.exalead.search.v30.CustomMetaOperation
    No documentation for this element.
    Parent objects
    • Meta
    Attributes
    • name string
    • classId string
    Name Type Default value Description
    name string - Name of the meta subject of the operation.

    Not required when this is inside a 'Meta' tag.
    classId string - The fully-qualified name of the class performing the operation.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <CustomMetaOperation name="" classId="" ></CustomMetaOperation>
  • InsertHitInfoFromFacets com.exalead.search.v30.InsertHitInfoFromFacets
    Insert hit information retrieved from facets.
    Nested elements
    • replaceKeyValue*
    • addKeyValue*
    Type Wrapper Description
    KeyValue* replace Key = hit info key to create.
    Value = facet id to fetch from then remove
    KeyValue* add Key = hit info key to create.
    Value = facet id to fetch from
    XML representation
    <InsertHitInfoFromFacets ></InsertHitInfoFromFacets>
  • SourceFetchabilityInfosInserter com.exalead.search.v30.SourceFetchabilityInfosInserter
    Inserts in the hit information whether the source supports fetch, thumbnails and preview
    XML representation
    <SourceFetchabilityInfosInserter ></SourceFetchabilityInfosInserter>
  • CustomHitOperation com.exalead.search.v30.CustomHitOperation
    No documentation for this element.
    Parent objects
    • AdvancedPipeline
    Attributes
    • classId string
    Name Type Default value Description
    classId string - The fully-qualified name of the class performing the operation.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <CustomHitOperation classId="" ></CustomHitOperation>
  • SimilarDocumentOperation com.exalead.search.v30.SimilarDocumentOperation
    No documentation for this element.
    Parent objects
    • AdvancedPipeline
    Attributes
    • prefixHandlerName string
    • metaName string
    • minNumberOfKeywordPerDoc long
    • similarityVirtualField string
    • similarityThreshold double
    • languageConstraint boolean
    • keywordsMetaName string
    Name Type Default value Description
    prefixHandlerName string similar The prefix handler name used in the generated query.
    metaName string similardoc_query Name of the generated meta containing the resulting query.
    minNumberOfKeywordPerDoc long 4 Documents with fewer keywords than this constant are ignored.
    Value must be higher or equal to 0.
    similarityVirtualField string similarity Name of the virtual field computing the final similarity score.
    similarityThreshold double - The mininum similarity score for two documents to be considered similar.
    Value must be between 0 and 1.
    languageConstraint boolean True Forces similar documents to be in the same language.
    keywordsMetaName string keyword Name of the retrieved meta containing the keywords.

    XML representation
    <SimilarDocumentOperation prefixHandlerName="similar" metaName="similardoc_query" minNumberOfKeywordPerDoc="4" similarityVirtualField="similarity" similarityThreshold="" languageConstraint="true" keywordsMetaName="keyword" ></SimilarDocumentOperation>
  • MultiContextField com.exalead.search.v30.MultiContextField
    No documentation for this element.
    Parent objects
    • AdvancedPipeline
    Attributes
    • indexField string
    • name string
    • type string
    • contextName string
    • keepMetaName boolean
    Name Type Default value Description
    indexField string - Index field or Virtual Expression to retrieve.
    name string - Name of the meta to create from this field.

    If not set, the index field name is used.
    type string - The type of the field to retrieve FIXME
    contextName string -
    keepMetaName boolean True When set to true, a key:value stored in the field will create a meta "key" which value is "value". Otherwise, this will generate a meta "name" which value is "value".

    Nested elements
    • MatchRule*
    Type Wrapper Description
    MatchRule* - List of rules to define which document metas should be mapped to this dynamic property.
    XML representation
    <MultiContextField indexField="" name="" type="" contextName="" keepMetaName="true" ></MultiContextField>
  • MatchRule com.exalead.search.v30.MatchRule
    No documentation for this element.
    Parent objects
    • DynamicFieldElementSource
    • MultiContextField
    Attributes
    • pattern string
    • mode enum(prefix, suffix, substring, exact)
    Name Type Default value Description
    pattern string - If null, property name is used
    mode enum(prefix, suffix, substring, exact)

    XML representation
    <MatchRule pattern="" mode="" ></MatchRule>
  • RankingField com.exalead.search.v30.RankingField
    No documentation for this element.
    Parent objects
    • AdvancedPipeline
    Attributes
    • expr string
    Name Type Default value Description
    expr string -

    XML representation
    <RankingField expr="" ></RankingField>
  • RetrievedField com.exalead.search.v30.RetrievedField
    No documentation for this element.
    Parent objects
    • AdvancedPipeline
    Attributes
    • indexField string
    • name string
    • type string
    • multivalued boolean
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    indexField string - Index field or Virtual Expression to retrieve.
    name string - Name of the meta to create from this field.

    If not set, the index field name is used.
    type string - The type of the field to retrieve FIXME
    multivalued boolean False Is the field to retrieve multivalued? FIXME
    dataModelState string - Is this retrieved field managed by a data model? @enum{null,auto,customized, error}.
    If null, this retrieved field is not related to a data model.
    If "auto", this retrieved field is auto-generated by a data model
    If "customized", this retrieved field was auto-generated by a data model and then customized.
    If "error", there is a conflict between this retrieved field and the data model.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this RetrievedField.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this RetrievedField.

    Nested elements
    • fromDataModelRetrievedField
    Type Wrapper Description
    RetrievedField fromDataModel If dataModelState is "customized", you will find here the original retrieved field generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <RetrievedField indexField="" name="" type="" multivalued="false" dataModelState="" dataModelClass="" dataModelProperty="" ></RetrievedField>
  • DataModelClassMetaInformation com.exalead.search.v30.DataModelClassMetaInformation
    No documentation for this element.
    Parent objects
    • HitConfig
    Attributes
    • categoryFieldName string
    • categoryRoot string
    • categoryForm enum(exact, lowercase, normalized)
    Name Type Default value Description
    categoryFieldName string categories
    categoryRoot string Top/dataModelClass
    categoryForm enum(exact, lowercase, normalized) normalized

    XML representation
    <DataModelClassMetaInformation categoryFieldName="categories" categoryRoot="Top/dataModelClass" categoryForm="normalized" ></DataModelClassMetaInformation>
  • Meta com.exalead.search.v30.Meta
    No documentation for this element.
    Parent objects
    • HitConfig
    Attributes
    • name string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    Name Type Default value Description
    name string -
    dataModelState string - Is this meta managed by a data model? @enum{null,auto,customized, error}.
    If null, this hit category group is not related to a data model.
    If "auto", this hit category group is auto-generated by a data model
    If "customized", this hit category group was auto-generated by a data model and then customized.
    If "error", there is a conflict between this hit category group and the data model
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this meta.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this meta.

    Nested elements
    • fromDataModelMeta
    • MetaSource*
    • MetaSpecificOperation*
    Type Wrapper Description
    Meta fromDataModel
    MetaSource* -
    MetaSpecificOperation* -
    XML representation
    <Meta name="" dataModelState="" dataModelClass="" dataModelProperty="" ></Meta>
  • FieldSource com.exalead.search.v30.FieldSource
    Specify from which index field the meta will be retrieved.
    Parent objects
    • Meta
    Attributes
    • indexField string
    Name Type Default value Description
    indexField string - Index field, Virtual field or Virtual Expression for this meta source.

    XML representation
    <FieldSource indexField="" ></FieldSource>
  • DynamicFieldElementSource com.exalead.search.v30.DynamicFieldElementSource
    Specifies from which index dynamic field the meta will be retrieved.
    Parent objects
    • Meta
    Attributes
    • indexField string
    • name string
    • keepMetaName boolean
    Name Type Default value Description
    indexField string - Dynamic index field for this meta source.
    name string - Specifies the meta name used to retrieve the value in the Dynamic Index Field.

    If null, it retrieves all meta/value pairs.
    keepMetaName boolean True When set to true, a key:value stored in the field will create a meta "key" which value is "value". Otherwise, this will generate a meta "name" which value is "value".

    Nested elements
    • MatchRule*
    Type Wrapper Description
    MatchRule* - List of rules to define which metas contained in this dynamic field will be processed and displayed.
    XML representation
    <DynamicFieldElementSource indexField="" name="" keepMetaName="true" ></DynamicFieldElementSource>
  • MultiFieldElementSource com.exalead.search.v30.MultiFieldElementSource
    Specifies from which CSV-encoded index field the meta will be retrieved.
    Parent objects
    • Meta
    Attributes
    • indexField string
    • metaNameInMultiField string
    Name Type Default value Description
    indexField string - CSV-encoded index field for this meta source.
    metaNameInMultiField string - CSV column name.

    XML representation
    <MultiFieldElementSource indexField="" metaNameInMultiField="" ></MultiFieldElementSource>
  • Metas com.exalead.search.v30.Metas
    No documentation for this element.
    Nested elements
    • MetasSource*
    Type Wrapper Description
    MetasSource* -
    XML representation
    <Metas ></Metas>
  • MultiFieldSource com.exalead.search.v30.MultiFieldSource
    Specifies from which CSV-encoded index field the metas will be retrieved.
    Retrieves all columns once.
    Parent objects
    • Metas
    Attributes
    • indexField string
    Name Type Default value Description
    indexField string - CSV-encoded index field for this meta source.

    XML representation
    <MultiFieldSource indexField="" ></MultiFieldSource>
  • LimitsConfig com.exalead.search.v30.LimitsConfig
    No documentation for this element.
    Parent objects
    • SearchLogic
    Attributes
    • maxKeptHits int
    • maxHitsPerSlice int
    • mainHeapFlushInterval int
    • sliceHeapFlushInterval int
    • maxQueryTimeMS int
    • maxFetchedHits int
    • enoughSlicesPercentage int
    • interruptGraceDelayMS int
    • maxTotalTimeMS int
    • fullGraceDelayMS int
    • hitsSampling double
    • usesConsecutiveQueries boolean
    • maxPathsToConsider long
    • needSecuBlockSize long
    Name Type Default value Description
    maxKeptHits int - Number of "best" hits that will be kept for navigation and full hits.

    It limits the number of hits that can be used for navigation and full hits fetch. Reducing this parameter can slightly increase performance in some situations (many slices, IO-bound). The "nhits" value in answer is at most equal to this limit.

    0 means no limit.

    Note: Heapsort executor is used if this setting is defined in ranked mode. If not in ranked mode, this setting is ignored.
    maxHitsPerSlice int - Maximum number of hits that a single index slice can fetch.
    mainHeapFlushInterval int - Unused setting
    sliceHeapFlushInterval int - Each time the slice heap receives this number of hits, it will be flushed to the main heap.
    This parameter only impacts performance.
    If value is 0, the system will pick up an appropriate value.

    Note: Only used in unranked and heapsort. Not used in localsort.
    maxQueryTimeMS int - Maximum time spent in the "hits matching" part of the search internal "query" part of the search.

    After this delay, hits matching is interrupted, partial results will be returned.

    0 means no limit.
    maxFetchedHits int - When this total number of hits matching the query have been found (on any slice), the query is interrupted.

    These hits are ranked and the "best" ones are selected for navigation and full hits. Reducing this parameter improves performance but it means that a potentially "better" hit located further in the index will not be found. Above this limit, the "nmatches" value in answer will be estimated.

    0 means no limit.
    enoughSlicesPercentage int 100 The query is interrupted if at least this percentage of active slices finished fetching hits.
    interruptGraceDelayMS int 100 After the query is interrupted, the slices have a small time period to flush their heaps.
    Hits that arrive to the main heap after this time period are dropped.
    maxTotalTimeMS int - Maximum time for the entire search (hits matching, navigation and hits computation).

    After this delay, the search query is aborted.

    0 means no limit.
    fullGraceDelayMS int 500 Additional time after maxTotalTime before forcing the full hits processor to stop.
    Note that this elapsed time is only checked between two full hits.
    hitsSampling double 100.0 Only fetch a sample of the matches. This is useful to improve search performance while still getting a good hits distribution.
    usesConsecutiveQueries boolean False Set this optio to True when you plan to use consecutive queries, then the dedup will occur.
    maxPathsToConsider long 20000000 If this value is not zero, when doing expand queries without using the "expand=true" mode, the slices will only consider the first maxExploredPaths paths they find for deciding which
    DIDs are contained in the graph.
    needSecuBlockSize long 0 If this value is not zero, when doing attrgraph queries, each document that need to be postfiltered will have its Fullhit rewritten once said number of documents to postfilter has been reached.

    Nested elements
    • ContentGroupLimitsConfig*
    Type Wrapper Description
    ContentGroupLimitsConfig* - Optional specific per content group limits.
    XML representation
    <LimitsConfig maxKeptHits="" maxHitsPerSlice="" mainHeapFlushInterval="" sliceHeapFlushInterval="" maxQueryTimeMS="" maxFetchedHits="" enoughSlicesPercentage="100" interruptGraceDelayMS="100" maxTotalTimeMS="" fullGraceDelayMS="500" hitsSampling="100.0" usesConsecutiveQueries="false" maxPathsToConsider="20000000" needSecuBlockSize="0" ></LimitsConfig>
  • ContentGroupLimitsConfig com.exalead.search.v30.ContentGroupLimitsConfig
    Specific search limits for a content group. May override global limits.
    Parent objects
    • LimitsConfig
    Attributes
    • contentGroup string
    • maxHitsPerSlice int
    Name Type Default value Description
    contentGroup string - Name of the content group.
    maxHitsPerSlice int - On each slice, when this number of hits matching the query have been found, the query is interrupted.

    These hits are returned to the searcher, ranked, and the "best" ones are selected for navigation and full hits. Reducing this parameter improves performance but it means that a potentially "better" hit located further in the slice would not be found. If this limit triggers on any slice, the "nmatches" value in answer will be estimated.

    0 means no limit.

    XML representation
    <ContentGroupLimitsConfig contentGroup="" maxHitsPerSlice="" ></ContentGroupLimitsConfig>
  • PostFilterConfig com.exalead.mercury.mami.search.v20.PostFilterConfig
    No documentation for this element.
    Parent objects
    • SearchLogic
    Attributes
    • additionalHits int
    • blacklistUrl string
    • maxQueryStart int
    Name Type Default value Description
    additionalHits int - The number of additional hits to retrieve
    blacklistUrl string -
    maxQueryStart int 100

    XML representation
    <PostFilterConfig additionalHits="" blacklistUrl="" maxQueryStart="100" ></PostFilterConfig>
  • QueryRestrictionConfig com.exalead.mercury.mami.search.v20.QueryRestrictionConfig
    No documentation for this element.
    Parent objects
    • SearchLogic
    Attributes
    • blackListURL string
    Name Type Default value Description
    blackListURL string - URL to a query-blocklist resource

    Nested elements
    • NodeRestrictionConfig
    • OperatorRestrictionConfig
    • QueryTermCountRestrictionConfig
    Type Wrapper Description
    NodeRestrictionConfig -
    OperatorRestrictionConfig -
    QueryTermCountRestrictionConfig -
    XML representation
    <QueryRestrictionConfig blackListURL="" ></QueryRestrictionConfig>
  • NodeRestrictionConfig com.exalead.mercury.mami.search.v20.NodeRestrictionConfig
    Configure node restriction policy for search queries
    Nested elements
    • RestrictedNode*
    Type Wrapper Description
    RestrictedNode* -
    XML representation
    <NodeRestrictionConfig ></NodeRestrictionConfig>
  • RestrictedNode com.exalead.mercury.mami.search.v20.RestrictedNode
    Restrict occurences of a given node within a search query
    Parent objects
    • NodeRestrictionConfig
    Attributes
    • node enum(AlphanumValue, AlphanumBWPattern, NumericalValue, CategoryValue, TrueNode, FalseNode, NoOp)
    • maximumCount int
    • action enum(discard, cleanup)
    Name Type Default value Description
    node enum(AlphanumValue, AlphanumBWPattern, NumericalValue, CategoryValue, TrueNode, FalseNode, NoOp)
    maximumCount int -1 Maximum number of occurences allowed for a given node.
    action enum(discard, cleanup) discard

    XML representation
    <RestrictedNode node="" maximumCount="-1" action="discard" ></RestrictedNode>
  • OperatorRestrictionConfig com.exalead.mercury.mami.search.v20.OperatorRestrictionConfig
    Configure operator restriction policy for search queries
    Nested elements
    • OperatorCountRestrictionConfig
    • RestrictedOperator*
    Type Wrapper Description
    OperatorCountRestrictionConfig -
    RestrictedOperator* -
    XML representation
    <OperatorRestrictionConfig ></OperatorRestrictionConfig>
  • OperatorCountRestrictionConfig com.exalead.mercury.mami.search.v20.OperatorCountRestrictionConfig
    Configure operator count restriction policy for search queries
    Parent objects
    • OperatorRestrictionConfig
    Attributes
    • maximumOperatorCount int
    • maximumProximityOperatorCount int
    • action enum(discard, cleanup)
    Name Type Default value Description
    maximumOperatorCount int -1 Maximum number of operators allowed in a query.
    maximumProximityOperatorCount int -1 Maximum number of proximity operators allowed in a query.

    The proximity operators are: NEAR NEXT AFTER BEFORE
    action enum(discard, cleanup) discard Action when maximumOperatorCount is reached: drop query or remove extra operators.

    The 'cleanup' mode does not guarantee the order in which extra operators are discarded.

    XML representation
    <OperatorCountRestrictionConfig maximumOperatorCount="-1" maximumProximityOperatorCount="-1" action="discard" ></OperatorCountRestrictionConfig>
  • RestrictedOperator com.exalead.mercury.mami.search.v20.RestrictedOperator
    Restrict occurrences of a given operator within a search query
    Parent objects
    • OperatorRestrictionConfig
    Attributes
    • operator enum(AND, AT, FUZZYAND, OR, XOR, BOR, NEAR, NOT, NEXT, SEQUENCE, SPLIT, BUTNOT, INNERJOIN, ALL, OPT)
    • maximumArity int
    • action enum(discard, cleanup, textual)
    Name Type Default value Description
    operator enum(AND, AT, FUZZYAND, OR, XOR, BOR, NEAR, NOT, NEXT, SEQUENCE, SPLIT, BUTNOT, INNERJOIN, ALL, OPT)
    maximumArity int -1 Maximum number of children allowed for a given operator.
    action enum(discard, cleanup, textual) discard

    XML representation
    <RestrictedOperator operator="" maximumArity="-1" action="discard" ></RestrictedOperator>
  • QueryTermCountRestrictionConfig com.exalead.mercury.mami.search.v20.QueryTermCountRestrictionConfig
    Configure term count restriction policy for search queries
    Parent objects
    • QueryRestrictionConfig
    Attributes
    • maximumTermCount int
    • action enum(discard, cleanup)
    Name Type Default value Description
    maximumTermCount int -1 Maximum number of alphanumeric tokens allowed in a query.
    action enum(discard, cleanup) discard Action if maximumTermCount is reached: drop query or cleanup extra values.

    XML representation
    <QueryTermCountRestrictionConfig maximumTermCount="-1" action="discard" ></QueryTermCountRestrictionConfig>
  • RelatedTermsSynthesisConfig com.exalead.search.v30.RelatedTermsSynthesisConfig
    No documentation for this element.
    Parent objects
    • SearchLogic
    Attributes
    • dictionaryName string
    • enabled boolean
    • number int
    • minCorpusFrequencyFilter int
    • corpusLowPassFilter double
    • optimizedForExhaustiveRankedSynthesis boolean
    • sortFunction enum(count, alphanum)
    • localLowPassFilter double
    • blacklist string
    • indexField string
    • maxSliceCategories int
    • maxRelatedTermsHitsPerSlice int
    Name Type Default value Description
    dictionaryName string - The dictionary containing keywords.
    enabled boolean False Enables related terms synthesis.
    number int 10 Maximum number of related terms computed for a query.
    minCorpusFrequencyFilter int 2 Minimum number of occurrences in the whole index for a term to be possibly selected for synthesis.
    corpusLowPassFilter double 0.5 Filters out terms occurring more than this threshold in the whole index.
    optimizedForExhaustiveRankedSynthesis boolean False If true, each slice will sort the DIDs before computing the synthesis to improve attributes fetching locality.
    sortFunction enum(count, alphanum) count Sort function
    Value can be one of
    • count
    • alphanum
    localLowPassFilter double 0.9 Filters out terms occurring more than this threshold in the result set.
    blacklist string - References the related terms blocklist resource.
    indexField string keywordv2 The index field where related-terms have been indexed.
    maxSliceCategories
    (UI: Max categories per slices)
    int 200 Maximum number of categories returned by each slice for one facet.
    0 means no limit.

    Warning: Results merged by the search server may be incorrect.
    maxRelatedTermsHitsPerSlice
    (UI: Max hits per slice)
    int 0 Maximum number of hits per slice used to compute related terms synthesis. Set to 0 for exhaustive. Note that the number of hits processed per slice to compute related terms synthesis will never exceed FacetsConfig's maxHits.

    XML representation
    <RelatedTermsSynthesisConfig dictionaryName="" enabled="false" number="10" minCorpusFrequencyFilter="2" corpusLowPassFilter="0.5" optimizedForExhaustiveRankedSynthesis="false" sortFunction="count" localLowPassFilter="0.9" blacklist="" maxSliceCategories="200" maxRelatedTermsHitsPerSlice="0" ></RelatedTermsSynthesisConfig>
  • RelevanceConfig com.exalead.search.v30.RelevanceConfig
    How to score and rank documents.
    A document rank, like a query result, can depend on many parameters:
    • the static document rank
    • the query term relative frequency
    • the form, position and rank of the query term in the matching document
    • proximity of query terms in the matching document
    ...
    Parent objects
    • SearchLogic
    Attributes
    • enabled boolean
    • exactPhraseScoreMergePolicy enum(SUM, MAX)
    Name Type Default value Description
    enabled boolean True Enables all relevance calculation.

    Globally disabling relevance calculation disables the following features:
    • term scoring
    • proximity scoring
    • sorting
    • node properties
    • using ranking elements for faceting
    • using ranking elements for facet aggregations
    • retrieving ranking elements
    <note>If your query contains grouping, either by a Search API parameter or in the product configuration set in the Administration Console &gt; Search Logic, the relevance feature will be forced to true. </note> <note>If your query contains a refinement on a disjunctive facet, the relevance feature will be forced to true.</note>
    In some cases, it provides significant performance boosts, notably for exhaustive counting and faceting.
    exactPhraseScoreMergePolicy enum(SUM, MAX) SUM Specifies how to merge the underlying word scores of an exact phrase.

    Setting the policy to 'MAX' will score the phrase as a single term.

    Nested elements
    • GroupConfig*
    • ProximityBoostConfig
    • SortConfig
    • TermScoreConfig
    Type Wrapper Description
    GroupConfig* - Configures collapsing TODO.
    ProximityBoostConfig - How to boost a document score depending on the proximity of the query terms.
    SortConfig - Configures the fields to use to sort documents in the result list.
    TermScoreConfig - How to compute the score of a term in a matching document.
    XML representation
    <RelevanceConfig enabled="true" exactPhraseScoreMergePolicy="SUM" ></RelevanceConfig>
  • GroupConfig com.exalead.search.v30.GroupConfig
    Collaspes hits in search results, by grouping hits with the same values, then sorting them within a group to keep only the top N hits. The other hits for the group do not display in the search results.

    The GroupBy child elemement specifies the grouping criteria. The order of GroupBy child elements does not affect sort order. The SortHitsInGroupBy child elements defines lexicographical order to sort hits within the group. By default, it uses SortConfiguration order.

    The Aggregation child elements specify the summary operation (SUM, MIN, MAX, and so on) to perform on all hits (discarded + top/bottom N) within a group.
    Parent objects
    • RelevanceConfig
    Attributes
    • topN int
    Name Type Default value Description
    topN int 1 Maximum number of hits to display for a group. The remaining hits do not display in search results. Must be strictly positive.

    Nested elements
    • Aggregation*
    • GroupBy*
    • SortHitsInGroupBy*
    Type Wrapper Description
    Aggregation* - Aggregation(s) to be evaluated sequentially on all the hits within a group, and stored as a virtual expression meta. To display the aggregation value in the grouped hit(s), add this meta name as a Hit content meta.
    GroupBy* - Virtual expressions used to group, or collapse, hits in search results. Hits with the same value for an expression form one group. Hits with undefined values for the expression are never grouped. If empty, displays all hits in the search results. As with any virtual expression, it requires RAM-based index fields.
    SortHitsInGroupBy* - Ordered list of expressions for sorting hits. The sort direction identifies the top N hits of a group.
    Hits with defined values are always listed higher than hits with undefined values. If empty, it uses the SortConfig by default. As with any virtual expression, it requires RAM-based index fields.
    XML representation
    <GroupConfig topN="1" ></GroupConfig>
  • Aggregation com.exalead.search.v30.Aggregation
    Combines the values of the specified expr for all the hits of the group and output the value to the specified meta.
    For example, if you have hits:
    • H1 with values: sort_field=100, group_field=12, value_field=57
    • H2 with values: sort_field=50, group_field=12, value_field=23


    With a sort by sort_field and a group by group_field, the GroupBy will only keep the hit H1 because its sort_field has the best value.
    • with a function on value_field of type "MIN", the value of value_field in the grouped hit will be 23 (minimum value over grouped hits)
    • with a function on value_field of type "MAX", the value of value_field in the grouped hit will be 57 (maximum value over grouped hits)
    • with a function on value_field of type "SUM", the value of value_field in the grouped hit will be 80 (sum of values for grouped hits)
    • with a function on value_field of type "AVG", the value of value_field in the grouped hit will be 40 (average of values for grouped hits)
    Parent objects
    • GroupConfig
    Attributes
    • expr string
    • aggregationField string
    • function enum(AVG, MAX, MIN, SUM, STDDEV, CONCAT)
    • separator string
    Name Type Default value Description
    expr string - Field to compute on the grouped hits.
    aggregationField string -
    function enum(AVG, MAX, MIN, SUM, STDDEV, CONCAT) MAX Aggregation function to apply on the specified expression.
    • MIN/MAX use the lowest/highest value of all hits in the group
    • SUM adds the values of all hits in the group
    • AVG calculates the mean of values of all hits in the group
    • STDDEV calculates the standard deviation of values of all hits in the group
    • CONCAT joins all the values from all the hits in the group. Only works with alphanumerical values.
    separator string - Character to separate concatenated strings.

    XML representation
    <Aggregation expr="" function="MAX" separator="" ></Aggregation>
  • GroupBy com.exalead.search.v30.GroupBy
    A comparative element used to identify group of documents.
    Parent objects
    • GroupConfig
    Attributes
    • expr string
    • LSB int
    • MSB int
    • limit int
    Name Type Default value Description
    expr string - Virtual expression to compare.
    LSB int 0 Position of the least significant bit to be considered. Applies to integer values only, ignored otherwize.
    MSB int 63 Position of the most significant bit to be considered. Applies to integer values only, ignored otherwize.
    limit int 0 Maximum number of characters to consider when comparing string values. 0 means the string is compared till the end.
    Applies to alphanumerical values only, ignored otherwize.

    XML representation
    <GroupBy expr="" LSB="0" MSB="63" limit="0" ></GroupBy>
  • SortHitsInGroupBy com.exalead.search.v30.SortHitsInGroupBy
    A comparative element used to sort documents of a group.
    Parent objects
    • SortConfig
    Attributes
    • name string
    • expr string
    • LSB int
    • MSB int
    • limit int
    • min long
    • order enum(asc, desc)
    • active boolean
    Name Type Default value Description
    name string - Unique sort name.
    expr string - Virtual expression to compare.
    LSB int 0 Position of the least significant bit to be considered. Applies to integer values only.
    MSB int 63 Position of the most significant bit to be considered. Applies to integer values only.
    limit int 0 Maximum number of characters to use when matching string values. 0 means the entire string is used for matching.
    Applies to alphanumerical values only.
    min long 0 Minimum value needed for a the document to be returned by the query.
    Applies to integer and double values only. 0 means keep all documents.
    Discards documents with a value below the minimum.
    Keeps documents with undefined value for this field.
    order enum(asc, desc) desc Sort order
    Value can be null or one of
    • asc
    • desc
    active boolean True Defines if the sort is applied by default.

    XML representation
    <SortHitsInGroupBy name="" expr="" LSB="0" MSB="63" limit="0" min="0" order="desc" active="true" ></SortHitsInGroupBy>
  • ProximityBoostConfig com.exalead.search.v30.ProximityBoostConfig
    Controls the behavior of the @proximity element
    Parent objects
    • RelevanceConfig
    Attributes
    • enabled boolean
    • maxDistance int
    Name Type Default value Description
    enabled boolean True Enables proximity calculation globally. When disabled, @proximity is always 1.0
    maxDistance int 100 Do not apply proximity boost if words are separated by more than 'maxDistance' words.

    XML representation
    <ProximityBoostConfig enabled="true" maxDistance="100" ></ProximityBoostConfig>
  • SortConfig com.exalead.search.v30.SortConfig
    Define the order of returned documents.
    Parent objects
    • RelevanceConfig
    Attributes
    • discardInvalidSort boolean
    Name Type Default value Description
    discardInvalidSort boolean False Auto discard sortBy not allowed rather than sending an error.

    Nested elements
    • SortBy*
    Type Wrapper Description
    SortBy* - Ordered list of expresion that defines a lexicographical order (documents having the same value for the first criteria are then ordered using the second critera, and so on).
    XML representation
    <SortConfig discardInvalidSort="false" ></SortConfig>
  • SortBy com.exalead.search.v30.SortBy
    Configures a sort criterion. An expression is calculated for every document matching the query and the results are then sorted according to calculated values.
    Parent objects
    • SortConfig
    Attributes
    • name string
    • expr string
    • LSB int
    • MSB int
    • limit int
    • min long
    • order enum(asc, desc)
    • active boolean
    Name Type Default value Description
    name string - Unique sort name.
    expr string - Virtual expression to compare.
    LSB int 0 Position of the least significant bit to be considered. Applies to integer values only.
    MSB int 63 Position of the most significant bit to be considered. Applies to integer values only.
    limit int 0 Maximum number of characters to use when matching string values. 0 means the entire string is used for matching.
    Applies to alphanumerical values only.
    min long 0 Minimum value needed for a the document to be returned by the query.
    Applies to integer and double values only. 0 means keep all documents.
    Discards documents with a value below the minimum.
    Keeps documents with undefined value for this field.
    order enum(asc, desc) desc Sort order
    Value can be null or one of
    • asc
    • desc
    active boolean True Defines if the sort is applied by default.

    XML representation
    <SortBy name="" expr="" LSB="0" MSB="63" limit="0" min="0" order="desc" active="true" ></SortBy>
  • TermScoreConfig com.exalead.search.v30.TermScoreConfig
    No documentation for this element.
    Parent objects
    • RelevanceConfig
    Attributes
    • type enum(RANK_TFIDF, TFIDF, RANK, BM25, BM25F, NO_RANKING, IDF, RANK_IDF, CUSTOM, CUSTOM_LUA)
    • scoreMap string
    • customExpr string
    • customLuaScript string
    Name Type Default value Description
    type enum(RANK_TFIDF, TFIDF, RANK, BM25, BM25F, NO_RANKING, IDF, RANK_IDF, CUSTOM, CUSTOM_LUA) NO_RANKING Select the ranking policy (
    Value can be null or one of
    • RANK_TFIDF
    • TFIDF
    • RANK
    • BM25
    • BM25F
    • NO_RANKING
    • IDF
    • RANK_IDF
    • CUSTOM
    • CUSTOM_LUA
    ) for the terms
    scoreMap string - Comma-separated list of target weights for the ranking scores.

    XXX: Should be cleanly documented elsewhere.
    customExpr string -
    customLuaScript string -

    XML representation
    <TermScoreConfig type="NO_RANKING" scoreMap="" customExpr="" customLuaScript="" ></TermScoreConfig>
  • Sort com.exalead.search.v30.Sort
    No documentation for this element.
    Parent objects
    • SearchLogic
    Attributes
    • secondarySortKey boolean
    • indexField string
    • ascending boolean
    • LSB int
    • MSB int
    • hasInvalidValue boolean
    • invalidValue long
    Name Type Default value Description
    secondarySortKey boolean False Allows disambiguating documents having the same score, on a configurable key. The rest of the comparator configuration is similar to the primary sort.

    If this option is enabled, the documents will be compared by score, and in case of equality, only by the key extracted from the configurable index field.
    indexField string - Index field on which we want to sort.

    You can only sort on an unsigned index field.
    ascending boolean False Inverse the total order relation.
    LSB int 0 Least Significant Bit of the key inside the field.
    MSB int 63 Most Significant Bit of the key inside the field.
    hasInvalidValue boolean False Should one of the keys be considered as excluded from the set and always put at the end
    invalidValue long - If hasInvalidValue is true, what is the invalid value?

    XML representation
    <Sort secondarySortKey="false" indexField="" ascending="false" LSB="0" MSB="63" hasInvalidValue="false" invalidValue="" ></Sort>
  • SpellCheckerConfig com.exalead.search.v30.SpellCheckerConfig
    No documentation for this element.
    Parent objects
    • SearchLogic
    Attributes
    • enabled boolean
    • blacklistedPrefixHandlers string
    • enabledRefine boolean
    • computeCorrectedNHits boolean
    • hitThreshold int
    • cancelBadCorrections boolean
    • autoCorrect boolean
    • graceDelayMS int
    • querySizeLimit int
    • dictionaryName string
    Name Type Default value Description
    enabled
    (UI: Enable)
    boolean False Enables spellcheck on queries.
    blacklistedPrefixHandlers string A list of comma-separated prefix handler names defining which parts of the query will NOT be spell checked .
    enabledRefine
    (UI: Use with facets)
    boolean False Suggests alternate expressions even if facets have been used to filter hits.
    computeCorrectedNHits
    (UI: Calculate no. hits)
    boolean False Calculates and displays the number of hits that will be returned for each suggested expression.

    This can impact performance as CloudView must perform the query for each suggestion.

    Warning: This option is not compatible with disjunctive facets. The count will not report any values.
    hitThreshold
    (UI: Disable at N hits)
    int 0 Only performs spell check if the query returns less than N hits.

    However, if N=0, spell check is always performed.

    For example, if N=4, spell check is only activated for queries that return 3 or less hits.
    cancelBadCorrections
    (UI: Remove weak suggestions)
    boolean False Leaves out alternate expressions that would return less hits than the original query.
    This requires computeCorrectedNHits to be true.
    autoCorrect
    (UI: Automatically correct)
    boolean False Automatically performs (uncorrected query) OR (corrected query 1) OR (corrected query 2) ...
    If cancelBadCorrections and autoCorrect are both true, the query is performed three times:
    • The uncorrected query is performed to calculate the number of hits.
    • The corrected queries are performed to calculate the number of hits.
    • The corrected queries that yield more hits than the uncorrected query are added to the uncorrected query, and the resulting query is performed and returned to the user.
    graceDelayMS
    (UI: Timeout(ms))
    int 50 Maximum time to wait for spell checking after query processing has finished.

    If the spell check exceeds this time, it is cancelled and a warning displays.

    After this time expires, spellcheck is cancelled.

    If computeCorrectedNHits is enabled, the additional time this takes will leave less time for spell check to finish.

    If 'Automatically correct' is selected, this timeout is ignored.
    querySizeLimit int 16 Queries with more than this count of chunks are considered computer-generated queries for which the spellchecking is disabled.
    dictionaryName
    (UI: Dictionary)
    string - Dictionary used for spell checking and allow/block lists

    Nested elements
    • SpellCheckParameters
    Type Wrapper Description
    SpellCheckParameters - Low-level parameters.
    XML representation
    <SpellCheckerConfig enabled="false" blacklistedPrefixHandlers="" enabledRefine="false" computeCorrectedNHits="false" hitThreshold="0" cancelBadCorrections="false" autoCorrect="false" graceDelayMS="50" querySizeLimit="16" dictionaryName="" ></SpellCheckerConfig>
  • SpellCheckParameters com.exalead.linguistic.v10.SpellCheckParameters
    No documentation for this element.
    Parent objects
    • SpellCheckerConfig
    Attributes
    • maxApprox int
    • insertion double
    • deletion double
    • substitution double
    • transposition double
    • first double
    • blank double
    • phoneticDist double
    • precision double
    • minLengthCandidate int
    • minLengthCandidate2 int
    • minLengthMultiWordCandidate int
    • minLengthMultiWordCandidate2 int
    • minLengthInsideMultiWordCandidate int
    • minLengthInsideMultiWordCandidate2 int
    • minNGramLengthForDist2 int
    • filterWhiteList boolean
    • filterPhonetic boolean
    • tokenizeResults boolean
    • maxSuggestions int
    • allSuggestionsMustAlwaysBeRetrieved boolean
    • whitelist string
    • blacklist string
    • timeoutMS int
    Name Type Default value Description
    maxApprox
    (UI: Max spellings per word)
    int 40 Maximum number of alternate spellings to retrieve for each word.
    insertion
    (UI: Insertion)
    double 0.8 Damerau-Levenshtein distance incurred for inserting a letter.
    deletion
    (UI: Deletion)
    double 0.9 Damerau-Levenshtein distance incurred for deleting a letter.
    substitution
    (UI: Substitution)
    double 0.9 Damerau-Levenshtein distance incurred for substituting a letter.
    transposition
    (UI: Transposing)
    double 0.9 Damerau-Levenshtein distance incurred for transposing letters.
    first
    (UI: First letter)
    double 0.1 Additional Damerau-Levenshtein distance incurred for any operation on the first letter of a word.
    blank
    (UI: Blank space)
    double 0.5 Damerau-Levenshtein distance incurred for inserting or deleting a space.
    phoneticDist
    (UI: Phonetic matches)
    double 0.1 Damerau-Levenshtein distance incurred for phonetic matches.
    precision
    (UI: Precision)
    double 1.0 The higher the precision, the less often alternatives are proposed; the lower, the more recall.
    minLengthCandidate
    (UI: Min word length for distance 1)
    int 4 Minimum length for a word to trigger a distance 1 spell check.
    minLengthCandidate2
    (UI: Min word length for distance 2)
    int 8 Minimum length for a word to trigger a distance 2 spell check.
    minLengthMultiWordCandidate
    (UI: Min word length anywhere for distance 1)
    int 2 Minimum length for a word anywhere in a multi-word expression to trigger a distance 1 spell check.
    minLengthMultiWordCandidate2
    (UI: Min word length anywhere for distance 2)
    int 8 Minimum length for a word anywhere in a multi-word expression to trigger a distance 2 spell check.
    minLengthInsideMultiWordCandidate
    (UI: Min word length mid-expression for distance 1)
    int 1 Minimum length for a word in the middle of a multi-word expression to trigger a distance 1 spell check.
    minLengthInsideMultiWordCandidate2
    (UI: Min word length mid-expression for distance 2)
    int 6 Minimum length for a word in the middle of a multi-word expression to trigger a distance 2 spell check.
    minNGramLengthForDist2
    (UI: Min expression length for distance 2)
    int 12 Minimum length for a multi-word expression to trigger a distance 2 spell check.
    filterWhiteList
    (UI: Apply min lengths to allow list)
    boolean False Applies minimum length criteria on allow list entries.
    filterPhonetic
    (UI: Apply min lengths to phonetic)
    boolean True Applies the minimum length criteria on phonetic matches.
    tokenizeResults
    (UI: Tokenize results)
    boolean False Tokenizes the suggested expression, and annotates each token with the corresponding word from the original query. This allows for additional processing after spellchecking, such as lemmatization, and normalization.
    For example, for a suggested expression of 3 tokens: - If true: each token has an annotation representing the associated word from the original user query. - If false: The first token is annotated with 3 tokens and a length of 3.
    maxSuggestions
    (UI: Max. suggestions)
    int 1 Suggests no more than N alternate expressions
    allSuggestionsMustAlwaysBeRetrieved
    (UI: Keep poor suggestions)
    boolean False Suggests alternate expressions even when they have a lower score than the corresponding expression from the original query. The score is determined by an algorithm that considers both the Damerau-Levenshtein distance and frequency in the corpus.
    whitelist string - Reference a dictionary's spellcheck allow list resource.
    Used to always suggest expressions for the listed words.
    blacklist string - Reference a dictionary's spellcheck block list resource.
    Used to never suggest expressions for the listed words.
    timeoutMS
    (UI: timeout)
    int 0 Time allotted to the computing of spell suggestions.

    Nested elements
    • SpellCheckBlackList
    • SpellCheckWhiteList
    Type Wrapper Description
    SpellCheckBlackList - Never suggest expressions for these words.
    SpellCheckWhiteList - Always suggest expressions for these words.
    XML representation
    <SpellCheckParameters maxApprox="40" insertion="0.8" deletion="0.9" substitution="0.9" transposition="0.9" first="0.1" blank="0.5" phoneticDist="0.1" precision="1.0" minLengthCandidate="4" minLengthCandidate2="8" minLengthMultiWordCandidate="2" minLengthMultiWordCandidate2="8" minLengthInsideMultiWordCandidate="1" minLengthInsideMultiWordCandidate2="6" minNGramLengthForDist2="12" filterWhiteList="false" filterPhonetic="true" tokenizeResults="false" maxSuggestions="1" allSuggestionsMustAlwaysBeRetrieved="false" whitelist="" blacklist="" timeoutMS="0" ></SpellCheckParameters>
  • SpellCheckBlackList com.exalead.linguistic.v10.SpellCheckBlackList
    No documentation for this element.
    Nested elements
    • Entry*
    Type Wrapper Description
    Entry* -
    XML representation
    <SpellCheckBlackList ></SpellCheckBlackList>
  • Entry com.exalead.linguistic.v10.Entry
    No documentation for this element.
    Parent objects
    • SpellCheckBlackList
    Attributes
    • lang iso code
    • expression string
    • level enum(exact, lowercase, normalized)
    Name Type Default value Description
    lang iso code -
    expression string -
    level enum(exact, lowercase, normalized) normalized

    XML representation
    <Entry lang="" expression="" level="normalized" ></Entry>
  • SpellCheckWhiteList com.exalead.linguistic.v10.SpellCheckWhiteList
    No documentation for this element.
    Nested elements
    • SpellCheckEntry*
    Type Wrapper Description
    SpellCheckEntry* -
    XML representation
    <SpellCheckWhiteList ></SpellCheckWhiteList>
  • SpellCheckEntry com.exalead.linguistic.v10.SpellCheckEntry
    No documentation for this element.
    Parent objects
    • SpellCheckWhiteList
    Attributes
    • lang iso code
    • expression string
    • level enum(exact, lowercase, normalized)
    • suggestion string
    Name Type Default value Description
    lang iso code -
    expression string -
    level enum(exact, lowercase, normalized) normalized
    suggestion string -

    XML representation
    <SpellCheckEntry lang="" expression="" level="normalized" suggestion="" ></SpellCheckEntry>
  • TimezoneConfig com.exalead.mercury.mami.search.v20.TimezoneConfig
    No documentation for this element.
    Parent objects
    • SearchLogic
    Attributes
    • add boolean
    • hours int
    • minutes int
    Name Type Default value Description
    add boolean True If true, the hours and minutes will be added. Otherwise, they will be substracted.
    hours int 0 Difference of hours to apply.
    minutes int 0 Difference of minutes to apply.

    XML representation
    <TimezoneConfig add="true" hours="0" minutes="0" ></TimezoneConfig>
  • UQLConfig com.exalead.mercury.mami.search.v20.UQLConfig
    No documentation for this element.
    Parent objects
    • SearchLogic
    Attributes
    • defaultPrefixHandler string
    • missingPrefixBehaviour enum(error, remove, ignore, and, replace_by_false)
    • phraseQueryTriggerChars string
    • queryTemplate string
    • defaultLanguage string
    Name Type Default value Description
    defaultPrefixHandler string text Prefix handler to use for content that is not contained within a specific prefix handler.
    missingPrefixBehaviour enum(error, remove, ignore, and, replace_by_false) error Action to perform when a prefix:value query does not reference a valid prefix:
    • error: fails the query
    • remove: removes the whole prefix node
    • ignore: does as if the query was a "value" without prefix
    • and: performs a query "prefix AND value"
    • replace_by_false: replaces the whole node by a false node
    phraseQueryTriggerChars string .-& List of chars inside a word that trigger a sequence match, for example, a.b will be rewritten as "a b".
    queryTemplate string #and(#query(_default_) #query(refine) #query(security) #query(restriction) #query(date_restriction) #query(geo_restriction)) Default query template to use (conjunction of query, refine, security and content restriction by default)
    defaultLanguage string xx Default language of the query

    Nested elements
    • LogicRunnerCustomization
    • ParsingConfig
    • PatternsExpansionConfig
    • QueryExpansionConfig
    • QueryExpansionModule*
    • QueryPrefixHandler*
    • SWConfig
    • SemanticQueryAnalysisConfig
    Type Wrapper Description
    LogicRunnerCustomization -
    ParsingConfig -
    PatternsExpansionConfig -
    QueryExpansionConfig -
    QueryExpansionModule* -
    QueryPrefixHandler* -
    SWConfig -
    SemanticQueryAnalysisConfig - Semantic analysis and rewriting of queries
    XML representation
    <UQLConfig defaultPrefixHandler="text" missingPrefixBehaviour="error" phraseQueryTriggerChars=".-&" queryTemplate="#and(#query(_default_) #query(refine) #query(security) #query(restriction) #query(date_restriction) #query(geo_restriction))" defaultLanguage="xx" ></UQLConfig>
  • LogicRunnerCustomization com.exalead.mercury.mami.search.v20.LogicRunnerCustomization
    No documentation for this element.
    Nested elements
    • globalPreParseProcessorsCustomProcessor*
    • preParseProcessorsCustomProcessor*
    • preLinguisticProcessorsCustomProcessor*
    • preTransformProcessorsCustomProcessor*
    • preMapProcessorsCustomProcessor*
    • postMapProcessorsCustomProcessor*
    • globalPostParseProcessorsCustomProcessor*
    • globalFinalProcessorsCustomProcessor*
    Type Wrapper Description
    CustomProcessor* globalPreParseProcessors
    CustomProcessor* preParseProcessors
    CustomProcessor* preLinguisticProcessors
    CustomProcessor* preTransformProcessors
    CustomProcessor* preMapProcessors
    CustomProcessor* postMapProcessors
    CustomProcessor* globalPostParseProcessors
    CustomProcessor* globalFinalProcessors
    XML representation
    <LogicRunnerCustomization ></LogicRunnerCustomization>
  • CustomProcessor com.exalead.mercury.mami.search.v20.CustomProcessor
    No documentation for this element.
    Parent objects
    • globalFinalProcessors LogicRunnerCustomization
    • globalPostParseProcessors LogicRunnerCustomization
    • globalPreParseProcessors LogicRunnerCustomization
    • postMapProcessors LogicRunnerCustomization
    • preLinguisticProcessors LogicRunnerCustomization
    • preMapProcessors LogicRunnerCustomization
    • preParseProcessors LogicRunnerCustomization
    • preTransformProcessors LogicRunnerCustomization
    Attributes
    • classId string
    Name Type Default value Description
    classId string - Must implement com.exalead.search.query.processors.CustomQueryProcessor

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <CustomProcessor classId="" ></CustomProcessor>
  • ParsingConfig com.exalead.search.v30.ParsingConfig
    No documentation for this element.
    Parent objects
    • UQLConfig
    Attributes
    • allowOptions boolean
    • nearDistance int
    • fallbackOnError boolean
    • fallbackReportMode enum(quiet, local, global)
    • maxQuerySize int
    • oversizedQueryReportMode enum(quiet, local, global)
    • implicitSequenceOperator enum(AND, OR)
    Name Type Default value Description
    allowOptions boolean False Allows word options in user query.
    nearDistance int 16 Default distance for NEAR queries
    fallbackOnError boolean False If the standard parsing fails, parses the query with a relaxed grammar, meaning it will split words on white spaces.

    Note that in some cases, operators and prefix handlers will not be taken into account for the query.

    For example, a query that ends with an AND will normally return an error.

    If this option is selected, CloudView will search for "AND" instead of considering it as an operator.
    See also missingPrefixBehaviour in UQLConfig
    fallbackReportMode enum(quiet, local, global) local Specifies how to report a search error in 'Troubleshooting > Logs':
    • quiet: No log.
    • local: Error is reported in the Process log.
    • global: Error is reported in both Process and Global logs.
    maxQuerySize int - Queries larger than N are rejected.
    0 means no limit. @unimplemented.
    oversizedQueryReportMode enum(quiet, local, global) local Specifies how to report an oversized query error in 'Troubleshooting > Logs':
    • quiet: No log.
    • local: Error is reported in the Process log.
    • global: Error is reported in both Process and Global logs.
    implicitSequenceOperator enum(AND, OR) AND The default query operator. If a user enters this query: exalead cloudview (without quotes), by default, it will be parsed as "exalead AND cloudview".

    XML representation
    <ParsingConfig allowOptions="false" nearDistance="16" fallbackOnError="false" fallbackReportMode="local" maxQuerySize="" oversizedQueryReportMode="local" implicitSequenceOperator="AND" ></ParsingConfig>
  • PatternsExpansionConfig com.exalead.search.v30.PatternsExpansionConfig
    No documentation for this element.
    Parent objects
    • UQLConfig
    Attributes
    • performRegexpExpansion boolean
    • minPrefixSize int
    • minSuffixSize int
    • expansionTimeout long
    • allLanguages boolean
    • rejectedPatternAction enum(normalize, remove, error, keep)
    • rejectedPatternReportMode enum(quiet, local, global)
    • errorAction enum(normalize, remove, error)
    • errorReportMode enum(quiet, local, global)
    Name Type Default value Description
    performRegexpExpansion boolean False Allows query expansion on regular expressions.
    minPrefixSize int 3 Minimum number of characters required before wilcard (*) for pattern expansion to be carried out.
    minSuffixSize int 3 Minimum number of characters required after wilcard (*) for pattern expansion to be carried out.
    expansionTimeout long - Query expansion timeout for each pattern.
    allLanguages boolean False Performs query expansion on all languages, whatever the language of the orignal query.
    rejectedPatternAction enum(normalize, remove, error, keep) error Action to perform when a pattern is rejected (because the prefix or suffix is too short, or the regular expression is disabled):
    • normalize: Transforms the pattern into a simple string match.
    • remove: Removes the pattern from the query.
    • error: Returns an error.
    rejectedPatternReportMode enum(quiet, local, global) local Specifies how to report a rejected pattern error in 'Troubleshooting > Logs':
    • quiet: No log.
    • local: Error is reported in the Process log.
    • global: Error is reported in both Process and Global logs.
    errorAction enum(normalize, remove, error) error Action to take when expansion fails:
    • normalize: Transforms the pattern into a simple string match.
    • remove: Removes the pattern from the query.
    • error: Returns an error.
    errorReportMode enum(quiet, local, global) local Specifies how to report an expansion error in 'Troubleshooting > Logs':
    • quiet: No log.
    • local: Error is reported in the Process log.
    • global: Error is reported in both Process and Global logs.

    XML representation
    <PatternsExpansionConfig performRegexpExpansion="false" minPrefixSize="3" minSuffixSize="3" expansionTimeout="" allLanguages="false" rejectedPatternAction="error" rejectedPatternReportMode="local" errorAction="error" errorReportMode="local" ></PatternsExpansionConfig>
  • QueryExpansionConfig com.exalead.search.v30.QueryExpansionConfig
    No documentation for this element.
    Parent objects
    • UQLConfig
    Attributes
    • timeoutMS int
    • unavailableDictionaryAction enum(error, ignore)
    • unavailableDictionaryReportMode enum(quiet, global, local)
    • detectCJ boolean
    • langDetect boolean
    • langDetectThreshold double
    • longFormsPruningThreshold int
    • performExactPhraseExpansion boolean
    • exactPhraseExpansionModulesBlackList string
    Name Type Default value Description
    timeoutMS int 500
    unavailableDictionaryAction enum(error, ignore) error Action to execute if the dictionary cannot be contacted for expansion.
    • error: fails the query
    • ignore: does not perform expansion, then continues
    unavailableDictionaryReportMode enum(quiet, global, local) global Value can be null or one of
    • quiet
    • global
    • local
    detectCJ boolean False Tries to detect Chinese and Japanese in the query and override the language of the request accordingly.

    If false, it uses the user-specified language for expansion.
    langDetect boolean True Tries to detect the text language from its unicode range.

    If false, it uses the user-specified language for expansion (xx is the default one).
    langDetectThreshold double 0.6 If langDetect is True, it tries to detect the text language.

    That language will be used only if its proportion is superior to langDetectThreshold of the string length
    longFormsPruningThreshold int 20 Prunes forms spanning several shorter forms from query nodes larger than N tokens, to avoid combination.
    performExactPhraseExpansion boolean False Activates query expansion on expressions contained within quotation marks.
    In this case, a number of query expansion modules can be disabled by exactPhraseExpansionModulesBlackList.
    exactPhraseExpansionModulesBlackList string If performExactPhraseExpansion is true, this list of comma-separated query expansion module names defines which modules are disabled when expanding exact phrases.

    Nested elements
    • LangDetectMapping
    Type Wrapper Description
    LangDetectMapping -
    XML representation
    <QueryExpansionConfig timeoutMS="500" unavailableDictionaryAction="error" unavailableDictionaryReportMode="global" detectCJ="false" langDetect="true" langDetectThreshold="0.6" longFormsPruningThreshold="20" performExactPhraseExpansion="false" exactPhraseExpansionModulesBlackList="" ></QueryExpansionConfig>
  • LangDetectMapping com.exalead.linguistic.v10.LangDetectMapping
    Maps unicode range to a default language for its automatic atribution
    Parent objects
    • QueryExpansionConfig
    Attributes
    • defaultDefault iso code
    • defaultArabic iso code
    • defaultCyrillic iso code
    Name Type Default value Description
    defaultDefault iso code xx
    defaultArabic iso code ar
    defaultCyrillic iso code ru

    XML representation
    <LangDetectMapping defaultDefault="xx" defaultArabic="ar" defaultCyrillic="ru" ></LangDetectMapping>
  • CustomQueryExpansionModule com.exalead.mercury.mami.search.v20.CustomQueryExpansionModule
    Serve as a function for query expansion config in query prefix handlers. @see QueryPrefixHandler#queryExpansionConfig
    Parent objects
    • UQLConfig
    Attributes
    • name string
    • classId string
    Name Type Default value Description
    name string - Name of the module. Referenced to target this module in query expansion config.
    classId string - Full name of the Java class implementing the custom module.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <CustomQueryExpansionModule name="" classId="" ></CustomQueryExpansionModule>
  • PhoneticModule com.exalead.mercury.mami.search.v20.PhoneticModule
    Expands the query using a phonetic resource.
    Parent objects
    • UQLConfig
    Attributes
    • name string
    • language string
    • resourceFile string
    Name Type Default value Description
    name string - Name of the module. Referenced to target this module in query expansion config.
    language string - Optional language code. If not specified, all are supported.
    resourceFile string - Optional phonetic rules file. If not specified, the default rules file is used.

    XML representation
    <PhoneticModule name="" language="en" resourceFile="" ></PhoneticModule>
  • ApproximateModule com.exalead.mercury.mami.search.v20.ApproximateModule
    Serve as a function for query expansion config in query prefix handlers. @see QueryPrefixHandler#queryExpansionConfig
    Parent objects
    • UQLConfig
    Attributes
    • name string
    Name Type Default value Description
    name string - Name of the module. Referenced to target this module in query expansion config.

    XML representation
    <ApproximateModule name="" ></ApproximateModule>
  • StemmingModule com.exalead.mercury.mami.search.v20.StemmingModule
    Extracts the root of a word.
    Parent objects
    • UQLConfig
    Attributes
    • name string
    • resourceDir string
    Name Type Default value Description
    name string - Name of the module. Referenced to target this module in query expansion config.
    resourceDir string - Optional resource directory. @unimplemented

    XML representation
    <StemmingModule name="" resourceDir="" ></StemmingModule>
  • RelatedTermModule com.exalead.mercury.mami.search.v20.RelatedTermModule
    Expands the query using related terms.
    Related terms are popular ngrams detected at indexing time. They allow to match popular concepts spanning several words.
    Parent objects
    • UQLConfig
    Attributes
    • name string
    • minSpan int
    • maxSpan int
    • keepLongestMatch boolean
    • withPartOfSpeech boolean
    Name Type Default value Description
    name string - Name of the module. Referenced to target this module in query expansion config.
    minSpan int 3 Minimum number of words (excluding stop words) in a generated related term.
    maxSpan int 6 Maximal number of words (excluding stop words) in a generated related term.
    keepLongestMatch boolean True Keeps only the longest match.

    For example, if you have 5 tokens ('a', 'b', 'c', 'd', 'e') and 4 related terms 'a', 'a-c', 'b-c-d' and 'd-e'.
    This option will only keep 'b-c-d' and will remove all other related terms.
    withPartOfSpeech boolean True Automatically adds a PartOfSpeechTagger to the list of processors.

    XML representation
    <RelatedTermModule name="" minSpan="3" maxSpan="6" keepLongestMatch="true" withPartOfSpeech="true" ></RelatedTermModule>
  • NGramModule com.exalead.mercury.mami.search.v20.NGramModule
    Serve as a function for query expansion config in query prefix handlers. @see QueryPrefixHandler#queryExpansionConfig
    Parent objects
    • UQLConfig
    Attributes
    • name string
    • minSpan int
    • maxSpan int
    • keepLongestMatch boolean
    • withLemmatizer boolean
    • withStopWordDict boolean
    • addCanonicalForm boolean
    Name Type Default value Description
    name string - Name of the module. Referenced to target this module in query expansion config.
    minSpan int 2 Minimum number of words (excluding stop words) in a generated related term.
    maxSpan int 3 Maximal number of words (excluding stop words) in a generated related term.
    keepLongestMatch boolean True Keeps only the longest match.
    For example, if you have 5 tokens ('a', 'b', 'c', 'd', 'e') and 4 related terms 'a', 'a-c', 'b-c-d' and 'd-e'.
    This option will only keep 'b-c-d' and will remove all other related terms.
    withLemmatizer boolean True Adds a Lemmatizer to the list of processors automatically.
    withStopWordDict boolean True Adds a StopWord dictionary to the list of processors automatically.
    addCanonicalForm boolean False

    XML representation
    <NGramModule name="" minSpan="2" maxSpan="3" keepLongestMatch="true" withLemmatizer="true" withStopWordDict="true" addCanonicalForm="false" ></NGramModule>
  • LemmatizationModule com.exalead.mercury.mami.search.v20.LemmatizationModule
    Semantic module expanding the query using the lemmatized forms of terms.
    The lemmatized form is the semantic root of a word.
    Parent objects
    • UQLConfig
    Attributes
    • name string
    • language string
    • resourceDir string
    Name Type Default value Description
    name string - Name of the module. Referenced to target this module in query expansion config.
    language string - Language in which the lemmatization should be enabled.

    If null, all languages are enabled.
    resourceDir string - Optional resource dir, default value is resource:///lemmatizer/LANGUAGE

    XML representation
    <LemmatizationModule name="" language="en" resourceDir="" ></LemmatizationModule>
  • SemanticProcessorModule com.exalead.mercury.mami.search.v20.SemanticProcessorModule
    Query expansion module embedding a semantic processing pipeline.

    The resulting annotation can be used to expand the query.
    Parent objects
    • UQLConfig
    Attributes
    • name string
    • annotation string
    • indexKind int
    Name Type Default value Description
    name string - Name of the module. Referenced to target this module in query expansion config.
    annotation string - The annotation name to use for expansion.
    indexKind int 2 The targeted index kind (0: exact search, 1: lowercase search, 2: normalized search).

    Nested elements
    • SemanticProcessor*
    Type Wrapper Description
    SemanticProcessor* - The underlying semantic processors plugged in the pipeline.
    XML representation
    <SemanticProcessorModule name="" annotation="" indexKind="2" ></SemanticProcessorModule>
  • SynonymsModule com.exalead.mercury.mami.search.v20.SynonymsModule
    Synonyms module broadening the scope of a user query.
    Parent objects
    • UQLConfig
    Attributes
    • name string
    • resourceFile string
    • scoreMultiWordsExpressionsAsSingleWords boolean
    Name Type Default value Description
    name string - Name of the module. Referenced to target this module in query expansion config.
    resourceFile string - The compiled synonyms configuration file, cannot be null.
    scoreMultiWordsExpressionsAsSingleWords boolean False If true, the multi-words synonym will have the maximum score of the underlying words multiplied by the number of original words spanned.

    By default, the sum is calculated.

    XML representation
    <SynonymsModule name="" resourceFile="" scoreMultiWordsExpressionsAsSingleWords="false" ></SynonymsModule>
  • NormalizationExceptionsModule com.exalead.mercury.mami.search.v20.NormalizationExceptionsModule
    Defines a list of words that should not be normalized.
    Parent objects
    • UQLConfig
    Attributes
    • name string
    • resourceFile string
    Name Type Default value Description
    name string - Name of the module. Referenced to target this module in query expansion config.
    resourceFile string resource://normalization-exceptions/synonyms.bin The compiled normalization exceptions, cannot be null.

    XML representation
    <NormalizationExceptionsModule name="" resourceFile="resource://normalization-exceptions/synonyms.bin" ></NormalizationExceptionsModule>
  • URLMatcherModule com.exalead.mercury.mami.search.v20.URLMatcherModule
    Serve as a function for query expansion config in query prefix handlers. @see QueryPrefixHandler#queryExpansionConfig
    Parent objects
    • UQLConfig
    Attributes
    • name string
    Name Type Default value Description
    name string - Name of the module. Referenced to target this module in query expansion config.

    XML representation
    <URLMatcherModule name="" ></URLMatcherModule>
  • TemplatePrefixHandler com.exalead.mercury.mami.search.v20.TemplatePrefixHandler
    Replaces its content with the expansion of its template attribute, in which the __QUERY__ placeholder has itself been replaced by the content.
    This prefix handler can be used as default prefix handler with some limitations (not compatible with linguistic prefix handlers like spellslike:).
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • template string
    Name Type Default value Description
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    template string - Template to use to replace the content of this prefix handler.

    Occurrences of __QUERY__ inside of this template are replaced by the content itself.

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <TemplatePrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" template="" ></TemplatePrefixHandler>
  • CategoryPrefixHandler com.exalead.mercury.mami.search.v20.CategoryPrefixHandler
    Searches for paths in a category field
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • root string
    • form enum(exact, lowercase, normalized)
    • tokenizationConfig string
    Name Type Default value Description
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    root string - Root category for this prefix
    form enum(exact, lowercase, normalized) normalized Form of the category path.
    Value can be null or one of
    • exact
    • lowercase
    • normalized
    tokenizationConfig string -

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <CategoryPrefixHandler indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" root="" form="normalized" tokenizationConfig="tok0" ></CategoryPrefixHandler>
  • RefinePrefixHandler com.exalead.mercury.mami.search.v20.RefinePrefixHandler
    Base class to configure a prefix handler targeting an index field.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • excluded boolean
    Name Type Default value Description
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    excluded boolean False Specifies whether the category must be included or excluded.

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <RefinePrefixHandler indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" excluded="false" ></RefinePrefixHandler>
  • DynamicNumericalPrefixHandler com.exalead.mercury.mami.search.v20.DynamicNumericalPrefixHandler
    Searches for numerical values in a dynamic field.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • meta string
    • invalidValueAction enum(remove, ignore, error)
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • defaultOperator enum(eq, le, lt, ge, gt, ne, range)
    Name Type Default value Description
    meta string - Meta name to search within.

    If null, you need to use the syntax prefix:meta:VALUE
    invalidValueAction enum(remove, ignore, error) error What to do when an invalid numerical value is passed:
    • remove: removes the whole prefix XXX
    • ignore: ? XXX
    • error: fails the query
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    defaultOperator enum(eq, le, lt, ge, gt, ne, range) eq Default numerical operator to use if the field is given with ':'

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <DynamicNumericalPrefixHandler invalidValueAction="error" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" defaultOperator="eq" ></DynamicNumericalPrefixHandler>
  • DynamicDatePrefixHandler com.exalead.mercury.mami.search.v20.DynamicDatePrefixHandler
    Searches for dates and/or times in a dynamic field.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • meta string
    • invalidValueAction enum(remove, ignore, error)
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • time boolean
    • defaultOperator enum(eq, le, lt, ge, gt, ne, range)
    Name Type Default value Description
    meta string - Meta name to search within.

    If null, you need to use the syntax prefix:meta:VALUE
    invalidValueAction enum(remove, ignore, error) error What to do when an invalid numerical value is passed:
    • remove: removes the whole prefix XXX
    • ignore: ? XXX
    • error: fails the query
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    time boolean False Allows a "date+time" input.
    defaultOperator enum(eq, le, lt, ge, gt, ne, range) eq Default numerical operator to use if the field is given with ':'

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <DynamicDatePrefixHandler invalidValueAction="error" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" time="false" defaultOperator="eq" ></DynamicDatePrefixHandler>
  • DynamicFullTextPrefixHandler (deprecated) com.exalead.mercury.mami.search.v20.DynamicFullTextPrefixHandler
    Root class for prefix handlers searching for alphanumerical values in a dynamic field.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • semanticContent boolean
    • indexFields string
    • performMAX boolean
    • patternMaxMatches int
    • tokenizationConfig string
    • meta string
    • invalidValueAction enum(remove, ignore, error)
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • matchingMode enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized)
    Name Type Default value Description
    semanticContent boolean true Is the content of this prefix "semantic"?

    If false, the content will not be tokenized nor passed through the linguistic expansion phase.
    indexFields string - Comma-separated list of index fields to be targeted by this prefix handler.
    performMAX boolean False Should we perform a MAX-like query rather than simple OR when targeting multiple fields?
    patternMaxMatches int 50 Max number of matches for the patterns under this prefix handler.
    tokenizationConfig string - Tokenization Config to use.
    meta string - Meta name to search within.

    If null, you need to use the syntax prefix:meta:VALUE
    invalidValueAction enum(remove, ignore, error) error What to do when an invalid numerical value is passed:
    • remove: removes the whole prefix XXX
    • ignore: ? XXX
    • error: fails the query
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    matchingMode enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized) normalized The matching mode defines how the query words are looked up in the index.
    Value can be null or one of
    • exact
    • lowercase
    • normalized
    • exact_lowercase
    • exact_lowercase_normalized
    • lowercase_normalized

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <DynamicFullTextPrefixHandler semanticContent="true" indexFields="" performMAX="false" patternMaxMatches="50" tokenizationConfig="tok0" invalidValueAction="error" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" matchingMode="normalized" ></DynamicFullTextPrefixHandler>
  • DynamicRawTextPrefixHandler com.exalead.mercury.mami.search.v20.DynamicRawTextPrefixHandler
    Root class for prefix handlers searching for alphanumerical values in a dynamic field.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • semanticContent boolean
    • indexFields string
    • performMAX boolean
    • patternMaxMatches int
    • tokenizationConfig string
    • meta string
    • invalidValueAction enum(remove, ignore, error)
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • kind int
    Name Type Default value Description
    semanticContent boolean true Is the content of this prefix "semantic"?

    If false, the content will not be tokenized nor passed through the linguistic expansion phase.
    indexFields string - Comma-separated list of index fields to be targeted by this prefix handler.
    performMAX boolean False Should we perform a MAX-like query rather than simple OR when targeting multiple fields?
    patternMaxMatches int 50 Max number of matches for the patterns under this prefix handler.
    tokenizationConfig string - Tokenization Config to use.
    meta string - Meta name to search within.

    If null, you need to use the syntax prefix:meta:VALUE
    invalidValueAction enum(remove, ignore, error) error What to do when an invalid numerical value is passed:
    • remove: removes the whole prefix XXX
    • ignore: ? XXX
    • error: fails the query
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    kind int -

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <DynamicRawTextPrefixHandler semanticContent="true" indexFields="" performMAX="false" patternMaxMatches="50" tokenizationConfig="tok0" invalidValueAction="error" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" kind="" ></DynamicRawTextPrefixHandler>
  • NumericalPrefixHandler com.exalead.mercury.mami.search.v20.NumericalPrefixHandler
    Searches for numerical values in a numerical field (integer or double)
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • defaultOperator enum(eq, le, lt, ge, gt, ne, range)
    • invalidValueAction enum(remove, ignore, error)
    • useRetrieved boolean
    • lsb int
    • msb int
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    Name Type Default value Description
    defaultOperator enum(eq, le, lt, ge, gt, ne, range) eq Default numerical operator to use if the field is given with ':'
    invalidValueAction enum(remove, ignore, error) error What to do when an invalid numerical value is passed:
    • remove: removes the whole prefix XXX
    • ignore: ? XXX
    • error: fails the query
    useRetrieved boolean False Search using the "retrieved" part of the index.

    Searching in the retrieved part of a numerical field is faster when this condition is not very restrictive (i.e. when it does not divide the result set by more than 5).

    It also provides the ability to restrict search to a range of bits in the numerical field.

    However, it can only be used on retrievable fields.

    Note that using this prefix handler is the only way to search in virtual fields.
    lsb int 0 Least significant bit to search in (only if using retrieved)
    msb int 63 Most significant bit to search in (only if using retrieved)
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <NumericalPrefixHandler defaultOperator="eq" invalidValueAction="error" useRetrieved="false" lsb="0" msb="63" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" ></NumericalPrefixHandler>
  • DatePrefixHandler com.exalead.mercury.mami.search.v20.DatePrefixHandler
    Searches for dates and/or times in a date field.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • defaultOperator enum(eq, le, lt, ge, gt, ne, range)
    • invalidValueAction enum(remove, ignore, error)
    • useRetrieved boolean
    • lsb int
    • msb int
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • time boolean
    • format string
    Name Type Default value Description
    defaultOperator enum(eq, le, lt, ge, gt, ne, range) eq Default numerical operator to use if the field is given with ':'
    invalidValueAction enum(remove, ignore, error) error What to do when an invalid numerical value is passed:
    • remove: removes the whole prefix XXX
    • ignore: ? XXX
    • error: fails the query
    useRetrieved boolean False Search using the "retrieved" part of the index.

    Searching in the retrieved part of a numerical field is faster when this condition is not very restrictive (i.e. when it does not divide the result set by more than 5).

    It also provides the ability to restrict search to a range of bits in the numerical field.

    However, it can only be used on retrievable fields.

    Note that using this prefix handler is the only way to search in virtual fields.
    lsb int 0 Least significant bit to search in (only if using retrieved)
    msb int 63 Most significant bit to search in (only if using retrieved)
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    time boolean False Allows a "date+time" input.
    format string -

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <DatePrefixHandler defaultOperator="eq" invalidValueAction="error" useRetrieved="false" lsb="0" msb="63" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" time="false" format="" ></DatePrefixHandler>
  • UnixTimestampPrefixHandler com.exalead.mercury.mami.search.v20.UnixTimestampPrefixHandler
    Searches for dates and/or times in a date field, in Unix Timestamp format.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • defaultOperator enum(eq, le, lt, ge, gt, ne, range)
    • invalidValueAction enum(remove, ignore, error)
    • useRetrieved boolean
    • lsb int
    • msb int
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • time boolean
    Name Type Default value Description
    defaultOperator enum(eq, le, lt, ge, gt, ne, range) eq Default numerical operator to use if the field is given with ':'
    invalidValueAction enum(remove, ignore, error) error What to do when an invalid numerical value is passed:
    • remove: removes the whole prefix XXX
    • ignore: ? XXX
    • error: fails the query
    useRetrieved boolean False Search using the "retrieved" part of the index.

    Searching in the retrieved part of a numerical field is faster when this condition is not very restrictive (i.e. when it does not divide the result set by more than 5).

    It also provides the ability to restrict search to a range of bits in the numerical field.

    However, it can only be used on retrievable fields.

    Note that using this prefix handler is the only way to search in virtual fields.
    lsb int 0 Least significant bit to search in (only if using retrieved)
    msb int 63 Most significant bit to search in (only if using retrieved)
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    time boolean True Generates a DateTime. Otherwise, it generates a Date.

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <UnixTimestampPrefixHandler defaultOperator="eq" invalidValueAction="error" useRetrieved="false" lsb="0" msb="63" indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" time="true" ></UnixTimestampPrefixHandler>
  • PositionPrefixHandler com.exalead.mercury.mami.search.v20.PositionPrefixHandler
    Searches in an alphanumerical field using the anchoring position (i.e. restricting the match to be at a specific position).
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • matchingMode enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized)
    • positionMatchMode enum(exact, startsWith, endsWith)
    • tokenizationConfig string
    Name Type Default value Description
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    matchingMode enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized) normalized The matching mode defines how the query words are looked up in the index.
    Value can be null or one of
    • exact
    • lowercase
    • normalized
    • exact_lowercase
    • exact_lowercase_normalized
    • lowercase_normalized
    positionMatchMode enum(exact, startsWith, endsWith) exact Choose between 'exact', 'startsWith' and 'endsWith'
    tokenizationConfig string -

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <PositionPrefixHandler indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" matchingMode="normalized" positionMatchMode="exact" tokenizationConfig="tok0" ></PositionPrefixHandler>
  • SitePrefixHandler com.exalead.mercury.mami.search.v20.SitePrefixHandler
    Searches for parts of URLs.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • additionalIndexField string
    • groupsByToRemove string
    Name Type Default value Description
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    additionalIndexField string -
    groupsByToRemove string Names of the GroupBy nodes to remove (comma-separated).

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <SitePrefixHandler indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" additionalIndexField="" groupsByToRemove="" ></SitePrefixHandler>
  • GeoPrefixHandler com.exalead.mercury.mami.search.v20.GeoPrefixHandler
    Sets the geographic field for WITHIN and DISTANCE searches
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • indexField string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    Name Type Default value Description
    indexField string - Index field to be targeted by this prefix.
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <GeoPrefixHandler indexField="" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" ></GeoPrefixHandler>
  • FullTextPrefixHandler com.exalead.mercury.mami.search.v20.FullTextPrefixHandler
    Searches in alphanumerical fields (both static and dynamic ones).
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • semanticContent boolean
    • indexFields string
    • indexFieldWeights string
    • performMAX boolean
    • patternMaxMatches int
    • usePrefixedMappings boolean
    • patternSearch boolean
    • tokenizationConfig string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • matchingMode enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized, custom)
    • customKind int
    • useNestedPrefixAsDynamicMetaName boolean
    • forcedDynamicMetaName string
    • dictionaryName string
    Name Type Default value Description
    semanticContent boolean true Is the content of this prefix handler "semantic"?

    If false, the content will neither be tokenized nor passed through the linguistic expansion phase.
    indexFields
    (UI: Index fields)
    string - Comma-separated list of index fields targeted by this prefix handler.
    indexFieldWeights
    (UI: Index fields weights)
    string - Optional comma-separated list of weights applied to each index field targeted by this prefix handler.
    performMAX
    (UI: Perform MAX)
    boolean False Should we perform a MAX-like query rather than a simple OR when targeting multiple fields?
    patternMaxMatches
    (UI: Max pattern matches)
    int 50 Max number of matches for the patterns under this prefix handler.
    usePrefixedMappings boolean False Prefixes the value with the prefix and #.

    This must be used with the "prefixWithContext" option of analysis mappings.

    For example: <FullTextPrefixHandler name='stuff' prefixMatch='true' usePrefixedMappings='true' indexField='everything'/> will transform: stuff_number1:test into: everything:stuff_number1#test
    patternSearch
    (UI: Use field dictionaries for pattern search)
    boolean False Uses index field dictionaries for pattern expansion rather than linguistic dictionaries.
    tokenizationConfig string -
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    matchingMode
    (UI: Matching mode)
    enum(exact, lowercase, normalized, exact_lowercase, exact_lowercase_normalized, lowercase_normalized, custom) normalized Normalization level for lookup of the words associated with this prefix handler.

    Using the 'custom' matching mode requires the custom kind to be set.

    Value can be null or one of
    • exact
    • lowercase
    • normalized
    • exact_lowercase
    • exact_lowercase_normalized
    • lowercase_normalized
    • custom
    customKind
    (UI: Custom kind)
    int - Custom indexing kind for lookup of the words associated with this prefix handler.

    This requires the matching mode to be set to 'custom'.

    Custom kinds must be used together with custom content targets at indexing time.
    useNestedPrefixAsDynamicMetaName
    (UI: Use nested prefix as dyn. meta)
    boolean False Uses a nested prefix handler as dynamic meta name to search within a dynamic alphanumerical field.

    This requires the Store meta names option to be enabled on the dynamic field.

    This is incompatible with the "forcedDynamicMetaName" option.


    For example, on a prefix handler named "myprefix" configured to target a "mydynfield" index field, a "myprefix:mymeta:myvalue" query will search for "myvalue" into the "mymeta" meta of "mydynfield".

    Note that a "myprefix:myvalue" query will not lookup into any meta, and will not find anything.
    forcedDynamicMetaName
    (UI: Dyn. meta name)
    string - Dynamic meta to search when targeting a dynamic alphanumerical field.

    Use this to restrict the search in a dynamic field to text that appears within this meta.


    This requires the Store meta names option to be enabled on the dynamic field.
    This is incompatible with the Use nested prefix as dyn. meta option.


    For example, with this option set to "mymeta", on a prefix handler named "myprefix" configured to target index field "mydynfield", a query "myprefix:myvalue" will search for "myvalue" into the "mymeta" meta of "mydynfield".

    It is possible to set a list of meta separated by ',' to mimic the behavior of a list of index fields (OR on the different metas)
    dictionaryName
    (UI: Dictionary)
    string - Dictionary to target

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <FullTextPrefixHandler semanticContent="true" indexFields="" indexFieldWeights="" performMAX="false" patternMaxMatches="50" usePrefixedMappings="false" patternSearch="false" tokenizationConfig="tok0" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" matchingMode="normalized" customKind="" useNestedPrefixAsDynamicMetaName="false" forcedDynamicMetaName="" dictionaryName="" ></FullTextPrefixHandler>
  • RawTextPrefixHandler com.exalead.mercury.mami.search.v20.RawTextPrefixHandler
    No documentation for this element.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • semanticContent boolean
    • indexFields string
    • indexFieldWeights string
    • performMAX boolean
    • patternMaxMatches int
    • usePrefixedMappings boolean
    • patternSearch boolean
    • tokenizationConfig string
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • kind int
    Name Type Default value Description
    semanticContent boolean true Is the content of this prefix handler "semantic"?

    If false, the content will neither be tokenized nor passed through the linguistic expansion phase.
    indexFields
    (UI: Index fields)
    string - Comma-separated list of index fields targeted by this prefix handler.
    indexFieldWeights
    (UI: Index fields weights)
    string - Optional comma-separated list of weights applied to each index field targeted by this prefix handler.
    performMAX
    (UI: Perform MAX)
    boolean False Should we perform a MAX-like query rather than a simple OR when targeting multiple fields?
    patternMaxMatches
    (UI: Max pattern matches)
    int 50 Max number of matches for the patterns under this prefix handler.
    usePrefixedMappings boolean False Prefixes the value with the prefix and #.

    This must be used with the "prefixWithContext" option of analysis mappings.

    For example: <FullTextPrefixHandler name='stuff' prefixMatch='true' usePrefixedMappings='true' indexField='everything'/> will transform: stuff_number1:test into: everything:stuff_number1#test
    patternSearch
    (UI: Use field dictionaries for pattern search)
    boolean False Uses index field dictionaries for pattern expansion rather than linguistic dictionaries.
    tokenizationConfig string -
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    kind int -

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <RawTextPrefixHandler semanticContent="true" indexFields="" indexFieldWeights="" performMAX="false" patternMaxMatches="50" usePrefixedMappings="false" patternSearch="false" tokenizationConfig="tok0" name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" kind="" ></RawTextPrefixHandler>
  • LinguisticOptionsPrefixHandler com.exalead.mercury.mami.search.v20.LinguisticOptionsPrefixHandler
    Sets a query expansion config on the prefix handler content, but does not actually perform search.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • dictionaryName string
    Name Type Default value Description
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    dictionaryName
    (UI: Dictionary)
    string - Dictionary to target

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <LinguisticOptionsPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" dictionaryName="" ></LinguisticOptionsPrefixHandler>
  • SimilarDocumentPrefixHandler com.exalead.mercury.mami.search.v20.SimilarDocumentPrefixHandler
    Manually looks up similar values in documents.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • similarityThreshold double
    • minNumberOfKeywordPerDoc long
    • minNumberOfMatchedKeywords long
    • similarityVirtualField string
    • dictionary string
    • keywordField string
    Name Type Default value Description
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    similarityThreshold double 0.1 The mininum similarity score for two documents to be considered similar.
    Value must be between 0 and 1.
    minNumberOfKeywordPerDoc long 5 Documents with fewer keywords than this constant are ignored.
    Value must be higher or equal to 0.
    minNumberOfMatchedKeywords long 4 Documents that do not share at least this number of keywords with the reference document are ignored.
    Value must be higher or equal to 0.
    similarityVirtualField string similarity Name of the virtual field computing the final similarity score.
    dictionary string dict0 Related-terms dictionary to use.
    keywordField string keywordv2 Searchable value field indexing the related-terms.

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <SimilarDocumentPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" similarityThreshold="0.1" minNumberOfKeywordPerDoc="5" minNumberOfMatchedKeywords="4" similarityVirtualField="similarity" dictionary="dict0" keywordField="keywordv2" ></SimilarDocumentPrefixHandler>
  • SplitPrefixHandler com.exalead.mercury.mami.search.v20.SplitPrefixHandler
    Searches for expressions, within the bounds of separators.

    For example, with a Split prefix handler named "mypage" using separator INPAGE, mypage:(a AND b) will only match if "a" and "b" appear on the same page in the document.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • sep enum(INPARAGRAPH, INSENTENCE, INPAGE)
    Name Type Default value Description
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    sep enum(INPARAGRAPH, INSENTENCE, INPAGE) INPAGE One of the separators:
    Value can be null or one of
    • INPARAGRAPH
    • INSENTENCE
    • INPAGE
    .

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <SplitPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" sep="INPAGE" ></SplitPrefixHandler>
  • KeywordPrefixHandler com.exalead.mercury.mami.search.v20.KeywordPrefixHandler
    Searches for a specific Related Term resolving its indexed identifier (internal) by querying the dictionary.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • keywordField string
    Name Type Default value Description
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    keywordField string keywordv2 Name of the value-field where keywords are indexed.

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <KeywordPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" keywordField="keywordv2" ></KeywordPrefixHandler>
  • EnumFacetPrefixHandler com.exalead.mercury.mami.search.v20.EnumFacetPrefixHandler
    No documentation for this element.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • enumFacetId string
    • mode enum(exact, tolower, toupper)
    • invalidValueAction enum(remove, ignore, error)
    Name Type Default value Description
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    enumFacetId string - The EnumFacet id this prefix handler refers to.
    Value must be any non-empty string
    mode enum(exact, tolower, toupper) exact What to do with the prefix handler value:
    • "exact": Does nothing
    • "tolower": Converts the value to lower case
    • "toupper": Converts the value to upper case
    { @code enum ("exact", "tolower", "toupper")}
    invalidValueAction enum(remove, ignore, error) error What to do when an invalid value is passed:
    • remove: removes the whole prefix XXX
    • ignore: ? XXX
    • error: fails the query

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <EnumFacetPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" enumFacetId="" mode="exact" invalidValueAction="error" ></EnumFacetPrefixHandler>
  • MultiQueryPrefixHandler com.exalead.mercury.mami.search.v20.MultiQueryPrefixHandler
    Prefix handler dispatching a query to several other prefix handlers.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • uqlQueries string
    Name Type Default value Description
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    uqlQueries string - Comma-separated list of additional UQL queries to build.

    Each UQL query is mapped to a specific prefix handler.

    Nested elements
    • fromDataModelQueryPrefixHandler
    • QueryPrefixHandler*
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    QueryPrefixHandler* - List of prefix handlers to query, identified by their names.
    XML representation
    <MultiQueryPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" uqlQueries="" ></MultiQueryPrefixHandler>
  • CustomPrefixHandler com.exalead.mercury.mami.search.v20.CustomPrefixHandler
    Prefix handler implemented using custom Java code.
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • classId string
    Name Type Default value Description
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    classId string - Reference of the Java class that implements the query prefix handler.

    This class must implement the interface com.exalead.search.query.prefix.CustomPrefixHandler

    Nested elements
    • fromDataModelQueryPrefixHandler
    • KeyValue*
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    KeyValue* -
    XML representation
    <CustomPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" classId="" ></CustomPrefixHandler>
  • UnitsOfMeasurementPrefixHandler com.exalead.mercury.mami.search.v20.UnitsOfMeasurementPrefixHandler
    Searches for documents by resolving and converting values in the required unit of measurement
    Parent objects
    • MultiQueryPrefixHandler
    • UQLConfig
    Attributes
    • name string
    • aliases string
    • queryExpansionConfig string
    • dataModelState string
    • dataModelClass string
    • dataModelProperty string
    • prefixMatch boolean
    • indexFieldTarget string
    • indexFieldUnitSymbol string
    • defaultUnitSymbol string
    Name Type Default value Description
    name string - Name of the prefix handler that should be handled.

    This is the prefix name that can be used in user queries.

    For example, the query prefix handler named 'soundslike' allows you to write this query: 'soundslike:fowbar'
    aliases string - Comma-separated list of aliases for this prefix handler.

    Any of these aliases can be used in the query for this prefix handler.
    queryExpansionConfig string - Specifies how to expand the text content of this prefix handler.

    Note that this is sometimes meaningless, if the content is not a semantic chunk of data.
    dataModelState string - Is this query prefix handler managed by a data model? @enum{null,auto,customized}.
    If null, this prefix handler is not related to a data model.
    If "auto", this prefix handler is auto-generated by a data model.
    If "customized", this prefix handler was auto-generated by a data model and then customized.
    dataModelClass string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelClass that generated this QueryPrefixHandler.
    dataModelProperty string - If dataModelState is "auto" or "customized", you will find here the name of the DataModelProperty that generated this QueryPrefixHandler.
    prefixMatch boolean False The prefix handler matches all strings beginning with "name".

    For example, a prefix handler with name="prop" prefixMatch="true" will match "props:value" and "property:value".
    indexFieldTarget string - The index field
    indexFieldUnitSymbol string - The output unit symbol
    defaultUnitSymbol string - (Optional) A default unit symbol

    Nested elements
    • fromDataModelQueryPrefixHandler
    Type Wrapper Description
    QueryPrefixHandler fromDataModel If dataModelState is "customized", you will find here the original prefix handler generated by the data model.

    Use this to easily revert to "auto" state from "customized".
    XML representation
    <UnitsOfMeasurementPrefixHandler name="" aliases="" queryExpansionConfig="" dataModelState="" dataModelClass="" dataModelProperty="" prefixMatch="false" indexFieldTarget="" indexFieldUnitSymbol="" ></UnitsOfMeasurementPrefixHandler>
  • SWConfig com.exalead.search.v30.SWConfig
    No documentation for this element.
    Parent objects
    • UQLConfig
    Attributes
    • stopWordsOnlyQueryAction enum(perform, error, empty)
    • stopWordsOnlyQueryReportMode enum(global, local, quiet)
    Name Type Default value Description
    stopWordsOnlyQueryAction enum(perform, error, empty) perform What to do when a query contains only stop words.
    One of:
    • "perform": Performs the full query.
    • "error": Sends an error.
    • "empty": Sends an empty answer.
    stopWordsOnlyQueryReportMode enum(global, local, quiet) local Specifies how to report a stop-word only query error in 'Troubleshooting > Logs':
    • quiet: No log.
    • local: Error is reported in the Process log.
    • global: Error is reported in both Process and Global logs.

    Nested elements
    • SWList*
    Type Wrapper Description
    SWList* -
    XML representation
    <SWConfig stopWordsOnlyQueryAction="perform" stopWordsOnlyQueryReportMode="local" ></SWConfig>
  • SWList com.exalead.search.v30.SWList
    No documentation for this element.
    Parent objects
    • SWConfig
    Attributes
    • language iso code
    • resourceFile string
    • action enum(opt, remove)
    • level enum(normalized, exact, lowercase)
    Name Type Default value Description
    language iso code - Language code
    resourceFile string -
    action enum(opt, remove) opt
    level enum(normalized, exact, lowercase) exact

    XML representation
    <SWList language="en" resourceFile="" action="opt" level="exact" ></SWList>
  • SemanticQueryAnalysisConfig com.exalead.mercury.mami.search.v20.SemanticQueryAnalysisConfig
    No documentation for this element.
    Parent objects
    • UQLConfig
    Attributes
    • resourceDir string
    • unusedWordPolicy enum(mandatory, optional, remove, as-is)
    • language string
    • queryNames string
    • htmlLogPath string
    • singleMatch boolean
    • overlappingMatches boolean
    Name Type Default value Description
    resourceDir string - URL for the directory containing the semantic extractor binaries (data://, file;// or resource://).
    unusedWordPolicy enum(mandatory, optional, remove, as-is) mandatory Unused query word policy
    language string - The query language for which the analyzer is activated.

    If null, it is activated whatever the language.
    queryNames string _default_ A list of comma-separated query names defining which parts of the query will be processed (default is _default_).
    htmlLogPath string - Path to a debug HTML file
    singleMatch boolean True If true, only the best match is selected for rewriting, if false, several rules may trigger simultaneously.
    overlappingMatches boolean True If true, allows rules to match on overlapping query parts. Only makes sense when singleMatch is false.

    Nested elements
    • SemanticProcessor*
    Type Wrapper Description
    SemanticProcessor* - A list of semantic processor to apply before the semantic extractor.
    XML representation
    <SemanticQueryAnalysisConfig resourceDir="" unusedWordPolicy="mandatory" queryNames="_default_" singleMatch="true" overlappingMatches="true" ></SemanticQueryAnalysisConfig>
  • VirtualFieldDefinition com.exalead.search.v30.VirtualFieldDefinition
    No documentation for this element.
    Parent objects
    • SearchLogic
    Attributes
    • name string
    • expr string
    Name Type Default value Description
    name string - Name of this virtual field
    expr string - Virtual field expression

    XML representation
    <VirtualFieldDefinition name="" expr="" ></VirtualFieldDefinition>
  • LocalOnlySearchTarget com.exalead.mercury.mami.search.v20.LocalOnlySearchTarget
    No documentation for this element.
    Attributes
    • targetName string
    • watchDogConnectTimeoutMS int
    • watchDogReadTimeoutMS int
    • nbVirtualPartitionPerSlice int
    • buildGroups string
    Name Type Default value Description
    targetName string - The name of this search target.
    This is the name to use in the search-api to reference a particular search target.
    Value must be a string matching '[A-Za-z0-9_]+'.
    watchDogConnectTimeoutMS int 1000 Global 'connect' timeout for connection to index slices maintained by this search target.

    This is an advanced parameter that should generally not be modified.
    watchDogReadTimeoutMS int 5000 Global 'read' timeout for connection to index slices maintainted by this search target.

    This is an advanced parameter that should generally not be modified.
    nbVirtualPartitionPerSlice int 1 Split every slices of the target in nbVirtualPartitionPerSlice virtual partitions. Emulate multislice behavior on a single slice, each partition acting on a subpart of the physical slice.
    buildGroups string - Comma-separated list of build groups. All local instances of the slices of this build group "will be queried

    XML representation
    <LocalOnlySearchTarget targetName="" watchDogConnectTimeoutMS="1000" watchDogReadTimeoutMS="5000" nbVirtualPartitionPerSlice="1" buildGroups="" ></LocalOnlySearchTarget>
  • SimpleSearchTarget com.exalead.mercury.mami.search.v20.SimpleSearchTarget
    The search server evenly distributes search requests among all available index replicas. It also checks the health of all index replicas periodically, to know whether to ignore those that are not available.
    Attributes
    • targetName string
    • watchDogConnectTimeoutMS int
    • watchDogReadTimeoutMS int
    • nbVirtualPartitionPerSlice int
    Name Type Default value Description
    targetName string - The name of this search target.
    This is the name to use in the search-api to reference a particular search target.
    Value must be a string matching '[A-Za-z0-9_]+'.
    watchDogConnectTimeoutMS int 1000 Global 'connect' timeout for connection to index slices maintained by this search target.

    This is an advanced parameter that should generally not be modified.
    watchDogReadTimeoutMS int 5000 Global 'read' timeout for connection to index slices maintainted by this search target.

    This is an advanced parameter that should generally not be modified.
    nbVirtualPartitionPerSlice int 1 Split every slices of the target in nbVirtualPartitionPerSlice virtual partitions. Emulate multislice behavior on a single slice, each partition acting on a subpart of the physical slice.

    Nested elements
    • TargetSlices*
    Type Wrapper Description
    TargetSlices* - Slice instances to query in this target
    XML representation
    <SimpleSearchTarget targetName="" watchDogConnectTimeoutMS="1000" watchDogReadTimeoutMS="5000" nbVirtualPartitionPerSlice="1" ></SimpleSearchTarget>
  • TargetSlices com.exalead.mercury.mami.search.v20.TargetSlices
    No documentation for this element.
    Parent objects
    • SimpleSearchTarget
    Attributes
    • buildGroups string
    • slices string
    • instances string
    • power int
    Name Type Default value Description
    buildGroups string - Comma-separated list of build groups. If null, all build groups are considered
    slices string - Comma-separated list of index slices. If null, all slices are considered
    instances string - Comma-separated list of slice instances. If null, all instances are considered
    power int 1 The power ratio defines the ratio of the inbound trafic to divert to a given instance (among all instances of a given slice).

    A slice instance with power=2 will receive twice as much traffic as a slice instance with power=1.

    If power=0, this slice instance is used as failover only if all other instances of this slice have failed.

    XML representation
    <TargetSlices buildGroups="" slices="" instances="" power="1" ></TargetSlices>
  • LegacySearchTarget (deprecated) com.exalead.mercury.mami.search.v20.LegacySearchTarget
    CloudView 5 compatible legacy search target. This target should generally not be used in new deployments.
    Attributes
    • targetName string
    • watchDogConnectTimeoutMS int
    • watchDogReadTimeoutMS int
    • nbVirtualPartitionPerSlice int
    Name Type Default value Description
    targetName string - The name of this search target.
    This is the name to use in the search-api to reference a particular search target.
    Value must be a string matching '[A-Za-z0-9_]+'.
    watchDogConnectTimeoutMS int 1000 Global 'connect' timeout for connection to index slices maintained by this search target.

    This is an advanced parameter that should generally not be modified.
    watchDogReadTimeoutMS int 5000 Global 'read' timeout for connection to index slices maintainted by this search target.

    This is an advanced parameter that should generally not be modified.
    nbVirtualPartitionPerSlice int 1 Split every slices of the target in nbVirtualPartitionPerSlice virtual partitions. Emulate multislice behavior on a single slice, each partition acting on a subpart of the physical slice.

    Nested elements
    • ContentGroupConfig*
    Type Wrapper Description
    ContentGroupConfig* -
    XML representation
    <LegacySearchTarget targetName="" watchDogConnectTimeoutMS="1000" watchDogReadTimeoutMS="5000" nbVirtualPartitionPerSlice="1" ></LegacySearchTarget>
  • ContentGroupConfig com.exalead.mercury.mami.search.v20.ContentGroupConfig
    Content group configuration. A content group is a list of build groups
    Parent objects
    • LegacySearchTarget
    Attributes
    • contentGroup string
    • type string
    Name Type Default value Description
    contentGroup string - Name of the content group
    type string - Type of the content group. Must be used only as reference by custom code for complex multi-content groups installations.

    Nested elements
    • BuildGroupConfig*
    Type Wrapper Description
    BuildGroupConfig* -
    XML representation
    <ContentGroupConfig contentGroup="" type="" ></ContentGroupConfig>
  • BuildGroupConfig com.exalead.mercury.mami.search.v20.BuildGroupConfig
    Build group configuration.
    Parent objects
    • ContentGroupConfig
    Attributes
    • buildGroup string
    Name Type Default value Description
    buildGroup string - Name of the build group.

    Nested elements
    • BuildGroupException*
    Type Wrapper Description
    BuildGroupException* - List of build group exceptions.
    XML representation
    <BuildGroupConfig buildGroup="" ></BuildGroupConfig>
  • BuildGroupException com.exalead.mercury.mami.search.v20.BuildGroupException
    No documentation for this element.
    Parent objects
    • BuildGroupConfig
    Attributes
    • slice int
    • sliceInstance string
    • enabled boolean
    • power int
    Name Type Default value Description
    slice int - slice number
    sliceInstance string - Name of the instance
    enabled boolean False Enables search for this instance.
    power int 1 The power ratio defines the ratio of the inbound trafic to divert to a given instance (among all instances of a given slice).

    A slice instance with power=2 will receive twice as much traffic as a slice instance with power=1.

    If power=0, this slice instance is used as failover only if all other instances of this slice have failed.

    XML representation
    <BuildGroupException slice="" sliceInstance="" enabled="false" power="1" ></BuildGroupException>
  • SearchAPIConfig com.exalead.mercury.mami.search.v20.SearchAPIConfig
    No documentation for this element.
    Attributes
    • name string
    • extraClassPath string
    Name Type Default value Description
    name string - Name of this API config
    extraClassPath string - Extra classpath for the web application.

    Comma or semicolon separated path of filenames or URLs pointing to directories or jar files.
    Directories should end with '/'.

    Used for JDBC for reporting and custom dependencies.

    Nested elements
    • Command*
    Type Wrapper Description
    Command* -
    XML representation
    <SearchAPIConfig name="" extraClassPath="" ></SearchAPIConfig>
  • SearchCommand com.exalead.mercury.mami.search.v20.SearchCommand
    Search API command
    Parent objects
    • SearchAPIConfig
    Attributes
    • defaultLogic string
    • defaultTarget string
    • targetReporting string
    • customRunnerFactoryClassId string
    • maxConcurrentQueries int
    • maxWaitingQueries int
    • base string
    • enableSOAP boolean
    • isAliveSynchronous boolean
    • isAliveAsynchronousDelayS int
    Name Type Default value Description
    defaultLogic string - Default search logic (can be overriden using API parameters).
    defaultTarget string - Default search target (can be overriden using API parameters).
    targetReporting string - Target reporting to use for the search queries.
    customRunnerFactoryClassId string -
    maxConcurrentQueries int 0 The maximum number of concurrent queries processed by the search server. If this limit is reached, incoming queries will be queued. 0 means the number of core of the search server times two.
    maxWaitingQueries int 200 When the maximum number of concurrent queries is reached the queries are put in a queue. This setting configures the maximum length of this queue. When this maximum is reached the query is rejected.
    base string /search-api HTTP path for this command
    enableSOAP boolean False
    isAliveSynchronous boolean True Should the query be performed synchronously on isAlive call or should it run in a background thread, and cached result be returned.
    isAliveAsynchronousDelayS int 5 Elapsed time between two queries if the query is performed asynchronously.

    Nested elements
    • customRunnerFactoryParamsKeyValue*
    • isAliveQueriesStringValue*
    • SearchCache*
    Type Wrapper Description
    KeyValue* customRunnerFactoryParams
    StringValue* isAliveQueries
    SearchCache* - List of caches for this search command.

    Cache entries will be looked up in all caches, in the order in which they are given.

    Incoming queries will be inserted in the first cache which accepts them.

    This allows complex configurations where you warmup some queries, and don't want them to be evicted by regular traffic <TimeBasedSearchCache name="time" cacheQueries="false"> <WarmUpQuery args="q=#all" /> </TimeBasedSearchCache> <ConsistentSearchCache name="default" /> <SearchCommand caches="time,default" />
    XML representation
    <SearchCommand defaultLogic="" defaultTarget="" targetReporting="" customRunnerFactoryClassId="" maxConcurrentQueries="0" maxWaitingQueries="200" base="/search-api" enableSOAP="false" isAliveSynchronous="true" isAliveAsynchronousDelayS="5" ></SearchCommand>
  • ConsistentSearchCache com.exalead.mercury.mami.search.v20.ConsistentSearchCache
    This cache is always fully evicted each time a new generation of the index is available.

    Warmup starts shortly afterwards, and is restarted for each new generation.
    Parent objects
    • SearchCommand
    Attributes
    • name string
    • warmupWorkRatio double
    • warmupThreads int
    • cacheQueries boolean
    • maxCachedQueries int
    • ignoreFromCacheKey string
    • warmupDelayS int
    Name Type Default value Description
    name string -
    warmupWorkRatio double 1.0 Maximum ratio of time each warm-up thread will work.
    warmupThreads int 2 Maximum number of threads for warmup
    cacheQueries boolean True Should this cache handle caching of incoming queries.
    If this is false, only warmup queries will be cached.
    Incoming queries with the "cache=no" argument will never be cached.

    Incoming queries with the "cache=force:1" argument will override this setting.

    Incoming queries with the "cache=evict" argument will evict the query from all caches if possible.
    maxCachedQueries int 1000 Maximum number of queries to cache.

    Additional queries are evicted using a LRU algorithm.
    ignoreFromCacheKey string cache,user_id,userquery_id,usersession_id List of url query parameters to ignore from the cache key construction.

    Separate each field with a come ",".
    warmupDelayS int 10 Elapsed time in seconds between imports on one of the build group slices, then the warmup starts.

    Nested elements
    • WarmUpQuery*
    Type Wrapper Description
    WarmUpQuery* - Queries that will be automatically cached
    XML representation
    <ConsistentSearchCache name="" warmupWorkRatio="1.0" warmupThreads="2" cacheQueries="true" maxCachedQueries="1000" ignoreFromCacheKey="cache,user_id,userquery_id,usersession_id" warmupDelayS="10" ></ConsistentSearchCache>
  • WarmUpQuery com.exalead.mercury.mami.search.v20.WarmUpQuery
    Query that will automatically be cached.
    Parent objects
    • ConsistentSearchCache
    • SearchCache
    • TimeBasedSearchCache
    Attributes
    • args string
    Name Type Default value Description
    args string - Arguments of the query to autocache, in URL form.
    For example, q=test%20stuff&logic=sl0

    XML representation
    <WarmUpQuery args="" ></WarmUpQuery>
  • TimeBasedSearchCache com.exalead.mercury.mami.search.v20.TimeBasedSearchCache
    The elements of this cache are evicted based on time policy.

    They are not evicted each time a new generation of the index is available. Therefore, this cache can contain stale data.
    Warmup queries are refreshed before their cache expiry
    Parent objects
    • SearchCommand
    Attributes
    • name string
    • warmupWorkRatio double
    • warmupThreads int
    • cacheQueries boolean
    • maxCachedQueries int
    • ignoreFromCacheKey string
    • expiryTimeS int
    Name Type Default value Description
    name string -
    warmupWorkRatio double 1.0 Maximum ratio of time each warm-up thread will work.
    warmupThreads int 2 Maximum number of threads for warmup
    cacheQueries boolean True Should this cache handle caching of incoming queries.
    If this is false, only warmup queries will be cached.
    Incoming queries with the "cache=no" argument will never be cached.

    Incoming queries with the "cache=force:1" argument will override this setting.

    Incoming queries with the "cache=evict" argument will evict the query from all caches if possible.
    maxCachedQueries int 1000 Maximum number of queries to cache.

    Additional queries are evicted using a LRU algorithm.
    ignoreFromCacheKey string cache,user_id,userquery_id,usersession_id List of url query parameters to ignore from the cache key construction.

    Separate each field with a come ",".
    expiryTimeS int 3600 Expiry time for queries in the cache

    Nested elements
    • WarmUpQuery*
    Type Wrapper Description
    WarmUpQuery* - Queries that will be automatically cached
    XML representation
    <TimeBasedSearchCache name="" warmupWorkRatio="1.0" warmupThreads="2" cacheQueries="true" maxCachedQueries="1000" ignoreFromCacheKey="cache,user_id,userquery_id,usersession_id" expiryTimeS="3600" ></TimeBasedSearchCache>
  • QueryExpansionCommand com.exalead.mercury.mami.search.v20.QueryExpansionCommand
    A BaseSearchCommand used for query expansion
    Parent objects
    • SearchAPIConfig
    Attributes
    • defaultLogic string
    • defaultTarget string
    • targetReporting string
    • customRunnerFactoryClassId string
    • maxConcurrentQueries int
    • maxWaitingQueries int
    • base string
    Name Type Default value Description
    defaultLogic string - Default search logic (can be overriden using API parameters).
    defaultTarget string - Default search target (can be overriden using API parameters).
    targetReporting string - Target reporting to use for the search queries.
    customRunnerFactoryClassId string -
    maxConcurrentQueries int 0 The maximum number of concurrent queries processed by the search server. If this limit is reached, incoming queries will be queued. 0 means the number of core of the search server times two.
    maxWaitingQueries int 200 When the maximum number of concurrent queries is reached the queries are put in a queue. This setting configures the maximum length of this queue. When this maximum is reached the query is rejected.
    base string /search-api HTTP path for this command

    Nested elements
    • customRunnerFactoryParamsKeyValue*
    Type Wrapper Description
    KeyValue* customRunnerFactoryParams
    XML representation
    <QueryExpansionCommand defaultLogic="" defaultTarget="" targetReporting="" customRunnerFactoryClassId="" maxConcurrentQueries="0" maxWaitingQueries="200" base="/search-api" ></QueryExpansionCommand>
  • SpellCheckCommand com.exalead.mercury.mami.search.v20.SpellCheckCommand
    Search API command
    Parent objects
    • SearchAPIConfig
    Attributes
    • defaultLogic string
    • defaultTarget string
    • targetReporting string
    • customRunnerFactoryClassId string
    • maxConcurrentQueries int
    • maxWaitingQueries int
    • base string
    Name Type Default value Description
    defaultLogic string - Default search logic (can be overriden using API parameters).
    defaultTarget string - Default search target (can be overriden using API parameters).
    targetReporting string - Target reporting to use for the search queries.
    customRunnerFactoryClassId string -
    maxConcurrentQueries int 0 The maximum number of concurrent queries processed by the search server. If this limit is reached, incoming queries will be queued. 0 means the number of core of the search server times two.
    maxWaitingQueries int 200 When the maximum number of concurrent queries is reached the queries are put in a queue. This setting configures the maximum length of this queue. When this maximum is reached the query is rejected.
    base string /search-api HTTP path for this command

    Nested elements
    • customRunnerFactoryParamsKeyValue*
    Type Wrapper Description
    KeyValue* customRunnerFactoryParams
    XML representation
    <SpellCheckCommand defaultLogic="" defaultTarget="" targetReporting="" customRunnerFactoryClassId="" maxConcurrentQueries="0" maxWaitingQueries="200" base="/search-api" ></SpellCheckCommand>
  • SecurityCommand com.exalead.mercury.mami.search.v20.SecurityCommand
    Search API command
    Parent objects
    • SearchAPIConfig
    Attributes
    • base string
    Name Type Default value Description
    base string /search-api HTTP path for this command

    XML representation
    <SecurityCommand base="/search-api" ></SecurityCommand>
  • SuggestCommand com.exalead.mercury.mami.search.v20.SuggestCommand
    Search API command
    Parent objects
    • SearchAPIConfig
    Attributes
    • base string
    • targetReporting string
    • tokenizationConfig string
    • autocomplete boolean
    • recursive boolean
    Name Type Default value Description
    base string /search-api HTTP path for this command
    targetReporting string -
    tokenizationConfig string - Tokenization config name used to parse the query.
    autocomplete boolean False Appends suggest results to the last query word being typed in the search field to autocomplete it.

    It only works with a suggest built with the Subexpr matching or Substring matching build options.
    recursive boolean False Discards the leftmost word of the query progressively.

    It sends each new subquery to the suggest(s) until the max number of suggestions is reached, or until there is no more word to use.

    For example, for a query "A B C", the suggest will be called 3 times, with "A B C", "B C", and "C".

    XML representation
    <SuggestCommand base="/search-api" targetReporting="" tokenizationConfig="tok0" autocomplete="false" recursive="false" ></SuggestCommand>
  • DictionaryCommand com.exalead.mercury.mami.search.v20.DictionaryCommand
    Search API command
    Parent objects
    • SearchAPIConfig
    Attributes
    • base string
    Name Type Default value Description
    base string /search-api HTTP path for this command

    XML representation
    <DictionaryCommand base="/search-api" ></DictionaryCommand>
  • GeoCommand com.exalead.mercury.mami.search.v20.GeoCommand
    Search API command
    Parent objects
    • SearchAPIConfig
    Attributes
    • base string
    Name Type Default value Description
    base string /search-api HTTP path for this command

    XML representation
    <GeoCommand base="/search-api" ></GeoCommand>
  • IsAliveCommand com.exalead.mercury.mami.search.v20.IsAliveCommand
    Search API command
    Parent objects
    • SearchAPIConfig
    Attributes
    • base string
    • queryToPerform string
    • targetQueryCommand string
    • performQuerySynchronously boolean
    • asynchronousQueryDelayS int
    Name Type Default value Description
    base string /search-api HTTP path for this command
    queryToPerform string q=IsAliveQuery The isAlive can optionally perform a query to check that all components are ready.

    If you want this behavior, enter the search parameters to query for here.

    For example "q=test&logic=sl0".
    If this is null, the query won't be performed.
    targetQueryCommand string /search-api Command path on which the query will be performed.
    performQuerySynchronously boolean False Should the query be performed synchronously on isAlive call or should it run in a background thread, and cached result be returned.
    asynchronousQueryDelayS int 5 Elapsed between two queries if the query is performed asynchronously.

    XML representation
    <IsAliveCommand base="/search-api" queryToPerform="q=IsAliveQuery" targetQueryCommand="/search-api" performQuerySynchronously="false" asynchronousQueryDelayS="5" ></IsAliveCommand>
  • DocumentFetchCommand com.exalead.mercury.mami.search.v20.DocumentFetchCommand
    Abstract class for a fetch command.
    Parent objects
    • SearchAPIConfig
    Attributes
    • base string
    Name Type Default value Description
    base string /search-api HTTP path for this command

    XML representation
    <DocumentFetchCommand base="/search-api" ></DocumentFetchCommand>
  • PreviewCommand com.exalead.mercury.mami.search.v20.PreviewCommand
    Retrieves rich HTML preview of the documents.
    Parent objects
    • SearchAPIConfig
    Attributes
    • base string
    • defaultLogic string
    • defaultHighlightPrefixHandlers string
    • defaultHighlightFacetIds string
    • convertAddresses string
    Name Type Default value Description
    base string /search-api HTTP path for this command
    defaultLogic string -
    defaultHighlightPrefixHandlers string Comma-separated list of prefix handlers that target the preview for highlight.
    The default prefix handler is "text".
    defaultHighlightFacetIds string Comma-separated list of facets. Refinements on categories are highlighted in the preview if they have a parent category in this attribute.
    convertAddresses string - Semicolon separated list of convert instance names or urls to use.

    XML representation
    <PreviewCommand base="/search-api" defaultLogic="" defaultHighlightPrefixHandlers="" defaultHighlightFacetIds="" ></PreviewCommand>
  • ThumbnailCommand com.exalead.mercury.mami.search.v20.ThumbnailCommand
    Abstract class for a fetch command.
    Parent objects
    • SearchAPIConfig
    Attributes
    • base string
    • maxConcurrentComputations int
    • defaultWidth int
    • defaultHeight int
    • allowUnsecurePrecomputedThumbnails boolean
    • convertAddresses string
    Name Type Default value Description
    base string /search-api HTTP path for this command
    maxConcurrentComputations int 4 Maximum number of concurrent thumbnail computations.

    Thumbnail queries that do not require a computation (cache hits) are not considered.
    defaultWidth int 120 Width of the computed thumbnail, if not specified in request
    defaultHeight int 90 Height of the computed thumbnail, if not specified in request
    allowUnsecurePrecomputedThumbnails boolean False When security is enforced on the fetch request, should a precomputed thumbnail be returned, if any.
    Security is not enforced on precomputed thumbnails: any logged user knowing the URL of a document could retrieve its precomputed thumbnail.
    convertAddresses string - Semicolon separated list of convert instance names or urls to use.

    Nested elements
    • ThumbnailsCache
    Type Wrapper Description
    ThumbnailsCache - Cache for the thumbnails computed on the fly
    XML representation
    <ThumbnailCommand base="/search-api" maxConcurrentComputations="4" defaultWidth="120" defaultHeight="90" allowUnsecurePrecomputedThumbnails="false" ></ThumbnailCommand>
  • LocalThumbnailsCache com.exalead.mercury.mami.search.v20.LocalThumbnailsCache
    The local thumbnails cache configuration.
    Parent objects
    • ThumbnailCommand
    Attributes
    • secure boolean
    • maxHeight int
    • maxWidth int
    • memoryEntries int
    • diskEntries int
    • timeToLiveS int
    Name Type Default value Description
    secure boolean True Thumbnails are only cached and retrieved if enforce_security is set to false for the request.

    If false, thumbnails are always cached and retrieved.

    Warning: any logged user knowing the URL of a document could then retrieve a thumbnail for it.
    maxHeight int 120 Maximum height of thumbnails for them to be cacheable
    maxWidth int 120 Maximum width of thumbnails for them to be cacheable
    memoryEntries int 20 Number of thumbnails to keep in memory cache
    diskEntries int 2000 Number of thumbnails to keep in disk cache
    timeToLiveS int 7200 Time period in seconds during which a thumbnail remains in cache.

    Note that when there is a cache, we also send an expires HTTP header for the thumbnails, with the same time period.

    XML representation
    <LocalThumbnailsCache secure="true" maxHeight="120" maxWidth="120" memoryEntries="20" diskEntries="2000" timeToLiveS="7200" ></LocalThumbnailsCache>
  • SuggestConfig com.exalead.mercury.mami.search.v20.SuggestConfig
    List of Suggest and SuggestDispatcher
    Attributes
    • version long
    Name Type Default value Description
    version long - -

    Nested elements
    • Suggest*
    • SuggestDispatcher*
    Type Wrapper Description
    Suggest* - List of Suggest.
    SuggestDispatcher* - List of SuggestDispatcher.
    XML representation
    <SuggestConfig version="" ></SuggestConfig>
  • QueryReportingSuggest com.exalead.mercury.mami.search.v20.QueryReportingSuggest
    Takes the user queries stored for query reporting to build a suggest automatically.
    Parent objects
    • SuggestConfig
    Attributes
    • suggestName string
    • whiteListURL string
    • blackListURL string
    • discardComplexQuery boolean
    • applicationId string
    Name Type Default value Description
    suggestName string - The mandatory suggest name. Must match [A-z0-9_]+
    whiteListURL string - URL to a suggest-allow list resource
    blackListURL string - URL to a suggest-block list resource
    discardComplexQuery boolean False Discards queries containing prefix handlers and operators.
    applicationId string - Retrieves logs for a specific application.

    Nested elements
    • SuggestBuildConfig
    • SuggestBuildTrigger
    • SuggestQueryConfig
    Type Wrapper Description
    SuggestBuildConfig - Build options
    SuggestBuildTrigger - Optional build trigger
    SuggestQueryConfig -
    XML representation
    <QueryReportingSuggest suggestName="" whiteListURL="" blackListURL="" discardComplexQuery="false" applicationId="" ></QueryReportingSuggest>
  • SuggestBuildConfig com.exalead.mercury.mami.search.v20.SuggestBuildConfig
    Suggest build options
    Parent objects
    • BuildableSuggest
    • CSVIndexFieldSuggest
    • CategoryPathSuggest
    • CategorySuggest
    • CategoryTitleSuggest
    • DynamicFieldSuggest
    • EnoviaSecureDynamicFieldSuggest
    • IndexFieldSuggest
    • QueryReportingSuggest
    • RelatedTermsSuggest
    • SearchBasedSuggest
    Attributes
    • subExpr boolean
    • subString boolean
    • splitSentence boolean
    • splitNGrams int
    • computePermutation boolean
    • maxEntryLength int
    • maxSuggestions int
    • nbUrlsPerEntry int
    • tokenizationConfig string
    • sanitizeEntries boolean
    • buildAfterIndexImport boolean
    • secure boolean
    • securityModel enum(default, enovia)
    Name Type Default value Description
    subExpr boolean False Finds matches on every start of word.

    For example, "first test" will be returned both for "fir" and for "tes".
    subString boolean False Finds matches on every letter.

    For example, "first test" will be returned for "fir", for "rs", for "es", ...
    splitSentence boolean False Breaks up a suggest entry into several entries, and performs matches independently on the chunks.

    If the entry is multi-sentence, an entry will be created for each sentence.
    splitNGrams int 0 Breaks up a suggest entry into several entries, and performs matches independently on the chunks.

    A sliding window of ngrams of a given size will be created, and an entry will be created for each step of the window.

    For example, "a b c d e f" with a split on 4-grams will give entries "a b c d", "b c d e" and "c d e f".
    0 means no splitting.

    This action is performed after the sentence split if the Sentence split option is selected.
    computePermutation boolean False Computes all permutations for an entry and adds them as separate entries.

    This action is performed after the sentence split if the Sentence split option is selected.
    Note that entries longer than 8 words will not be permuted for performance reasons (would generate n! entries)
    maxEntryLength int 50 Maximum number of characters in a suggest entry.

    This is a security measure to prevent overly-long entries.

    They are automatically truncated after the specified length.
    0 means no limit.
    maxSuggestions int 10 The maximum number of suggestions that can be shown to the user for a given input string.

    You cannot change this dynamically.
    nbUrlsPerEntry int 3 Defines the maximum number of URL to be stored for each entry.
    tokenizationConfig string - Tokenization Config to use.
    sanitizeEntries boolean False Strips the entry of punctuation, and encloses any UQL operator between quotes.
    buildAfterIndexImport boolean False Triggers a build automatically after the index refreshes.
    secure
    (UI: Enable security)
    boolean False Makes use of documents and users' security tokens to restrict suggestions.
    securityModel
    (UI: Security model)
    enum(default, enovia) default Enables the enovia-specific handling of security tokens.

    XML representation
    <SuggestBuildConfig subExpr="false" subString="false" splitSentence="false" splitNGrams="0" computePermutation="false" maxEntryLength="50" maxSuggestions="10" nbUrlsPerEntry="3" tokenizationConfig="tok0" sanitizeEntries="false" buildAfterIndexImport="false" secure="false" securityModel="default" ></SuggestBuildConfig>
  • SimpleSuggestBuildTrigger com.exalead.mercury.mami.search.v20.SimpleSuggestBuildTrigger
    Triggers a suggest build at a given time interval
    Parent objects
    • BuildableSuggest
    • CSVIndexFieldSuggest
    • CategoryPathSuggest
    • CategorySuggest
    • CategoryTitleSuggest
    • DynamicFieldSuggest
    • EnoviaSecureDynamicFieldSuggest
    • IndexFieldSuggest
    • QueryReportingSuggest
    • RelatedTermsSuggest
    • SearchBasedSuggest
    Attributes
    • startTimestamp long
    • endTimestamp long
    • repeatInterval long
    Name Type Default value Description
    startTimestamp long - Unix timestamp
    endTimestamp long - Unix timestamp
    repeatInterval long - Repeat interval in milliseconds

    XML representation
    <SimpleSuggestBuildTrigger startTimestamp="" endTimestamp="" repeatInterval="" ></SimpleSuggestBuildTrigger>
  • CronSuggestBuildTrigger com.exalead.mercury.mami.search.v20.CronSuggestBuildTrigger
    Triggers a suggest build based on a Quartz Cron expression
    Parent objects
    • BuildableSuggest
    • CSVIndexFieldSuggest
    • CategoryPathSuggest
    • CategorySuggest
    • CategoryTitleSuggest
    • DynamicFieldSuggest
    • EnoviaSecureDynamicFieldSuggest
    • IndexFieldSuggest
    • QueryReportingSuggest
    • RelatedTermsSuggest
    • SearchBasedSuggest
    Attributes
    • startTimestamp long
    • endTimestamp long
    • cronExpression string
    Name Type Default value Description
    startTimestamp long - Unix timestamp
    endTimestamp long - Unix timestamp
    cronExpression string - Quartz Cron expression: seconds minutes hours day_of_month month day of week [year]

    XML representation
    <CronSuggestBuildTrigger startTimestamp="" endTimestamp="" cronExpression="" ></CronSuggestBuildTrigger>
  • SuggestQueryConfig com.exalead.mercury.mami.search.v20.SuggestQueryConfig
    Defines default suggest-time options
    Parent objects
    • BuildableSuggest
    • CSVIndexFieldSuggest
    • CategoryPathSuggest
    • CategorySuggest
    • CategoryTitleSuggest
    • DynamicFieldSuggest
    • EnoviaSecureDynamicFieldSuggest
    • IndexFieldSuggest
    • QueryReportingSuggest
    • RelatedTermsSuggest
    • SearchBasedSuggest
    • StaticResourceSuggest
    • StaticXMLSuggest
    • Suggest
    Attributes
    • distance int
    • minLengthForDistance1 int
    • minLengthForDistance2 int
    • autocomplete boolean
    • recursive boolean
    • secureSuggestTimeoutMS int
    Name Type Default value Description
    distance int 2 Allows approximate matching. The higher the distance the more approximate the match.
    0 for exact match.
    minLengthForDistance1 int 3 Only searches for distance 1 fuzzy matches if the original word in the query is at least N characters long.

    This avoids too much approximation on very short words.

    The default value is 3.
    minLengthForDistance2 int 6 Only searches for distance 2 fuzzy matches if the original word in the query is at least N characters long.

    This avoids too much approximation on very short words.

    The default value is 6.
    autocomplete boolean False Only works with a suggest built with subString or subExpr options.

    If set, the query is not replaced by the suggest results, instead, the results are appended to the query being typed in order to auto complete it.
    recursive boolean False Discards the leftmost word of the query progressively. Each new subquery, is sent to the suggest(s), until maxSuggestions is reached, or until there is no more word to use.

    For example, for a query "A B C", the suggests will be called 3 times, with "A B C", "B C", and "C".
    secureSuggestTimeoutMS int 200 Aborts matching in a secure suggest when this time limit is reached.

    XML representation
    <SuggestQueryConfig distance="2" minLengthForDistance1="3" minLengthForDistance2="6" autocomplete="false" recursive="false" secureSuggestTimeoutMS="200" ></SuggestQueryConfig>
  • IndexFieldSuggest com.exalead.mercury.mami.search.v20.IndexFieldSuggest
    Takes the value of an index field to build a suggest.
    Parent objects
    • SuggestConfig
    Attributes
    • logic string
    • target string
    • indexField string
    • query string
    • searchServerInstance string
    • suggestName string
    • whiteListURL string
    • blackListURL string
    Name Type Default value Description
    logic string - The search logic to use
    target string - The search target to use
    indexField string - The index field to retrieve
    query string #all The query: may be used as a filter for the exported index data
    searchServerInstance string - The search server instance to target
    suggestName string - The mandatory suggest name. Must match [A-z0-9_]+
    whiteListURL string - URL to a suggest-allow list resource
    blackListURL string - URL to a suggest-block list resource

    Nested elements
    • SuggestBuildConfig
    • SuggestBuildTrigger
    • SuggestQueryConfig
    Type Wrapper Description
    SuggestBuildConfig - Build options
    SuggestBuildTrigger - Optional build trigger
    SuggestQueryConfig -
    XML representation
    <IndexFieldSuggest logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" ></IndexFieldSuggest>
  • DynamicFieldSuggest com.exalead.mercury.mami.search.v20.DynamicFieldSuggest
    Takes the value from a subset of a dynamic index field to build a suggest.
    Parent objects
    • SuggestConfig
    Attributes
    • logic string
    • target string
    • indexField string
    • query string
    • searchServerInstance string
    • suggestName string
    • whiteListURL string
    • blackListURL string
    • mode enum(all, exact, prefix, suffix, regexp)
    • pattern string
    • keepContext boolean
    Name Type Default value Description
    logic string - The search logic to use
    target string - The search target to use
    indexField string - The index field to retrieve
    query string #all The query: may be used as a filter for the exported index data
    searchServerInstance string - The search server instance to target
    suggestName string - The mandatory suggest name. Must match [A-z0-9_]+
    whiteListURL string - URL to a suggest-allow list resource
    blackListURL string - URL to a suggest-block list resource
    mode enum(all, exact, prefix, suffix, regexp) all Matching mode. If 'all' is selected, all the content of the index field will be used.
    pattern string - The string to look for.
    keepContext boolean True When set to True, the context is kept in suggest entry.

    Nested elements
    • SuggestBuildConfig
    • SuggestBuildTrigger
    • SuggestQueryConfig
    Type Wrapper Description
    SuggestBuildConfig - Build options
    SuggestBuildTrigger - Optional build trigger
    SuggestQueryConfig -
    XML representation
    <DynamicFieldSuggest logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" mode="all" pattern="" keepContext="true" ></DynamicFieldSuggest>
  • EnoviaSecureDynamicFieldSuggest com.exalead.mercury.mami.search.v20.EnoviaSecureDynamicFieldSuggest
    Retrieve enovia-specific tokens from the index and uses the value of an index field and the associated security tokens to build a suggest.
    Parent objects
    • SuggestConfig
    Attributes
    • mode enum(all, exact, prefix, suffix, regexp)
    • pattern string
    • keepContext boolean
    • logic string
    • target string
    • indexField string
    • query string
    • searchServerInstance string
    • suggestName string
    • whiteListURL string
    • blackListURL string
    • mxentryQuery string
    • securityTokensMeta string
    Name Type Default value Description
    mode enum(all, exact, prefix, suffix, regexp) all Matching mode. If 'all' is selected, all the content of the index field will be used.
    pattern string - The string to look for.
    keepContext boolean True When set to True, the context is kept in suggest entry.
    logic string - The search logic to use
    target string - The search target to use
    indexField string - The index field to retrieve
    query string #all The query: may be used as a filter for the exported index data
    searchServerInstance string - The search server instance to target
    suggestName string - The mandatory suggest name. Must match [A-z0-9_]+
    whiteListURL string - URL to a suggest-allow list resource
    blackListURL string - URL to a suggest-block list resource
    mxentryQuery string availability:aclfts This query retrieves mxov, mxentry and mxpass.
    securityTokensMeta string mxov Meta associated to each suggest entry used as base for security token computing.

    Nested elements
    • SuggestBuildConfig
    • SuggestBuildTrigger
    • SuggestQueryConfig
    Type Wrapper Description
    SuggestBuildConfig - Build options
    SuggestBuildTrigger - Optional build trigger
    SuggestQueryConfig -
    XML representation
    <EnoviaSecureDynamicFieldSuggest mode="all" pattern="" keepContext="true" logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" mxentryQuery="availability:aclfts" securityTokensMeta="mxov" ></EnoviaSecureDynamicFieldSuggest>
  • RelatedTermsSuggest com.exalead.mercury.mami.search.v20.RelatedTermsSuggest
    Takes the value of the 'keyword' index field.
    Parent objects
    • SuggestConfig
    Attributes
    • logic string
    • target string
    • indexField string
    • query string
    • searchServerInstance string
    • suggestName string
    • whiteListURL string
    • blackListURL string
    • dictionaryName string
    Name Type Default value Description
    logic string - The search logic to use
    target string - The search target to use
    indexField string - The index field to retrieve
    query string #all The query: may be used as a filter for the exported index data
    searchServerInstance string - The search server instance to target
    suggestName string - The mandatory suggest name. Must match [A-z0-9_]+
    whiteListURL string - URL to a suggest-allow list resource
    blackListURL string - URL to a suggest-block list resource
    dictionaryName string dict0

    Nested elements
    • SuggestBuildConfig
    • SuggestBuildTrigger
    • SuggestQueryConfig
    Type Wrapper Description
    SuggestBuildConfig - Build options
    SuggestBuildTrigger - Optional build trigger
    SuggestQueryConfig -
    XML representation
    <RelatedTermsSuggest logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" dictionaryName="dict0" ></RelatedTermsSuggest>
  • CSVIndexFieldSuggest com.exalead.mercury.mami.search.v20.CSVIndexFieldSuggest
    Takes one value from a multi-encoded CSV field.
    Parent objects
    • SuggestConfig
    Attributes
    • logic string
    • target string
    • indexField string
    • query string
    • searchServerInstance string
    • suggestName string
    • whiteListURL string
    • blackListURL string
    • meta string
    Name Type Default value Description
    logic string - The search logic to use
    target string - The search target to use
    indexField string - The index field to retrieve
    query string #all The query: may be used as a filter for the exported index data
    searchServerInstance string - The search server instance to target
    suggestName string - The mandatory suggest name. Must match [A-z0-9_]+
    whiteListURL string - URL to a suggest-allow list resource
    blackListURL string - URL to a suggest-block list resource
    meta string - The CSV encoded meta to retrieve

    Nested elements
    • SuggestBuildConfig
    • SuggestBuildTrigger
    • SuggestQueryConfig
    Type Wrapper Description
    SuggestBuildConfig - Build options
    SuggestBuildTrigger - Optional build trigger
    SuggestQueryConfig -
    XML representation
    <CSVIndexFieldSuggest logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" meta="" ></CSVIndexFieldSuggest>
  • CategoryTitleSuggest com.exalead.mercury.mami.search.v20.CategoryTitleSuggest
    Takes category titles of one category index field.
    Parent objects
    • SuggestConfig
    Attributes
    • root string
    • rootForm enum(exact, lowercase, normalized)
    • logic string
    • target string
    • indexField string
    • query string
    • searchServerInstance string
    • suggestName string
    • whiteListURL string
    • blackListURL string
    Name Type Default value Description
    root string - Path to retrieve
    rootForm enum(exact, lowercase, normalized) normalized Form of the category root.
    logic string - The search logic to use
    target string - The search target to use
    indexField string - The index field to retrieve
    query string #all The query: may be used as a filter for the exported index data
    searchServerInstance string - The search server instance to target
    suggestName string - The mandatory suggest name. Must match [A-z0-9_]+
    whiteListURL string - URL to a suggest-allow list resource
    blackListURL string - URL to a suggest-block list resource

    Nested elements
    • SuggestBuildConfig
    • SuggestBuildTrigger
    • SuggestQueryConfig
    Type Wrapper Description
    SuggestBuildConfig - Build options
    SuggestBuildTrigger - Optional build trigger
    SuggestQueryConfig -
    XML representation
    <CategoryTitleSuggest root="" rootForm="normalized" logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" ></CategoryTitleSuggest>
  • CategoryPathSuggest com.exalead.mercury.mami.search.v20.CategoryPathSuggest
    Takes category paths of one category index field.
    Parent objects
    • SuggestConfig
    Attributes
    • root string
    • rootForm enum(exact, lowercase, normalized)
    • logic string
    • target string
    • indexField string
    • query string
    • searchServerInstance string
    • suggestName string
    • whiteListURL string
    • blackListURL string
    • removeRoot boolean
    Name Type Default value Description
    root string - Path to retrieve
    rootForm enum(exact, lowercase, normalized) normalized Form of the category root.
    logic string - The search logic to use
    target string - The search target to use
    indexField string - The index field to retrieve
    query string #all The query: may be used as a filter for the exported index data
    searchServerInstance string - The search server instance to target
    suggestName string - The mandatory suggest name. Must match [A-z0-9_]+
    whiteListURL string - URL to a suggest-allow list resource
    blackListURL string - URL to a suggest-block list resource
    removeRoot boolean False Removes the category root from results.

    Nested elements
    • SuggestBuildConfig
    • SuggestBuildTrigger
    • SuggestQueryConfig
    Type Wrapper Description
    SuggestBuildConfig - Build options
    SuggestBuildTrigger - Optional build trigger
    SuggestQueryConfig -
    XML representation
    <CategoryPathSuggest root="" rootForm="normalized" logic="" target="" indexField="" query="#all" searchServerInstance="" suggestName="" whiteListURL="" blackListURL="" removeRoot="false" ></CategoryPathSuggest>
  • StaticXMLSuggest com.exalead.mercury.mami.search.v20.StaticXMLSuggest
    Takes a suggest XML file as parameter and compiles it.

    This suggest cannot be scheduled.

    Build options are read directly from the suggest XML file. Entries are compiled as found in the file.
    Parent objects
    • SuggestConfig
    Attributes
    • suggestName string
    • whiteListURL string
    • blackListURL string
    • url string
    • tokenizationConfig string
    Name Type Default value Description
    suggestName string - The mandatory suggest name. Must match [A-z0-9_]+
    whiteListURL string - URL to a suggest-allow list resource
    blackListURL string - URL to a suggest-block list resource
    url string - URL to the Suggest XML file.
    tokenizationConfig string - Tokenization Config to use.

    Nested elements
    • SuggestQueryConfig
    Type Wrapper Description
    SuggestQueryConfig -
    XML representation
    <StaticXMLSuggest suggestName="" whiteListURL="" blackListURL="" url="" tokenizationConfig="tok0" ></StaticXMLSuggest>
  • StaticResourceSuggest com.exalead.mercury.mami.search.v20.StaticResourceSuggest
    Takes an already compiled resource as parameter.

    This suggest can neither be scheduled nor built.
    Parent objects
    • SuggestConfig
    Attributes
    • suggestName string
    • whiteListURL string
    • blackListURL string
    • url string
    • tokenizationConfig string
    Name Type Default value Description
    suggestName string - The mandatory suggest name. Must match [A-z0-9_]+
    whiteListURL string - URL to a suggest-allow list resource
    blackListURL string - URL to a suggest-block list resource
    url string - URL to the directory where the compiled suggest data is stored.
    tokenizationConfig string - Tokenization Config to use.

    Nested elements
    • SuggestQueryConfig
    Type Wrapper Description
    SuggestQueryConfig -
    XML representation
    <StaticResourceSuggest suggestName="" whiteListURL="" blackListURL="" url="" tokenizationConfig="tok0" ></StaticResourceSuggest>
  • SuggestDispatcher com.exalead.mercury.mami.search.v20.SuggestDispatcher
    A SuggestDispatcher associates prefix handlers to Suggests.

    It may be used by the SuggestCommand to dispatch a relevant part of a query to the right suggest.

    For example, given a query "title: test", it will send "test" to the Suggest attached to the "title" prefix handler.
    Parent objects
    • SuggestConfig
    Attributes
    • name string
    • matchWholeQuery boolean
    • useWithUnknownPrefix boolean
    • addQuotes boolean
    • displayPrefixInSuggest boolean
    • maxNbResults int
    • boostVariety boolean
    Name Type Default value Description
    name string - The mandatory suggest dispatcher name. Must match [A-z0-9_]+
    matchWholeQuery boolean False Sends the whole query to the default suggest if the cursor is outside a prefix handler.

    If unselected, it sends a subquery delimited by surrounding prefixes.
    useWithUnknownPrefix boolean False Sends the query to the default suggest if the cursor is within an undefined prefix handler.

    If unselected, undefined prefix handlers are ignored and there will be no suggestions.
    addQuotes boolean True Adds quotes where necessary so that the whole suggestion is included in the prefix handler.
    displayPrefixInSuggest boolean False When activated, the suggest is prefixed by the corresponding prefix handler if defined.
    maxNbResults int 0 Max number of suggest entries returned by this suggest dispatcher (0 returns all available entries).
    boostVariety boolean False Allows to retrieve the best matches for each suggest according to the maximum number of suggestions. Note that this mode does not always return the best global results.

    Nested elements
    • designedForLogicsStringValue*
    • PrefixHandlerSuggestPair*
    Type Wrapper Description
    StringValue* designedForLogics SearchLogic names. Prefix handler names will be checked against these logics
    PrefixHandlerSuggestPair* - Association of a prefix handler and a suggest
    XML representation
    <SuggestDispatcher name="" matchWholeQuery="false" useWithUnknownPrefix="false" addQuotes="true" displayPrefixInSuggest="false" maxNbResults="0" boostVariety="false" ></SuggestDispatcher>
  • PrefixHandlerSuggestPair com.exalead.mercury.mami.search.v20.PrefixHandlerSuggestPair
    Maps a prefix handler to a suggest dictionary. You can map as many pairs as needed.
    Parent objects
    • SuggestDispatcher
    Attributes
    • prefixHandler string
    • suggestName string
    • maxNbResults int
    • useInDefaultSuggest boolean
    Name Type Default value Description
    prefixHandler string - A prefix handler name or alias
    suggestName string - An existing suggest name
    maxNbResults int 0 Max number of suggest entries returned by this suggest
    useInDefaultSuggest boolean False Select Default to specify the suggest dictionary to use by default for a specific prefix handler.

    Note that you must specify at least one default suggest to use the following options:
    • Match whole query
    • Use default suggest for non configured prefix
    • Add prefix handler to suggestion

    XML representation
    <PrefixHandlerSuggestPair prefixHandler="" suggestName="" maxNbResults="0" useInDefaultSuggest="false" ></PrefixHandlerSuggestPair>
  • LoggingConfig com.exalead.mercury.mami.master.v10.LoggingConfig
    Data model
    General Logging configuration, for generic process logs.
    Specific system (crawler, query) have their specific log policy.
    Attributes
    • version long
    • enableTimeBasedLogRotation boolean
    • logRotationCronExpression string
    • logRotationMaxSizeKB long
    • logPurgeMaxSizeKB long
    • logExpirationDays long
    • logInSyslog boolean
    Name Type Default value Description
    version long - -
    enableTimeBasedLogRotation boolean False Rotates process log files each day at midnight.
    logRotationCronExpression string - Custom Quartz Cron expression to trigger log rotation. Only taken into account if enableTimeBasedLogRotation is false.
    logRotationMaxSizeKB long - If set, process log files are rotated when bigger than the specified number of kilobytes.
    logPurgeMaxSizeKB long - If set, process log files are purged (older first) when their total size is above the specified number.
    logExpirationDays long - If set, process log files are purged when older than the specified number of days.
    logInSyslog boolean False If set, log are also copied in syslog with type=daemon, ident=cloudview, message="AppenderName: logMessage"
    This producs this type of line in /var/log/daemon.log: "Mar 19 12:10:47 madpc125 cloudview: index.clustermanager: Building slice list for st0"

    Nested elements
    • GlobalLoggerConfig
    • LoggingLevel*
    Type Wrapper Description
    GlobalLoggerConfig -
    LoggingLevel* -
    XML representation
    <LoggingConfig version="" enableTimeBasedLogRotation="false" logRotationCronExpression="" logRotationMaxSizeKB="" logPurgeMaxSizeKB="" logExpirationDays="" logInSyslog="false" ></LoggingConfig>
  • GlobalLoggerConfig com.exalead.mercury.mami.master.v10.GlobalLoggerConfig
    Global logger module configuration.
    Parent objects
    • LoggingConfig
    Attributes
    • maxSize int
    • enableGlobalLogEntryStore boolean
    Name Type Default value Description
    maxSize int 100 Max size of the RAM buffer that stores latest messages.
    enableGlobalLogEntryStore boolean False Keeps all logs (in these case, an additional structure is created on disk and GetGlobalLogEntryList can be called).

    XML representation
    <GlobalLoggerConfig maxSize="100" enableGlobalLogEntryStore="false" ></GlobalLoggerConfig>
  • LoggingLevel com.exalead.mercury.mami.master.v10.LoggingLevel
    Logging level configuration.
    Parent objects
    • LoggingConfig
    Attributes
    • level string
    • host string
    • install string
    • process string
    • logger string
    Name Type Default value Description
    level string - Logging level. The following values are available: off, fatal, error, warning, info, notice, debug, and trace.
    host string - Host for which this logging level should be applied.

    If null, the configuration applies to every host of the cluster.
    install string - Instance for which this logging level should be applied.

    If null, the configuration applies to every instance of the cluster.
    process string - Process for which this logging level should be applied.

    If null, the configuration applies to every process of the selected host (or every host if host is null).
    logger string - Logger for which this logging level should be applied.

    If null, the configuration applies to every logger.

    XML representation
    <LoggingLevel level="" host="" install="" process="" logger="" ></LoggingLevel>
  • SetLoggingLevel com.exalead.mercury.mami.master.v10.SetLoggingLevel
    Dynamically updates the logging level of a given process.
    Parent objects
    • LoggingConfig
    Attributes
    • level string
    • host string
    • install string
    • process string
    • logger string
    Name Type Default value Description
    level string - Logging level. The following values are available: off, fatal, error, warning, info, notice, debug, and trace.
    host string - Host for which this logging level should be applied.

    If null, the configuration applies to every host of the cluster.
    install string - Instance for which this logging level should be applied.

    If null, the configuration applies to every instance of the cluster.
    process string - Process for which this logging level should be applied.

    If null, the configuration applies to every process of the selected host (or every host if host is null).
    logger string - Logger for which this logging level should be applied.

    If null, the configuration applies to every logger.

    XML representation
    <SetLoggingLevel level="" host="" install="" process="" logger="" ></SetLoggingLevel>
  • ReportingConfig com.exalead.cloudview.reporting.v10.ReportingConfig
    No documentation for this element.
    Attributes
    • httpProxyHost string
    • httpProxyPort int
    • httpProxyLogin string
    • httpProxyPassword string
    Name Type Default value Description
    httpProxyHost string - Address of the HTTP proxy to send usage data report and diagnostics to Exalead
    httpProxyPort int - Port of the HTTP proxy to send usage data report and diagnostics to Exalead
    httpProxyLogin string - Login of the HTTP proxy to send usage data report and diagnostics to Exalead
    httpProxyPassword string - Password of the HTTP proxy to send usage data report and diagnostics to Exalead

    Nested elements
    • CustomReportField*
    • NotificationsConfig
    • PerformanceMonitoringConfig
    • SearchDynamicReportingConfig
    • SearchStaticReportingConfig
    • ServiceReportingConfig
    • UsageDataReportConfig
    Type Wrapper Description
    CustomReportField* -
    NotificationsConfig -
    PerformanceMonitoringConfig -
    SearchDynamicReportingConfig -
    SearchStaticReportingConfig -
    ServiceReportingConfig -
    UsageDataReportConfig -
    XML representation
    <ReportingConfig httpProxyHost="" httpProxyPort="" httpProxyLogin="" httpProxyPassword="" ></ReportingConfig>
  • HttpHeaderField com.exalead.cloudview.reporting.v10.HttpHeaderField
    No documentation for this element.
    Parent objects
    • ReportingConfig
    Attributes
    • fieldName string
    • headerName string
    Name Type Default value Description
    fieldName string -
    headerName string -

    XML representation
    <HttpHeaderField fieldName="" headerName="" ></HttpHeaderField>
  • HttpParamField com.exalead.cloudview.reporting.v10.HttpParamField
    No documentation for this element.
    Parent objects
    • ReportingConfig
    Attributes
    • fieldName string
    • paramName string
    Name Type Default value Description
    fieldName string -
    paramName string -

    XML representation
    <HttpParamField fieldName="" paramName="" ></HttpParamField>
  • NotificationsConfig com.exalead.cloudview.reporting.v10.NotificationsConfig
    No documentation for this element.
    Parent objects
    • ReportingConfig
    Attributes
    • enabled boolean
    • licenseNotificationEnabled boolean
    • processesNotificationEnabled boolean
    • indexingNotificationEnabled boolean
    • searchNotificationEnabled boolean
    • tokenNotificatioEnabled boolean
    • tokenNotificationThreshold double
    • sender string
    • recipient string
    • subjectPrefix string
    Name Type Default value Description
    enabled boolean True
    licenseNotificationEnabled boolean True
    processesNotificationEnabled boolean True
    indexingNotificationEnabled boolean True
    searchNotificationEnabled boolean True
    tokenNotificatioEnabled boolean True
    tokenNotificationThreshold double 0.8
    sender string -
    recipient string -
    subjectPrefix string [CloudView]

    Nested elements
    • SMTPServer
    Type Wrapper Description
    SMTPServer -
    XML representation
    <NotificationsConfig enabled="true" licenseNotificationEnabled="true" processesNotificationEnabled="true" indexingNotificationEnabled="true" searchNotificationEnabled="true" tokenNotificatioEnabled="true" tokenNotificationThreshold="0.8" subjectPrefix="[CloudView] " ></NotificationsConfig>
  • SMTPServer com.exalead.cloudview.reporting.v10.SMTPServer
    SMTP Server used to send mails concerning reporting of your Cloudview instance.
    Parent objects
    • NotificationsConfig
    Attributes
    • username string
    • password string
    • host string
    • port int
    • tls boolean
    Name Type Default value Description
    username string - Username used to connect to the SMTP server.
    password string - Password associated to the username.
    host string localhost Hostname of the SMTP server.
    port int 25 Port of the targeted SMTP server.
    tls boolean False Enables the TLS (Transport Layer Security) protocol for this server.

    XML representation
    <SMTPServer username="" password="" host="localhost" port="25" tls="false" ></SMTPServer>
  • PerformanceMonitoringConfig com.exalead.cloudview.reporting.v10.PerformanceMonitoringConfig
    Monitoring-Console configuration
    Parent objects
    • ReportingConfig
    Attributes
    • monitoringIntervalS int
    • timeout int
    • collectorThreads int
    • enabled boolean
    • rrddir string
    • configdir string
    • tmpdir string
    • libspath string
    • autocreate boolean
    • onlyForInstallName string
    Name Type Default value Description
    monitoringIntervalS int 60 Interval in seconds between two monitoring probes.
    Value must be higher or equal to 4
    timeout int 20 Timeout value in seconds for network activity
    collectorThreads int 6
    enabled boolean True Enables monitoring
    rrddir string ${dataDir}/perfmonitoring/probe Directory where probes files (rrd) are stored.
    configdir string ${dataDir}/perfmonitoring/config Directory parsed to find xml configuration files.
    tmpdir string ${dataDir}/perfmonitoring/tmp Temporary directory used internally.
    libspath string ${installDir}/extrajava/jrds/ List of jar files or directory containing probes.
    autocreate boolean True Enables the auto-creation of configdir and rrddir.
    onlyForInstallName string If set, restricts monitoring to a specific install name.

    Nested elements
    • monitoredPageCachesStringValue*
    • MonitoredHost*
    • MonitoredIndexField*
    • MonitoredIndexFieldConfig
    • PerformanceMonitoringTimer*
    Type Wrapper Description
    StringValue* monitoredPageCaches List of PageCache to monitor.
    MonitoredHost* - Additional host probes
    MonitoredIndexField* - List of index field to monitor.
    MonitoredIndexFieldConfig - Global index fied monitoring config
    PerformanceMonitoringTimer* - Additional timers
    XML representation
    <PerformanceMonitoringConfig monitoringIntervalS="60" timeout="20" collectorThreads="6" enabled="true" rrddir="${dataDir}/perfmonitoring/probe" configdir="${dataDir}/perfmonitoring/config" tmpdir="${dataDir}/perfmonitoring/tmp" libspath="${installDir}/extrajava/jrds/" autocreate="true" onlyForInstallName="" ></PerformanceMonitoringConfig>
  • MonitoredHost com.exalead.cloudview.reporting.v10.MonitoredHost
    No documentation for this element.
    Parent objects
    • PerformanceMonitoringConfig
    Attributes
    • name string
    Name Type Default value Description
    name string -

    Nested elements
    • MonitoredHostConnection*
    • MonitoredHostProbe*
    Type Wrapper Description
    MonitoredHostConnection* -
    MonitoredHostProbe* -
    XML representation
    <MonitoredHost name="" ></MonitoredHost>
  • MonitoredHostConnection com.exalead.cloudview.reporting.v10.MonitoredHostConnection
    How to connect to a monitored host.
    Parent objects
    • MonitoredHost
    Attributes
    • type string
    • name string
    Name Type Default value Description
    type string - Connection type. Must not be empty.
    name string - Connection name. Must not be empty.

    Nested elements
    • MonitoredHostElementArg*
    Type Wrapper Description
    MonitoredHostElementArg* - List of arguments for connection. Currently unused.
    XML representation
    <MonitoredHostConnection type="" name="" ></MonitoredHostConnection>
  • MonitoredHostElementArg com.exalead.cloudview.reporting.v10.MonitoredHostElementArg
    No documentation for this element.
    Parent objects
    • MonitoredHostConnection
    • MonitoredHostProbe
    • argList MonitoredHostProbe
    Attributes
    • type string
    • value string
    Name Type Default value Description
    type string -
    value string -

    XML representation
    <MonitoredHostElementArg type="" value="" ></MonitoredHostElementArg>
  • MonitoredHostProbe com.exalead.cloudview.reporting.v10.MonitoredHostProbe
    A monitoring probe
    Parent objects
    • MonitoredHost
    Attributes
    • type string
    • label string
    • connection string
    Name Type Default value Description
    type string - The probe type
    label string - The descriptive label of the probe
    connection string - The name of the MonitoredHostConnection used by the probe.

    Nested elements
    • argListMonitoredHostElementArg*
    • MonitoredHostElementArg*
    Type Wrapper Description
    MonitoredHostElementArg* argList The list of MonitoredHostElementArg (one per argument of the probe)
    MonitoredHostElementArg* - The list of MonitoredHostElementArg (one per argument of the probe).
    XML representation
    <MonitoredHostProbe type="" label="" connection="" ></MonitoredHostProbe>
  • MonitoredIndexField com.exalead.cloudview.reporting.v10.MonitoredIndexField
    No documentation for this element.
    Parent objects
    • PerformanceMonitoringConfig
    Attributes
    • name string
    Name Type Default value Description
    name string -

    XML representation
    <MonitoredIndexField name="" ></MonitoredIndexField>
  • MonitoredIndexFieldConfig com.exalead.cloudview.reporting.v10.MonitoredIndexFieldConfig
    No documentation for this element.
    Parent objects
    • PerformanceMonitoringConfig
    Attributes
    • monitorAllFacetFields boolean
    Name Type Default value Description
    monitorAllFacetFields boolean True If true, monitor all the Value and Category fields

    XML representation
    <MonitoredIndexFieldConfig monitorAllFacetFields="true" ></MonitoredIndexFieldConfig>
  • PerformanceMonitoringTimer com.exalead.cloudview.reporting.v10.PerformanceMonitoringTimer
    Timer configuration. A timer is identified by a name and is composed of one or several probe types.
    Parent objects
    • PerformanceMonitoringConfig
    Attributes
    • monitoringIntervalS int
    • timeout int
    • collectorThreads int
    • name string
    Name Type Default value Description
    monitoringIntervalS int 60 Interval in seconds between two monitoring probes.
    Value must be higher or equal to 4
    timeout int 20 Timeout value in seconds for network activity
    collectorThreads int 6
    name string - Identifier name of the timer

    Nested elements
    • probeTypesStringValue*
    Type Wrapper Description
    StringValue* probeTypes List of probe types using this timer. Probe types that are not referenced by any timer use the default one.
    XML representation
    <PerformanceMonitoringTimer monitoringIntervalS="60" timeout="20" collectorThreads="6" name="" ></PerformanceMonitoringTimer>
  • SearchDynamicReportingConfig com.exalead.cloudview.reporting.v10.SearchDynamicReportingConfig
    Dynamic Reporting configuration
    Parent objects
    • ReportingConfig
    Attributes
    • enabled boolean
    Name Type Default value Description
    enabled boolean False Enables automatic report generations

    Nested elements
    • SearchDynamicReportingConfigNames
    Type Wrapper Description
    SearchDynamicReportingConfigNames - Generated reporting configuration element names.
    XML representation
    <SearchDynamicReportingConfig enabled="false" ></SearchDynamicReportingConfig>
  • SearchDynamicReportingConfigNames com.exalead.cloudview.reporting.v10.SearchDynamicReportingConfigNames
    Contains names of configuration elements used by the dynamic reporting
    Parent objects
    • SearchDynamicReportingConfig
    Attributes
    • hostName string
    • installName string
    • papiPort int
    • reportingConnector string
    • reportingBuildGroup string
    • reportingDataModel string
    • reportingTaskQueue string
    • reportingIndexBuilder string
    • reportingSearchTarget string
    • reportingSearchLogic string
    Name Type Default value Description
    hostName string -
    installName string -
    papiPort int -
    reportingConnector string -
    reportingBuildGroup string -
    reportingDataModel string -
    reportingTaskQueue string -
    reportingIndexBuilder string -
    reportingSearchTarget string -
    reportingSearchLogic string -

    XML representation
    <SearchDynamicReportingConfigNames hostName="" installName="" papiPort="" reportingConnector="" reportingBuildGroup="" reportingDataModel="" reportingTaskQueue="" reportingIndexBuilder="" reportingSearchTarget="" reportingSearchLogic="" ></SearchDynamicReportingConfigNames>
  • SearchStaticReportingConfig com.exalead.cloudview.reporting.v10.SearchStaticReportingConfig
    Static Reporting configuration
    Parent objects
    • ReportingConfig
    Attributes
    • enabled boolean
    • reportCron string
    • reportApplicationCron string
    • topLimit int
    • maxVersionsToKeep int
    Name Type Default value Description
    enabled boolean True Enables automatic report generations
    reportCron string - Quartz Cron expression triggering global static report generation.
    reportApplicationCron string 0 10 0 * * ? Quartz Cron expression triggering static report generation on applications.

    The default setting is every day at 00:10.

    Reports are generated for each deployed application.
    topLimit int 100 Maximum number of rows to retrieve while computing ordered results (for example, top queries).
    maxVersionsToKeep int 30 Maximum number of resource versions to keep in the StaticReportList. 0 means no limit is enforced, whereas 1 will keep only the most recent version.

    Nested elements
    • ApplicationReportScheduling*
    Type Wrapper Description
    ApplicationReportScheduling* - List of application reports scheduling.

    Each application can have a scheduled search static reporting.

    Each application report scheduling is defined by the application id and a Quartz Cron expression.
    XML representation
    <SearchStaticReportingConfig enabled="true" reportCron="" reportApplicationCron="0 10 0 * * ?" topLimit="100" maxVersionsToKeep="30" ></SearchStaticReportingConfig>
  • ApplicationReportScheduling com.exalead.cloudview.reporting.v10.ApplicationReportScheduling
    Associate a Quartz Cron expression to a specific application in order to trigger its reporting generation
    Parent objects
    • SearchStaticReportingConfig
    Attributes
    • cron string
    • applicationId string
    Name Type Default value Description
    cron string -
    applicationId string -

    XML representation
    <ApplicationReportScheduling cron="" applicationId="" ></ApplicationReportScheduling>
  • ServiceReportingConfig com.exalead.cloudview.reporting.v10.ServiceReportingConfig
    No documentation for this element.
    Nested elements
    • ReporterConfig*
    Type Wrapper Description
    ReporterConfig* -
    XML representation
    <ServiceReportingConfig ></ServiceReportingConfig>
  • ReporterConfig com.exalead.cloudview.reporting.v10.ReporterConfig
    No documentation for this element.
    Parent objects
    • ServiceReportingConfig
    Attributes
    • name string
    Name Type Default value Description
    name string -

    Nested elements
    • ReportingPublisher*
    Type Wrapper Description
    ReportingPublisher* -
    XML representation
    <ReporterConfig name="" ></ReporterConfig>
  • PAPIReportingPublisher com.exalead.cloudview.reporting.v10.PAPIReportingPublisher
    Base class for PAPI Publishers.
    Parent objects
    • ReporterConfig
    Attributes
    • synchronous boolean
    • closeTimeoutMS int
    • batchTimeoutMS int
    • minBatchSize int
    • maxBatchSize int
    • maxQueueSize int
    • publishedFields string
    • host string
    • port int
    • connectorName string
    Name Type Default value Description
    synchronous boolean False
    closeTimeoutMS int 0
    batchTimeoutMS int 1000
    minBatchSize int 50
    maxBatchSize int 250
    maxQueueSize int 250
    publishedFields string -
    host string - Hostname of the reporting publisher
    port int - Port of the reporting publisher
    connectorName string - Name of the reporting publisher

    XML representation
    <PAPIReportingPublisher synchronous="false" closeTimeoutMS="0" batchTimeoutMS="1000" minBatchSize="50" maxBatchSize="250" maxQueueSize="250" publishedFields="" host="" port="" connectorName="" ></PAPIReportingPublisher>
  • CSVReportingPublisher com.exalead.cloudview.reporting.v10.CSVReportingPublisher
    Base class for CSV Publishers.
    Parent objects
    • ReporterConfig
    Attributes
    • synchronous boolean
    • closeTimeoutMS int
    • batchTimeoutMS int
    • minBatchSize int
    • maxBatchSize int
    • maxQueueSize int
    • publishedFields string
    • file string
    • maxFileSizeMB int
    • rotateEveryNMonths int
    • rotateEveryNDays int
    • rotateEveryNHours int
    • maxSizeToKeepMB int
    • maxFileToKeep int
    • maxDayToKeep int
    Name Type Default value Description
    synchronous boolean False
    closeTimeoutMS int 0
    batchTimeoutMS int 1000
    minBatchSize int 50
    maxBatchSize int 250
    maxQueueSize int 250
    publishedFields string -
    file string -
    maxFileSizeMB int 50 0 means no rotation is triggered by the file size
    rotateEveryNMonths int - 0 means no rotation is triggered by the file age in month
    rotateEveryNDays int 1 0 means no rotation is triggered by the file age in day
    rotateEveryNHours int - 0 means no rotation is triggered by the file age in hour
    maxSizeToKeepMB int 2500 Maximum csv reporting files volume in megaoctet.
    The oldest files are discarded at rotation time. 0 means no limit is enforced.
    maxFileToKeep int 50 Maximum csv reporting files to keep.
    The oldest files are discarded at rotation time. 0 means no limit is enforced, whereas 1 will discards all rotated files.
    maxDayToKeep int 30 Maximum csv reporting file age in day to keep.
    The oldest files are discarded at rotation time. 0 means no limit is enforced, whereas 1 only keep todays files.

    XML representation
    <CSVReportingPublisher synchronous="false" closeTimeoutMS="0" batchTimeoutMS="1000" minBatchSize="50" maxBatchSize="250" maxQueueSize="250" publishedFields="" file="" maxFileSizeMB="50" rotateEveryNMonths="" rotateEveryNDays="1" rotateEveryNHours="" maxSizeToKeepMB="2500" maxFileToKeep="50" maxDayToKeep="30" ></CSVReportingPublisher>
  • JDBCReportingPublisher com.exalead.cloudview.reporting.v10.JDBCReportingPublisher
    Base class for JDBC Publishers.
    Parent objects
    • ReporterConfig
    Attributes
    • synchronous boolean
    • closeTimeoutMS int
    • batchTimeoutMS int
    • minBatchSize int
    • maxBatchSize int
    • maxQueueSize int
    • publishedFields string
    • driver string
    • connectionString string
    • login string
    • password string
    • table string
    • insertQuery string
    Name Type Default value Description
    synchronous boolean False
    closeTimeoutMS int 0
    batchTimeoutMS int 1000
    minBatchSize int 50
    maxBatchSize int 250
    maxQueueSize int 250
    publishedFields string -
    driver string - JDBC driver fully qualified Java class name.
    connectionString string - JDBC connection string. This is vendor specific.
    login string - Login for the database connection.
    password string - Password for the database connection.
    table string - Table where data must be inserted.
    insertQuery string - Query used to insert data in the table.

    XML representation
    <JDBCReportingPublisher synchronous="false" closeTimeoutMS="0" batchTimeoutMS="1000" minBatchSize="50" maxBatchSize="250" maxQueueSize="250" publishedFields="" driver="" connectionString="" login="" password="" table="" insertQuery="" ></JDBCReportingPublisher>
  • ReportingStorePublisher com.exalead.cloudview.reporting.v10.ReportingStorePublisher
    Base class for PAPI Publishers.
    Parent objects
    • ReporterConfig
    Attributes
    • synchronous boolean
    • closeTimeoutMS int
    • batchTimeoutMS int
    • minBatchSize int
    • maxBatchSize int
    • maxQueueSize int
    • publishedFields string
    • schema string
    • rotationCron string
    • maxRowCount long
    Name Type Default value Description
    synchronous boolean False
    closeTimeoutMS int 0
    batchTimeoutMS int 1000
    minBatchSize int 50
    maxBatchSize int 250
    maxQueueSize int 250
    publishedFields string -
    schema string -
    rotationCron string 0 0 0 * * ? Quartz Cron expression to force a rotation.

    A rotation is also triggered every time a collection is queried.
    maxRowCount long 100000 Maximum count of line in the database.

    When this size is reached the oldest records are discarded. 0 means there is no limit to the database size.

    XML representation
    <ReportingStorePublisher synchronous="false" closeTimeoutMS="0" batchTimeoutMS="1000" minBatchSize="50" maxBatchSize="250" maxQueueSize="250" publishedFields="" schema="" rotationCron="0 0 0 * * ?" maxRowCount="100000" ></ReportingStorePublisher>
  • UsageDataReportConfig com.exalead.cloudview.reporting.v10.UsageDataReportConfig
    Configuration of the Usage Data Reporting (UDR). UDR allows 3DS EXALEAD to improve its products thanks to anonymous information on their use.
    Parent objects
    • ReportingConfig
    Attributes
    • enabled boolean
    Name Type Default value Description
    enabled boolean False Should usage data be sent regularly to 3DS EXALEAD?

    XML representation
    <UsageDataReportConfig enabled="false" ></UsageDataReportConfig>
  • DeploymentConfig exa.bee.deploy.v10.DeploymentConfig
    Deployment configuration.
    Defines the default {@link PortRange} used by hosts.

    Defines the list of {@link Host} part of the cluster configuration.

    Defines the list of virtual services defined for this installation.
    Attributes
    • version long
    • description string
    • hostAgentPort int
    Name Type Default value Description
    version long - -
    description string - A descriptive name for this deployment, that will appear in the Admin-UI, for identification purposes
    hostAgentPort int - Default port for the hostagent if not defined in each host

    Nested elements
    • Host*
    • PortRange
    • VirtualService*
    Type Wrapper Description
    Host* - -
    PortRange - -
    VirtualService* - -
    XML representation
    <DeploymentConfig version="" description="" hostAgentPort="" ></DeploymentConfig>
  • Host exa.bee.deploy.v10.Host
    Defines a {@link PortRange} which overrides the one defined at {@link DeploymentConfig} level. Note that if this host's port range has a null start port then, default port range start port is used. Same for last port.

    Defines the set of roles implemented by this host. See Management API reference guide for a complete list of available roles.
    Parent objects
    • DeploymentConfig
    Attributes
    • hostname string
    • install string
    • architecture string
    • hostAgentPort int
    Name Type Default value Description
    hostname string - Network name of the physical host. The host must be reachable by all other hosts of the CloudView install using this name.
    install string - Name of this CloudView cluster host on the physical host.
    It is possible to deploy several CloudView installs in the same CloudView cluster on the same physical host.
    architecture string - System architecture of the host. Optional.
    hostAgentPort int - Fixed Host agent port (optional). Overrides hostAgentPort set in
    DeploymentConfig

    Nested elements
    • PortRange
    • Role*
    Type Wrapper Description
    PortRange - Range of TCP ports used for this host
    Role* - CloudView roles hosted by this host
    XML representation
    <Host hostname="" install="" architecture="" hostAgentPort="" ></Host>
  • PortRange exa.bee.deploy.v10.PortRange
    Port range attributed to the product
    Parent objects
    • DeploymentConfig
    • Host
    Attributes
    • startPort int
    • lastPort int
    Name Type Default value Description
    startPort int - First TCP port
    lastPort int - Last TCP port

    XML representation
    <PortRange startPort="" lastPort="" ></PortRange>
  • Role exa.bee.deploy.v10.Role
    A Role in the CloudView process. The list of valid Roles can be found in the CloudView deployment guide
    Parent objects
    • Host
    Attributes
    • name string
    Name Type Default value Description
    name string - $CONSTRAINTS

    Nested elements
    • RoleAttribute*
    Type Wrapper Description
    RoleAttribute* - -
    XML representation
    <Role name="" ></Role>
  • RoleAttribute exa.bee.deploy.v10.RoleAttribute
    Parameter of a role. The list of valid role attributes depends on the
    Role.
    Parent objects
    • Role
    Attributes
    • name string
    • value string
    Name Type Default value Description
    name string - $CONSTRAINTS
    value string - $CONSTRAINTS

    XML representation
    <RoleAttribute name="" value="" ></RoleAttribute>
  • VirtualService exa.bee.deploy.v10.VirtualService
    Virtual services are internal high availability points. Not used in general
    CloudView deployments..
    Defines a single access point to a pool of services. {@link VirtualServiceRef} can be used to override the default behavior of {@link VirtualServiceGroupRef}.
    Parent objects
    • DeploymentConfig
    Attributes
    • name string
    Name Type Default value Description
    name string - Name of the virtual service (use in Directory.dispatch(...).

    Nested elements
    • VirtualServiceGroupRef*
    • VirtualServiceRef*
    Type Wrapper Description
    VirtualServiceGroupRef* - -
    VirtualServiceRef* - -
    XML representation
    <VirtualService name="" ></VirtualService>
  • VirtualServiceGroupRef exa.bee.deploy.v10.VirtualServiceGroupRef
    Internal advanced configuration.
    Reference to a group of services.
    The set of services referenced by a group is built from the roles "Process" whose attribute virtualGroup match this group's name.
    Default behaviour is simple round-robin among all the services.
    Parent objects
    • VirtualService
    Attributes
    • name string
    • service string
    Name Type Default value Description
    name string - -
    service string - Name of the service which is virtualized

    XML representation
    <VirtualServiceGroupRef name="" service="" ></VirtualServiceGroupRef>
  • VirtualServiceRef exa.bee.deploy.v10.VirtualServiceRef
    Internal advanced configuration. Reference to a concrete service implementation.
    Parent objects
    • VirtualService
    Attributes
    • name string
    • enabled boolean
    • power int
    • maxConnection int
    Name Type Default value Description
    name string - Fully qualified path to the service implementation.
    enabled boolean true Use this to easily enable or disable.
    power int 1 -
    maxConnection int 100 -

    XML representation
    <VirtualServiceRef name="" enabled="true" power="1" maxConnection="100" ></VirtualServiceRef>
  • CloudviewDeploymentInternalConfig com.exalead.mercury.mami.deploy.v10.CloudviewDeploymentInternalConfig
    Advanced internal settings.
    Attributes
    • enableServiceList boolean
    • version long
    • collapseIndexSlices boolean
    • collapseSearchServers boolean
    • collapseMaster boolean
    • debug boolean
    Name Type Default value Description
    enableServiceList boolean true Enable services listing on Java processes
    version long - -
    collapseIndexSlices boolean True Merges the processes for all slices (in a given index replica) into only one process.
    collapseSearchServers boolean False Merges searchserver processes in the gateway process.
    collapseMaster boolean False Merges the Exascript master process into the hostagent.
    debug boolean False Activate debugger access on java process.

    Nested elements
    • globalBeegistryKeysKeyValue*
    • ProcessInternalConfig*
    • SearchInternalParameters
    • ServerCiphers
    • ServerCryptographicProtocols
    • TickInternalParameters
    Type Wrapper Description
    KeyValue* globalBeegistryKeys Additional global beegistry keys
    ProcessInternalConfig* - Configuration sets for the process. Configuration are matched in a first-match fashion.
    For example: <ProcessInternalConfig name="foobar" runtime="java" /> <ProcessInternalConfig runtime="java" /> the second one will be used for all java processes except foobar.
    SearchInternalParameters -
    ServerCiphers - Defines the cipher suite by a sequence on include and exclude statement.
    ServerCryptographicProtocols - Set the protocol that will be supported by servers (SSL, TLS..., see javax.net.ssl.SSLEngine)
    TickInternalParameters -
    XML representation
    <CloudviewDeploymentInternalConfig enableServiceList="true" version="" collapseIndexSlices="true" collapseSearchServers="false" collapseMaster="false" debug="false" ></CloudviewDeploymentInternalConfig>
  • ProcessInternalConfig exa.bee.deploy.v10.ProcessInternalConfig
    Internal configuration for a set of processes.
    Parent objects
    • CloudviewDeploymentInternalConfig
    Attributes
    • name string
    • hostname string
    • installName string
    • runtime string
    • exaBinary string
    • httpMinThreads int
    • httpMaxThreads int
    • httpMaxStopTime int
    • httpMaxGETQuerySize int
    • httpMaxFormPOSTQuerySize int
    • httpMaxIdleTimeMS int
    • httpAcceptorThreads int
    • httpAcceptQueueSize int
    Name Type Default value Description
    name string - Name of the process on which this applies. Applies to all processes if null
    hostname string - Name of the host on which this applies. Applies to all hosts if null
    installName string - Name of the install on which this applies. Applies to all installs if null
    runtime string - Runtime on which this applies. One of exa, java, dotnet. Applies to all runtimes if null
    exaBinary string - EXA binary to use in case of an EXA process.
    httpMinThreads int 5 Minimum number of alive threads for servicing HTTP queries
    httpMaxThreads int 500 Maximum number of alive threads for servicing HTTP queries
    httpMaxStopTime int 100 Maximum total time(s) allowed for current queries to be treated before really stopping the server
    httpMaxGETQuerySize int 32768 Default max size in bytes of HTTP GET queries on this process.
    Note that this might be overriden by specific settings (like the SearchAPI max size)
    httpMaxFormPOSTQuerySize int 10000000 Default max size in bytes of HTTP POST queries on this process.
    Note that this might be overriden by specific settings (like the SearchAPI max size)
    httpMaxIdleTimeMS int 86400000 Maximum timeout for an HTTP connection, either while waiting for new data, or while processing a query
    httpAcceptorThreads int 1 Number of threads waiting for new HTTP connections
    httpAcceptQueueSize int 1200 Maximum size for the TCP acception queue

    Nested elements
    • paramsKeyValue*
    • argsStringValue*
    • envOverrideKeyValue*
    • envAppendKeyValue*
    • envPrependKeyValue*
    Type Wrapper Description
    KeyValue* params Various advanced process params:
    • stopTimeout: Time, in milliseconds, to wait for the process to stop before being killed
    • startTimeout: Time, in milliseconds, to wait for the process to start before being killed
    • startDelay: Time, in milliseconds, to delay the start of this process after the others
    • killIfNotResponding: (true/false) If the process fails to answer to pings, indicate if it should be killed or not
    StringValue* args Command line arguments. If empty, default arguments are used.

    List of possible params for java processes:
    • -Dcv.showServletStacks=false : do not print java stacks when web app fails

    For linux:
    • Overriding ulimits -Dulimit.NAME=XXXX
      with NAME one of 'core', 'cpu', 'data', 'fsize', 'nofile', 'stack', 'as', 'rss', 'nproc', 'memlock', 'locks', 'sigpending', 'msgqueue', 'nice', 'rtprio', 'rttime'

      See linux ulimits doc for precise documentation

      XXXX is a number expressed in octets for sizes
    • Overriding malloc options -DMALLOPT_NAME=YYYY
      with MALLOPT_NAME one of 'mmap_threshold', 'mmap_max', 'mxfast', 'top_pad', 'trim_threshold'
      YYYY is a number expressed in octets
      See linux mallopt documentation for more details.
    KeyValue* envOverride Environment variables to set/replace for this process
    KeyValue* envAppend Additional content to append at the end of environment variables (for PATH-like variables)
    KeyValue* envPrepend Additional content to prepend at the beginning of environment variables (for PATH-like variables)
    XML representation
    <ProcessInternalConfig name="" hostname="" installName="" runtime="" exaBinary="" httpMinThreads="5" httpMaxThreads="500" httpMaxStopTime="100" httpMaxGETQuerySize="32768" httpMaxFormPOSTQuerySize="10000000" httpMaxIdleTimeMS="86400000" httpAcceptorThreads="1" httpAcceptQueueSize="1200" ></ProcessInternalConfig>
  • SearchInternalParameters com.exalead.mercury.mami.deploy.v10.SearchInternalParameters
    Search internal parameters to tune performance. Warning: Experts only, this can have a huge performance impact.
    Parent objects
    • CloudviewDeploymentInternalConfig
    Attributes
    • fullHitsContextMaxLifeSpan int
    • rankedSearchFullHitsRecyclingMaxSize int
    • fullHitMetasRecyclingMaxSize int
    • messagesAllocatorMaxSizeMB int
    • messagesAllocatorMaxLifeSpan int
    • workerThreadMaxQueries int
    • runnerMaxLifeSpan int
    • runnerMaxQueries int
    • runnerKeepAtLeastWhenIdle int
    • runnerKeepAtLeastWhenInBurst int
    • runnerDestroyIdleAfter int
    Name Type Default value Description
    fullHitsContextMaxLifeSpan int 1000 When fullhits and their associated metas are released, they are put back into a memory pool for reuse. A full free is performed each 'fullHitsContextMaxLifeSpan' queries.
    rankedSearchFullHitsRecyclingMaxSize int 500 When fullhits are released, they are put back into a memory pool for reuse. A full free is performed once the pool has allocated more than 'rankedSearchFullHitsRecyclingMaxSize' fullhits.
    fullHitMetasRecyclingMaxSize int 50000 When fullhit metas are released, they are put back into a memory pool for reuse. A full free is performed once the pool has allocated more than 'fullHitMetasRecyclingMaxSize' metas.
    messagesAllocatorMaxSizeMB int 2000 A full reinitialization of the FLEA messages allocator will be performed once the pool of flea message has allocated more than 'messagesAllocatorMaxSizeMB' bytes.
    messagesAllocatorMaxLifeSpan int 1000 A full reinitialization of the FLEA messages allocator will be performed once 'messagesAllocatorMaxLifeSpan' queries have been done.
    workerThreadMaxQueries int 100 A worker (query container at slice level) is entirely destroyed once a worker thread has served 'workerThreadMaxQueries' queries.
    runnerMaxLifeSpan int 300 A runner (query container at searchserver level) is entirely destroyed once a runner has served queries during 'runnerMaxLifeSpan' seconds.
    runnerMaxQueries int 1000 A runner is entirely destroyed once a runner has served 'runnerMaxQueries' queries.
    runnerKeepAtLeastWhenIdle int 0 Keep at least 'runnerKeepAtLeastWhenIdle' runner(s) ready to serve queries even without traffic.
    runnerKeepAtLeastWhenInBurst int 5 Be sure you keep at least 'runnerKeepAtLeastWhenInBurst' live runners, whatever the fullfilment of 'runnerMaxQueries' 'runnerMaxLifeSpan' when your are in a peak of activity.
    runnerDestroyIdleAfter int 60 Destroy idle runners after 'runnerDestroyIdleAfter' seconds of inactivity.

    XML representation
    <SearchInternalParameters fullHitsContextMaxLifeSpan="1000" rankedSearchFullHitsRecyclingMaxSize="500" fullHitMetasRecyclingMaxSize="50000" messagesAllocatorMaxSizeMB="2000" messagesAllocatorMaxLifeSpan="1000" workerThreadMaxQueries="100" runnerMaxLifeSpan="300" runnerMaxQueries="1000" runnerKeepAtLeastWhenIdle="0" runnerKeepAtLeastWhenInBurst="5" runnerDestroyIdleAfter="60" ></SearchInternalParameters>
  • ServerCiphers exa.bee.deploy.v10.ServerCiphers
    Defines the cipher suite by a sequence on include and exclude statement.
    When no cipher suite is included, all supported cipher suites are accepted.,
    Nested elements
    • Exclude*
    • Include*
    Type Wrapper Description
    Exclude* - Names of the cipher that can not be used for HTTPS requests.
    Include* - Names of the cipher that can be used for HTTPS requests. If empty then all ciphers can be used.
    XML representation
    <ServerCiphers ></ServerCiphers>
  • Exclude exa.bee.deploy.v10.Exclude
    Names of a cipher suite that can not be used for HTTPS request.
    Parent objects
    • ServerCiphers
    Attributes
    • name string
    Name Type Default value Description
    name string -

    XML representation
    <Exclude name="" ></Exclude>
  • Include exa.bee.deploy.v10.Include
    Names of a cipher suite that can be used for HTTPS request.
    Parent objects
    • ServerCiphers
    Attributes
    • name string
    Name Type Default value Description
    name string -

    XML representation
    <Include name="" ></Include>
  • ServerCryptographicProtocols exa.bee.deploy.v10.ServerCryptographicProtocols
    Set the protocol that will be supported by servers (SSL, TLS..., see javax.net.ssl.SSLEngine)
    Nested elements
    • ExcludeProtocol*
    • IncludeProtocol*
    Type Wrapper Description
    ExcludeProtocol* - Name of a crypograhic protocol as defined in javax.net.ssl.SSLEngine) to exclude
    IncludeProtocol* - Name of a crypograhic protocol (as defined in javax.net.ssl.SSLEngine) to include
    XML representation
    <ServerCryptographicProtocols ></ServerCryptographicProtocols>
  • ExcludeProtocol exa.bee.deploy.v10.ExcludeProtocol
    Name of a crypograhic protocol as defined in javax.net.ssl.SSLEngine) to exclude
    Parent objects
    • ServerCryptographicProtocols
    Attributes
    • name string
    Name Type Default value Description
    name string -

    XML representation
    <ExcludeProtocol name="" ></ExcludeProtocol>
  • IncludeProtocol exa.bee.deploy.v10.IncludeProtocol
    Name of a crypograhic protocol (as defined in javax.net.ssl.SSLEngine) to include
    Parent objects
    • ServerCryptographicProtocols
    Attributes
    • name string
    Name Type Default value Description
    name string -

    XML representation
    <IncludeProtocol name="" ></IncludeProtocol>
  • TickInternalParameters com.exalead.mercury.mami.deploy.v10.TickInternalParameters
    Tick message parameters to tune performance. Warning: Experts only, this can have a huge performance impact.
    Parent objects
    • CloudviewDeploymentInternalConfig
    Attributes
    • bufferRecyclingMaxSize int
    • bufferRecyclingMaxLifeSpan int
    Name Type Default value Description
    bufferRecyclingMaxSize int 1000 When a tick message is released, it is put back into a memory pool for reuse. The attached payload is kept if its size is smaller than bufferRecyclingMaxSize, and freed if larger.
    bufferRecyclingMaxLifeSpan int 64 To avoid a tick message pool full of messages with a large payload (==bufferRecyclingMaxSize), the payload is released every time the message has been used bufferRecyclingMaxLifeSpan times.

    XML representation
    <TickInternalParameters bufferRecyclingMaxSize="1000" bufferRecyclingMaxLifeSpan="64" ></TickInternalParameters>
  • ProcessConfigList exa.bee.deploy.v10.ProcessConfigList
    Alternative way for defining a service in a cloudview product.
    Attributes
    • version long
    Name Type Default value Description
    version long - -

    Nested elements
    • paramsKeyValue*
    • ProcessConfig*
    Type Wrapper Description
    KeyValue* params Parameters added to the global beegistry
    ProcessConfig* - -
    XML representation
    <ProcessConfigList version="" ></ProcessConfigList>
  • SimpleJavaProcessConfig exa.bee.deploy.v10.SimpleJavaProcessConfig
    Simple java process definition which only exposes a set of web applications.
    Parent objects
    • ProcessConfigList
    Attributes
    • name string
    Name Type Default value Description
    name string - Process name. Referenced by the attribute "process" of a "Process\ role.

    Nested elements
    • JavaWebApp*
    Type Wrapper Description
    JavaWebApp* - -
    XML representation
    <SimpleJavaProcessConfig name="" ></SimpleJavaProcessConfig>
  • JavaWebApp exa.bee.deploy.v10.JavaWebApp
    A Java Web application, to be embedded in a custom process.
    A Webapp can optionally expose some services which will be accessible by other parts of the CloudView instance through the Directory mechanism
    Parent objects
    • ComplexJavaProcessConfig
    • JavaServer
    • SimpleJavaProcessConfig
    Attributes
    • name string
    • url string
    • webAppLoaderClassId string
    • extraClassPath string
    • isGlobal boolean
    Name Type Default value Description
    name string - Name of this webapp
    url string - Webapp location (WAR file or directory)
    webAppLoaderClassId string - Custom code for overloading the default webapp classloader
    extraClassPath string - Elements to add to the webapp class path, additionally to the default CloudView libraries. extraClassPath is a comma-separated list of elements.
    Each element must be one of:
    • extrajava:folder/STAR.jar : Adds all jar files in $KIT/extrajava/folder and $DATA/extrajava/folder
    • extrajava:folder/STARSTAR/STAR.jar : Add recursively all jar files in $KIT/extrajava/folder and $DATA/extrajava/folder
    • Absolute path to a JAR file
    • Absolute path to a class folder
    isGlobal boolean true Global applications are deployed under the context path "/name".
    Not global applications are deployed under "/name/hostname-installname/processname".
    Note that global applications can not expose any service

    Nested elements
    • initParamsKeyValue*
    • JavaWebAppService*
    Type Wrapper Description
    KeyValue* initParams Webapp init parameters
    JavaWebAppService* - -
    XML representation
    <JavaWebApp name="" url="" webAppLoaderClassId="" extraClassPath="" isGlobal="true" ></JavaWebApp>
  • JavaWebAppService exa.bee.deploy.v10.JavaWebAppService
    Describe the hosted Java WebApp service.
    Parent objects
    • JavaWebApp
    Attributes
    • name string
    • schemaName string
    Name Type Default value Description
    name string - Name of the service (relative path) in the web app.
    schemaName string - Optional schema name when published in the directory

    XML representation
    <JavaWebAppService name="" schemaName="" ></JavaWebAppService>
  • ComplexJavaProcessConfig exa.bee.deploy.v10.ComplexJavaProcessConfig
    Base class for describing processes which can be hosted in a cloudview installation.
    Parent objects
    • ProcessConfigList
    Attributes
    • name string
    Name Type Default value Description
    name string - Process name. Referenced by the attribute "process" of a "Process\ role.

    Nested elements
    • paramsKeyValue*
    • ConfigFile*
    • JavaServer*
    • JavaService*
    • JavaWebApp*
    Type Wrapper Description
    KeyValue* params Parameters added to the process local configuration registry.
    ConfigFile* - -
    JavaServer* - -
    JavaService* - -
    JavaWebApp* - -
    XML representation
    <ComplexJavaProcessConfig name="" ></ComplexJavaProcessConfig>
  • ConfigFile exa.bee.deploy.v10.ConfigFile
    No documentation for this element.
    Parent objects
    • ComplexExaProcessConfig
    • ComplexJavaProcessConfig
    Attributes
    • url string
    • extraClassPath string
    Name Type Default value Description
    url string - -
    extraClassPath string - Extra class path elements to add to the context class loader when loading this configuration file. Follows the JavaWebApp.extraClassPath syntax.
    This class path will be used to load the Java objects that might be contained in this configuration file

    XML representation
    <ConfigFile url="" extraClassPath="" ></ConfigFile>
  • JavaServer exa.bee.deploy.v10.JavaServer
    A named server containing services and/or web apps.
    Parent objects
    • ComplexJavaProcessConfig
    Attributes
    • name string
    • protocol string
    Name Type Default value Description
    name string - Server name
    protocol string - Server protocol

    Nested elements
    • JavaService*
    • JavaWebApp*
    Type Wrapper Description
    JavaService* - List of services deployed in this server
    JavaWebApp* - List of web apps deployed in this server
    XML representation
    <JavaServer name="" protocol="" ></JavaServer>
  • JavaService exa.bee.deploy.v10.JavaService
    No documentation for this element.
    Parent objects
    • ComplexJavaProcessConfig
    • JavaServer
    Attributes
    • name string
    • schemaName string
    • beeKey string
    • type string
    • publicName string
    • extraClassPath string
    Name Type Default value Description
    name string - Service name. Exposed full prefix name will be automatically prefixed
    schemaName string - Optional schema name used when publishing the directory
    beeKey string - service implementation
    type string - service type
    publicName string - Optional public name. If null, name + host + processName is used
    extraClassPath string - Extra class path elements to add to the context class loader when performing queries on this service. Follows the JavaWebApp.extraClassPath syntax.

    XML representation
    <JavaService name="" schemaName="" beeKey="" type="" publicName="" extraClassPath="" ></JavaService>
  • SimpleDotNetProcessConfig exa.bee.deploy.v10.SimpleDotNetProcessConfig
    Describe the hosted .NET process configuration.
    Parent objects
    • ProcessConfigList
    Attributes
    • name string
    Name Type Default value Description
    name string - Process name. Referenced by the attribute "process" of a "Process\ role.

    XML representation
    <SimpleDotNetProcessConfig name="" ></SimpleDotNetProcessConfig>
  • ComplexExaProcessConfig exa.bee.deploy.v10.ComplexExaProcessConfig
    Base class for describing processes which can be hosted in a cloudview installation.
    Parent objects
    • ProcessConfigList
    Attributes
    • name string
    Name Type Default value Description
    name string - Process name. Referenced by the attribute "process" of a "Process\ role.

    Nested elements
    • paramsKeyValue*
    • ConfigFile*
    • ExaServer*
    • ExaService*
    Type Wrapper Description
    KeyValue* params Parameters added to the process local configuration registry.
    ConfigFile* - -
    ExaServer* - -
    ExaService* - -
    XML representation
    <ComplexExaProcessConfig name="" ></ComplexExaProcessConfig>
  • ExaServer exa.bee.deploy.v10.ExaServer
    No documentation for this element.
    Parent objects
    • ComplexExaProcessConfig
    Attributes
    • name string
    • protocol string
    Name Type Default value Description
    name string - -
    protocol string - -

    Nested elements
    • ExaService*
    Type Wrapper Description
    ExaService* - -
    XML representation
    <ExaServer name="" protocol="" ></ExaServer>
  • ExaService exa.bee.deploy.v10.ExaService
    No documentation for this element.
    Parent objects
    • ComplexExaProcessConfig
    • ExaServer
    Attributes
    • name string
    • schemaName string
    • beeKey string
    • type string
    • publicName string
    Name Type Default value Description
    name string - -
    schemaName string - -
    beeKey string - -
    type string - -
    publicName string - Optional public name. If null, name + host + processName is used

    XML representation
    <ExaService name="" schemaName="" beeKey="" type="" publicName="" ></ExaService>
  • LinguisticConfig com.exalead.linguistic.v10.LinguisticConfig
    No documentation for this element.
    Attributes
    • version long
    Name Type Default value Description
    version long - -

    Nested elements
    • TokenizationConfig*
    Type Wrapper Description
    TokenizationConfig* -
    XML representation
    <LinguisticConfig version="" ></LinguisticConfig>
  • TokenizationConfig com.exalead.linguistic.v10.TokenizationConfig
    How to tokenize documents, ie split the input strings in tokens. Tokens are usually words.
    Parent objects
    • LinguisticConfig
    Attributes
    • name string
    Name Type Default value Description
    name string - The tokenization configuration's name.

    Nested elements
    • FormIndexingConfig
    • NormalizerConfig
    • Tokenizer*
    Type Wrapper Description
    FormIndexingConfig - How to index different word forms (exact, normalized, lemmatized) for each language.
    NormalizerConfig - How to normalize words.
    Tokenizer* - List of enabled tokenizers. Tokenizer choice depends on the document's language.
    XML representation
    <TokenizationConfig name="" ></TokenizationConfig>
  • FormIndexingConfig com.exalead.linguistic.v10.FormIndexingConfig
    No documentation for this element.
    Nested elements
    • Form*
    Type Wrapper Description
    Form* -
    XML representation
    <FormIndexingConfig ></FormIndexingConfig>
  • Form com.exalead.linguistic.v10.Form
    No documentation for this element.
    Parent objects
    • FormIndexingConfig
    Attributes
    • tag string
    • indexKind short
    • trustLevel int
    Name Type Default value Description
    tag string -
    indexKind short -
    trustLevel int 100

    XML representation
    <Form tag="" indexKind="" trustLevel="100" ></Form>
  • NormalizerConfig com.exalead.linguistic.v10.NormalizerConfig
    No documentation for this element.
    Parent objects
    • TokenizationConfig
    Attributes
    • transliteration boolean
    • useNormalizationExceptions boolean
    • normalizationExceptionsDir string
    • disableBasisTechNormalizerForLanguages string
    • enableBasisTechNormalizerForLanguages string
    • useGermanExceptions boolean
    Name Type Default value Description
    transliteration boolean True When normalizing, convert some characters to their latin equivalent
    useNormalizationExceptions boolean True Uses the normalization exceptions, even if the resource path is not set (in that case, the default resource is used).
    normalizationExceptionsDir string - The path to the normalization-exceptions resource directory.
    Use the default resource if no path is set.
    disableBasisTechNormalizerForLanguages string - Removes BasisTech's processor "Encoding and Character Normalizer" for these languages.
    enableBasisTechNormalizerForLanguages string ja Adds BasisTech's processor "Encoding and Character Normalizer" only for these languages.
    useGermanExceptions boolean False

    Nested elements
    • NormalizerCharOverride*
    • NormalizerIndexLower*
    Type Wrapper Description
    NormalizerCharOverride* -
    NormalizerIndexLower* -
    XML representation
    <NormalizerConfig transliteration="true" useNormalizationExceptions="true" normalizationExceptionsDir="" enableBasisTechNormalizerForLanguages="ja" useGermanExceptions="false" ></NormalizerConfig>
  • NormalizerCharOverride com.exalead.linguistic.v10.NormalizerCharOverride
    Introduces custom normalization rule
    Parent objects
    • NormalizerConfig
    Attributes
    • chr string
    • replace string
    • language iso code
    Name Type Default value Description
    chr string -
    replace string -
    language iso code -

    XML representation
    <NormalizerCharOverride chr="" replace="" language="en" ></NormalizerCharOverride>
  • NormalizerIndexLower com.exalead.linguistic.v10.NormalizerIndexLower
    The list of words to index in lowercase instead of normalized form.
    Parent objects
    • NormalizerConfig
    Attributes
    • word string
    • language iso code
    Name Type Default value Description
    word string -
    language iso code -

    XML representation
    <NormalizerIndexLower word="" language="en" ></NormalizerIndexLower>
  • StandardTokenizer com.exalead.linguistic.v10.StandardTokenizer
    Sets a specific configuration for the standard tokenizer. If this object is not in the Semantic processors list, standard tokenization is used.
    Parent objects
    • TokenizationConfig
    Attributes
    • language iso code
    • concatAlphaNum boolean
    • concatNumAlpha boolean
    Name Type Default value Description
    language iso code - The language handled by this tokenizer. This can be null for all unhandled languages.
    concatAlphaNum boolean True Concat alpha and num chars
    concatNumAlpha boolean True Concat num and alpha chars

    Nested elements
    • charOverridesStandardTokenizerOverride*
    • patternOverridesStandardTokenizerOverride*
    • TokenizerPlugin*
    Type Wrapper Description
    StandardTokenizerOverride* charOverrides Set of rules that allows the forcing of a type for specific character or for a character set defined by a unicode property.
    StandardTokenizerOverride* patternOverrides Set of rules that allows the forcing of specific regexp type.
    TokenizerPlugin* -
    XML representation
    <StandardTokenizer language="en" concatAlphaNum="true" concatNumAlpha="true" ></StandardTokenizer>
  • BasisTechTokenizationCompatibility com.exalead.linguistic.v10.BasisTechTokenizationCompatibility
    No documentation for this element.
    Parent objects
    • StandardTokenizer
    Attributes
    • languages string
    Name Type Default value Description
    languages string en,de,fr,sv,es,it,nl,pt,no,fi,da,bg,ca,cs,el,hr,hu,pl,ru,sk,sl,sr Postprocesses BasisTech's analyzer output in order to generate a tokenization as close as possible to that of this standard tokenizer for these languages.

    XML representation
    <BasisTechTokenizationCompatibility languages="en,de,fr,sv,es,it,nl,pt,no,fi,da,bg,ca,cs,el,hr,hu,pl,ru,sk,sl,sr" ></BasisTechTokenizationCompatibility>
  • GermanDisagglutiner com.exalead.linguistic.v10.GermanDisagglutiner
    This class performs disagglutinations for German.
    Use with a Standard tokenizer.
    Parent objects
    • StandardTokenizer
    Attributes
    • resourceDir string
    Name Type Default value Description
    resourceDir string - URL for the resource (data://, file;// or resource://) if null, use standard resource of product.

    XML representation
    <GermanDisagglutiner resourceDir="" ></GermanDisagglutiner>
  • DutchDisagglutiner com.exalead.linguistic.v10.DutchDisagglutiner
    This class performs disagglutinations for Dutch.
    Use with a Standard tokenizer.
    Parent objects
    • StandardTokenizer
    Attributes
    • resourceDir string
    Name Type Default value Description
    resourceDir string - URL for the resource (data://, file;// or resource://) if null, use standard resource of product.

    XML representation
    <DutchDisagglutiner resourceDir="" ></DutchDisagglutiner>
  • NorwegianDisagglutiner com.exalead.linguistic.v10.NorwegianDisagglutiner
    This class performs disagglutinations for Norwegian.
    Use with a Standard tokenizer.
    Parent objects
    • StandardTokenizer
    Attributes
    • resourceDir string
    Name Type Default value Description
    resourceDir string - URL for the resource (data://, file;// or resource://) if null, use standard resource of product.

    XML representation
    <NorwegianDisagglutiner resourceDir="" ></NorwegianDisagglutiner>
  • ChineseWordFinder com.exalead.linguistic.v10.ChineseWordFinder
    This class performs words detection for Chinese.
    Use with a Standard tokenizer.
    Parent objects
    • StandardTokenizer
    Attributes
    • resourceDir string
    • addSimplified boolean
    • addNGrams boolean
    • minNGramLength int
    • maxNGramLength int
    Name Type Default value Description
    resourceDir string - URL for the resource (data://, file;// or resource://) if null, use standard resource of product.
    addSimplified boolean False Annotates each token with its simplified Chinese transliteration, when applicable.
    addNGrams boolean False Annotates with NGrams
    minNGramLength int 1 Minimum length of NGrams (1: monograms, 2: bigrams...)
    maxNGramLength int 2 Maximum length of NGrams (1: monograms, 2: bigrams...)

    XML representation
    <ChineseWordFinder resourceDir="" addSimplified="false" addNGrams="false" minNGramLength="1" maxNGramLength="2" ></ChineseWordFinder>
  • JapaneseWordFinder com.exalead.linguistic.v10.JapaneseWordFinder
    Performs words detection for Japanese.
    Use with a Standard tokenizer.
    Parent objects
    • StandardTokenizer
    Attributes
    • resourceDir string
    • processRomaji boolean
    • addRomaji boolean
    • addCharset boolean
    • addNGrams boolean
    • minNGramLength int
    • maxNGramLength int
    • addTransliterations boolean
    • addPartOfSpeech boolean
    Name Type Default value Description
    resourceDir string - URL for the resource (data://, file;// or resource://) if null, use standard resource of product.
    processRomaji boolean False Processes romaji tokens
    addRomaji boolean False Annotates each token with its romaji transliteration, when applicable.
    addCharset boolean False Annotates with charset
    addNGrams boolean False Annotates with NGrams
    minNGramLength int 1 Minimum length of NGrams (1: monograms, 2: bigrams...)
    maxNGramLength int 2 Maximum length of NGrams (1: monograms, 2: bigrams...)
    addTransliterations boolean False Annotates with transliterations (kanji, katakana, hiragana, romaji).
    addPartOfSpeech boolean False Annotates with part of speech.

    XML representation
    <JapaneseWordFinder resourceDir="" processRomaji="false" addRomaji="false" addCharset="false" addNGrams="false" minNGramLength="1" maxNGramLength="2" addTransliterations="false" addPartOfSpeech="false" ></JapaneseWordFinder>
  • StandardTokenizerOverride com.exalead.linguistic.v10.StandardTokenizerOverride
    No documentation for this element.
    Parent objects
    • charOverrides StandardTokenizer
    • patternOverrides StandardTokenizer
    Attributes
    • type enum(token, separator, sentence, ignore, punct)
    • toOverride string
    Name Type Default value Description
    type enum(token, separator, sentence, ignore, punct) token Values = "token", "separator", "sentence" (will break related terms extraction, named entities, ...), "ignore" or "punct" (sentence is considered as a separator but it is also considered as an entity separator for semantic extractors)
    toOverride string -

    XML representation
    <StandardTokenizerOverride type="token" toOverride="" ></StandardTokenizerOverride>
  • StandardTokenizerPatternOverride com.exalead.linguistic.v10.StandardTokenizerPatternOverride
    No documentation for this element.
    Parent objects
    • charOverrides StandardTokenizer
    • patternOverrides StandardTokenizer
    Attributes
    • type enum(token, separator, sentence, ignore, punct)
    • toOverride string
    • separated boolean
    Name Type Default value Description
    type enum(token, separator, sentence, ignore, punct) token Values = "token", "separator", "sentence" (will break related terms extraction, named entities, ...), "ignore" or "punct" (sentence is considered as a separator but it is also considered as an entity separator for semantic extractors)
    toOverride string -
    separated boolean True Pattern must be separated to match.

    XML representation
    <StandardTokenizerPatternOverride type="token" toOverride="" separated="true" ></StandardTokenizerPatternOverride>
  • ChineseTokenizer com.exalead.linguistic.v10.ChineseTokenizer
    When set in the configuration, tokenizes Chinese documents.
    Parent objects
    • TokenizationConfig
    Attributes
    • addSimplified boolean
    • resourceDir string
    Name Type Default value Description
    addSimplified boolean False Annotates each token with its simplified Chinese transliteration, when applicable.
    resourceDir string - URL for the resource (data://, file;// or resource://) if null, use standard resource of product.

    XML representation
    <ChineseTokenizer addSimplified="false" resourceDir="" ></ChineseTokenizer>
  • JapaneseTokenizer com.exalead.linguistic.v10.JapaneseTokenizer
    When set in the configuration, tokenizes Japanese documents.
    Parent objects
    • TokenizationConfig
    Attributes
    • addRomanji boolean
    • addMorphology boolean
    • resourceDir string
    • favor enum(precision, recall)
    Name Type Default value Description
    addRomanji boolean False Annotates each token with its romaji transliteration, when applicable.
    addMorphology boolean False Annotates each token with its part of speech, when applicable.
    resourceDir string - URL for the resource (data://, file;// or resource://) if null, use standard resource of product.
    favor enum(precision, recall) recall Favoring precision decreases the amount of unrelevant documents returned by queries, favoring recall decreases the amount of missing documents.

    XML representation
    <JapaneseTokenizer addRomanji="false" addMorphology="false" resourceDir="" favor="recall" ></JapaneseTokenizer>
  • BasisTechTokenizer com.exalead.linguistic.v10.BasisTechTokenizer
    No documentation for this element.
    Parent objects
    • TokenizationConfig
    Attributes
    • language iso code
    Name Type Default value Description
    language iso code - The language handled by this tokenizer. This can be null for all unhandled languages.

    XML representation
    <BasisTechTokenizer language="en" ></BasisTechTokenizer>
  • CustomTokenizer com.exalead.linguistic.v10.CustomTokenizer
    No documentation for this element.
    Parent objects
    • TokenizationConfig
    Attributes
    • classId string
    Name Type Default value Description
    classId string - The class identifier of the custom processor; this class must implement CustomTokenizerInterface.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <CustomTokenizer classId="" ></CustomTokenizer>
  • DictionaryConfig com.exalead.dictionary.v10.DictionaryConfig
    Defines what a dictionary must contain.
    Attributes
    • name string
    Name Type Default value Description
    name string -

    Nested elements
    • BuildPolicies
    • CompactPolicies
    • QueryConfig*
    • ResourceConfig*
    Type Wrapper Description
    BuildPolicies - The build policies used to trigger a build of a dictionary.
    CompactPolicies - The compact policies used to trigger dictionary compaction.
    QueryConfig* -
    ResourceConfig* - Resources that should be built for this dictionary, along with their configuration.
    XML representation
    <DictionaryConfig name="" ></DictionaryConfig>
  • BuildPolicies com.exalead.dictionary.v10.BuildPolicies
    No documentation for this element.
    Parent objects
    • DictionaryConfig
    Attributes
    • disjunctives boolean
    Name Type Default value Description
    disjunctives boolean True

    Nested elements
    • BuildPolicy*
    Type Wrapper Description
    BuildPolicy* - Specifies the build policies.
    XML representation
    <BuildPolicies disjunctives="true" ></BuildPolicies>
  • CompactCountBuildPolicy com.exalead.dictionary.v10.CompactCountBuildPolicy
    Triggers a build when a given number of compacts have been done
    Parent objects
    • BuildPolicies
    Attributes
    • countThreshold int
    Name Type Default value Description
    countThreshold int 1 Number of compacts that will trigger a compact.

    XML representation
    <CompactCountBuildPolicy countThreshold="1" ></CompactCountBuildPolicy>
  • PeriodicBuildPolicy com.exalead.dictionary.v10.PeriodicBuildPolicy
    Triggers a build after 'delayInS' seconds have elapsed.
    Parent objects
    • BuildPolicies
    Attributes
    • delayInS long
    Name Type Default value Description
    delayInS long 600 Time in seconds between two builds.

    XML representation
    <PeriodicBuildPolicy delayInS="600" ></PeriodicBuildPolicy>
  • CompactPolicies com.exalead.dictionary.v10.CompactPolicies
    No documentation for this element.
    Parent objects
    • DictionaryConfig
    Attributes
    • disjunctives boolean
    Name Type Default value Description
    disjunctives boolean True

    Nested elements
    • CompactFilter*
    • CompactPolicy*
    Type Wrapper Description
    CompactFilter* - Filtering rules for data imported from the stream to be compacted. * CompactFrequencyFilter...
    CompactPolicy* - Specifies the compact policies.
    XML representation
    <CompactPolicies disjunctives="true" ></CompactPolicies>
  • FrequencyCompactFilter com.exalead.dictionary.v10.FrequencyCompactFilter
    Filter resources while compacting according to the given frequencies / languages
    Parent objects
    • CompactPolicies
    Attributes
    • minFrequency long
    • language iso code
    Name Type Default value Description
    minFrequency long 1 Discards entries with frequency below this threshold
    language iso code - Language represented by its iso639-1 code, e.g: fr, en...

    XML representation
    <FrequencyCompactFilter minFrequency="1" language="en" ></FrequencyCompactFilter>
  • ImportCountCompactPolicy com.exalead.dictionary.v10.ImportCountCompactPolicy
    Triggers a compact when a given number of imports have been done
    Parent objects
    • CompactPolicies
    Attributes
    • countThreshold int
    Name Type Default value Description
    countThreshold int 1 Number of imports that will trigger a compact.

    XML representation
    <ImportCountCompactPolicy countThreshold="1" ></ImportCountCompactPolicy>
  • PeriodicCompactPolicy com.exalead.dictionary.v10.PeriodicCompactPolicy
    Triggers a compact after 'delayInS' seconds have elapsed.
    Parent objects
    • CompactPolicies
    Attributes
    • delayInS long
    Name Type Default value Description
    delayInS long 600 Time in seconds between two compacts.

    XML representation
    <PeriodicCompactPolicy delayInS="600" ></PeriodicCompactPolicy>
  • SpellcheckQueryConfig com.exalead.dictionary.v10.SpellcheckQueryConfig
    No documentation for this element.
    Parent objects
    • DictionaryConfig
    Attributes
    • tokenizationConfig string
    • preAllocatedPipelines int
    Name Type Default value Description
    tokenizationConfig string -
    preAllocatedPipelines int -

    Nested elements
    • blacklistBWList*
    • whitelistBWList*
    Type Wrapper Description
    BWList* blacklist
    BWList* whitelist
    XML representation
    <SpellcheckQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></SpellcheckQueryConfig>
  • BWList com.exalead.dictionary.v10.BWList
    No documentation for this element.
    Parent objects
    • blacklist ApproxQueryConfig
    • blacklist ExpansionQueryConfig
    • blacklist NGramFrequencyQueryConfig
    • blacklist PhoneticQueryConfig
    • blacklist QueryConfig
    • blacklist RelatedTermsQueryConfig
    • blacklist SpellcheckQueryConfig
    • blacklist WordFrequencyQueryConfig
    • whitelist ApproxQueryConfig
    • whitelist ExpansionQueryConfig
    • whitelist NGramFrequencyQueryConfig
    • whitelist PhoneticQueryConfig
    • whitelist QueryConfig
    • whitelist RelatedTermsQueryConfig
    • whitelist SpellcheckQueryConfig
    • whitelist WordFrequencyQueryConfig
    Attributes
    • resourceURL string
    • name string
    • defaultList boolean
    Name Type Default value Description
    resourceURL string -
    name string -
    defaultList boolean False Select the list to be used by default

    XML representation
    <BWList resourceURL="" name="" defaultList="false" ></BWList>
  • RelatedTermsQueryConfig com.exalead.dictionary.v10.RelatedTermsQueryConfig
    Related-terms synthesis configuration. At query-time, the searchserver will ask the dictionary to perform the related-terms synthesis, according to their display forms and corpus frequency.
    Parent objects
    • DictionaryConfig
    Attributes
    • tokenizationConfig string
    • preAllocatedPipelines int
    Name Type Default value Description
    tokenizationConfig string -
    preAllocatedPipelines int -

    Nested elements
    • blacklistBWList*
    • whitelistBWList*
    Type Wrapper Description
    BWList* blacklist
    BWList* whitelist
    XML representation
    <RelatedTermsQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></RelatedTermsQueryConfig>
  • ExpansionQueryConfig com.exalead.dictionary.v10.ExpansionQueryConfig
    No documentation for this element.
    Parent objects
    • DictionaryConfig
    Attributes
    • tokenizationConfig string
    • preAllocatedPipelines int
    Name Type Default value Description
    tokenizationConfig string -
    preAllocatedPipelines int -

    Nested elements
    • blacklistBWList*
    • whitelistBWList*
    Type Wrapper Description
    BWList* blacklist
    BWList* whitelist
    XML representation
    <ExpansionQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></ExpansionQueryConfig>
  • WordFrequencyQueryConfig com.exalead.dictionary.v10.WordFrequencyQueryConfig
    Enables word frequency use at query-time. It can be used for things like linguistic expansion.
    Parent objects
    • DictionaryConfig
    Attributes
    • tokenizationConfig string
    • preAllocatedPipelines int
    Name Type Default value Description
    tokenizationConfig string -
    preAllocatedPipelines int -

    Nested elements
    • blacklistBWList*
    • whitelistBWList*
    Type Wrapper Description
    BWList* blacklist
    BWList* whitelist
    XML representation
    <WordFrequencyQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></WordFrequencyQueryConfig>
  • NGramFrequencyQueryConfig com.exalead.dictionary.v10.NGramFrequencyQueryConfig
    Enables ngram frequency use at query-time.
    Parent objects
    • DictionaryConfig
    Attributes
    • tokenizationConfig string
    • preAllocatedPipelines int
    Name Type Default value Description
    tokenizationConfig string -
    preAllocatedPipelines int -

    Nested elements
    • blacklistBWList*
    • whitelistBWList*
    Type Wrapper Description
    BWList* blacklist
    BWList* whitelist
    XML representation
    <NGramFrequencyQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></NGramFrequencyQueryConfig>
  • ApproxQueryConfig com.exalead.dictionary.v10.ApproxQueryConfig
    No documentation for this element.
    Parent objects
    • DictionaryConfig
    Attributes
    • tokenizationConfig string
    • preAllocatedPipelines int
    Name Type Default value Description
    tokenizationConfig string -
    preAllocatedPipelines int -

    Nested elements
    • blacklistBWList*
    • whitelistBWList*
    Type Wrapper Description
    BWList* blacklist
    BWList* whitelist
    XML representation
    <ApproxQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></ApproxQueryConfig>
  • PhoneticQueryConfig com.exalead.dictionary.v10.PhoneticQueryConfig
    Enables phonetic search by looking in the dictionary for words matching the phonetic form of the query. The phonetic query config requires a tokenization config. It will work only if a PhonemesResourceConfig exists, in order to build the phonetic forms of the words at build-time.
    Parent objects
    • DictionaryConfig
    Attributes
    • tokenizationConfig string
    • preAllocatedPipelines int
    Name Type Default value Description
    tokenizationConfig string -
    preAllocatedPipelines int -

    Nested elements
    • blacklistBWList*
    • whitelistBWList*
    Type Wrapper Description
    BWList* blacklist
    BWList* whitelist
    XML representation
    <PhoneticQueryConfig tokenizationConfig="tok0" preAllocatedPipelines="" ></PhoneticQueryConfig>
  • WordsResourceConfig com.exalead.dictionary.v10.WordsResourceConfig
    Should we store word dictionary for this dictionary.
    The word dictionary is used for relevance purposes and term expansion.
    Nested elements
    • LanguageConfig*
    Type Wrapper Description
    LanguageConfig* -
    XML representation
    <WordsResourceConfig ></WordsResourceConfig>
  • LanguageConfig com.exalead.dictionary.v10.LanguageConfig
    No documentation for this element.
    Parent objects
    • CustomResourceConfig
    • NGramsResourceConfig
    • PhonemesResourceConfig
    • RelatedTermsResourceConfig
    • ResourceConfig
    • WordsResourceConfig
    Attributes
    • language iso code
    • maxTerm long
    Name Type Default value Description
    language iso code - Language represented by its iso639-1 code, e.g: fr, en...
    maxTerm long 1000000 Maximum number of terms kept

    Nested elements
    • Filter*
    Type Wrapper Description
    Filter* - Filtering rules for data imported from the stream to build this resource. * RegexpFilter, FrequencyFilter, CustomFilter ...
    XML representation
    <LanguageConfig language="en" maxTerm="1000000" ></LanguageConfig>
  • RegexpFilter com.exalead.dictionary.v10.RegexpFilter
    Discards entries matching this regular expression
    Parent objects
    • LanguageConfig
    Attributes
    • regexp string
    Name Type Default value Description
    regexp string - Discards entries matching this regular expression

    XML representation
    <RegexpFilter regexp="" ></RegexpFilter>
  • FrequencyFilter com.exalead.dictionary.v10.FrequencyFilter
    No documentation for this element.
    Parent objects
    • LanguageConfig
    Attributes
    • minFrequency long
    Name Type Default value Description
    minFrequency long 1 Discards entries with frequency below this threshold

    XML representation
    <FrequencyFilter minFrequency="1" ></FrequencyFilter>
  • PhonemesResourceConfig com.exalead.dictionary.v10.PhonemesResourceConfig
    Should we build a phonetic resource for this dictionary.
    Used for relevance purposes and term expansion.
    Required for phonetic term expansion.
    Nested elements
    • LanguageConfig*
    Type Wrapper Description
    LanguageConfig* -
    XML representation
    <PhonemesResourceConfig ></PhonemesResourceConfig>
  • RelatedTermsResourceConfig com.exalead.dictionary.v10.RelatedTermsResourceConfig
    Should we build related terms resource for this dictionary.
    A related terms semantic processor must also be defined in the pipeline.
    Parent objects
    • DictionaryConfig
    Attributes
    • overallMaxTerm long
    Name Type Default value Description
    overallMaxTerm long 100000000 Overall maximum number of terms kept regardless of the language

    Nested elements
    • LanguageConfig*
    Type Wrapper Description
    LanguageConfig* -
    XML representation
    <RelatedTermsResourceConfig overallMaxTerm="100000000" ></RelatedTermsResourceConfig>
  • NGramsResourceConfig com.exalead.dictionary.v10.NGramsResourceConfig
    Should we build ngrams resource for this dictionary.
    Used to provide accurate spell checking.
    Nested elements
    • LanguageConfig*
    Type Wrapper Description
    LanguageConfig* -
    XML representation
    <NGramsResourceConfig ></NGramsResourceConfig>
  • CustomResourceConfig com.exalead.dictionary.v10.CustomResourceConfig
    No documentation for this element.
    Nested elements
    • LanguageConfig*
    Type Wrapper Description
    LanguageConfig* -
    XML representation
    <CustomResourceConfig ></CustomResourceConfig>
  • ResourceManagerConfig com.exalead.mercury.mami.resources.v10.ResourceManagerConfig
    Configuration of the resource manager.

    The resource manager supervises resources and provides methods to fetch, convert and build them.
    Attributes
    • version long
    • resourceDir string
    Name Type Default value Description
    version long - -
    resourceDir string data:///build/resources Path to master copy of all resources

    Nested elements
    • ResourceGroup*
    Type Wrapper Description
    ResourceGroup* -
    XML representation
    <ResourceManagerConfig version="" resourceDir="data:///build/resources" ></ResourceManagerConfig>
  • ResourceGroup com.exalead.mercury.mami.resources.v10.ResourceGroup
    A ResourceGroup is a container for one or more resources.
    Each time a ResourceGroup is published, all resources are compiled and published atomically.
    Parent objects
    • ResourceManagerConfig
    Attributes
    • name string
    • roles string
    • waitOnSync boolean
    Name Type Default value Description
    name string - Resource group name
    roles string - The list of roles on which the resource should be published (comma-separated).
    waitOnSync boolean False

    Nested elements
    • Resource*
    Type Wrapper Description
    Resource* -
    XML representation
    <ResourceGroup name="" roles="" waitOnSync="false" ></ResourceGroup>
  • Resource com.exalead.mercury.mami.resources.v10.Resource
    Base class for resources.

    A resource belongs to a group
    Parent objects
    • ResourceGroup
    Attributes
    • name string
    • type string
    • description string
    • author string
    • applicationId string
    • converterClassId string
    Name Type Default value Description
    name string - Name of the resource.
    type string - Type of the resource.
    description string - Short description of the resource.
    author string - Author of the resource.
    applicationId string - Application Id of the resource.

    If defined, the resource can only be edited in this application.
    converterClassId string - Converter Class Id.

    If defined, it will be used instead of the default converter associated with the resource type.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* - Configuration of the resource.
    XML representation
    <Resource name="" type="" description="" author="" applicationId="" converterClassId="" ></Resource>
  • SemanticResource com.exalead.mercury.mami.resources.v10.SemanticResource
    A Semantic resource is a resource requiring a tokenization config
    Parent objects
    • ResourceGroup
    Attributes
    • name string
    • type string
    • description string
    • author string
    • applicationId string
    • converterClassId string
    • tokenizationConfig string
    Name Type Default value Description
    name string - Name of the resource.
    type string - Type of the resource.
    description string - Short description of the resource.
    author string - Author of the resource.
    applicationId string - Application Id of the resource.

    If defined, the resource can only be edited in this application.
    converterClassId string - Converter Class Id.

    If defined, it will be used instead of the default converter associated with the resource type.
    tokenizationConfig string - Specify the tokenization config to use

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* - Configuration of the resource.
    XML representation
    <SemanticResource name="" type="" description="" author="" applicationId="" converterClassId="" tokenizationConfig="tok0" ></SemanticResource>
  • ProductSecurity com.exalead.mercury.mami.master.v10.ProductSecurity
    Defines the product security.
    Specifies how product components are exposed to the rest of the world.

    Also defines a set of identity provider configurations. Identity providers can be used to define authentication domain and to provide common authentication tools for different services (for example, admin UI and api console).
    Attributes
    • version long
    • login string
    • password string
    • secureInternalConnections boolean
    Name Type Default value Description
    version long - -
    login string - Root login.

    Can be used to connect to every secured UI. Also used for securing inter process communications when required.
    password string - Root password.

    Encrypted using product RSA encryption key.
    secureInternalConnections boolean False Basic authentication using product login - password.

    Nested elements
    • trustedHostStringValue*
    • IdentityProviderConfig*
    • MAMISecurity
    • PushAPISecurity
    • SearchAPISecurity
    Type Wrapper Description
    StringValue* trustedHost Hostnames allowed for redirection.
    "*" means any characters. For example, *.exalead.com would allow all hosts in the exalead.com domain.
    IdentityProviderConfig* -
    MAMISecurity -
    PushAPISecurity -
    SearchAPISecurity -
    XML representation
    <ProductSecurity version="" login="" password="" secureInternalConnections="false" ></ProductSecurity>
  • IdentityProviderConfig com.exalead.mercury.mami.master.v10.IdentityProviderConfig
    Definition of an identity provider.
    Used, for example, by the admin ui components to define a common login API.
    Parent objects
    • ProductSecurity
    Attributes
    • name string
    • filterFactoryClassId string
    • securitySource string
    • sessionInactivityTimeoutS long
    Name Type Default value Description
    name string - Identity provider name.
    filterFactoryClassId string - Optional classId of a specific FilterFactory to use instead of the default one.

    It is used to create a specific SecurityFilter to change the authentication method.
    securitySource string - Associated security source.
    sessionInactivityTimeoutS long 21600 Sessions will time out after this time. -1 for no timeout

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <IdentityProviderConfig name="" filterFactoryClassId="" securitySource="" sessionInactivityTimeoutS="21600" ></IdentityProviderConfig>
  • MAMISecurity com.exalead.mercury.mami.master.v10.MAMISecurity
    Management APIs security (gateway).
    Parent objects
    • ProductSecurity
    Attributes
    • login string
    • password string
    • useHttps boolean
    • serverCertificate string
    Name Type Default value Description
    login string - Login.

    Set to null to disable authentication.
    password string - Password.

    Encrypted using product RSA encryption key.
    useHttps boolean False Secures connection with HTTPS.
    serverCertificate string - When using HTTPS, name of a sertificate inside the security folder of the data directory. When not provided the default certificate is used.

    Nested elements
    • IPRule*
    Type Wrapper Description
    IPRule* -
    XML representation
    <MAMISecurity login="" password="" useHttps="false" serverCertificate="" ></MAMISecurity>
  • IPRule (deprecated) com.exalead.mercury.mami.master.v10.IPRule
    IP Rule.
    Parent objects
    • MAMISecurity
    • SearchAPISecurity
    Attributes
    • allow boolean
    • addr string
    • mask string
    Name Type Default value Description
    allow boolean False Allows the IP address access.
    addr string - Address.
    mask string - Mask.

    XML representation
    <IPRule allow="false" addr="" mask="" ></IPRule>
  • PushAPISecurity com.exalead.mercury.mami.master.v10.PushAPISecurity
    PushAPI Security

    Warning on contrary to MAMI Security and Search API Security, the login and password are configured at the connector config level.
    Parent objects
    • ProductSecurity
    Attributes
    • serverCertificate string
    • useHttps boolean
    Name Type Default value Description
    serverCertificate string - When using HTTPS, name of a sertificate inside the security folder of the data directory. When not provided the default certificate is used.
    useHttps boolean False Secures connection with HTTPS.

    XML representation
    <PushAPISecurity serverCertificate="" useHttps="false" ></PushAPISecurity>
  • SearchAPISecurity com.exalead.mercury.mami.master.v10.SearchAPISecurity
    Class used for search API security (search, fetch).

    Defines whether the connection to the search API should be authenticated and/or secured.
    Parent objects
    • ProductSecurity
    Attributes
    • login string
    • password string
    • useHttps boolean
    • serverCertificate string
    Name Type Default value Description
    login string - Login.

    Set to null to disable authentication.
    password string - Password.

    Encrypted using product RSA encryption key.
    useHttps boolean False Secures connection with HTTPS.
    serverCertificate string - When using HTTPS, name of a sertificate inside the security folder of the data directory. When not provided the default certificate is used.

    Nested elements
    • IPRule*
    Type Wrapper Description
    IPRule* -
    XML representation
    <SearchAPISecurity login="" password="" useHttps="false" serverCertificate="" ></SearchAPISecurity>
  • SecuritySourceList com.exalead.mercury.mami.master.v10.SecuritySourceList
    List of security sources
    Attributes
    • version long
    • strictConfig boolean
    Name Type Default value Description
    version long - -
    strictConfig boolean False Stops deployment if a security source failed to initialize.

    Nested elements
    • SecuritySourceBase*
    Type Wrapper Description
    SecuritySourceBase* -
    XML representation
    <SecuritySourceList version="" strictConfig="false" ></SecuritySourceList>
  • MetaSecuritySource com.exalead.mercury.mami.master.v10.MetaSecuritySource
    Meta security source configuration
    Parent objects
    • SecuritySourceList
    Attributes
    • name string
    • deploy boolean
    • type enum(First, Merge, No Authentication)
    • forcedTokens string
    Name Type Default value Description
    name string - Security source name
    deploy boolean True Will this security source be deployed?
    type enum(First, Merge, No Authentication) First Type of action when an authentication succeeds
    Value can be null or one of
    • First
    • Merge
    • No Authentication
    forcedTokens string An optional comma-separated list of tokens which will be appended to all authenticated users.

    Nested elements
    • AuthenticationSource*
    • AuthorizationSource*
    Type Wrapper Description
    AuthenticationSource* - List of sources on which authentication will be performed
    AuthorizationSource* - List of sources for which token will be retrieved if an authentication has succeeded.
    XML representation
    <MetaSecuritySource name="" deploy="true" type="First" forcedTokens="" ></MetaSecuritySource>
  • AuthenticationSource com.exalead.mercury.mami.master.v10.AuthenticationSource
    No documentation for this element.
    Parent objects
    • MetaSecuritySource
    Attributes
    • name string
    • rewrittenLogin string
    Name Type Default value Description
    name string - The name of the security source.
    rewrittenLogin string Can be used to rewrite the login. '$login' will be replaced by the user's login. For example, OFFICE\$login

    XML representation
    <AuthenticationSource name="" ></AuthenticationSource>
  • AuthorizationSource com.exalead.mercury.mami.master.v10.AuthorizationSource
    No documentation for this element.
    Parent objects
    • MetaSecuritySource
    Attributes
    • name string
    • rewrittenLogin string
    • passwordIsMandatory boolean
    Name Type Default value Description
    name string - The name of the security source.
    rewrittenLogin string Can be used to rewrite the login. '$login' will be replaced by the user's login. For example, OFFICE\$login
    passwordIsMandatory boolean False Forces the password to match

    XML representation
    <AuthorizationSource name="" passwordIsMandatory="false" ></AuthorizationSource>
  • SecuritySource com.exalead.mercury.mami.master.v10.SecuritySource
    Security source configuration
    Parent objects
    • SecuritySourceList
    Attributes
    • name string
    • deploy boolean
    • classId string
    • customClassId string
    • singleInstance boolean
    Name Type Default value Description
    name string - Security source name
    deploy boolean True Will this security source be deployed?
    classId string - Security source type
    customClassId string - Customized security source type
    singleInstance boolean False Will this source be deployed on each security command?

    Nested elements
    • configKeyValue*
    Type Wrapper Description
    KeyValue* config
    XML representation
    <SecuritySource name="" deploy="true" classId="" customClassId="" singleInstance="false" ></SecuritySource>
  • RemoteHttpSource com.exalead.mercury.mami.master.v10.RemoteHttpSource
    Remote Http security source configuration
    Parent objects
    • SecuritySourceList
    Attributes
    • name string
    • deploy boolean
    • service string
    • isAlivePath string
    • maxRetries int
    Name Type Default value Description
    name string - Security source name
    deploy boolean True Will this security source be deployed?
    service string Service path on the remote security source
    isAlivePath string - The path to determine if the service is available on hosts.

    Not used if null.
    maxRetries int - Number of retries before skipping an host

    Nested elements
    • RemoteHttpConfig*
    Type Wrapper Description
    RemoteHttpConfig* - Hosts on which authentication will be performed
    XML representation
    <RemoteHttpSource name="" deploy="true" service="" isAlivePath="" maxRetries="" ></RemoteHttpSource>
  • RemoteHttpConfig com.exalead.mercury.mami.master.v10.RemoteHttpConfig
    Security source config.
    The list of uri used by RemoteHttpSource
    Parent objects
    • RemoteHttpSource
    Attributes
    • protocol enum(http, https)
    • host string
    • port int
    • power int
    Name Type Default value Description
    protocol enum(http, https) http Protocol of the remote security source host
    host string - Hostname of the remote security source
    port int 80 Port of the remote security source
    power int 1 Priority of this host

    XML representation
    <RemoteHttpConfig protocol="http" host="" port="80" power="1" ></RemoteHttpConfig>
  • AlertingConfig com.exalead.cloudview.alerting.v10.AlertingConfig
    No documentation for this element.
    Attributes
    • version long
    • dbConnectionString string
    • dbUsername string
    • dbPassword string
    Name Type Default value Description
    version long - -
    dbConnectionString string jdbc:sqlite:${dataDir}/alerting/db.sql JDBC connection string for the alerting database.
    dbUsername string -
    dbPassword string -

    XML representation
    <AlertingConfig version="" dbConnectionString="jdbc:sqlite:${dataDir}/alerting/db.sql" dbUsername="" dbPassword="" ></AlertingConfig>
  • ApplicationAlertingConfig com.exalead.cloudview.alerting.v10.ApplicationAlertingConfig
    No documentation for this element.
    Attributes
    • lastModifiedDate long
    • humanLastModifiedDate long
    Name Type Default value Description
    lastModifiedDate long -
    humanLastModifiedDate long 0

    Nested elements
    • AlertGroup*
    Type Wrapper Description
    AlertGroup* -
    XML representation
    <ApplicationAlertingConfig lastModifiedDate="" humanLastModifiedDate="0" ></ApplicationAlertingConfig>
  • AlertGroup com.exalead.cloudview.alerting.v10.AlertGroup
    Defines the scheduling and publication rules that apply to a set of scheduled alerts
    Parent objects
    • ApplicationAlertingConfig
    Attributes
    • name string
    • description string
    • securitySource string
    • application string
    • useAsDefault boolean
    • publishDocumentMetas boolean
    • storeAlertOccurences boolean
    • realTime boolean
    • autoBuild boolean
    • storeResults boolean
    • incremental boolean
    • groupAlertsByUser boolean
    Name Type Default value Description
    name string -
    description string -
    securitySource string - Name of the security source to check tokens against when executing user alerts
    application string - Mashup application for this alert group
    useAsDefault boolean False Automatically selects this alert group in the "Save as alert" Mashup Widget.
    publishDocumentMetas boolean True When enabled, document metas are also published on match. It may be useful to disable this option to reduce gateway heap usage.
    storeAlertOccurences boolean True If set, each matching alert will be stored in the underlying alert_occurence table.
    realTime boolean False Matches alerts in this alert group as soon as the document is pushed (real-time alerting). If false, matches alerts by executing scheduled queries (scheduled alerting).
    Real-time alerting offers very low latency alerting, but with fewer features.
    For details, see the CloudView Administration Guide.
    autoBuild boolean True Automatically rebuilds the real-time alerting matchers each time an alert is added, deleted, or updated.
    Autobuild should be disabled for large alerting groups (thousands of alerts).
    storeResults boolean False Stores the results of each alert. Required for using the real-time alerts displayer; calculating the difference since the last alert trigger; and ensuring transaction consistency if a publication fails.
    incremental boolean False Includes a timestamp criteria with each alert request so the alert only returns documents modified or added since the last alert trigger.
    Only applies to scheduled alerting groups, since real-time alerting groups are always incremental.
    groupAlertsByUser boolean False Groups all alerts for a user in a single publication. Only applies to scheduled alerting groups, since real-time alerting publications are never grouped.

    Nested elements
    • AlertPublisher*
    • IncrementalModeConfig
    • Scheduling
    Type Wrapper Description
    AlertPublisher* - List of alert publisher configurations.
    IncrementalModeConfig - Advanced configuration for the incrementality of scheduled alerting groups.
    Scheduling - Defines the trigger rule for scheduled queries.
    Only applies to scheduled alerting groups.
    XML representation
    <AlertGroup name="" description="" securitySource="" application="" useAsDefault="false" publishDocumentMetas="true" storeAlertOccurences="true" realTime="false" autoBuild="true" storeResults="false" incremental="false" groupAlertsByUser="false" ></AlertGroup>
  • CustomPublisher com.exalead.cloudview.alerting.v10.CustomPublisher
    Publishes push-mode alerts
    Parent objects
    • AlertGroup
    Attributes
    • name string
    • classId string
    Name Type Default value Description
    name string -
    classId string - Publisher type

    Nested elements
    • configKeyValue*
    • KeyValue
    Type Wrapper Description
    KeyValue* config
    KeyValue - Defined for XSD compatibility, do not use
    XML representation
    <CustomPublisher name="" classId="" ></CustomPublisher>
  • SimpleHTTPPublisher com.exalead.cloudview.alerting.v10.SimpleHTTPPublisher
    Pushes the alerts to a remote HTTP service.
    Each alert occurence triggers an HTTP POST query.
    This publisher can only be used for scheduled alert groups.
    Parent objects
    • AlertGroup
    Attributes
    • name string
    • address string
    • format string
    Name Type Default value Description
    name string -
    address string - Publication endpoint address
    format string atom Alert publishing format.
    The only supported format is 'atom'. This is the native atom format of the Mashup API. It can be deserialized in Java by using, for example, the AccessClient. Additional information is added on the root feed, with alerting-specific data.

    XML representation
    <SimpleHTTPPublisher name="" address="" format="atom" ></SimpleHTTPPublisher>
  • RealTimeHTTPPublisher com.exalead.cloudview.alerting.v10.RealTimeHTTPPublisher
    Pushes the alerts to a remote HTTP service.
    Each alert occurence triggers an HTTP POST query.
    This publisher can only be used for real-time alert groups.
    Parent objects
    • AlertGroup
    Attributes
    • name string
    • address string
    • format string
    Name Type Default value Description
    name string -
    address string - Publication endpoint address
    format string json Alert publishing format. The only supported format is 'json'

    XML representation
    <RealTimeHTTPPublisher name="" address="" format="json" ></RealTimeHTTPPublisher>
  • ConsolePublisher com.exalead.cloudview.alerting.v10.ConsolePublisher
    Debug-only publisher that simply logs alerts
    Parent objects
    • AlertGroup
    Attributes
    • name string
    Name Type Default value Description
    name string -

    XML representation
    <ConsolePublisher name="" ></ConsolePublisher>
  • SimpleMailPublisher com.exalead.cloudview.alerting.v10.SimpleMailPublisher
    In the formats, the following entries are replaced: ${TOTAL_NB_RESULTS}, ${NB_ALERTS}, ${USER}, ${ALERT_NAME}, ${ALERT_DESCRIPTION}, ${ALERT_NB_RESULTS}, ${TITLE}, ${TEXT}, ${ID}
    Parent objects
    • AlertGroup
    Attributes
    • name string
    • fromAddress string
    • toAddressFormat string
    • titleFormat string
    • mailTemplateDirectory string
    • mailTemplate string
    • groupAlertsByUser boolean
    • feedToRender string
    Name Type Default value Description
    name string -
    fromAddress string - E-mail address that will appear as "From" in alerting mails.
    toAddressFormat string - Format string used to build the recipient e-mail address. ${USER} is automatically substituted with the login of the user.

    For example, if the format is "${USER}@mycompany.com", and "user1" sets up an alert, mails for this alert are sent to "user1@mycompany.com".
    titleFormat string ${NB_RESULTS} results for your ${NB_ALERTS} alert(s)
    mailTemplateDirectory string resource:///alerting/mail/templates
    mailTemplate string default.ftl
    groupAlertsByUser boolean True
    feedToRender string cloudview Name of the top-level feed to render.

    Nested elements
    • metaToExposeStringValue*
    • SMTPServer
    Type Wrapper Description
    StringValue* metaToExpose List of meta exposed in mail template.
    Metas "publicurl" and "lastmodifieddate" are exposed by default
    SMTPServer -
    XML representation
    <SimpleMailPublisher name="" fromAddress="" toAddressFormat="" titleFormat="${NB_RESULTS} results for your ${NB_ALERTS} alert(s)" mailTemplateDirectory="resource:///alerting/mail/templates" mailTemplate="default.ftl" groupAlertsByUser="true" feedToRender="cloudview" ></SimpleMailPublisher>
  • SMTPServer com.exalead.cloudview.alerting.v10.SMTPServer
    No documentation for this element.
    Parent objects
    • SimpleMailPublisher
    Attributes
    • host string
    • port int
    • username string
    • password string
    • tls boolean
    Name Type Default value Description
    host string localhost
    port int 25
    username string -
    password string -
    tls boolean False

    XML representation
    <SMTPServer host="localhost" port="25" username="" password="" tls="false" ></SMTPServer>
  • StandardIncrementalModeConfig com.exalead.cloudview.alerting.v10.StandardIncrementalModeConfig
    A simple implementation of incremental mode. Using this mode may cause some documents that have already been alerted to trigger another alert.
    When each document is analyzed, an analysis timestamp is attached to this document. When triggering the alert query, the criteria "analysis_timestamp > (PREVIOUS_ALERT_TRIGGER_TIMESTAMP - OFFSET)" is added to the user query.
    PREVIOUS_ALERT_TRIGGER_TIMESTAMP is then set to the current timestamp.
    The OFFSET is required to account for the time delta between the moment a document is analyzed and the moment it is available for queries.
    Parent objects
    • AlertGroup
    Attributes
    • analysisTimestampIndexField string
    • timestampRestrictionPageParameter string
    • timestampOffsetS long
    Name Type Default value Description
    analysisTimestampIndexField string analysisdate
    timestampRestrictionPageParameter string cloudview.eq.date_restriction Name of the Mashup-API page parameter to send the ELLQL query for timestamp restriction
    timestampOffsetS long 900 Negative offset in seconds on the previous trigger timestamp, to account for the time delta between the moment a document is analyzed and the moment it is available for queries.

    XML representation
    <StandardIncrementalModeConfig analysisTimestampIndexField="analysisdate" timestampRestrictionPageParameter="cloudview.eq.date_restriction" timestampOffsetS="900" ></StandardIncrementalModeConfig>
  • FixedTimeScheduling com.exalead.cloudview.alerting.v10.FixedTimeScheduling
    Sequentially executes all alerts at fixed times, according to cron rules.
    Use this option when alerts need to be sent at fixed times, and when there are few alerts (in the low 1000s).
    Because of the sequential processing, this option generates a significant load when alerts are triggered.
    Parent objects
    • AlertGroup
    Attributes
    • cronExpression string
    • workerThreads int
    Name Type Default value Description
    cronExpression string -
    workerThreads int 1

    XML representation
    <FixedTimeScheduling cronExpression="" workerThreads="1" ></FixedTimeScheduling>
  • ConsolidationConfig com.exalead.mercury.mami.consolidation.v10.ConsolidationConfig
    Consolidation Server configuration. The Consolidation Server collects documents from a push API.
    The document goes through the following transformation steps before being sent to the Indexing Server:
    1. Transformation processors
    2. Aggregation processors
    Attributes
    • name string
    • nbThreads int
    • maxNativeMemoryConsumptionThreshold enum(disabled, enabled, auto)
    • maxNativeMemoryConsumptionInMB int
    Name Type Default value Description
    name string - Identifies this consolidation configuration uniquely
    nbThreads int 4 Number of threads to allocate during the transformation and aggregation phases.
    maxNativeMemoryConsumptionThreshold enum(disabled, enabled, auto) enabled When reaching the memory consumption value specified, transformation is stopped and transformed documents are committed. Then the transformation starts again.
    • Enabled: Commits when the memory usage reaches the threshold value specified (by default, 2048 MB).
    • Auto: Commits when the memory usage reaches 2048 MB.'
    maxNativeMemoryConsumptionInMB int 2048 The maximum of native memory the transformation service can allocate. Reaching this limit triggers a commit.

    Nested elements
    • CommitTriggerConditionsCommitTriggerCondition*
    • AggregationTriggerConditionsCommitTriggerCondition*
    • AdvancedConfig
    • AggregationForwardProcessorConfigList
    • AggregationProcessorConfigList
    • TransformationProcessorConfigList
    Type Wrapper Description
    CommitTriggerCondition* CommitTriggerConditions
    CommitTriggerCondition* AggregationTriggerConditions
    AdvancedConfig -
    AggregationForwardProcessorConfigList -
    AggregationProcessorConfigList -
    TransformationProcessorConfigList -
    XML representation
    <ConsolidationConfig name="" nbThreads="4" maxNativeMemoryConsumptionThreshold="enabled" maxNativeMemoryConsumptionInMB="2048" ></ConsolidationConfig>
  • AdvancedConfig com.exalead.mercury.mami.consolidation.v10.AdvancedConfig
    No documentation for this element.
    Parent objects
    • ConsolidationConfig
    Attributes
    • sequentialCompaction boolean
    Name Type Default value Description
    sequentialCompaction boolean True

    Nested elements
    • AdvancedAggregationConfig
    • CompactPolicies
    • DocumentStoreWarnConfig
    Type Wrapper Description
    AdvancedAggregationConfig -
    CompactPolicies -
    DocumentStoreWarnConfig -
    XML representation
    <AdvancedConfig sequentialCompaction="true" ></AdvancedConfig>
  • AdvancedAggregationConfig com.exalead.mercury.mami.consolidation.v10.AdvancedAggregationConfig
    No documentation for this element.
    Parent objects
    • AdvancedConfig
    Attributes
    • impactQueueSize int
    • aggregationQueueSize int
    • forwardQueueSize int
    • aggregatedVerticesCacheSize int
    • vertexCacheSize int
    • impactDetectionResultCacheMaxFileSizeInBytes int
    • logCountImpactedVertices int
    • metaCountLogThreshold int
    • disableCheckpointForwarding boolean
    • countCheckpointAsATask boolean
    • progressionSaveThresholdInMS long
    Name Type Default value Description
    impactQueueSize int 8
    aggregationQueueSize int 8
    forwardQueueSize int 1600
    aggregatedVerticesCacheSize int 65536 Deprecated since V6R2016xR4, a dedup on impact detection results is now performed before running the aggregation
    vertexCacheSize int 65536
    impactDetectionResultCacheMaxFileSizeInBytes int 10485760
    logCountImpactedVertices int -1 Possible values:
    • -1: disabled (default value)
    • 0: log all
    • N: log only if the number of vertices impacted is greater than or equals to N
    metaCountLogThreshold int -1 Possible values:
    • -1: disabled (default value)
    • N: log only if the number of values for a meta is greater than or equals to N
    disableCheckpointForwarding boolean False
    countCheckpointAsATask boolean True
    progressionSaveThresholdInMS long 3600000

    XML representation
    <AdvancedAggregationConfig impactQueueSize="8" aggregationQueueSize="8" forwardQueueSize="1600" aggregatedVerticesCacheSize="65536" vertexCacheSize="65536" impactDetectionResultCacheMaxFileSizeInBytes="10485760" logCountImpactedVertices="-1" metaCountLogThreshold="-1" disableCheckpointForwarding="false" countCheckpointAsATask="true" progressionSaveThresholdInMS="3600000" ></AdvancedAggregationConfig>
  • DocumentStoreWarnConfig com.exalead.mercury.mami.consolidation.v10.DocumentStoreWarnConfig
    No documentation for this element.
    Parent objects
    • AdvancedConfig
    Attributes
    • thresholdInBytes long
    • enableAtTransformation boolean
    • enableAtAggregation boolean
    Name Type Default value Description
    thresholdInBytes long 104857600
    enableAtTransformation boolean True
    enableAtAggregation boolean True

    XML representation
    <DocumentStoreWarnConfig thresholdInBytes="104857600" enableAtTransformation="true" enableAtAggregation="true" ></DocumentStoreWarnConfig>
  • AggregationForwardProcessorConfigList com.exalead.mercury.mami.consolidation.v10.AggregationForwardProcessorConfigList
    No documentation for this element.
    Nested elements
    • AggregationForwardProcessorConfig*
    Type Wrapper Description
    AggregationForwardProcessorConfig* -
    XML representation
    <AggregationForwardProcessorConfigList ></AggregationForwardProcessorConfigList>
  • AggregationForwardProcessorConfig com.exalead.mercury.mami.consolidation.v10.AggregationForwardProcessorConfig
    No documentation for this element.
    Parent objects
    • AggregationForwardProcessorConfigList
    Attributes
    • pushAPIServer string
    • documentTypes string
    • triggerIndexing boolean
    Name Type Default value Description
    pushAPIServer string - Specify the target Push API server
    documentTypes string - Comma-separated list of document types to forward. Leave empty to match all documents. Deprecated since V6R2015x.SP4, replaced by ForwardedDocumentTypeList
    triggerIndexing boolean True If set to true (default behavior), it will trigger an indexing job on your remote Push API server when the aggregation job is complete

    Nested elements
    • ForwardedDocumentTypeList
    Type Wrapper Description
    ForwardedDocumentTypeList -
    XML representation
    <AggregationForwardProcessorConfig pushAPIServer="" documentTypes="" triggerIndexing="true" ></AggregationForwardProcessorConfig>
  • ForwardedDocumentTypeList com.exalead.mercury.mami.consolidation.v10.ForwardedDocumentTypeList
    List of forwarded document types
    Nested elements
    • ForwardedDocumentType*
    Type Wrapper Description
    ForwardedDocumentType* -
    XML representation
    <ForwardedDocumentTypeList ></ForwardedDocumentTypeList>
  • ForwardedDocumentType com.exalead.mercury.mami.consolidation.v10.ForwardedDocumentType
    No documentation for this element.
    Parent objects
    • ForwardedDocumentTypeList
    Attributes
    • type string
    Name Type Default value Description
    type string -

    XML representation
    <ForwardedDocumentType type="" ></ForwardedDocumentType>
  • AggregationProcessorConfigList com.exalead.mercury.mami.consolidation.v10.AggregationProcessorConfigList
    List of aggregation processors that yield aggregated documents
    Nested elements
    • CommonProcessorConfig*
    Type Wrapper Description
    CommonProcessorConfig* - Declare an aggregation processor to be used by the Consolidation Server.
    XML representation
    <AggregationProcessorConfigList ></AggregationProcessorConfigList>
  • TransformationProcessorConfig com.exalead.mercury.mami.consolidation.v10.TransformationProcessorConfig
    No documentation for this element.
    Parent objects
    • AggregationProcessorConfigList
    Attributes
    • source string
    • name string
    • description string
    • enabled boolean
    • mime string
    • perfMonitored boolean
    • context string
    • code string
    Name Type Default value Description
    source string -
    name string - Identifies this processor uniquely in the processing step
    description string - Description of this processor
    enabled boolean True
    mime string - Mime-type of the language used to define the processor. As of now, supported types are:
    • text/x-groovy
    • text/x-jar
    perfMonitored boolean False Show the processor statistics in the Monitoring Console.
    context string - Indicates the context in which the processing operation will occur
    • create/update
    • delete
    code string - Processor source code run on add/update operations.

    XML representation
    <TransformationProcessorConfig source="" name="" description="" enabled="true" mime="" perfMonitored="false" context="" code="" ></TransformationProcessorConfig>
  • CustomJavaTransformationProcessor com.exalead.mercury.mami.consolidation.v10.CustomJavaTransformationProcessor
    No documentation for this element.
    Parent objects
    • AggregationProcessorConfigList
    Attributes
    • source string
    • name string
    • description string
    • enabled boolean
    • mime string
    • perfMonitored boolean
    • context string
    • classId string
    • pluginName string
    Name Type Default value Description
    source string -
    name string - Identifies this processor uniquely in the processing step
    description string - Description of this processor
    enabled boolean True
    mime string - Mime-type of the language used to define the processor. As of now, supported types are:
    • text/x-groovy
    • text/x-jar
    perfMonitored boolean False Show the processor statistics in the Monitoring Console.
    context string - Indicates the context in which the processing operation will occur
    • create/update
    • delete
    classId string -
    pluginName string - Deprecated since V6R2016x.R2, not used by the product so not replaced.

    Nested elements
    • configKeyValue*
    Type Wrapper Description
    KeyValue* config
    XML representation
    <CustomJavaTransformationProcessor source="" name="" description="" enabled="true" mime="" perfMonitored="false" context="" classId="" pluginName="" ></CustomJavaTransformationProcessor>
  • AggregationProcessorConfig com.exalead.mercury.mami.consolidation.v10.AggregationProcessorConfig
    No documentation for this element.
    Parent objects
    • AggregationProcessorConfigList
    Attributes
    • name string
    • description string
    • enabled boolean
    • mime string
    • perfMonitored boolean
    • context string
    • code string
    Name Type Default value Description
    name string - Identifies this processor uniquely in the processing step
    description string - Description of this processor
    enabled boolean True
    mime string - Mime-type of the language used to define the processor. As of now, supported types are:
    • text/x-groovy
    • text/x-jar
    perfMonitored boolean False Show the processor statistics in the Monitoring Console.
    context string - Indicates the context in which the processing operation will occur
    • create/update
    • delete
    code string - Processor source code run on add/update operations.

    XML representation
    <AggregationProcessorConfig name="" description="" enabled="true" mime="" perfMonitored="false" context="" code="" ></AggregationProcessorConfig>
  • CustomJavaAggregationProcessor com.exalead.mercury.mami.consolidation.v10.CustomJavaAggregationProcessor
    No documentation for this element.
    Parent objects
    • AggregationProcessorConfigList
    Attributes
    • name string
    • description string
    • enabled boolean
    • mime string
    • perfMonitored boolean
    • context string
    • classId string
    • pluginName string
    Name Type Default value Description
    name string - Identifies this processor uniquely in the processing step
    description string - Description of this processor
    enabled boolean True
    mime string - Mime-type of the language used to define the processor. As of now, supported types are:
    • text/x-groovy
    • text/x-jar
    perfMonitored boolean False Show the processor statistics in the Monitoring Console.
    context string - Indicates the context in which the processing operation will occur
    • create/update
    • delete
    classId string -
    pluginName string - Deprecated since V6R2016x.R2, not used by the product so not replaced.

    Nested elements
    • configKeyValue*
    Type Wrapper Description
    KeyValue* config
    XML representation
    <CustomJavaAggregationProcessor name="" description="" enabled="true" mime="" perfMonitored="false" context="" classId="" pluginName="" ></CustomJavaAggregationProcessor>
  • TransformationProcessorConfigList com.exalead.mercury.mami.consolidation.v10.TransformationProcessorConfigList
    List of transformation processors that yield transformed documents
    Nested elements
    • CommonTransformationProcessorConfig*
    Type Wrapper Description
    CommonTransformationProcessorConfig* - Declare a transformation processor to be used by the Consolidation Server.
    XML representation
    <TransformationProcessorConfigList ></TransformationProcessorConfigList>
  • CustomConfigList com.exalead.mercury.mami.master.v10.CustomConfigList
    A list of custom configuration entries.
    Attributes
    • version long
    Name Type Default value Description
    version long - -

    Nested elements
    • CustomConfig*
    Type Wrapper Description
    CustomConfig* -
    XML representation
    <CustomConfigList version="" ></CustomConfigList>
  • CustomConfig com.exalead.mercury.mami.master.v10.CustomConfig
    Data model
    A custom configuration entry.
    Contains the custom configuration entries in a set of @c KeyValue.
    Parent objects
    • CustomConfigList
    Attributes
    • name string
    Name Type Default value Description
    name string - Name of the custom configuration entry.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <CustomConfig name="" ></CustomConfig>
  • SchedulingConfig com.exalead.mercury.mami.master.v10.SchedulingConfig
    Scheduled jobs configuration. Allows to run jobs periodically at a certain date or time
    Attributes
    • version long
    Name Type Default value Description
    version long - -

    Nested elements
    • CalendarConfig*
    • JobConfigGroup*
    • TriggerConfigGroup*
    Type Wrapper Description
    CalendarConfig* -
    JobConfigGroup* -
    TriggerConfigGroup* -
    XML representation
    <SchedulingConfig version="" ></SchedulingConfig>
  • CronCalendarConfig com.exalead.mercury.mami.master.v10.CronCalendarConfig
    Set to exclude a time range defined with a Quartz Cron expression.
    Parent objects
    • SchedulingConfig
    Attributes
    • name string
    • cronExpression string
    Name Type Default value Description
    name string - Name of the calendar.
    cronExpression string - The Quartz Cron expression.

    XML representation
    <CronCalendarConfig name="" cronExpression="" ></CronCalendarConfig>
  • DailyCalendarConfig com.exalead.mercury.mami.master.v10.DailyCalendarConfig
    Used to exclude a time range from a day.
    Parent objects
    • SchedulingConfig
    Attributes
    • name string
    • startTime string
    • endTime string
    Name Type Default value Description
    name string - Name of the calendar.
    startTime string - Time range start.
    endTime string - Time range end.

    XML representation
    <DailyCalendarConfig name="" startTime="" endTime="" ></DailyCalendarConfig>
  • MonthlyCalendarConfig com.exalead.mercury.mami.master.v10.MonthlyCalendarConfig
    Used to exclude a set of days from a month.

    Contains the range of days to exclude (1-31)
    Parent objects
    • SchedulingConfig
    Attributes
    • name string
    Name Type Default value Description
    name string - Name of the calendar.

    Nested elements
    • IntValue*
    Type Wrapper Description
    IntValue* -
    XML representation
    <MonthlyCalendarConfig name="" ></MonthlyCalendarConfig>
  • WeeklyCalendarConfig com.exalead.mercury.mami.master.v10.WeeklyCalendarConfig
    Used to exclude a set of days in the week.

    Contains the range of days to exclude (1-7).
    Parent objects
    • SchedulingConfig
    Attributes
    • name string
    Name Type Default value Description
    name string - Name of the calendar.

    Nested elements
    • IntValue*
    Type Wrapper Description
    IntValue* -
    XML representation
    <WeeklyCalendarConfig name="" ></WeeklyCalendarConfig>
  • JobConfigGroup com.exalead.mercury.mami.master.v10.JobConfigGroup
    Configuration for a group of jobs.
    Parent objects
    • SchedulingConfig
    Attributes
    • name string
    Name Type Default value Description
    name string - Group name.

    Nested elements
    • JobConfig*
    Type Wrapper Description
    JobConfig* -
    XML representation
    <JobConfigGroup name="" ></JobConfigGroup>
  • DispatchJobConfig com.exalead.mercury.mami.master.v10.DispatchJobConfig
    Generic job used to dispath a message to a service
    Parent objects
    • JobConfigGroup
    Attributes
    • name string
    Name Type Default value Description
    name string - Job name.

    Nested elements
    • DispatchMessage
    Type Wrapper Description
    DispatchMessage -
    XML representation
    <DispatchJobConfig name="" ></DispatchJobConfig>
  • TriggerConfigGroup com.exalead.mercury.mami.master.v10.TriggerConfigGroup
    Configuration for a group of triggers
    Parent objects
    • SchedulingConfig
    Attributes
    • name string
    Name Type Default value Description
    name string - Group name.

    Nested elements
    • TriggerConfig*
    Type Wrapper Description
    TriggerConfig* -
    XML representation
    <TriggerConfigGroup name="" ></TriggerConfigGroup>
  • SimpleTriggerConfig com.exalead.mercury.mami.master.v10.SimpleTriggerConfig
    Simple trigger <p />
    The following values for {@link misfireInstruction} are available:
    • fire_now: immediately fires the trigger
    • reschedule_next_with_existing_count: reschedule to next execution with {@link repeatCount} left unchanged.
    • reschedule_next_with_remaining_count: reschedule to next execution with repeat count set to what it would be if it had not missed any firings.
    • reschedule_now_with_existing_repeat_count: reschedule now with {@link repeatCount} left unchanged.
    • reschedule_now_with_remaining_repeat_count: reschedule now with repeat count set to what it would be if it had not missed any firings.
    <p />
    If {@link misfireInstruction} is set to null, when the trigger couldn't be fired, the following scheme applies
    • If {@link repeatCount} is set to 0, the instruction will be interpreted as fire_now
    • If {link repeatCount} is set to <0, the instruction will be interpreted as reschedule_next_with_remaining_count
    • If {link repeatCount} is set to >0, the instruction will be interpreted as reschedule_now_with_existing_repeat_count
    Parent objects
    • TriggerConfigGroup
    Attributes
    • name string
    • startTime long
    • endTime long
    • jobGroupName string
    • jobName string
    • calendarName string
    • misfireInstruction string
    • repeatCount int
    • repeatInterval long
    Name Type Default value Description
    name string - Name of the trigger.
    startTime long - Start date for this trigger.
    endTime long - End date for this trigger.
    jobGroupName string - Job group for this trigger.
    jobName string - Job name for this trigger.
    calendarName string - Optional calendar name for this trigger (to exclude blocks of time), only applies to simple triggers
    misfireInstruction string - Action to execute when the trigger couldn't be fired. @refer to each trigger documentation for available misfire instructions.
    repeatCount int - Repeats count (if <0 , no limit).
    repeatInterval long - Repeats interval in milliseconds.

    XML representation
    <SimpleTriggerConfig name="" startTime="" endTime="" jobGroupName="" jobName="" calendarName="" misfireInstruction="" repeatCount="" repeatInterval="" ></SimpleTriggerConfig>
  • CronTriggerConfig com.exalead.mercury.mami.master.v10.CronTriggerConfig
    Cron based trigger <p />
    The following values for {@link misfireInstruction} are available:
    • do_nothing: nothing is done when a trigger has been misfired.
    • fire_once_now: the trigger is immediately executed when it has been misfired.
    <p />
    If {@link misfireInstruction} is set to null, when the trigger couldn't be fired, the following scheme applies:
    • The instruction will be interpreted as fire_once_now.
    Parent objects
    • TriggerConfigGroup
    Attributes
    • name string
    • startTime long
    • endTime long
    • jobGroupName string
    • jobName string
    • calendarName string
    • misfireInstruction string
    • cronExpression string
    Name Type Default value Description
    name string - Name of the trigger.
    startTime long - Start date for this trigger.
    endTime long - End date for this trigger.
    jobGroupName string - Job group for this trigger.
    jobName string - Job name for this trigger.
    calendarName string - Optional calendar name for this trigger (to exclude blocks of time), only applies to simple triggers
    misfireInstruction string - Action to execute when the trigger couldn't be fired. @refer to each trigger documentation for available misfire instructions.
    cronExpression string - Cron expression.

    XML representation
    <CronTriggerConfig name="" startTime="" endTime="" jobGroupName="" jobName="" calendarName="" misfireInstruction="" cronExpression="" ></CronTriggerConfig>
  • ValidatorConfig com.exalead.mercury.mami.master.v10.ValidatorConfig
    No documentation for this element.
    Nested elements
    • ValidatorGroup*
    Type Wrapper Description
    ValidatorGroup* -
    XML representation
    <ValidatorConfig ></ValidatorConfig>
  • ValidatorGroup com.exalead.mercury.mami.master.v10.ValidatorGroup
    List of Validators that have the same runtime or target the same configuration.

    Allows to run specific validators at the same time.
    Parent objects
    • ValidatorConfig
    Attributes
    • name string
    • autorun boolean
    Name Type Default value Description
    name string - Name of this group of validators.
    autorun boolean True Default way to run all the group's validators.

    Default set to true (meaning the group will be ran at apply config time).

    Nested elements
    • Validator*
    Type Wrapper Description
    Validator* - List of validators belonging to this group.
    XML representation
    <ValidatorGroup name="" autorun="true" ></ValidatorGroup>
  • Validator com.exalead.mercury.mami.master.v10.Validator
    No documentation for this element.
    Parent objects
    • ValidatorGroup
    Attributes
    • classId string
    • name string
    • autorun boolean
    Name Type Default value Description
    classId string -
    name string - Unique identifier of the validator.
    autorun boolean True True if validator is to be ran at apply configuration time. Otherwise, will only be ran manually.

    XML representation
    <Validator classId="" name="" autorun="true" ></Validator>
  • CustomValidator com.exalead.mercury.mami.master.v10.CustomValidator
    No documentation for this element.
    Parent objects
    • ValidatorGroup
    Attributes
    • classId string
    • name string
    • autorun boolean
    Name Type Default value Description
    classId string -
    name string - Unique identifier of the validator.
    autorun boolean True True if validator is to be ran at apply configuration time. Otherwise, will only be ran manually.

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* -
    XML representation
    <CustomValidator classId="" name="" autorun="true" ></CustomValidator>
  • AdminUIConfig com.exalead.mercury.mami.adminui.v10.AdminUIConfig
    No documentation for this element.
    Attributes
    • version long
    • identityProvider string
    • useHttps boolean
    • serverCertificate string
    Name Type Default value Description
    version long - -
    identityProvider string -
    useHttps boolean False
    serverCertificate string -

    Nested elements
    • Role*
    Type Wrapper Description
    Role* -
    XML representation
    <AdminUIConfig version="" identityProvider="" useHttps="false" serverCertificate="" ></AdminUIConfig>
  • Role com.exalead.mercury.mami.adminui.v10.Role
    No documentation for this element.
    Parent objects
    • AdminUIConfig
    Attributes
    • name string
    Name Type Default value Description
    name string -

    Nested elements
    • Permission*
    Type Wrapper Description
    Permission* -
    XML representation
    <Role name="" ></Role>
  • Permission com.exalead.mercury.mami.adminui.v10.Permission
    No documentation for this element.
    Parent objects
    • Role
    Attributes
    • id string
    • permission string
    Name Type Default value Description
    id string -
    permission string -

    XML representation
    <Permission id="" permission="" ></Permission>
  • KeyValue exa.bee.KeyValue
    No documentation for this element.
    Parent objects
    • AlphanumProperty
    • CSVMultiField
    • ConvertInternalSettings
    • ConvertJavaPluginSetting
    • ConvertTextExtractor
    • CustomConfig
    • CustomDocumentProcessor
    • CustomHitOperation
    • CustomMetaOperation
    • CustomPostFilter
    • CustomPostProcessingPipeline
    • CustomPrefixHandler
    • CustomProcess
    • CustomProcessor
    • CustomPublisher
    • CustomQueryExpansionModule
    • CustomSemanticProcessor
    • CustomTokenizer
    • CustomTransform
    • CustomValidator
    • DataProperty
    • EnumProperty
    • Feed
    • GeoProperty
    • IdentityProviderConfig
    • KeyValue
    • NumericalProperty
    • ObjectProperty
    • Post
    • Property
    • ReplaceValues
    • Resource
    • ScratchHook
    • SemanticResource
    • SemanticType
    • SetDefaultValue
    • PushAPIFilter Crawler
    • PushAPIFilter FeedFetcher
    • PushAPIFilter ICrawler
    • add InsertHitInfoFromFacets
    • config Connector
    • config CustomJavaAggregationProcessor
    • config CustomJavaTransformationProcessor
    • config CustomPublisher
    • config CustomPublisher
    • config SecuritySource
    • customRunnerFactoryParams BaseSearchCommand
    • customRunnerFactoryParams QueryExpansionCommand
    • customRunnerFactoryParams SearchCommand
    • customRunnerFactoryParams SpellCheckCommand
    • envAppend ProcessInternalConfig
    • envOverride ProcessInternalConfig
    • envPrepend ProcessInternalConfig
    • forcedMeta Connector
    • globalBeegistryKeys CloudviewDeploymentInternalConfig
    • initParams JavaWebApp
    • messageContent DispatchMessage
    • parameters AddParameters
    • parameters Fetcher
    • params ComplexExaProcessConfig
    • params ComplexJavaProcessConfig
    • params ProcessConfigList
    • params ProcessInternalConfig
    • replace InsertHitInfoFromFacets
    • scanModeConfig ConnectorScheduledScan
    • tags AlphanumProperty
    • tags DataProperty
    • tags EnumProperty
    • tags GeoProperty
    • tags NumericalProperty
    • tags ObjectProperty
    • tags Property
    Attributes
    • key string
    • value string
    • type string
    • description string
    Name Type Default value Description
    key string - The name of the key
    value string - -
    type string - -
    description string - -

    Nested elements
    • KeyValue*
    Type Wrapper Description
    KeyValue* - -
    XML representation
    <KeyValue key="" value="" type="" description="" ></KeyValue>
  • StringConstantValue exa.bee.StringConstantValue
    No documentation for this element.
    Parent objects
    • mimeTypes Crawler
    • mimeTypes FeedFetcher
    • mimeTypes ICrawler
    • sessionIdBlacklist Crawler
    • sessionIdBlacklist FeedFetcher
    • sessionIdBlacklist ICrawler
    Attributes
    • value string
    Name Type Default value Description
    value string - -

    XML representation
    <StringConstantValue value="" ></StringConstantValue>
  • StringValue exa.bee.StringValue
    No documentation for this element.
    Parent objects
    • ConvertInternalSettings
    • annotationsToCopy HTMLRelevantContentExtractor
    • args ProcessInternalConfig
    • classes HTMLCSSExtractor
    • classes HTMLCSSSelector
    • designedForLogics SuggestDispatcher
    • ids HTMLCSSExtractor
    • ids HTMLCSSSelector
    • idsAndClassesToIgnore HTMLRelevantContentExtractor
    • idsAndClassesToKeep HTMLRelevantContentExtractor
    • inputContexts ConcatValues
    • inputContexts ContentCleanup
    • inputContexts CoordinatesFormatter
    • inputContexts DebugProcessor
    • inputContexts LanguageDetector
    • inputContexts LanguageSetter
    • inputContexts MultiContextCSVEncoder
    • inputContexts MultiContextDocumentProcessor
    • inputContexts NumericalFormatter
    • inputContexts RemoteMOTAPIDocumentProcessor
    • inputContexts RemoveContexts
    • inputContexts StringHash
    • inputContexts StringHash32
    • inputContexts StringHash64
    • inputContexts StringTransform
    • inputContexts UTF8Checker
    • inputContexts ValueSelector
    • isAliveQueries SearchCommand
    • metaToExpose SimpleMailPublisher
    • mimes MimeCondition
    • monitoredPageCaches PerformanceMonitoringConfig
    • partSpecificContexts StandardPartsMerger
    • probeTypes PerformanceMonitoringTimer
    • targetInstances RemoteMOTAPIDocumentProcessor
    • trustedHost ProductSecurity
    • values SimilarStringToPart
    • values UniformRandomContextGenerator
    • values ZipfRandomContextGenerator
    Attributes
    • value string
    Name Type Default value Description
    value string - -

    XML representation
    <StringValue value="" ></StringValue>
  • IntValue exa.bee.IntValue
    No documentation for this element.
    Parent objects
    • MonthlyCalendarConfig
    • WeeklyCalendarConfig
    Attributes
    • value int
    Name Type Default value Description
    value int - -

    XML representation
    <IntValue value="" ></IntValue>
  • DispatchMessage exa.bee.DispatchMessage
    No documentation for this element.
    Parent objects
    • DispatchJobConfig
    Attributes
    • serviceName string
    • messageName string
    • outputFormat string
    Name Type Default value Description
    serviceName string - -
    messageName string - -
    outputFormat string - -

    Nested elements
    • messageContentKeyValue*
    Type Wrapper Description
    KeyValue* messageContent -
    XML representation
    <DispatchMessage serviceName="" messageName="" outputFormat="" ></DispatchMessage>