AndCondition |
|
AutoTileGeoCategorizer |
|
AutoTileGeoCategorizerNative |
|
AutoTileGeoV2Categorizer |
|
AutoTileGeoV2CategorizerNative |
|
BinaryContentCondition |
A condition that matches if the FIRST document part binary data matches the condition
|
BuildGroupCondition |
A condition that matches if the build group of the current job matches.
|
CGRDocumentProcessor |
This document processor retrieves the CGR part of the document if any and sends it to the convert process
to generate the corresponding octree.
|
ConcatValues |
|
ConcatValuesNative |
|
ContentCleanup |
|
ContentCleanupNative |
|
CoordinatesFormatter |
|
CoordinatesFormatterNative |
|
CopyContext |
|
CopyContextNative |
|
CSSExtractor |
|
CSSExtractorNative |
|
CSSSelector |
|
CSSSelectorNative |
|
CustomDirectiveCondition |
A condition that matches if the document has the specified directive name,
with an optional specific value.
|
CustomDocumentProcessor |
Deprecated |
DataModelClassCondition |
A condition that matches if the document it has the specified data model
class
|
DataModelClassResolver |
|
DateFormatter |
|
DateFormatterNative |
|
DebugCrashProcessor |
Debug crash processor, aimed to simulate exceptions, crashes and other native
errors.
|
DebugCrashProcessorNative |
Debug crash processor, aimed to simulate exceptions, crashes and other native
errors.
|
DebugProcessor |
|
DebugProcessorNative |
|
DetailedDocumentProcessor |
|
DiscardDocumentProcessor |
|
DocumentProcessorFactory |
|
DocumentProcessorFactoryCtx |
|
DocumentProcessorGroup |
|
DoubleToLong |
|
DoubleToLongNative |
|
FilenameMatchCondition |
A condition that matches if the FIRST document part filename matches the regexp
|
FixedRangeNumericalPartitioning |
|
FixedRangeNumericalPartitioningNative |
|
ForcedRangeNumericalPartitioning |
|
ForcedRangeNumericalPartitioningNative |
|
FormatCheckerDate |
|
FormatCheckerDateNative |
|
GeoBBoxProcessor |
|
GeoBBoxProcessorNative |
|
GeoCategorizer |
|
GeoCategorizerNative |
|
InferFileExtension |
|
InsertCurrentDate |
|
LanguageDetector |
|
LanguageDetectorNative |
|
LanguageSetter |
|
LanguageSetterNative |
|
MathDocumentProcessor |
|
MetaCondition |
A condition that matches if the value (or existence) of a context matches
|
MetaFinder |
|
MimeCondition |
A condition that matches if the FIRST document part mime is in the list
|
MIMEDetector |
|
MimeTypeSetter |
|
MultiContextCSVEncoder |
|
MultiContextCSVEncoderNative |
|
NativeDocumentProcessor<T extends NativeDocumentProcessorNative> |
|
NativeDocumentProcessorNative |
|
NativeExtractor |
|
NativeExtractor.Extractors |
|
NativeExtractorNative |
|
NativeMultiContextDocumentProcessor<T extends NativeMultiContextDocumentProcessorNative> |
|
NativeMultiContextDocumentProcessorNative |
|
NativeSingleContextDocumentProcessor<T extends NativeSingleContextDocumentProcessorNative> |
|
NativeSingleContextDocumentProcessorNative |
|
NewChunk |
|
NewChunkNative |
|
NGConvertExtractor |
Convert extractor.
|
NGConvertExtractor.BenchRunner |
|
NotCondition |
|
NumericalFormatter |
|
NumericalFormatterNative |
|
OrCondition |
|
PartExtractor |
|
PartExtractor.ExtractCtxBase |
|
PLMExpandDocumentProcessor |
|
PrecomputedThumbnailsDocumentProcessor |
Generates thumbnails and store them on disk using a DocumentCache .
|
PrintfValues |
|
PrintfValues.PrintfValuesParsed |
|
PrintfValuesNative |
|
ReadabilityProcessor |
|
ReadabilityProcessor.Builder |
The following example shows how to use this Builder to construct a ReadabilityProcessor instance:
ReadabilityProcessor = new ReadabilityProcessor.Builder()
.setNewContextName("readability:relevant")
.setIrrelevantChunkAnnotation("readability:irrelevant")
.setMinScore(10)
.setAnnotateOnly(true)
.create();
NOTE: Construction with the default configuration.
|
ReadabilityProcessorNative |
|
RemoveContexts |
|
RemoveContextsNative |
|
RenameContext |
|
RenameContextNative |
|
RenameUnmappedContexts |
|
RenameUnmappedContextsNative |
|
ReplaceContextNames |
|
ReplaceContextNamesNative |
|
ReplaceRegexp |
|
ReplaceRegexpNative |
|
ReplaceValues |
|
ReplaceValuesNative |
|
RTADocumentProcessor |
|
SemanticPipeDocumentProcessor |
|
SemanticPipeDocumentProcessorNative |
|
SetDefaultValue |
|
SimilarStringToPart |
|
SourceCondition |
A condition that matches if the source of the document matches
|
SplitValues |
|
SplitValuesNative |
|
StandardPartsMerger |
|
StandardPartsMergerNative |
|
StringHash |
|
StringHashNative |
|
StringTransform |
|
StringTransformNative |
|
UniformRandomContextGenerator |
|
UniformRandomContextGeneratorNative |
|
UnitsOfMeasurementNormalizer |
This processor normalizes measurement metas as 3cm
His role is to detect cm as centi-meters unit of measurement via a MOT pipeline, convert the value
in the index unit and add it in metas.
|
URLCodec |
|
URLCodecNative |
|
URLMatchCondition |
A condition that matches if the document URI matches the regexp
|
URLProcessor |
|
URLProcessorNative |
|
URLTransformer |
|
URLTransformerNative |
|
UTF8Checker |
|
UTF8CheckerNative |
Simply check that text passing through is valid UTF-8.
|
ValueSelector |
|
ValueSelectorNative |
|
ValuesToLong |
|
ValuesToLongNative |
|
VirtualRange |
|
WebTableProcessor |
|
WebTableProcessorNative |
|
WildcardIndexing |
|
WildcardIndexingNative |
|
XpathExtractor |
|
XpathFragmentExtractor |
|
ZipfRandomContextGenerator |
|
ZipfRandomContextGeneratorNative |
|