• Parses a context string as a regular URL (RFC 2396, "Uniform Resource Identifier") and transforms it according to the given URL pattern. A new DocumentChunk is created with the substitution. Pattern used to transform the URL (in the form <scheme>://<authority><path>?<query>#<fragment>):
◦ Characters other than '$' or '\' are kept as-is
◦ The '$' character and the '\' character must be escaped with a leading \
◦ The ${expression} form allows to compute a string expression based on URL components (see "Expression" below)
Expression used inside the enclosing ${}:
◦ url: Original URL
◦ scheme: Scheme name ("http", "https", "file", ...)
◦ authority: Authority (host:port or host) (may be empty)
◦ host: Hostname part of the authority (may be empty)
◦ port: Port number part of the authority (may be empty)
◦ userInfo: username:password field of the authority (may be empty)
◦ file: File starting with / and query string, if any
◦ pathurl: Normalized absolute path starting with /
◦ path: Normalized absolute path (may start with C:\ on Windows)
◦ query: Normalized query part starting with ? (may be empty)
◦ args: Query part without the leading ? (may be empty)
◦ fragment: Fragment part starting with #(may be empty)
◦ reference: Reference part ; i.e., fragment without the leading # (may be empty)
◦ arg:name: Query part argument identified by its name, unescaped (you must re-escape it using "urlencode:" when necessary)
◦ str:string: The final argument is not a variable name, but a string (only useful for clarity purpose)
◦ tolower:<i>expression</i>: Transform into lowercase (ONLY A-Z)
◦ toupper:<i>expression</i>: Transform into uppercase (ONLY a-z)
◦ urlencode:<i>expression</i> :URL encoding (%NN or +)
If dataModelState is "customized", you will find here the original document processor generated by the data model. Use this to easily revert to "auto" state from "customized". @IgnoreForValueConstructor
AcceptCondition
com.exalead.indexing.analysis.v10.AcceptCondition
Expresses the enablement condition of this DocumentProcessor.