Default Connectors
What's New?
About Connectors
Introduction
The Push API
Default and Optional Connectors
Creating a Standard Connector
Deploying Standard Connectors
Deploy a Connector Server
Define the Target Push API Server
Configure HTTPS
Deploying a Custom Connector
Controlling Connectors
Control Document Scans
Schedule Update Frequency
Setting up the Document Type Pushed by a Connector
Storing Documents in a Specific Data Model Class
Using Document Cache
Understand Typical Use Cases
Enable Document Cache
Change the Location of the Document Cache on the File System
Repush from Document Cache
Clear the Document Cache Entirely
Clear the Document Cache for a Specific Connector Only
Using Push API Filters
Defining an External Push API Server
Create a Custom Push API Server
Create a Source Linked to the External Instance
Scan Your Source and Check the External Instance
Using the Interconnector Service
Configure the Interconnector Server
Configure the File System Connector
Configure the JDBC Connector
Add an Interconnector Aggregation Processor
Scan Connectors
CSV Connector
Introducing the CSV Connector
Aggregate Column Values
Filter Columns
Customize Column Names
Configuring the CSV Connector
Configure the Connector
Check the CSV Config
Crawler Connector
Introducing the Crawler Connector
About the Crawler
Crawler Connector Architecture
Crawl Rules
Refresh
About Site Collapsing
URL Processing
URL Handling of Crawled Sites
About the Fetcher
About the HTTP Fetcher
Fetcher Authentication Process
Debug Authentication Problems
Configuring the Crawler
Configuring the Crawler
Define the Groups of URLs to Crawl
Define Crawl Rules
Specify the File Name Extension and MIME Types to Crawl
Sample Configurations for Different Use Cases
Configuring the Fetcher
Configure the Fetcher
Configure the Fetcher Headers
Crawl Secured Sites
Deploying the Crawler Connector
Specify the Crawler Server
Specify the Push API Server
Managing the Crawler Connector
Troubleshooting the Crawler Connector
Use the Crawler http.log
Dump the Crawler Repository
Performance Monitoring
Hardware Sizing
Determine the Hardware Requirements
Determine the Crawl Speed
Advanced Configuration
Crawl Rules
Crawl Rule Actions
How Priorities Work
Error Handling
Custom Configuration
JDBC Database Connector
Introducing the JDBC Database Connector
Prerequisites
Workflow
Installing JDBC Drivers
Installing Custom Code
Configuring the JDBC Database Connector
Connect to the JDBC Database
Specify the Query Parameters
Define the Fields to Crawl
Examples of JDBC Database Connector Configurations
Feed Fetcher Connector
Introducing the Feed Fetcher Connector
Configure the Connector
Files Connector
File Server Access and Security
Behavior
Limitations
Maximum Path Length
Local File Server Access
Remote File Server Access
FTP Server Access
HDFS Server Access
Security
Mount Drive at Service Startup
About the Files Connector Configuration
Filesystem Paths
MS Windows Specificity for HDFS File System
Exclude/Include Rules
Allowed Extensions
Search Nonindexed File Names
Configure the Files Connector
Basic Configuration
Advanced Configuration - Crawl HDFS Example
Maximize Performances
Maximize Crawl Throughput
Add a Metadata Compaction Preprocessor
Extending the Files Connector Through Plugins
Advanced Configuration Parameters
IMAP Connector
About IMAP Configuration
Subject Filters
Indexing Email Threads
Gmail
Subject Normalization
Configure the IMAP Connector
Configure the IMAP Server Global Configuration
Configure a User Account
Configure Folders
Check Connectivity
Folder Configuration Examples
Troubleshooting
Out of Memory Errors
Mails Do Not Group by Thread Subjects
LDAP Connector
About LDAP Configuration
Before You Start
LDAP Classes and Attributes
Add LDAP References
Classes to Index
Configure the LDAP Connector
Configure the Connection to the LDAP Server
Specify the LDAP Classes and Attributes
Parameter Descriptions
Global Configuration Parameters
Class Config Parameters
Attribute Parameters
Logs Connector
Logging Framework
log4j
Apache
Auto
Custom
Configure the Logs Connector
Configure the Connector
Check the Logs Connector Config
Use Case
Step 1: Analyze Log Levels
Step 2: Group Stack Traces to Refine Analysis
Managed Push API Connector
Introducing the Managed Push API Connector
Configuring the Managed Push API Connector
RScan Client Connector
Introducing the RScan Client Connector
Before You Start
RScan Global Architecture
Configuring the RScan Client Connector
Replay Connector
Introducing the Replay Connector
Configuring the Replay Connector
Deploy the Replay Server Role
Capture a Data Flow
XML Connectors
Introducing the XML Connector
Configuring the XML Simple Connector
Extract by XPath Method
Extracting by XML Elements Method
Extracting Using XSLT Method
Using the Split XML Documents Method
Property Descriptions
Configuring the XML Advanced Connector
About the Processor Pipeline
Configuring the PAPI Document Processor
Configuring the XML Element Processor
Configuring the XPath Processor
Configuring the XSLT Processor
Configuring the Tee Processor
Configuring the XML Attach Processor
Configuring the Child Split Processor
Configuring the XPath Split Processor
Configuring the Custom Processor
Developing a Custom XML Processor
Develop Regular and Split Processors
Compile and Deploy a Custom Processor
Installing Custom Code
Requirements
Install Custom Code
Default Connectors
Installing Custom Code
AC_ABOUT_CONNECTORS_ID
AC_DOCUMENT_TYPE_ID
AC_DOCUMENT_CACHE_ID
AC_PAPI_SERVERS_ID
AC_CONNECTOR_CSV_ID
AC_CONNECTOR_FEEDFETCHER_ID
AC_CONNECTOR_LOGS_ID
AC_CONNECTOR_MANAGEDPAPI_ID
AC_CONNECTOR_RSCAN_ID
AC_CONNECTOR_XML_ID
This site works best with JavaScript enabled