Database Collection
SearchBlox lets you index database table records easily, with results customizable using Faceted Search.
Creating a Database Collection
You can Create a Database Collection with the following steps:
- Log in to the Admin Console, go to the Collections tab, and click Create a New Collection or the "+" icon.
- Select Database Collection as the Collection Type.
- Enter a unique name (e.g., DatabaseCollection).
- Set Collection Access (Private or Public) and Encryption as needed.
- Choose the content language if it’s not English.
- Click Save to create the collection.

- After creating the Database Collection, you will be taken to the Database Settings tab.
Supported Databases
SearchBlox supports indexing from these databases:
- MySQL
- SQL Server
- Oracle
- PostgreSQL
- Other applications supported via Database Collection
| Supported Connectors | Supported Connectors | Supported Connectors | Supported Connectors | Supported Connectors | Supported Connectors | Supported Connectors | Supported Connectors |
|---|---|---|---|---|---|---|---|
| AAS | Azure DevOps | HCL Domino | Odoo | Sage 300 | Snapchat Ads | Zendesk | |
| Access | Azure Synapse | EnterpriseDB | HDFS | Office 365 | Sage 50 UK | Snowflake | Zoho Books |
| Act CRM | Azure Table Storage | Epicor ERP | Highrise | OneDrive | Sage Business Cloud Accounting | Spark SQL | Zoho Creator |
| Active Directory | BaseCamp | Exact Online | HubSpot | OneNote | Sage Intacct | Splunk | Zoho CRM |
| ActiveCampaign | BigCommerce | Excel | IBM Cloud Object Storage | Oracle Cloud Financials(ERP) | Salesforce | SSAS | Zoho Inventory |
| Act-On | Bing | Excel Online | IBM Cloud SQL Query | Oracle Eloqua | Salesforce Pardot | Streak | Zoho Projects |
| Acumatica | Bing Ads | Excel Services | IBM DB2 | Oracle HCM Cloud | Salesloft | Stripe | Zuora |
| ADLS | Box | Exchange | IBM Informix | Oracle oci | SAP BusinessObject DI | SugarCRM | |
| Adobe Analytics | Bugzilla | Oracle Sales Cloud | SAP Business One | SuiteCRM | |||
| Adobe Commerce | Bullhorn CRM | Facebook Ads | JDBC-ODBC Bridge | Oracle SCM | SAP ByDesign | SurveyMonkey | |
| ADP | Cassandra | FreshBooks | JIRA | Oracle Service Cloud | SAP Concur | Sybase | |
| Airtable | Certinia | FreshDesk | JIRA Service Desk | Outreach.io | SAP ERP | Sybase IQ | |
| AlloyDB | CDS | FTP | JSON | Parquet | SAP Fieldglass | Tableau CRM | |
| Amazon Athena | Cloudant | Gmail | Kintone | Paylocity | SAP Gateway | Tally | |
| Amazon DynamoDB | CockroachDB | Google Ads | LDAP | PayPal | SAP HANA | TaxJar | |
| Amazon Marketplace | Confluence | Google Ads Manager | SAP HANA XSA | Teradata | |||
| Amazon S3 | CosmosDB | Google Analytics | LinkedIn Ads | Pipedrive | SAP Hybris C4C | Trello | |
| Apache CouchDB | Couchbase | Google BigQuery | MailChimp | Power BI XMLA | SAP SuccessFactors | TSheets | |
| Apache HBase | CSV | Google Calender | MariaDB | Presto | SAS DataSets | Twilio | |
| Apache Hive | DocuSign | Google Cloud Storage | Marketo | QuickBase | SAS Xpt | Twitter Ads | |
| Apache Impala | DropBox | Google CM | MarkLogic | QuickBooks | SendGrid | Veeva Vault | |
| Apache Phoenix | Dynamics 365 Business Central | Google Contacts | Microsoft Planner | QuickBooks Online | ServiceNow | Wave Financial | |
| API | Databricks | Google Data Catalog | Microsoft Project | QuickBooks POS | SF Marketing Cloud | WooCommerce | |
| Asana | Dynamics 365 | Google Directory | Microsoft Teams | Raiser Edge NXT | SFTP | WordPress | |
| Authorize.Net | Dynamics CRM | Google Drive | Monday.com | Reckon | Sharepoint | Workday | |
| Autify | Dynamics GP | Google Search | MongoDB | Redis | ShipStation | xBase | |
| Avalara | Dynamics NAV | Google Spanner | MYOB | Redshift | Shopify | Xero | |
| Avro | eBay | Google Sheets | Neo4j | REST | SingleStore | Xero Workflow Max | |
| Azure Active Directory | eBay Analytics | GraphQL | NetSuite | RSS | Slack | XML | |
| Azure Data Catalog | Elasticsearch | GreenPlum | OData | Sage 200 | Smartsheet | YouTube Analytics |
Database Settings
-
Database settings must be configured for Database Collections.
-
Mandatory settings for databases like MySQL, Oracle, etc., include:
- Database Type
- Host IP/Name
- Host Port Number
- Database Name
- Database Username
- Database Password
- SQL Query
-
-
For External DB Connectors, the mandatory settings for a Database Collection are:
-
Database Type
-
Database URL String
-
SQL Query
-
SearchBlox provides default settings when a new collection is created, but you can also choose your preferred values.
-
The following table lists all the available settings for Database Collections.
| Field | Description |
|---|---|
| Database Type | The database to be indexed. SearchBlox supports MySQL, SQL Server, Oracle, PostgreSQL, and other drivers. The list of other drivers supported by SearchBlox is provided here: Supported DB Connectors |
| Database URL String | Specify the database URL string for the selected Supported DB Connectors |
| Host IP/name | Enter the IP address or name of the database host. (Default is localhost). |
| Host Port No. | Enter the port number of the database. |
| Database Name | Name of the database containing the tables to be indexed. |
| Database Username | Username for accessing the database. |
| Database Password | Password for the database user. |
| SQL Query | SQL query used to fetch the table data for indexing. Example: SELECT * FROM tablename. |
| Relevance - Remove Duplicates | Prevents indexing duplicate documents with the same content. Default is NO. |
| Relevance - Stemming | Treats inflected words as their root form (e.g., "running", "runs", "ran" → "run"). Default is YES. |
| Relevance - Spelling Suggestions | Provides spelling suggestions for the collection. Default is YES. |
| Keyword-in-Context Display | Shows search results with snippets from content where the search term appears. |
| SQL Settings - Fetch Size | Sets the maximum number of records fetched from the database per request. Default is 100. |
| Enable Detailed Log Settings | When debug mode is on, logs detailed indexing activity in index.log, including URL status, timestamps, status codes, and time taken. Default is NO. |
| Enable Content API | Allows the crawler to index document content that contains special characters. |
NOTE:
- While providing a
SQL QueryinDatabase Settingspage, make sure to provide the primary key asuidelse while reindexing/scheduled indexing duplicate documents gets added.- If you notice document missing while scheduled index/reindex, please increase the
SQL Settings-Fetch Size, which you will find underSettingspage as shown in the following screenshots.
After saving the Database settings, test the connection. The connection must be successful for indexing to work.

Schedule and Index
Set when and how often a collection should be indexed. SearchBlox supports these schedule options:
- Once
- Hourly
- Daily
- Every 48 Hours
- Every 96 Hours
- Weekly
- Monthly
The following operations can be performed in Database collections.
| Activity | Description |
|---|---|
| Enable Scheduler for Indexing | Turn this on to set the start date and how often indexing should run. |
| Save | Saves your scheduling settings for the collection. |
| View all Collection Schedules | Opens the Schedules page where you can see all scheduled collections. |

Data Fields Tab
The Data Fields tab lets you create custom fields for search and view the default fields in non-encrypted collections.
SearchBlox supports 4 types of Data Fields:
- Keyword
- Number
- Date
- Text
- After configuring Data Fields, you must clear and re-index the collection for changes to take effect.
To know more about Data Fields please refer to Data Fields Tab
Models
Embedding
- Provider specifies the embedding provider used to generate vector representations of documents.
- Model defines the embedding model used to convert document content into vectors for semantic search.
Reranker
- Provider specifies the reranker provider used for improving search result relevance.
- Model defines the reranker model used to re-score and reorder search results based on relevance.
LLM
-
Provider specifies the Large Language Model provider used for AI-powered features.
-
Model defines the LLM used for tasks such as document enrichment, summaries, and SmartFAQs.
-
These settings override global configurations and apply only to the current collection.
Best Practices
- Verify the type of application or database to be indexed and provide the appropriate database settings.
- If your SQL SELECT query doesn’t have a unique ID, create a view with a unique ID and use it in the query to fetch data.
- For multiple collections, schedule them so that only 2–3 collections index at the same time.
Viewing Search Results
- View the search results by searching for the records here: https://localhost:8443/search/index.html.
- After clicking the search results, the data will appear in a grid format.

If you want search results in the grid view, use SearchBlox's Database plugin. Learn more about Database Search
Updated 15 days ago


