Database Collection

SearchBlox lets you index database table records easily, with results customizable using Faceted Search.

Creating a Database Collection

You can Create a Database Collection with the following steps:

  • Log in to the Admin Console, go to the Collections tab, and click Create a New Collection or the "+" icon.
  • Select Database Collection as the Collection Type.
  • Enter a unique name (e.g., DatabaseCollection).
  • Set Collection Access (Private or Public) and Encryption as needed.
  • Choose the content language if it’s not English.
  • Click Save to create the collection.

  • After creating the Database Collection, you will be taken to the Database Settings tab.

Supported Databases

SearchBlox supports indexing from these databases:

  1. MySQL
  2. SQL Server
  3. Oracle
  4. PostgreSQL
  5. Other applications supported via Database Collection
Supported ConnectorsSupported ConnectorsSupported ConnectorsSupported ConnectorsSupported ConnectorsSupported ConnectorsSupported ConnectorsSupported Connectors
AASAzure DevOpsEmailHCL DominoOdooSage 300Snapchat AdsZendesk
AccessAzure SynapseEnterpriseDBHDFSOffice 365Sage 50 UKSnowflakeZoho Books
Act CRMAzure Table StorageEpicor ERPHighriseOneDriveSage Business Cloud AccountingSpark SQLZoho Creator
Active DirectoryBaseCampExact OnlineHubSpotOneNoteSage IntacctSplunkZoho CRM
ActiveCampaignBigCommerceExcelIBM Cloud Object StorageOracle Cloud Financials(ERP)SalesforceSSASZoho Inventory
Act-OnBingExcel OnlineIBM Cloud SQL QueryOracle EloquaSalesforce PardotStreakZoho Projects
AcumaticaBing AdsExcel ServicesIBM DB2Oracle HCM CloudSalesloftStripeZuora
ADLSBoxExchangeIBM InformixOracle ociSAP BusinessObject DISugarCRM
Adobe AnalyticsBugzillaFacebookInstagramOracle Sales CloudSAP Business OneSuiteCRM
Adobe CommerceBullhorn CRMFacebook AdsJDBC-ODBC BridgeOracle SCMSAP ByDesignSurveyMonkey
ADPCassandraFreshBooksJIRAOracle Service CloudSAP ConcurSybase
AirtableCertiniaFreshDeskJIRA Service DeskOutreach.ioSAP ERPSybase IQ
AlloyDBCDSFTPJSONParquetSAP FieldglassTableau CRM
Amazon AthenaCloudantGmailKintonePaylocitySAP GatewayTally
Amazon DynamoDBCockroachDBGoogle AdsLDAPPayPalSAP HANATaxJar
Amazon MarketplaceConfluenceGoogle Ads ManagerLinkedInPinterestSAP HANA XSATeradata
Amazon S3CosmosDBGoogle AnalyticsLinkedIn AdsPipedriveSAP Hybris C4CTrello
Apache CouchDBCouchbaseGoogle BigQueryMailChimpPower BI XMLASAP SuccessFactorsTSheets
Apache HBaseCSVGoogle CalenderMariaDBPrestoSAS DataSetsTwilio
Apache HiveDocuSignGoogle Cloud StorageMarketoQuickBaseSAS XptTwitter Ads
Apache ImpalaDropBoxGoogle CMMarkLogicQuickBooksSendGridVeeva Vault
Apache PhoenixDynamics 365 Business CentralGoogle ContactsMicrosoft PlannerQuickBooks OnlineServiceNowWave Financial
APIDatabricksGoogle Data CatalogMicrosoft ProjectQuickBooks POSSF Marketing CloudWooCommerce
AsanaDynamics 365Google DirectoryMicrosoft TeamsRaiser Edge NXTSFTPWordPress
Authorize.NetDynamics CRMGoogle DriveMonday.comReckonSharepointWorkday
AutifyDynamics GPGoogle SearchMongoDBRedisShipStationxBase
AvalaraDynamics NAVGoogle SpannerMYOBRedshiftShopifyXero
AvroeBayGoogle SheetsNeo4jRESTSingleStoreXero Workflow Max
Azure Active DirectoryeBay AnalyticsGraphQLNetSuiteRSSSlackXML
Azure Data CatalogElasticsearchGreenPlumODataSage 200SmartsheetYouTube Analytics

Database Settings

  • Database settings must be configured for Database Collections.

  • Mandatory settings for databases like MySQL, Oracle, etc., include:

    • Database Type
    • Host IP/Name
    • Host Port Number
    • Database Name
    • Database Username
    • Database Password
    • SQL Query
  • For External DB Connectors, the mandatory settings for a Database Collection are:

  • Database Type

  • Database URL String

  • SQL Query

  • SearchBlox provides default settings when a new collection is created, but you can also choose your preferred values.

  • The following table lists all the available settings for Database Collections.

FieldDescription
Database TypeThe database to be indexed. SearchBlox supports MySQL, SQL Server, Oracle, PostgreSQL, and other drivers.
The list of other drivers supported by SearchBlox is provided here: Supported DB Connectors
Database URL StringSpecify the database URL string for the selected Supported DB Connectors
Host IP/nameEnter the IP address or name of the database host. (Default is localhost).
Host Port No.Enter the port number of the database.
Database NameName of the database containing the tables to be indexed.
Database UsernameUsername for accessing the database.
Database PasswordPassword for the database user.
SQL QuerySQL query used to fetch the table data for indexing. Example: SELECT * FROM tablename.
Relevance - Remove DuplicatesPrevents indexing duplicate documents with the same content. Default is NO.
Relevance - StemmingTreats inflected words as their root form (e.g., "running", "runs", "ran" → "run"). Default is YES.
Relevance - Spelling SuggestionsProvides spelling suggestions for the collection. Default is YES.
Keyword-in-Context DisplayShows search results with snippets from content where the search term appears.
SQL Settings - Fetch SizeSets the maximum number of records fetched from the database per request. Default is 100.
Enable Detailed Log SettingsWhen debug mode is on, logs detailed indexing activity in index.log, including URL status, timestamps, status codes, and time taken. Default is NO.
Enable Content APIAllows the crawler to index document content that contains special characters.

🚧

NOTE:

  • While providing a SQL Query in Database Settings page, make sure to provide the primary key as uid else while reindexing/scheduled indexing duplicate documents gets added.
  • If you notice document missing while scheduled index/reindex, please increase the SQL Settings-Fetch Size, which you will find under Settings page as shown in the following screenshots.

After saving the Database settings, test the connection. The connection must be successful for indexing to work.

Schedule and Index

Set when and how often a collection should be indexed. SearchBlox supports these schedule options:

  • Once
  • Hourly
  • Daily
  • Every 48 Hours
  • Every 96 Hours
  • Weekly
  • Monthly

The following operations can be performed in Database collections.

ActivityDescription
Enable Scheduler for IndexingTurn this on to set the start date and how often indexing should run.
SaveSaves your scheduling settings for the collection.
View all Collection SchedulesOpens the Schedules page where you can see all scheduled collections.

Data Fields Tab

The Data Fields tab lets you create custom fields for search and view the default fields in non-encrypted collections.
SearchBlox supports 4 types of Data Fields:

  1. Keyword
  2. Number
  3. Date
  4. Text
  • After configuring Data Fields, you must clear and re-index the collection for changes to take effect.

To know more about Data Fields please refer to Data Fields Tab



Models

Embedding

  • Provider specifies the embedding provider used to generate vector representations of documents.
  • Model defines the embedding model used to convert document content into vectors for semantic search.

Reranker

  • Provider specifies the reranker provider used for improving search result relevance.
  • Model defines the reranker model used to re-score and reorder search results based on relevance.

LLM

  • Provider specifies the Large Language Model provider used for AI-powered features.

  • Model defines the LLM used for tasks such as document enrichment, summaries, and SmartFAQs.

  • These settings override global configurations and apply only to the current collection.

👍

Best Practices

  • Verify the type of application or database to be indexed and provide the appropriate database settings.
  • If your SQL SELECT query doesn’t have a unique ID, create a view with a unique ID and use it in the query to fetch data.
  • For multiple collections, schedule them so that only 2–3 collections index at the same time.

Viewing Search Results

If you want search results in the grid view, use SearchBlox's Database plugin. Learn more about Database Search