SharePoint Server Collection

SearchBlox includes a crawler to index documents from on-premises SharePoint, including top-level sites and their subsites.

Creating SharePoint Server Collection

You can create a SharePoint Server Collection by following these steps:


  • After logging in to the Admin Console, go to the Collections tab and click "Create a New Collection" or "+" icon.
  • Select SharePoint Server Collection as the Collection Type.
  • Enter a unique name for the collection (e.g., SharePointServer).
  • Enable/Disable RAG for ChatBot and Hybrid RAG search.
  • Choose Private/Public Collection Access and Collection Encryption as needed.
  • Select the content language (if other than English).
  • Click Save to create the collection.
  • Once created, you will be taken to the Authentication tab.

Settings Tab

  • Configure settings related to SharePoint Server Authentication.

FieldDescription
SharePointURLURL of the SharePoint site.
Example : http://sbdemo/sites/test
UsernameName of the user who has access to the SharePoint site
PasswordPassword of the user who has access to the SharePoint site
DomainName of the domain where the SharePoint Server is configured
  • Choose the settings for Generate Using LLM and Hybrid Search.

SettingsDescription
TitleGenerates concise and relevant titles for the indexed documents using LLM.
DescriptionGenerates the description for indexed documents using LLM.
TopicGenerates relevant topics for indexed documents using LLM based on document's content.
Auto RelevanceEnable/Disable Hybrid Search for automatic relevance ranking
  • Click on Save button and Click on Test Connection.

Sites Tab

After successful authentication, you will be taken to the Sites Tab to see the list of Sites from your SharePoint account.

Configure Sites

  • Select the Sites to be indexed (ensure the user has access).
  • Shows the total Sites in the organization.
  • Shows the selected Sites; you can deselect them if needed.
  • Click Save after selecting or deselecting Sites.

Schedule and Index

SharePoint Online collection should be indexed only on the accessed Sites. You can set the start date/time and frequency for indexing. Schedule Frequency supported in SearchBlox is:

  • Once
  • Hourly
  • Daily
  • Every 48 Hours
  • Every 96 Hours
  • Weekly
  • Monthly

The following operation can be performed in SharePoint Online collections

ActivityDescription
Enable Scheduler for IndexingOnce enabled, you can set the Start Date and Frequency
ScheduleFor each collection, indexing can be scheduled based on the above options.
View all Collection SchedulesRedirects to the Schedules section, where all the Collection Schedules are listed.

Manage Documents Tab

  • Using Manage Documents tab, you can perform the following operations:

    1. Filter
    2. View content
    3. View metadata
    4. Refresh
    5. Delete
  • To delete a file from your collection, enter the file path and click "Delete".

  • To check the status of an indexed file, click "View Metadata".

Data Fields Tab

Using the Data Fields tab, you can create custom fields for search, and you can also view the default data fields available for a non-encrypted collection. SearchBlox supports the following four types of data fields:

  1. Keyword
  2. Number
  3. Date
  4. Text
  • After configuring the data fields, the collection must be cleared and re-indexed for the changes to take effect.

To know more about Data Fields please refer to Data Fields Tab


Models

Embedding

  • Provider specifies the embedding provider used to generate vector representations of documents.
  • Model defines the embedding model used to convert document content into vectors for semantic search.

Reranker

  • Provider specifies the reranker provider used for improving search result relevance.
  • Model defines the reranker model used to re-score and reorder search results based on relevance.

LLM

  • Provider specifies the Large Language Model provider used for AI-powered features.

  • Model defines the LLM used for tasks such as document enrichment, summaries, and SmartFAQs.

  • These settings override global configurations and apply only to the current collection.