Collections

Logical grouping of documents that share common indexing rules and storage paths.

In SearchBlox, a collection is the main way to organize and manage documents. It groups related documents together so they can be indexed based on specific rules and file paths. These rules control how documents are crawled, processed, and added to the search index. By organizing documents into collections, SearchBlox makes it easier and faster to run search queries on a well-structured set of content.

There are several different types of collections that can be created in SearchBlox:

Creating a Collection

  • After logging in to the Admin Console, click the Create tab in the header panel and select the Collection icon.
  • Choose the type of collection you want to create.


  • Collection Name – Enter a clear and unique name to identify the collection. This name cannot be changed after the collection is created.
  • RAG Option – Enable or disable RAG. When enabled, the system breaks content into paragraphs, converts them into vector embeddings, and allows Hybrid RAG search.
  • Access Type – Choose Private or Public access and configure Collection Encryption if required. Use Private for sensitive data and Public for general or non-sensitive content.
  • Language Selection – Select the language of the content if it is not English. Choosing the correct language helps improve search accuracy and relevance.
  • Save – Click Save to create the collection.

📘

Note

  • Private collections can only be viewed by users who sign in with secure search authentication. They are meant for sensitive or restricted information.
  • Public collections can be viewed by anyone, even without signing in. They are meant for general or open information.

Collection Dashboard Items

The Collections page displays the following headers:

  • ID (Collection ID number)
  • Type (Collection Type)
  • RAG (Retrieval Augmented Generation)
  • KG (Knowledge Graph)
  • Collection Name (Unique Collection Name)
  • Status (Indexing or Ready)
  • Last Updated (Date and time the index was last updated)
  • Documents (Number of documents currently in the index)
  • Queries (Number of queries that each collection has processed)
  • Language (Language used in the indexed data)
  • Hybrid (Hybrid Search Type)
  • Actions (Possible actions that can be performed such as index, refresh, clear, configure collection, search, delete and clone)

What’s Next