Google Drive Collection
SearchBlox includes a crawler to index the documents of a Google Drive account. Google Drive Collection can be created by following the steps given below.
Prerequisites
- Create a Project in Google Account to get KEY File, ServiceAccount ID, Application Name and ServiceAccount User.
- Provide the Required Permissions to ServiceAccount User to access the files/folders of Google Drive.
- Share the files/folders with ServiceAccount User.
- Guidelines to create an ServiceAccount
Creating Google Drive Collection
You can Create a Google Drive Collection with the following steps:
-
After logging in to the Admin Console, select the Collections tab and click on Create a New
Collection or "+" icon. -
Choose Google Drive Collection as Collection Type.
-
Enter a unique name for the collection (for example, Drive).
-
Enable/Disable RAG, enable for ChatBot and Hybrid RAG search.
-
Choose Private/Public Collection Access and Collection Encryption as per the requirements.
-
Choose the language of the content (if the language is other than English).
-
Click Save to create the collection.
-
Once the Google Drive collection is created you will be taken to the Authentication tab.
data:image/s3,"s3://crabby-images/b5f2d/b5f2dcbf82c94e735f3f912cb639b50d3f058f4b" alt=""
Settings Tab
Field | Description |
---|---|
KEY File | The File generated by ServiceAccount of Google Drive, while adding a KEY. KEY-FILE can be of the format JSON or PKCS12. |
ServiceAccount ID | ServiceAccount ID is the 'email' of the created ServiceAccount. |
Application Name | Name of the Application, for the created ServiceAccount. |
ServiceAccount User | Email of the User with the permissions to access the files of Google Drive, default is the value of the ServiceAccount ID. |
data:image/s3,"s3://crabby-images/9ed1a/9ed1ab2d0c19f5ad3f342d2f1b0b30ed3a9cbc5b" alt=""
- Upload the KEY File and provide ServiceAccount ID, Application Name and ServiceAccount User, you can find those values in ServiceAccount of Google Cloud Console.
data:image/s3,"s3://crabby-images/fb62b/fb62bd277bdb8b50132051a631cb76fd03b1b725" alt=""
- Choose the settings for
Generate Using LLM
andHybrid Search
.
Settings | Description |
---|---|
Title | Generates concise and relevant titles for the indexed documents using LLM. |
Description | Generates the description for indexed documents using LLM. |
Topic | Generates relevant topics for indexed documents using LLM based on document's content. |
Auto Relevance | Enable/Disable Hybrid Search for automatic relevance ranking |
- Click on
Save
button and Click onTest Connection
.
Schedule and Index
Google Drive collection should be indexed only on shared files/folders. Sets the frequency and the start date/time for indexing a collection. Schedule Frequency supported in SearchBlox is as follows:
- Once
- Hourly
- Daily
- Every 48 Hours
- Every 96 Hours
- Weekly
- Monthly
data:image/s3,"s3://crabby-images/eb6d1/eb6d14a2b1ee252aa91527f5538d485b37c9f794" alt=""
The following operation can be performed in Google Drive collections
Activity | Description |
---|---|
Enable Scheduler for Indexing | Once enabled, you can set the Start Date and Frequency |
Schedule | For each collection, indexing can be scheduled based on the above options. |
View all Collection Schedules | Redirects to the Schedules section, where all the Collection Schedules are listed. |
Manage Documents Tab
-
Using Manage Documents tab we can do the following operations:
- Filter
- View content
- View metadata
- Refresh
- Delete
-
To delete a file from your collection, enter the file path and click "Delete".
-
To see the status of an indexed file, click "View Metadata".
data:image/s3,"s3://crabby-images/d38d2/d38d2823f27f72d3936703b9fa9f754ecb9b873c" alt=""
Data Fields Tab
Using Data Fields tab we can create custom fields for search and we can see the Default Data Fields with non-encrypted collection. SearchBlox supports 4 types of Data Fields as listed below:
Keyword
Number
Date
Text
- Once the Data fields are configured, collection must be cleared and re-indexed to take effect.
To know more about Data Fields please refer to Data Fields Tab
Updated 3 months ago