Before using Google Site Connector, install SearchBlox successfully, then create a Custom Collection.
Steps to create client_secret.json and configure Google Site
- Go to https://console.developers.google.com/.
- Log in with the account username and password.
- Select a project if it already exists, or create a new project.
- To create a new project, you will be asked to fill in the project name and agree to terms of service.
- Next, click on the Products and Services list (menu) near Google APIs logo on left side.
- Choose API Manager and then click on credentials on the left side menu.
- In the resulting right pane, click on Create Credentials and select 'OAuth client ID' from the list.
- Configure the consent screen then choose the application type.
- Click on 'Configure consent screen' above the application type options.
- In the landing OAuth consent screen, the email ID field is chosen by default. Fill in the product name field and click on Save. Other fields in this page are optional.
11.Now choose “Other Application”
- Download Json and rename it to client_secret.json
- Place it in the folder where the exe file is available
- All the files related to the connector should be available in the same folder that is, all files should be extracted into the same folder.
- Create a data folder on your drive where the files would be temporarily stored and mention in yml files.
Contact [email protected] to request the download link for SearchBlox Google Site connector. In Windows, the connector would be installed in the C drive.
- Download the SearchBlox Google Site connector. Extract the downloaded zip to a folder.
- Unzip the archive under *C:*
- Configure the googlesites.yml file which includes Google Site properties and SearchBlox properties as listed here:
SearchBlox API Key
Data Folder where the data needs to be stored. Make sure it has write permission.
The name of the custom collection in SearchBlox.
File formats to exclude.
Google mime types to be excluded
Fetch interval between each hits (in seconds)
size of file that can be excluded in index
site name specified in google sites
- The content details of googlesites.yml are provided here:
#SearchBlox URL url: http://localhost:8080/searchblox/rest/v2/api/ #SearchBlox API Key api-key: C6D418861BAD66A46A7CC96B70CEADF9 #Data Folder where the data needs to be stored Make sure it has write permission data-directory: C:\CONNECTORS\googlesite\data #The name of the collection colname: site #The Excluded formats wont be indexed exclude-formats: [.war,.zip,.tar.gz] #The Excluded Google mime types wont be indexed exclude-google-mime: [application/vnd.google-apps.video,application/vnd.google-apps.map,application/vnd.google-apps.unknown] #Fetch interval between each hits (in seconds) fetch-inverval: 1 #Exclude size above in MB exclude-size: 2 #website domain domain: searchblox.com #site name siteName: 
- Start running the googlesites.exe file for Windows
- First time you would get a URL in the console, copy that URL and post it in browser
- Allow SearchBlox to access your account
Copy the string generated and paste it in console
- Once giving enter the files would get indexed
- Please note that this is a one time process, running the connector next time the crawler will start automatically.
Updated about a month ago