XML Data Source

XML Data Source is used to index the content of XML files by parsing all the information within the tags available in the XML page/file. Header tags will be considered as meta tags in SearchBlox and body information will be taken as SearchBlox document’s description/content.

Configuring SearchBlox

Before using XML Data Source, install SearchBlox successfully and create a Custom Collection.

512

Configuration details of XML Data Source

.
Accessing Connector UI

  • The XML file should have data in the following format for the connector to work that is, the XML file should have the record tags within which the URL to be indexed should be specified along with other meta data content within metadata tags.
<?xml version="1.0"?>
<gsafeed>
	<header>
		<datasource>test</datasource>
		<feedtype>metadata-and-url</feedtype>
	</header>
	<group>
		<record url="https://www.example.com/ethical_hacking/ethical_hacking_tutorial.pdf">
			<metadata>
				<meta content="FOU" name="category" />
				<meta content="2018" name="applicable-reports" />
				<meta content="2018" name="display-evaluators" />
				<meta content="1d" name="criteria-2018-1" />
				<meta content="2018" name="show-on-www" />
			</metadata>
		</record>
		<record url="https://www.example.com/cprogramming/cprogramming_tutorial.pdf">
			<metadata>
				<meta content="SSV" name="category" />
				<meta content="2014" name="applicable-reports" />
				<meta content="2014" name="display-evaluators" />
				<meta content="3c" name="criteria-2014-3" />
			</metadata>
		</record>
		<record url="https://www.example.com/android/android_tutorial.pdf">
			<metadata>
				<meta content="SSV" name="category" />
				<meta content="2014" name="applicable-reports" />
				<meta content="3c" name="criteria-2014-3" />
			</metadata>
		</record>
    		<record url="https://www.searchblox.com/">
			<metadata>
				<meta content="SSV" name="category" />
				<meta content="2014" name="applicable-reports" />
				<meta content="3c" name="criteria-2014-3" />
			</metadata>
		</record>
    	</group>
</gsafeed>

🚧

Note:

In Linux, make sure that necessary permissions have been provided to the folder /opt by using the CHMOD command for writing log files and executing jar files.

api-keySearchBlox API Key
colnameThe name of the custom collection in SearchBlox.
urlSearchBlox URL
data-directoryData Folder along with filename of XML from where the data needs to be fetched
log-file-maxSizeMegabytes after which new file is created
log-file-maxBackupNumber of backups after which log file should be deleted
log-file-maxAgeNumber of days after which log files should be deleted