Content Manager Features

FeatureDescription
Open Graph Meta TagContent managers can use the og:image HTML tag in web pages to display thumbnail images for documents on the search page.
Selective HTML indexingSpecific content can be excluded from indexing using tags such as ..., ..., or ....
Meta RobotsMeta robots tags can control whether a page should be indexed or crawled, allowing flexible indexing rules.
Robots.txtThe crawler follows the rules defined in the robots.txt file, which take the highest priority over other web collection settings.
CanonicalCanonical URLs can be indexed by enabling the canonical setting in the web collection. Eg of canonical tag: <link href="https://www.example.com" rel="canonical">
SitemapSearchBlox supports standard XML sitemaps, including sitemaps referenced in robots.txt.
Custom Date HeaderCustom date header indexing can be configured if the .htaccess file contains a header in the required date format. Example: Header set SearchBlox-Last-modified "Wed, 01 Jan 2000 12:00:01 GMT".
HTML Parser - Document DescriptionThe description setting in Web Collection allows the HTML parser to read document descriptions from HTML tags such as META, H1, H2, H3, H4, H5, and H6. Ensure valid tags exist on the webpage if this feature is used.
Selective LastModified DateDocuments can be indexed using Default Header Date, Custom LastModified Date (lastmodified or last-modified), or Custom Header Date.
Search Results DisplayDefault SearchBlox fields such as title, description, keywords, and lastmodified can be controlled from web pages. After re-indexing the collection, updated content will appear in the search page.
SmartFAQsAllows easy creation and management of FAQs using the SearchBlox AI model.
Custom Meta FieldsCustom meta tags can be added to web pages and indexed within web collections to enhance search functionality. To know more: Custom Data Fields