Stopwords

Stop-words are defined as non-information-bearing words. Words such as the, and, a, is, and on are on the stop-word list. When indexing and searching, these words will be ignored. Each language has its own stop-word list. In order to save disk space, or to speed up search results and provide a better search relevance, most search engines do not consider these common words.

Stopwords are based on the language used while creating a collection. The files for stopwords are available at <SEARCHBLOX_INSTALLATION_PATH>/webapps/searchblox/stopwords

If you need to change or modify the default stopwords, you can edit the XML file using the following steps (using English as an example):

  • Go to <SEARCHBLOX_INSTALLATION_PATH>/webapps/searchblox/stopwords/English_en.xml
  • Add or remove stopwords.
  • Stop SearchBlox.
  • Start SearchBlox.
  • Clear and reindex the collection.

🚧

Important Note:

Please note that the collection must be cleared and reindexed for the changes to take effect.


What’s Next