Languages
Set a collection’s language in SearchBlox by selecting it in the drop-down menu during creation.
Introduction
This page lists all the languages SearchBlox supports for indexing and search. Setting the correct language for a collection ensures that search results are more accurate and relevant for your users — especially for languages that have unique grammar rules, word structures, or common words that should be ignored during search.
SearchBlox currently supports 38 languages across European, Asian, Middle Eastern, and South Asian scripts.
How Language Settings Work
When you create a collection in SearchBlox, you select a language for that collection. Here's what happens behind the scenes:
- Stopwords are applied automatically — A stopwords file for the selected language is used during indexing. Stopwords are common words (like "the", "is", "and" in English) that don't add meaning to a search query and are filtered out to improve result accuracy.
- Indexing is optimized for the language — SearchBlox adjusts how it reads and processes the content based on the language's structure, improving both indexing quality and search relevance.
- Each collection has its own language — If your organization has content in multiple languages, you can create separate collections for each language and search across all of them simultaneously.
Note: The language is set at the time of collection creation. To change the language of an existing collection, you will need to re-create the collection and re-index the content.
How to Set a Collection's Language
- Go to the Create a New Collection page in the SearchBlox Admin Console.
- In the Collection Language field, use the drop-down menu to select the language that matches your content.
- Complete the rest of the collection setup and save. The stopwords file for the selected language will be applied automatically during indexing.
Supported Languages
| Language Name | Language Code |
|---|---|
| Arabic | ar |
| Bengali | bn |
| Czech | cz |
| Danish | da |
| German | de |
| Greek | el |
| English | en |
| Spanish | es |
| Estonian | et |
| Finnish | fi |
| French | fr |
| Gujarati | gu |
| Hindi | hi |
| Hungarian | hu |
| Italian | it |
| Hebrew | iw |
| Japanese | ja |
| Kannada | ka |
| Korean | kr |
| Lithuanian | lt |
| Latvian | lv |
| Malayalam | mal |
| Dutch | nl |
| Norwegian | no |
| Polish | pl |
| Portuguese | pt |
| Romanian | ro |
| Russian | ru |
| Slovak | sk |
| Slovenian | sl |
| Swedish | sv |
| Tamil | ta |
| Telugu | te |
| Thai | th |
| Turkish | tr |
| Vietnamese | vn |
| Chinese (Simplified) | zhcn |
| Chinese (Traditional) | zhtw |
By selecting the appropriate language, the indexing process is optimized for that language’s characteristics, improving the overall search experience for users.
Updated 14 days ago
