Meilisearch is compatible with datasets in any language. Additionally, it features optimized support for languages using whitespace to separate words, Chinese, Hebrew, Japanese, Korean, and Thai.
Meilisearch is multilingual, featuring optimized support for:
Any language that uses whitespace to separate words
While we have employees from all over the world at Meilisearch, we don’t speak every language. We rely almost entirely on feedback from external contributors to understand how our engine is performing across different languages.If you’d like to request optimized support for a language, please upvote the related discussion in our product repository or open a new one if it doesn’t exist.If you’d like to help by developing a tokenizer pipeline yourself: first of all, thank you! We recommend that you take a look at the tokenizer contribution guide before making a PR.
What do you mean when you say Meilisearch offers optimized support for a language?
Optimized support for a language means Meilisearch has implemented internal processes specifically tailored to parsing that language, leading to more relevant results.
Does Meilisearch plan to support additional languages in the future?
Yes, we definitely do. The more feedback we get from native speakers, the easier it is for us to understand how to improve performance for those languages. Similarly, the more requests we get to improve support for a specific language, the more likely we are to devote resources to that project.