[Feature Request]: Update BM25 Retriever to Fully Support bm25s Non-ASCII and UTF-8 Options · Issue #17461 · run-llama/llama_index
Feature Description The current implementation of BM25Retriever in LlamaIndex relies on an older version of bm25s (0.2.3 or 0.2.4), which does not support non-ASCII tokenization or UTF-8 encoding. ...