Skip to content

Instantly share code, notes, and snippets.

@sany2k8
Created July 30, 2025 15:55
Show Gist options
  • Save sany2k8/076468ba1c2e9d9fd69febec5a4e4a55 to your computer and use it in GitHub Desktop.
Save sany2k8/076468ba1c2e9d9fd69febec5a4e4a55 to your computer and use it in GitHub Desktop.
Feature TF-IDF BM25
Full Form Term Frequency – Inverse Document Frequency Best Matching 25
Default in Elasticsearch ❌ (before v5.0) ✅ (v5.0 and later)
Term Frequency Handling Linear Saturated (diminishing returns)
Document Length Normalization Minimal Tunable and robust
Tunable Parameters No Yes (k1, b)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment