Massive Legal Embedding Benchmark (MLEB)

@misc{mleb2025,
  title={The Massive Legal Embedding Benchmark},
  author={Isaacus},
  year={2025},
  eprint={2510.19365},
  archivePrefix={arXiv},
  primaryClass={cs.CL}
}

Open evaluation code and raw results: github.com/isaacus-dev/mleb. Introductory post: Introducing MLEB.

Ready to build with the world’s best legal embedding model?

Read our docs, explore our models, and join our platform.

Start building Join platform Explore models

Datasets, paper, and citation

Legal RAG Bench

Bar Exam QA

SCALR

ECHR Retrieval

Singaporean Judicial Keywords

GDPR Holdings Retrieval

Australian Tax Guidance Retrieval

Irish Legislative Summaries

UK Legislative Long Titles

Contractual Clause Retrieval

License TL;DR Retrieval

Consumer Contracts QA

Ready to build with the world’s best legal embedding model?