Models Leaderboard Hardware Evals Train Rentals API Docs

Language

Eval Suites

Community benchmark suites for evaluating local LLM quality. Submit results via the API.

All Official LM-Eval runs Custom server-side coding knowledge math reasoning truthfulness writing

No eval suites yet

Approved suites will appear here. Submit one via the API.