Local model benchmarks, measured in Doha

Local LLM Leaderboard

Local model benchmarks, measured in Doha

We benchmark open-weight models running fully on local hardware — no data leaves the machine. Throughput, latency, footprint, and quality, compared side by side.

Coming soon

We're finalising our benchmarking methodology and first set of results. Check back shortly, or get in touch to suggest a model or hardware target.

Suggest a model

Throughput

Tokens generated per second under sustained load.

First-token latency

Time to first token after prompt submission.

Footprint

Parameters, quantization, and on-disk / memory size.

Quality

Task scoring across reasoning, coding, and bilingual EN/AR prompts.