Leaderboard
On-device LLM performance rankings powered by Glicko-2
Galaxy S25+
AndroidRank
#30
Rating
1,889
±17 RD
Win Rate
87.5%
Conservative Rating
1,855
TG Rating
1,914
PP Rating
1,837
Matches
955
Record
836W – 119L
Models Tested
| Model | TG Median (tok/s) | PP Median (tok/s) | TG Best | PP Best | Runs |
|---|---|---|---|---|---|
| SmolLM2-1.7B-Instruct-Q8_0 | 29.01 | 104.99 | 29.01 | 104.99 | 1 |
| Qwen3-0.6B.fp16 | 26.92 | 41.49 | 26.92 | 41.49 | 1 |
| EXAONE-4.0-1.2B-Q8_0 | 26.17 | 55.60 | 26.17 | 55.60 | 1 |
| gemma-2-2b-it-Q6_K | 17.65 | 34.38 | 17.65 | 34.38 | 1 |
| Gemmasutra-Mini-2B-v1-Q6_K | 16.03 | 44.44 | 17.99 | 47.77 | 2 |
| Phi-3.5-mini-instruct.Q4_K_M | 13.84 | 19.89 | 17.24 | 30.44 | 4 |
| qwen2.5-3b-instruct-q5_k_m | 12.73 | 26.72 | 16.93 | 61.26 | 7 |
| Llama-3.2-3B-Instruct-Q6_K | 10.74 | 40.15 | 10.74 | 40.15 | 1 |
| Qwen3.5-0.8B-Q8_0 | 10.53 | 388.73 | 10.53 | 388.73 | 1 |
Head-to-Head Record
1–50 of 253 rows
1 / 6
Performance by App Version
ImprovedRegressed