Leaderboard
On-device LLM performance rankings powered by Glicko-2
Xiaomi 14
AndroidRank
#46
Rating
1,805
±16 RD
Win Rate
79.4%
Conservative Rating
1,773
TG Rating
1,830
PP Rating
1,744
Matches
1,053
Record
836W – 217L
Models Tested
| Model | TG Median (tok/s) | PP Median (tok/s) | TG Best | PP Best | Runs |
|---|---|---|---|---|---|
| Qwen2-0.5B-Instruct-Q8_0 | 50.46 | 322.02 | 50.46 | 322.02 | 1 |
| tinyllama-1.1b-chat-v1.0.Q2_K | 34.60 | 50.84 | 34.60 | 50.84 | 1 |
| hunyuan-0.5b-instruct-q8_0 | 32.42 | 148.86 | 32.42 | 148.86 | 1 |
| llama-3.2-1b-instruct-q8_0 | 25.01 | 133.08 | 25.19 | 133.20 | 2 |
| qwen2.5-1.5b-instruct-q8_0 | 19.47 | 78.79 | 19.47 | 78.79 | 1 |
| DeepSeek-R1-Distill-Qwen-1.5B-IQ4_NL | 18.53 | 76.45 | 18.53 | 76.45 | 1 |
| DeepSeek-R1-Distill-Qwen-1.5B-IQ2_M | 17.51 | 27.16 | 18.08 | 27.83 | 2 |
| Phi-4-mini-instruct-Q4_K_M | 13.92 | 23.12 | 13.92 | 23.12 | 1 |
| gemma-3-4B-it-QAT-Q4_0 | 12.96 | 63.88 | 12.96 | 63.88 | 1 |
| Phi-3.5-mini-instruct.Q4_K_M | 12.71 | 21.56 | 12.71 | 21.56 | 1 |
| gemma-2-2b-it-Q6_K | 12.52 | 26.96 | 12.52 | 26.96 | 1 |
| gemma-3-4b-it-Q4_K_M | 12.13 | 26.03 | 12.13 | 26.03 | 1 |
| qwen2.5-3b-instruct-q5_k_m | 11.98 | 19.60 | 12.31 | 19.84 | 2 |
| Llama-3.2-3B-Instruct-Q6_K | 11.35 | 20.37 | 11.98 | 22.04 | 5 |
| Llama-3.2-3B-Instruct-uncensored-Q8_0 | 11.14 | 36.38 | 11.14 | 36.38 | 1 |
| DeepSeek-R1-Distill-Qwen-1.5B-Q5_K_M | 9.38 | 21.66 | 9.38 | 21.66 | 1 |
| DeepSeek-R1-Distill-Qwen-1.5B-UD-IQ4_XS | 8.02 | 23.94 | 11.13 | 25.97 | 2 |
| BgGPT-Gemma-2-2B-IT-v1.0.Q4_K_S | 7.44 | 20.24 | 7.44 | 20.24 | 1 |
| DeepSeek-R1-Distill-Qwen-7B-Q4_K_M | 6.43 | 11.14 | 6.43 | 11.14 | 1 |
| Qwen2.5-VL-3B-Instruct-bf16-q4_k | 5.88 | 19.60 | 5.88 | 19.60 | 1 |
| MiMo-7B-RL-Q4_K_M | 5.71 | 9.89 | 5.71 | 9.89 | 1 |
| Qwen2.5-7B-Instruct.Q5_K_M | 4.83 | 6.68 | 4.83 | 6.68 | 1 |
| Qwen3.5-0.8B-Q4_K_M | 3.70 | 115.33 | 3.70 | 115.33 | 1 |
| Meta-Llama-3.1-8B-Instruct-IQ3_XS | 3.22 | 4.15 | 3.22 | 4.15 | 1 |
| DeepSeek-R1-Distill-Qwen-7B-Q6_K | 2.19 | 5.99 | 2.19 | 5.99 | 1 |
Head-to-Head Record
1–50 of 300 rows
1 / 6
Performance by App Version
ImprovedRegressed