Leaderboard
On-device LLM performance rankings powered by Glicko-2
iPhone 13 Pro Max
iOSRank
#24
Rating
1,901
±14 RD
Win Rate
88.6%
Conservative Rating
1,872
TG Rating
1,786
PP Rating
1,930
Matches
1,284
Record
1138W – 146L
Models Tested
| Model | TG Median (tok/s) | PP Median (tok/s) | TG Best | PP Best | Runs |
|---|---|---|---|---|---|
| yi-ai-4b-chinese-it-v1-q6_k | 1359.46 | 42173.88 | 1359.46 | 42173.88 | 1 |
| SmolLM2-135M-Instruct-Q8_0 | 103.55 | 3206.11 | 113.77 | 3399.06 | 3 |
| google_gemma-3-270m-it-qat-Q8_0 | 63.45 | 226.54 | 63.45 | 226.54 | 1 |
| gemma-3-270m-it-F16 | 53.89 | 245.96 | 56.06 | 250.27 | 2 |
| google_gemma-3-1b-it-qat-IQ3_M | 27.87 | 574.15 | 34.01 | 670.99 | 2 |
| DeepSeek-R1-Distill-Qwen-1.5B-Q4_0 | 27.61 | 427.80 | 27.61 | 427.80 | 1 |
| gemma-3-1b-it.Q5_K_M | 25.40 | 42.28 | 25.40 | 42.28 | 1 |
| gemma-3-1b-it.Q8_0 | 23.52 | 748.28 | 23.52 | 748.28 | 1 |
| llama-3.2-1b-instruct-q8_0 | 22.99 | 54.23 | 23.28 | 638.08 | 3 |
| DeepSeek-R1-Distill-Qwen-1.5B-IQ4_NL | 18.36 | 26.88 | 18.36 | 26.88 | 1 |
| qwen2.5-1.5b-instruct-q8_0 | 16.83 | 235.24 | 17.06 | 433.74 | 2 |
| DeepSeek-R1-Distill-Qwen-1.5B-Q4_K_M | 15.30 | 30.10 | 15.30 | 30.10 | 1 |
| gemma-3-1b-it.fp16 | 13.64 | 779.07 | 13.64 | 779.07 | 1 |
| Qwen3.5-2B-IQ4_NL | 12.71 | 241.94 | 12.71 | 241.94 | 1 |
| DeepSeek-R1-Distill-Qwen-1.5B-IQ2_M | 12.32 | 15.51 | 12.32 | 15.51 | 1 |
| qwen2.5-3b-instruct-q5_k_m | 11.73 | 165.96 | 12.33 | 172.98 | 5 |
| Gemmasutra-Mini-2B-v1-Q6_K | 10.91 | 187.00 | 13.07 | 232.52 | 2 |
| Phi-3.5-mini-instruct.Q4_K_M | 9.72 | 133.43 | 9.72 | 133.43 | 1 |
| Llama-3.2-3B-Instruct-Q6_K | 9.00 | 140.96 | 9.48 | 177.03 | 2 |
| Llama-3.2-3B-Instruct-Q4_0 | 7.81 | 13.95 | 7.81 | 13.95 | 1 |
| gemma-2-2b-it-Q6_K | 7.80 | 13.65 | 12.26 | 200.41 | 5 |
| Qwen3-4B-Instruct-2507.Q2_K | 6.99 | 8.89 | 6.99 | 8.89 | 1 |
| Dolphin3.0-Llama3.2-3B-Q6_K | 6.67 | 120.25 | 6.67 | 120.25 | 1 |
| Qwen3-4B.Q3_K_L | 5.90 | 7.31 | 5.90 | 7.31 | 1 |
| gemma-3-4b-it.Q6_K | 3.05 | 39.79 | 3.05 | 39.79 | 1 |
Head-to-Head Record
1–50 of 324 rows
1 / 7
Performance by App Version
ImprovedRegressed