Leaderboard

On-device LLM performance rankings powered by Glicko-2

iPhone 14 Pro Max

iOS

Rank

#21

Rating

1,921

±16 RD

Win Rate

90.6%

Conservative Rating

1,889

TG Rating

1,888

PP Rating

1,929

Matches

1,086

Record

984W – 102L

Models Tested

ModelTG Median (tok/s)PP Median (tok/s)TG BestPP BestRuns
DeepSeek-R1-Distill-Qwen-7B-IQ2_M394.3023733.36785.9847463.462
tinygemma3-Q8_0143.7122221.25143.7122221.251
MiniCPM4-0.5B.Q2_K81.95109.5181.95109.511
qwen2.5-0.5b-instruct-q8_062.001360.9662.631377.182
Qwen2.5-Coder-1.5B-Instruct-Q4_K_M36.57422.0536.57422.051
Qwen3-1.7B-Q4_K_M35.34373.8935.34373.891
DeepSeek-R1-Distill-Qwen-1.5B-Q4_K_M32.10380.1432.10380.141
llama-3.2-1b-instruct-q8_031.9063.9433.26581.024
Qwen3-1.7B-Q5_K_M31.51347.4431.51347.441
qwen2.5-1.5b-instruct-q3_k_m30.81385.6930.81385.691
DeepSeek-R1-Distill-Qwen-1.5B-Q4_K_M28.7743.2028.7743.201
Qwen2.5-1.5B-Instruct.Q8_025.6644.7425.6644.741
qwen2.5-1.5b-instruct-q8_024.46275.0925.68428.593
gemma-3-1b-it-Q8_023.70746.8323.70746.831
Qwen3-1.7B.Q6_K21.59280.6021.59280.601
Qwen3-1.7B-Q6_K19.7728.4519.7728.451
DeepSeek-R1-Distill-Qwen-1.5B-Fully-Uncensored.i1-Q6_K19.53255.0319.53255.031
gemma-3-1b-it.fp1619.25774.1619.25774.161
DeepSeek-R1-Distill-Qwen-1.5B-Q8_015.17101.9219.58164.352
gemma-3-1b-it.Q6_K15.0936.8715.0936.871
Llama-3.2-3B-Instruct.Q6_K14.97199.4714.97199.471
Qwen3.5-0.8B-BF1614.10403.1914.10403.191
DeepSeek-R1-Distill-Qwen-1.5B-IQ2_M13.8216.0313.8216.031
Qwen3-4B-Q4_K_M13.64140.7413.64140.741
Qwen3.5-2B-Q3_K_M13.09231.4113.09231.411
Qwen3-4B-Thinking-2507-Q4_K_M12.75117.2112.75117.211
gemma-2-2b-it-Q6_K12.3018.1919.44266.966
DeepSeek-R1-Distill-Qwen-1.5B-Q3_K_M12.08201.2912.08201.291
gemma-3-4b-it-Q4_K_M11.92152.3811.92152.381
Phi-4-mini-instruct-Q4_K_M11.56128.4311.56128.431
Llama-3.2-3B-Instruct.Q5_K_M10.1396.0910.1396.091
qwen2.5-3b-instruct-q5_k_m10.0414.2015.20173.698
DeepSeek-R1-Distill-Qwen-1.5B-Q5_K_M9.98157.679.98157.671
Llama-3.2-3B-Instruct-Q6_K8.52125.4615.64174.593
Qwen3-4B-Instruct-2507-UD-Q4_K_XL7.3611.4616.11147.004
Qwen3-4B-Thinking-2507-UD-Q4_K_XL7.3191.507.3191.501
Qwen3-4B-Instruct-2507-IQ4_NL6.8410.456.8410.451
gemma-3-4b-it-Q4_K_M6.5321.178.4382.643
Qwen3.5-4B-UD-Q2_K_XL5.8793.525.8793.521
DeepSeek-R1-Distill-Qwen-7B-Uncensored.i1-IQ1_S5.5061.045.5061.041
Llama-3.2-3B-Instruct-IQ3_M4.846.234.846.231
Qwen3-4B.Q4_K_M4.8210.564.8210.561
Phi-3.5-mini-instruct.Q4_K_M2.3912.032.3912.031

Head-to-Head Record

Performance by App Version

ImprovedRegressed

Compare With