Leaderboard

On-device LLM performance rankings powered by Glicko-2

iPhone 16e

iOS

Rank

#16

Rating

1,939

±16 RD

Win Rate

92.3%

Conservative Rating

1,907

TG Rating

1,902

PP Rating

1,927

Matches

1,032

Record

953W – 79L

Models Tested

ModelTG Median (tok/s)PP Median (tok/s)TG BestPP BestRuns
Qwen3-0.6B-abliterated-TIES.IQ4_XS68.24869.1368.24869.131
gemma-3-1b-it-abliterated-q4_k_m42.73647.0942.73647.091
DeepSeek-R1-Distill-Qwen-1.5B-Abliterated-dpo.IQ4_XS36.18363.4736.18363.471
llama-3.2-1b-instruct-q8_031.4669.1438.56551.823
Qwen3-1.7B.Q4_K_M28.0344.5228.0344.521
gemma-3-1b-it-BF1620.34349.3620.34349.361
gemma-2-2b-it-Q6_K19.00240.2820.31245.983
Phi-3.5-mini-instruct.Q4_K_M15.68114.2515.68114.251
qwen2.5-3b-instruct-q5_k_m12.12138.0812.12138.081
Phi-4-mini-instruct.Q8_010.51122.0510.96127.562
Llama-3.2-3B-Instruct-Q6_K9.9282.2013.60156.812
Phi-3.5-mini-instruct_Uncensored-Q6_K_L9.13112.249.13112.241
Qwen3.5-4B.Q4_K_M8.36102.008.36102.001
gemma-3-4b-it.Q4_K_S7.0314.057.0314.051
gemma-3-4b-it.Q8_07.0054.447.6992.552
DeepSeek-R1-Distill-Qwen-7B-Q3_K_L6.9445.916.9445.911
Qwen3-4B-Q6_K6.069.746.069.741

Head-to-Head Record

Performance by App Version

ImprovedRegressed

Compare With