Leaderboard

On-device LLM performance rankings powered by Glicko-2

Galaxy S23

Android

Rank

#95

Rating

1,643

±14 RD

Win Rate

63.8%

Conservative Rating

1,615

TG Rating

1,661

PP Rating

1,531

Matches

1,367

Record

872W – 495L

Models Tested

ModelTG Median (tok/s)PP Median (tok/s)TG BestPP BestRuns
gemma-3-270m-it-UD-Q6_K_XL29.92295.7929.92295.791
Qwen_Qwen3-0.6B-Q4_K_M24.4687.3424.4687.341
Qwen3-0.6B.Q4_K_M18.7159.7018.7159.701
Qwen3-0.6B.Q5_K_M16.5754.7116.5754.711
Qwen3-0.6B.Q2_K15.7557.4815.7557.481
llama-3.2-1b-instruct-q8_015.1965.8020.4481.568
qwen2.5-coder-1.5b-q8_014.0744.5914.0744.591
Qwen3-0.6B.Q6_K12.9728.8416.1052.142
gemma-3-1b-it.Q3_K_L12.6162.5112.6162.511
qwen2.5-1.5b-instruct-q8_012.5440.8113.8249.246
DeepSeek-R1-Distill-Qwen-1.5B-Q5_K_M12.5229.4012.5531.392
SmolLM2-1.7B-Instruct-Q8_012.4637.1112.4637.111
Qwen3-0.6B.Q3_K_M12.2650.1712.2650.171
gemma-3-1b-it-Q4_011.94224.8711.94224.871
Qwen3-1.7B-Q3_K_S11.0625.1111.0625.111
Qwen3-1.7B.Q2_K10.4120.9310.4120.931
Qwen3-0.6B.fp1610.0825.9610.0825.961
Llama-3.2-1B-Instruct.IQ1_M9.9722.309.9722.301
Hermes-3-Llama-3.2-3B.Q4_K_M8.6915.148.6915.141
Qwen3-4B.Q4_K_M8.2812.538.2812.531
DeepSeek-R1-Distill-Qwen-1.5B-IQ4_NL8.1932.668.1932.661
gemma-3-4b-it-uncensored.i1-Q4_K_M7.4214.917.4214.911
Phi-4-mini-instruct.Q2_K7.2111.167.2111.161
qwen2.5-3b-instruct-q5_k_m7.0012.297.9515.124
gemma-2-2b-it-Q6_K6.9216.8810.2721.376
Llama-3.2-3B-Instruct-Q6_K5.3812.037.3315.456
DeepSeek-R1-Distill-Qwen-1.5B-f165.357.195.357.191
Gemmasutra-Mini-2B-v1-Q6_K5.1910.595.1910.591
DeepSeek-R1-Distill-Qwen-1.5B-Fully-Uncensored.f164.836.895.077.022
gemma-2-2b-it-IQ3_M4.557.824.557.821
Phi-3.5-mini-instruct.Q4_K_M3.9010.247.8612.346

Head-to-Head Record

Performance by App Version

ImprovedRegressed

Compare With