Leaderboard

On-device LLM performance rankings powered by Glicko-2

Find X8

Android

Rank

#59

Rating

1,775

±21 RD

Win Rate

76.6%

Conservative Rating

1,734

TG Rating

1,810

PP Rating

1,744

Matches

620

Record

475W – 145L

Models Tested

ModelTG Median (tok/s)PP Median (tok/s)TG BestPP BestRuns
SmolLM2-135M-Instruct-Q8_0105.58673.03114.09774.132
granite-3.1-1b-a400m-instruct-Q8_048.97154.4248.97154.421
granite-3.1-3b-a800m-instruct-IQ4_XS34.8669.6334.8669.631
OLMoE-1B-7B-0924-Instruct-IQ4_XS30.7061.5330.7061.531
granite-3.1-3b-a800m-instruct-Q8_028.0677.1528.0677.151
llama-3.2-1b-instruct-q8_024.92128.8927.25153.134
DeepSeek-R1-ReDistill-Qwen-1.5B-v1.0-Q8_019.7185.1619.7185.161
SmallThinker-3B-Preview-Q8_012.0347.6212.0347.621
Qwen2.5-3B-Instruct-Q8_011.6348.2011.6348.201
qwen2.5-3b-instruct-q5_k_m10.9618.1510.9618.151
Phi-3.5-mini-instruct.Q4_K_M10.6117.2210.6117.221
Marco-o1-Q4_K_S8.4316.238.4316.231
Mistral-7B-Instruct-v0.3.IQ4_XS8.4012.188.4012.181
Qwen2.5-7B-Instruct-Q4_K_S7.7114.978.2516.204
Qwen3-4B-Q8_07.5232.507.5232.501
DeepSeek-R1-Distill-Qwen-7B-Q4_K_M6.3310.936.3310.931
Qwen2.5-Coder-7B-Instruct-Q8_05.3620.935.3620.931
Qwen2-7B-Instruct.IQ2_XS5.256.425.256.421
gemma-2-9b-it-Q4_K_M5.029.275.029.271
Mistral-Nemo-Instruct-2407-IQ2_M2.463.002.803.362

Head-to-Head Record

Performance by App Version

ImprovedRegressed

Compare With