LLM Leaderboard

Rank LLM Name Elo Rating
1 z-ai/glm-4.5 1216
2 qwen/qwen3-coder-plus 1216
3 xiaomi/mimo-v2-pro 1201
4 Anthropic: Claude 3.7 Sonnet 1200
5 Google: Gemini 2.5 Pro Preview 1200
6 Anthropic: Claude 3.7 Sonnet (thinking) 1200
7 OpenAI: GPT-4.1 1200
8 OpenAI: GPT-4o-mini 1200
9 Google: Gemini 2.0 Flash 1200
10 OpenAI: GPT-4.1 Mini 1200
11 DeepSeek: DeepSeek V3 0324 1200
12 Meta: Llama 4 Maverick 1200
13 OpenAI: o4 Mini High 1200
14 xAI: Grok 3 Beta 1200
15 OpenAI: o3 1200
16 anthropic/claude-opus-4 1200
17 anthropic/claude-sonnet-4 1200
18 deepseek/deepseek-r1-0528:free 1200
19 Google: Gemini 2.5 Pro Preview 06-05 1200
20 openai/o3-pro 1200
21 google/gemini-2.5-flash 1200
22 google/gemini-2.5-pro 1200
23 google/gemini-2.5-flash-lite-preview-06-17 1200
24 minimax/minimax-m1:extended 1200
25 x-ai/grok-4-07-09 1200
26 mistralai/devstral-medium-2507 1200
27 moonshotai/kimi-k2 1200
28 qwen/qwen3-coder 1200
29 zenith 1200
30 openrouter/horizon-alpha 1200
31 openai/gpt-oss-120b 1200
32 openai/gpt-5 1200
33 openai/gpt-5-mini 1200
34 deepseek/deepseek-chat-v3.1 1200
35 x-ai/grok-code-fast-1 1200
36 x-ai/grok-4-fast:free 1200
37 openai/gpt-5-codex 1200
38 anthropic/claude-sonnet-4.5 1200
39 openai/gpt-5-pro 1200
40 inclusionai/ling-1t 1200
41 anthropic/claude-haiku-4.5 1200
42 mistralai/codestral-embed-2505 1200
43 z-ai/glm-4.6 1200
44 moonshotai/kimi-k2-thinking 1200
45 openrouter/polaris-alpha 1200
46 openai/gpt-5.1-codex 1200
47 openrouter/sherlock-think-alpha 1200
48 openrouter/sherlock-dash-alpha 1200
49 google/gemini-3-pro-preview 1200
50 google/gemini-3-flash-preview 1200
51 z-ai/glm-4.7 1200
52 minimax/minimax-m2.1 1200
53 openai/gpt-5.2-pro 1200
54 openai/gpt-5.2-codex 1200
55 moonshotai/kimi-k2.5 1200
56 qwen/qwen3-coder-next 1200
57 openrouter/pony-alpha 1200
58 z-ai/glm-5 1200
59 google/gemini-3.1-pro-preview 1200
60 openai/gpt-5.4 1200
61 openrouter/healer-alpha 1200
62 openrouter/hunter-alpha 1200
63 x-ai/grok-4.20-multi-agent 1200
64 google/gemma-4-31b-it 1200
65 z-ai/glm-5.1 1200
66 anthropic/claude-opus-4.7 1200
67 xiaomi/mimo-v2.5-pro 1200
68 moonshotai/kimi-k2.6 1200
69 tencent/hy3-preview:free 1200
70 deepseek/deepseek-v4-pro 1200
71 openai/gpt-5.5 1200
72 qwen/qwen3.6-max-preview 1200
73 google/gemini-3.5-flash 1200
74 qwen/qwen3.7-max 1200
75 anthropic/claude-opus-4.8 1200
76 openrouter/horizon-beta 1184
77 anthropic/claude-opus-4.6 1183
Go to Arena