Calculator categories

LMSYS Chatbot Arena Leaderboard

Chatbot Arena history

LMSYS Chatbot Arena Leaderboard Over Time

Explore historical LMSYS Chatbot Arena ELO rankings, compare top LLMs, replay the top 10 by date, and see which AI labs held the leaderboard. The included dataset snapshot runs through 2025-08-29.

Latest dataset leader

Gemini 2.5 pro

1,456

Aug 29, 2025

Tracked models

306

Models found in the included LMSYS dataset.

Snapshots

124

From Jan 9, 2024 to Aug 29, 2025.

Selected leader

Gemini 2.5 pro

1,456

Aug 29, 2025

LMSYS Chatbot Arena leaderboard over time

Move through historical snapshots and compare top ELO trajectories.

Aug 29, 2025

Gemini 2.5 pro

GPT 5 high

Claude opus 4 1 20250805 thinking 16k

o3 2025 04 16

ChatGPT 4o latest 20250326

GPT 4.5 preview 2025 02 27

GPT 5 old

Claude opus 4 1 20250805

Top 10 on Aug 29, 2025

Rank, model, company, and ELO for the selected snapshot.

Rank	Model	Company	ELO
#1	gemini-2.5-pro	Google	1,456
#2	gpt-5-high	OpenAI	1,447
#3	claude-opus-4-1-20250805-thinking-16k	Anthropic	1,447
#4	o3-2025-04-16	OpenAI	1,444
#5	chatgpt-4o-latest-20250326	OpenAI	1,443
#6	gpt-4.5-preview-2025-02-27	OpenAI	1,439
#7	gpt-5-old	OpenAI	1,439
#8	claude-opus-4-1-20250805	Anthropic	1,436
#9	gpt-5-chat	OpenAI	1,426
#10	qwen-max-2025-08-15	Alibaba	1,425

Find a model in the dataset

Search model names included in the historical CSV.

alpaca-13b

amazon-nova-experimental-chat-05-14

amazon-nova-lite-v1.0

amazon-nova-micro-v1.0

amazon-nova-pro-v1.0

athene-70b

athene-70b-0725

athene-v2-chat

bard-jan-24-gemini-pro

c4ai-aya-expanse-32b

c4ai-aya-expanse-8b

chatglm-6b

Company leaderboard trajectory

Best model per company on each date, using the same historical snapshots.

Google

OpenAI

Anthropic

Alibaba

Longest model lead

Calendar days at rank #1 in the snapshot range.

gpt-4o-2024-05-1378d

o1-preview58d

gemini-2.5-pro50d

o1-2024-12-1742d

gemini-2.5-pro-preview-05-0629d

gpt-4-1106-preview28d

gpt-4-turbo-2024-04-0928d

gemini-2.5-pro-exp-03-2524d

Longest company lead

Which AI labs held the top spot the longest.

OpenAI324d

Google141d

Anthropic16d

Biggest leader jumps

Largest ELO gains by the #1 model between snapshots.

gemini-2.5-pro-preview-05-06+62.3

May 19, 2025 from gemini-2.5-pro-preview-05-06

gpt-4o-2024-05-13+31.6

May 15, 2024 from gpt-4-turbo-2024-04-09

gpt-4.5-preview-2025-02-27+28.6

Mar 3, 2025 from chatgpt-4o-latest-20250129

gemini-2.5-pro-exp-03-25+27.0

Mar 25, 2025 from gpt-4.5-preview-2025-02-27

o1-preview+25.7

Sep 17, 2024 from chatgpt-4o-latest-20240903

gemini-2.5-pro-preview-06-05+24.4

Jun 5, 2025 from gemini-2.5-pro-preview-05-06

gpt-5-high+22.8

Aug 8, 2025 from gemini-2.5-pro

chatgpt-4o-latest-20250129+17.6

Feb 11, 2025 from o1-2024-12-17

This page uses the included historical LMSYS Chatbot Arena demo dataset, not a live API. For the official live ranking, visit LM Arena leaderboard .

Is this LMSYS Chatbot Arena leaderboard live?

No. This tool uses the included historical demo dataset and labels the latest available snapshot date. Use the official LM Arena leaderboard for live rankings.

What is ELO in Chatbot Arena?

ELO is a rating system used to estimate relative model strength from pairwise human preference battles. Higher ELO means stronger performance in the arena dataset.

Why do some model lines appear only part of the time?

The chart focuses on models that were in the daily top 10. A model appears only on dates where it was present in that top-10 slice.

Found this tool helpful?

Help others discover it with one click.

Copy Link

Share on Social Media

X Facebook LinkedIn WhatsApp

Share via Email

Suggested hashtags: #lmsys #chatbot #Calculator #FreeTools #AICalculator

LMSYS Chatbot Arena Leaderboard Over Time

Is this LMSYS Chatbot Arena leaderboard live?

What is ELO in Chatbot Arena?

Why do some model lines appear only part of the time?

Related Calculators

P-Value Calculator

Standard Deviation Calculator

InvNorm Calculator

Percentage Calculator

Found this tool helpful?