The world's leading AI models, ranked
Aggregated live from Artificial Analysis and OpenRouter. Track every frontier model: intelligence, coding, math, speed and cost, all in one place.
| Rank | Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) Anthropic | 65.8 | 59.9 | 76.5 | 0 | $20 | ||||
| 2 | Claude Opus 4.8 (Adaptive Reasoning, Max Effort) Anthropic | 62.3 | 55.7 | 74.3 | 71 | $10 | ||||
| 3 | GPT-5.5 (xhigh) OpenAI | 61.9 | 54.8 | 74.9 | 64 | $11.25 | ||||
| 4 | Claude Opus 4.7 (Adaptive Reasoning, Max Effort) Anthropic | 60.6 | 53.5 | 73.6 | 60 | $10 | ||||
| 5 | GPT-5.4 (xhigh) OpenAI | 58.4 | 51.4 | 71.1 | 161 | $5.63 | ||||
| 6 | GLM-5.2 (max) Zhipu AIOpen weights | 57.3 | 51.1 | 68.8 | 112 | $2.15 | ||||
| 7 | Gemini 3.5 Flash GoogleT🖼🎬📄🔊 | 57.2 | 50.2 | 70.1 | 240 | $3.38 | 1.0M | |||
| 8 | Gemini 3.1 Pro Preview Google🔊📄🖼T🎬 | 54.4 | 46.5 | 68.8 | 140 | $4.5 | 1.0M | |||
| 9 | GPT-5.2 (xhigh) OpenAI | 54.4 | 42.2 | 99 | 88 | $4.81 | ||||
| 10 | Qwen3.7 Max AlibabaOpen weightsT | 53.1 | 46 | 66 | 202 | $3.75 | 1M |
Frequently asked questions
+ What is the best AI model right now?
By the DataCore composite score (intelligence, coding, math), Claude Fable 5 (Adaptive Reasoning, Max Effort, Opus 4.8 Fallback) currently leads our ranking of 529 AI models from 57 providers, updated continuously.
+ Which AI model is the cheapest?
Several models is among the lowest-cost frontier options. You can sort and filter the leaderboard by price.
+ What is the fastest LLM?
Mercury 2 reaches the highest output speed (~1192 tokens/sec). Speed and latency are measured by Artificial Analysis.
+ What is an AI model "time horizon"?
It's METR's metric: the length of task (in human time) a model can complete with a 50% success rate. A longer time horizon means the model can handle longer, more complex agentic tasks.
+ What is the best open-source LLM?
The board tracks 222 open-weight models such as DeepSeek, Qwen, Llama and GLM. Use the "Open weights" filter to compare them head-to-head.
+ Where does the data come from and how often is it updated?
Aggregated from Artificial Analysis (intelligence, coding, math, speed, price), METR (time horizon) and OpenRouter (catalog, live pricing). Data refreshes continuously (≈10-minute cache).