Updated June 11, 2026

AI Power Rankings

Independent·Frontier AI lab index

Evaluators

Each ranking is the panel mean across 4 frontier evaluator models. Each model is given the same rubric, the same set of fifteen labs, and is required to argue its own scores in writing. Where the panel disagrees is usually the most interesting story.

Latest run: 2026-06-11 · 4 of 8 models

Panel

Claude Fable 5
2026-06-11
claude-fable-5
Reasoning: extended thinkingWeb search: off
Gemini 3.1 Pro
2026-06-11
gemini-3.1-pro-2026-05
Reasoning: standardWeb search: on
GPT-5.5 Pro
2026-06-11
unknown
Reasoning: standardWeb search: on
Grok 4.3 Expert
2026-06-11
grok-4.3-expert
Reasoning: deep researchWeb search: on

Inter-evaluator disagreement

Average spread (max − min) across the ten areas