The AI Productivity Index
APEX measures how well AI models perform real-world, economically valuable work.

The APEX Leaderboard: assessing frontier models for economic value

This APEX leaderboard presents the global ranking of models based on their averaged results across all task domains. It shows which AI models are best overall at performing the kinds of professional work that drive the economy.

Model

Score

GPT 5

GPT 5

64%

Grok 4

Grok 4

61%

Gemini 2.5 Flash

Gemini 2.5 Flash

60%

Gemini 2.5 Pro

Gemini 2.5 Pro

60%

o3 Pro

o3 Pro

60%

Domains in APEX

Consulting Associate
Analyzes industries, evaluates markets, and builds strategic or financial models to guide client decisions. Work often includes preparing presentations, drafting reports, and synthesizing research into actionable recommendations.
Advised by Dominic Barton—former McKinsey Global Managing Director and Canadian Ambassador to China.
Experts from McKinsey, BCG, Deloitte, Accenture, EY
gpt-5
gpt-5
65%
grok-4
grok-4
63%
gemini-2-5-pro
gemini-2-5-pro
63%
Investment Banking Analyst
Builds financial models, values companies, and prepares pitch materials for potential deals. Responsibilities include conducting industry research, supporting transaction execution, and producing client-ready presentations under tight deadlines.
Experts from Goldman Sachs, Morgan Stanley, JPMorgan, Barclays, UBS, Bank of America, Evercore
gpt-5
gpt-5
60%
deepseek-r1
deepseek-r1
58%
gemini-2-5-pro
gemini-2-5-pro
58%
Big Law Associate
Drafts and reviews contracts, conducts legal research, and advises clients on regulatory and transactional matters. Collaborates with partners on litigation, mergers and acquisitions, and compliance while managing heavy workloads across cases.
Advised by Cass Sunstein—Harvard law professor, former White House Regulatory Administrator, and top-cited legal scholar.
Experts from Latham & Watkins, Skadden, Cravath
gpt-5
gpt-5
71%
claude-sonnet-4.5
claude-sonnet-4.5
70%
o3
o3
68%
General Practitioner (MD)
Diagnoses and treats a wide range of patient conditions, from acute illnesses to chronic diseases. Reviews medical histories, orders and interprets tests, prescribes treatments, and provides preventative care and ongoing patient guidance.
Advised by Eric Topol—Cardiologist, geneticist, and founder of the Scripps Research Translational Institute, leading voice in digital and precision medicine.
Experts from University of Pennsylvania, Northwestern, Cornell, Brigham & Women’s, Mount Sinai
claude-sonnet-4.5
claude-sonnet-4.5
62%
gpt-5
gpt-5
62%
grok-4
grok-4
59%