The AI Productivity Index
APEX measures how well AI models perform real-world, economically valuable work.

The APEX leaderboard

The APEX Leaderboard assesses the performance of frontier AI models at performing professional work that drives the economy. The reported scores are their average performance across four roles: Investment Banking Analyst, Management Consulting Associate, Big Law Associate, and General Practitioner (MD).

Model

Score

GPT 5

GPT 5

64%

Grok 4

Grok 4

61%

Gemini 2.5 Flash

Gemini 2.5 Flash

60%

Gemini 2.5 Pro

Gemini 2.5 Pro

60%

o3 Pro

o3 Pro

60%

Domains in APEX

Consulting Associate
Analyzes industries, evaluates markets, and builds strategic or financial models to guide client decisions. Work often includes preparing presentations, drafting reports, and synthesizing research into actionable recommendations.
Advised by Dominic Barton—former McKinsey Global Managing Director and Canadian Ambassador to China.
Experts from McKinsey, BCG, Deloitte, Accenture, EY
gpt-5
gpt-5
65%
grok-4
grok-4
63%
gemini-2-5-pro
gemini-2-5-pro
63%
Investment Banking Analyst
Builds financial models, values companies, and prepares pitch materials for potential deals. Responsibilities include conducting industry research, supporting transaction execution, and producing client-ready presentations under tight deadlines.
Experts from Goldman Sachs, Morgan Stanley, JPMorgan, Barclays, UBS, Bank of America, Evercore
gpt-5
gpt-5
60%
deepseek-r1
deepseek-r1
58%
gemini-2-5-pro
gemini-2-5-pro
58%
Big Law Associate
Drafts and reviews contracts, conducts legal research, and advises clients on regulatory and transactional matters. Collaborates with partners on litigation, mergers and acquisitions, and compliance while managing heavy workloads across cases.
Advised by Cass Sunstein—Harvard law professor, former White House Regulatory Administrator, and top-cited legal scholar.
Experts from Latham & Watkins, Skadden, Cravath
gpt-5
gpt-5
71%
claude-sonnet-4.5
claude-sonnet-4.5
70%
o3
o3
68%
General Practitioner (MD)
Diagnoses and treats a wide range of patient conditions, from acute illnesses to chronic diseases. Reviews medical histories, orders and interprets tests, prescribes treatments, and provides preventative care and ongoing patient guidance.
Advised by Eric Topol—Cardiologist, geneticist, and founder of the Scripps Research Translational Institute, leading voice in digital and precision medicine.
Experts from University of Pennsylvania, Northwestern, Cornell, Brigham & Women’s, Mount Sinai
claude-sonnet-4.5
claude-sonnet-4.5
62%
gpt-5
gpt-5
62%
grok-4
grok-4
59%