Model
Score
GPT 5
64%
64%
Grok 4
61%
61%
Gemini 2.5 Flash
60%
60%
Gemini 2.5 Pro
60%
60%
o3 Pro
60%
60%
The APEX Leaderboard assesses the performance of frontier AI models at performing professional work that drives the economy. The reported scores are their average performance across four roles: Investment Banking Analyst, Management Consulting Associate, Big Law Associate, and General Practitioner (MD).
Model
Score
GPT 5
64%
64%
Grok 4
61%
61%
Gemini 2.5 Flash
60%
60%
Gemini 2.5 Pro
60%
60%
o3 Pro
60%
60%