Off-the-shelf data

Diverse datasets spanning professional, coding, research, agentic, and multimodal tasks.

Talk to our team

50K+

tasks

1M+

domain experts

30+

covered domains

Get to training in days, not months

No waiting, the same datasets used by the world's top AI labs, ready to license immediately.

Immediately available

Every dataset is pre-built, peer-reviewed, and ready to license. Sample tasks delivered same day. No custom pipeline, no 6-month wait.

Created by domain experts

Every task is written by PhD researchers, practicing lawyers, and senior engineers, not crowdsourced.

Quality assured

Each task is created, vetted, and peer reviewed by domain experts and using quality automation to ensure training signal.

Expert-written and graded

Every dataset is built from tasks created and evaluated by our expert network.

APEX Agents

Mercor's AI Productivity Index for Agents. Expert-built tasks run inside high-fidelity enterprise app clones, testing whether agents can navigate hundreds of files, hold context, and finish long-horizon professional work.

Tool useLong horizonReasoningInformation retrievalMultiple modalitiesAgenticProfessional services

2,620 tasks across 8 domains

Domains

Finance830

Consulting590

Law530

Medicine250

Education100

Data Science200

Accounting40

HR80

Featured Datasets

ACE

Mercor's AI Consumer Index. The first benchmark for everyday consumer tasks across shopping, food, gaming, and DIY, penalizing models that hallucinate prices, specs, or links instead of verifying them.

Web searchInstruction followingReasoningConsumer

1,200 tasks across 4 domains

APEX v1

The AI Productivity Index. Rubric-graded tasks test whether models can do the real knowledge work of investment bankers, consultants, lawyers, and physicians.

Long form inputsInformation processingReasoningProfessional services

920 tasks across 5 domains

BrowseComp

A web-browsing benchmark for realistic, end-to-end search. Tasks test whether models can strategize, navigate authoritative sources, and synthesize grounded answers to questions general knowledge can't solve.

Web searchReasoningInstruction FollowingMultiple domains

3,558 tasks

Many more datasets are available. Reach out to our team to learn more.

Ready to train on data that actually moves models?

Sample tasks from any dataset, delivered same day.

Talk to our team