A framework for measuring, improving, and safely deploying enterprise agents using verifier-backed judgments over agent trajectories and outputs.