Inlay

We’ve backfilled FrontierMath: Tiers 1–4 (v2) scores for a selection of notable models, including recent Claude Opus models. You can find these on our website. We will add scores for Claude Fable 5 and GPT Pro models shortly. epoch.ai/frontiermat...

FrontierMath Tiers 1-4 is an AI benchmark of hundreds of unpublished and extremely challenging math problems.