//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
We’ve backfilled FrontierMath: Tiers 1–4 (v2) scores for a selection of notable models, including recent Claude Opus models. You can find these on our website. We will add scores for Claude Fable 5 and GPT Pro models shortly. epoch.ai/frontiermat...
15h
epoch.ai
FrontierMath Tiers 1-4 is an AI benchmark of hundreds of unpublished and extremely challenging math problems.
FrontierMath: LLM Benchmark for Advanced AI Math Reasoning
Epoch AI