FrontierMath: Tiers 1–4 is now approaching saturation. We believe the future of math benchmarking lies in open problems drawn from real research, like those we’ve collected in FrontierMath: Open Problems.
epoch.ai/frontiermat...
A collection of unsolved mathematical problems designed to test AI systems' ability to advance human mathematical knowledge.