Inlay

//

Post

AI scores a 'C-' ...

1d

Alan Pater

The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got six or seven of the 10 questions right

1d

The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got six or seven of the 10 questions right

AI scores a ‘C–’ on its hardest math test yet

www.scientificamerican.com

Scientific American