//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
๐Ÿงต1/ ๐Ÿšจ New paper: A Sober Look at Progress in Language Model Reasoning We re-evaluate recent SFT and RL models for mathematical reasoning and find most gains vanish under rigorous, multi-seed, standardized evaluation. ๐Ÿ“Š bethgelab.github.io/sober-reason... ๐Ÿ“„ arxiv.org/abs/2504.07086
Apr 10, 2025
Andreas Hochlehnert