//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
Our work on Asynchronous RLHF was accepted to #ICLR2025 ! (I was so excited to announce it, I forgot to say I was excited) Used by @ai2.bsky.social for OLMo-2 32B 🔥 New results show ~70% speedups for LLM + RL math and reasoning 🧠 🧵below or hear my DLCT talk online on March 28!