//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
Scaling LLM Reasoning with EGGROLL 🥚🧠📝 Using 🥚 to finetune RWKV-7 language models outperforms GRPO on Countdown and GSM8K ❗ 🥚significantly outperformed GRPO on the Countdown task, achieving a 35% validation accuracy compared to GRPO's 23%❗
6mo