Inlay

//

ProfilePosts

Loading...

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models by @sagnikmukherjee.bsky.social, Lifan Yuan, @dilekh.bsky.social, Hao Peng Read more here: arxiv.org/abs/2505.11711 x.com/saagnikkk/st...

8mo

arxiv.org

Reinforcement learning (RL) yields substantial improvements in large language models (LLMs) downstream task performance and alignment with human values. Surprisingly, such large gains result from upda...

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Thrilled to announce our new survey that explores the exciting possibilities and troubling risks of computational persuasion in the era of LLMs 🤖💬 📄Arxiv: arxiv.org/pdf/2505.07775 💻 GitHub: github.com/beyzabozdag/...

May 13, 2025

[5/5] Persuasion research is still playing catch-up, promising great advancements!✨ Thank you to my amazing co-authors! @shuhaib.bsky.social @xiaocheng-yang.bsky.social @HyeonjeongHa @ziruicheng.bsky.social @EsinDurmus @JiaxuanYou @HengJi @gokhantur.bsky.social @dilekh.bsky.social