//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
Our framework shows, both theoretically and empirically, that online MARL self-improvement can reach a new frontier for safety alignment of LMs. Check out more details at: šŸ“šššš©šžš«: arxiv.org/abs/2506.07468 šŸ“š‚šØššž: github.com/mickelliu/s...
Jun 12, 2025
Mickel Liu