Inlay

Our framework shows, both theoretically and empirically, that online MARL self-improvement can reach a new frontier for safety alignment of LMs. Check out more details at: 📍𝐏𝐚𝐩𝐞𝐫: arxiv.org/abs/2506.07468 📍𝐂𝐨𝐝𝐞: github.com/mickelliu/s...