//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
ManagerBench was accepted to #ICLR2026🎉 Check it out⬇️
4mo
Adi Simhi
ManagerBench was accepted to ICLR! @iclr-conf.bsky.social #ICLR2026 LLMs are still either unsafe, or completely harm avoidant - even when the harm affects furniture 🛋️ Check out our benchmark, online or in Rio 🇧🇷
4mo
Martin Tutek
🤔What happens when LLM agents choose between achieving their goals and avoiding harm to humans in realistic management scenarios? Are LLMs pragmatic or prefer to avoid human harm? 🚀 New paper out: ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs🚀🧵
8mo
Martin Tutek