Inlay

//

Post

ManagerBench was accepted to #ICLR2026🎉 Check it out⬇️

4mo

Adi Simhi

ManagerBench was accepted to ICLR! @iclr-conf.bsky.social #ICLR2026 LLMs are still either unsafe, or completely harm avoidant - even when the harm affects furniture 🛋️ Check out our benchmark, online or in Rio 🇧🇷

4mo

Martin Tutek

🤔What happens when LLM agents choose between achieving their goals and avoiding harm to humans in realistic management scenarios? Are LLMs pragmatic or prefer to avoid human harm? 🚀 New paper out: ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs🚀🧵

8mo

Martin Tutek