now accepted at ICLR! 🐺🥳🐺
arxiv.org/abs/2506.20666
Tomer Ullman
NEW on our #DeeperLearning blog
People balance being kind vs. being honest — and #LLMs should too.
New research shows training choices often favor informativeness over kindness, but prompting can induce sycophancy.
Read more: bit.ly/3Wqrtxl
People’s actions and words are the result of a balance of different goals. The authors use a leading cognitive model of this value trade-off in polite speech to systematically examine […]