Excited about our new work measuring multi-turn persuasion in AI-human interactions and how to simulate human persuadability!
Our new short piece in TiCS on intuitive theories of truth: how people judge whether statements could be true, whether statements are true, and whether to assert them as true. A great collab with @keremoktar.bsky.social
@ihandleyminer.bsky.social @kevinzollman.com @lianeleeyoung.bsky.social
New paper challenges how we think about Theory of Mind. What if we model others as executing simple behavioral scripts rather than reasoning about complex mental states? Our algorithm, ROTE (Representing Others' Trajectories as Executables), treats behavior prediction as program synthesis.
Really excited to have the opportunity to give a talk on this work @cogscisociety.bsky.social !!! last year was a blast can’t wait to go back to Rio in July 🇧🇷
HUGE thanks to my collaborators for the support @aydanhuang265.bsky.social @EricYe29011995 @natashajaques.bsky.social @maxkw.bsky.social 🙏
Can't wait to present this work @iclr-conf.bsky.social this year!!! Looking forward to hearing everyone's thoughts on the paper and learning more about peoples' research!
Thanks again to my collaborators for all of their help on this project!
Max Kleiman-Weiner
Max Kleiman-Weiner
Max Kleiman-Weiner
Max Kleiman-Weiner
LLMs can shift people's beliefs.
But most persuasion studies only check beliefs before and after a conversation.
We built PersuasionTrace to measure beliefs turn by turn, so we can study how belief updates actually unfold.
Forget modeling every belief and goal! What if we represented people as following simple scripts instead (i.e "cross the crosswalk")?
Our new paper shows AI which models others’ minds as Python code 💻 can quickly and accurately predict human behavior!
shorturl.at/siUYI%F0%9F%...
Task diversity is supposedly key to generalization in RL. But what does it do to continual RL, where agents face one new task distribution after another?
We find that past a point, more diversity actually inhibits continual reinforcement learning 🧵