Excited about our new work measuring multi-turn persuasion in AI-human interactions and how to simulate human persuadability!
LLMs can shift people's beliefs.
But most persuasion studies only check beliefs before and after a conversation.
We built PersuasionTrace to measure beliefs turn by turn, so we can study how belief updates actually unfold.
Task diversity is supposedly key to generalization in RL. But what does it do to continual RL, where agents face one new task distribution after another?
We find that past a point, more diversity actually inhibits continual reinforcement learning 🧵
Really excited to have the opportunity to give a talk on this work @cogscisociety.bsky.social !!! last year was a blast can’t wait to go back to Rio in July 🇧🇷
HUGE thanks to my collaborators for the support @aydanhuang265.bsky.social @EricYe29011995 @natashajaques.bsky.social @maxkw.bsky.social 🙏
Our new short piece in TiCS on intuitive theories of truth: how people judge whether statements could be true, whether statements are true, and whether to assert them as true. A great collab with @keremoktar.bsky.social
@ihandleyminer.bsky.social @kevinzollman.com @lianeleeyoung.bsky.social
Can't wait to present this work @iclr-conf.bsky.social this year!!! Looking forward to hearing everyone's thoughts on the paper and learning more about peoples' research!
Thanks again to my collaborators for all of their help on this project!
🤔💭What even is reasoning? It's time to answer the hard questions!
We built the first unified taxonomy of 28 cognitive elements underlying reasoning
Spoiler—LLMs commonly employ sequential reasoning, rarely self-awareness, and often fail to use correct reasoning structures🧠
Forget modeling every belief and goal! What if we represented people as following simple scripts instead (i.e "cross the crosswalk")?
Our new paper shows AI which models others’ minds as Python code 💻 can quickly and accurately predict human behavior!
shorturl.at/siUYI%F0%9F%...
arXiv: arxiv.org/abs/2510.01272
New paper challenges how we think about Theory of Mind. What if we model others as executing simple behavioral scripts rather than reasoning about complex mental states? Our algorithm, ROTE (Representing Others' Trajectories as Executables), treats behavior prediction as program synthesis.
Video
Accurate prediction of human behavior is essential for robust and safe human-AI collaboration. However, existing approaches for modeling people are often data-hungry and brittle because they either ma...