Inlay

ProfilePosts

arXiv: arxiv.org/abs/2510.01272

Excited about our new work measuring multi-turn persuasion in AI-human interactions and how to simulate human persuadability!

Our new short piece in TiCS on intuitive theories of truth: how people judge whether statements could be true, whether statements are true, and whether to assert them as true. A great collab with @keremoktar.bsky.social @ihandleyminer.bsky.social @kevinzollman.com @lianeleeyoung.bsky.social

New paper challenges how we think about Theory of Mind. What if we model others as executing simple behavioral scripts rather than reasoning about complex mental states? Our algorithm, ROTE (Representing Others' Trajectories as Executables), treats behavior prediction as program synthesis.

16d

3mo

8mo

Really excited to have the opportunity to give a talk on this work @cogscisociety.bsky.social !!! last year was a blast can’t wait to go back to Rio in July 🇧🇷 HUGE thanks to my collaborators for the support @aydanhuang265.bsky.social @EricYe29011995 @natashajaques.bsky.social @maxkw.bsky.social 🙏

2mo

Can't wait to present this work @iclr-conf.bsky.social this year!!! Looking forward to hearing everyone's thoughts on the paper and learning more about peoples' research! Thanks again to my collaborators for all of their help on this project!

Max Kleiman-Weiner

LLMs can shift people's beliefs. But most persuasion studies only check beliefs before and after a conversation. We built PersuasionTrace to measure beliefs turn by turn, so we can study how belief updates actually unfold.

Forget modeling every belief and goal! What if we represented people as following simple scripts instead (i.e "cross the crosswalk")? Our new paper shows AI which models others’ minds as Python code 💻 can quickly and accurately predict human behavior! shorturl.at/siUYI%F0%9F%...

4mo

16d

Task diversity is supposedly key to generalization in RL. But what does it do to continual RL, where agents face one new task distribution after another? We find that past a point, more diversity actually inhibits continual reinforcement learning 🧵

8mo

19d

Kunal Jha

🤔💭What even is reasoning? It's time to answer the hard questions! We built the first unified taxonomy of 28 cognitive elements underlying reasoning Spoiler—LLMs commonly employ sequential reasoning, rarely self-awareness, and often fail to use correct reasoning structures🧠

6mo

Jared Moore