In multi-turn conversation, LLMs tend to repeat the same kind of things over and over again. They could have different words, but we found them to be the *same discourse moves*!
Introducing @hongli-zhan.bsky.social’s new work: novel discourse-level diversity rewards in post-training:
Jessy Li
New paper! 🏁 Last one from my PhD at UT Austin.
LLMs sound empathic but repeat the same discourse moves turn after turn — at 2x the rate of humans.
We built MINT🌿, the first RL framework for discourse move diversity in empathic dialogue. +25% empathy, −26% repetition.
📄 arxiv.org/abs/2604.11742