Inlay

//

ProfilePosts

Have that eerie feeling of déjà vu when reading model-generated text 👀, but can’t pinpoint the specific words or phrases 👀? ✨We introduce QUDsim, to quantify discourse similarities beyond lexical, syntactic, and content overlap.

Check out our paper for more results and analysis! 📝 arxiv.org/abs/2504.09373 🐙 github.com/AlliteraryAl... This was a fun collaboration with @yatingwu.bsky.social @asher-zheng.bsky.social @manyawadhwa.bsky.social @gregdnlp.bsky.social @jessyjli.bsky.social

The “LLM vibe” is real even when the actual content is different. Across several genres from creative writing to obituaries, different LLMs generate homogenous discourse compared to humans.

Apr 21, 2025

QUDsim assigns a similarity score between two documents. It works by considering to what extent one document answers another's QUDs, and vice versa. Segment alignments between the texts can also be derived.

As large language models become increasingly capable at various writing tasks, their weakness at generating unique and creative content becomes a major liability. Although LLMs have the ability to gen...

arxiv.org

QUDsim: Quantifying Discourse Similarities in LLM-Generated Text

Apr 21, 2025

Ramya Namuduri