Inlay

Profile

Do you want to know what information LLMs prioritize in text synthesis tasks? Here's a short 🧵 about our new paper, led by Jan Trienes: an interpretable framework for salience analysis in LLMs. First of all, information salience is a fuzzy concept. So how can we even measure it? (1/6)

Check out our paper for more results and analysis! 📝 arxiv.org/abs/2504.09373 🐙 github.com/AlliteraryAl... This was a fun collaboration with @yatingwu.bsky.social @asher-zheng.bsky.social @manyawadhwa.bsky.social @gregdnlp.bsky.social @jessyjli.bsky.social

We at UT Linguistics are hiring for 🔥 2 faculty positions in Computational Linguistics! Assistant or Associate professors, deadline Dec 1. UT has a super vibrant comp ling & #nlp community!! Apply here 👉 apply.interfolio.com/158280

Feb 21, 2025

Apr 21, 2025

Nov 19, 2024

[3/4] Do VLMs actually ground in the figure? Fine-tuning Qwen3.5-9B on MQUD makes generated questions more grounded in the figure and more specific to the paper’s scientific content.

1mo

Ramya Namuduri

Yating Wu

Jessy Li

I did a starter pack of ML/AI people at @utaustin.bsky.social Please distribute and feel free to self nominate! go.bsky.app/QLQznZg

✨New paper✨ Linguistic evaluations of LLMs often implicitly assume that language is generated by symbolic rules. In a new position paper, @adelegoldberg.bsky.social, @kmahowald.bsky.social and I argue that languages are not Lego sets, and evaluations should reflect this! arxiv.org/pdf/2502.13195

1k+ downloads each on the MINT empathy models since release 🔥 Encouraging to see the interest in our work! tl;dr: In multi-turn empathic dialogue, LLMs reuse the same discourse moves far more often than humans do; MINT uses RL to diversify them. Give it a try!👇 huggingface.co/hongli-zhan/...

Nov 22, 2024

Feb 20, 2025

1mo

Atlas Wang

Leonie Weissweiler

Hongli Zhan

As large language models become increasingly capable at various writing tasks, their weakness at generating unique and creative content becomes a major liability. Although LLMs have the ability to gen...

arxiv.org

QUDsim: Quantifying Discourse Similarities in LLM-Generated Text

at://did:plc:vz2my7lhvw254yf43nom4otr/app.bsky.graph.starterpack/3lbjnpo4jc32a

New paper! 🏁 Last one from my PhD at UT Austin. LLMs sound empathic but repeat the same discourse moves turn after turn — at 2x the rate of humans. We built MINT🌿, the first RL framework for discourse move diversity in empathic dialogue. +25% empathy, −26% repetition. 📄 arxiv.org/abs/2604.11742

1mo

Hongli Zhan

What does a scientific figure make you wonder? 📊 We introduce MQUD: multimodal Questions Under Discussion for scientific figures. With 1,250 author-annotated questions over 245 figures from 56 papers, MQUD asks what scientific question a figure raises in context.

1mo

Yating Wu

[2/4] These questions often require reasoning across the figure and paper text: Why does this curve shift? What comparison is scientifically meaningful? What claim is this figure supporting? 🔍

1mo

Yating Wu

[4/4] Paper: arxiv.org/abs/2604.23733 Project page: lingchensanwen.github.io/multimodal-q... Dataset: huggingface.co/datasets/lin... w/ William Rudman, @venkatasg.net @alexdimakis.bsky.social, @jessyjli.bsky.social

1mo

Yating Wu