Do you want to know what information LLMs prioritize in text synthesis tasks? Here's a short 🧵 about our new paper, led by Jan Trienes: an interpretable framework for salience analysis in LLMs.
First of all, information salience is a fuzzy concept. So how can we even measure it? (1/6)
Check out our paper for more results and analysis!
📝 arxiv.org/abs/2504.09373
🐙 github.com/AlliteraryAl...
This was a fun collaboration with @yatingwu.bsky.social @asher-zheng.bsky.social @manyawadhwa.bsky.social @gregdnlp.bsky.social @jessyjli.bsky.social
We at UT Linguistics are hiring for 🔥 2 faculty positions in Computational Linguistics! Assistant or Associate professors, deadline Dec 1.
UT has a super vibrant comp ling & #nlp community!!
Apply here 👉 apply.interfolio.com/158280
[3/4] Do VLMs actually ground in the figure?
Fine-tuning Qwen3.5-9B on MQUD makes generated questions more grounded in the figure and more specific to the paper’s scientific content.
Ramya Namuduri
Yating Wu
Jessy Li
Jessy Li
I did a starter pack of ML/AI people at @utaustin.bsky.social Please distribute and feel free to self nominate!
go.bsky.app/QLQznZg
✨New paper✨
Linguistic evaluations of LLMs often implicitly assume that language is generated by symbolic rules.
In a new position paper, @adelegoldberg.bsky.social, @kmahowald.bsky.social and I argue that languages are not Lego sets, and evaluations should reflect this!
arxiv.org/pdf/2502.13195
1k+ downloads each on the MINT empathy models since release 🔥 Encouraging to see the interest in our work!
tl;dr: In multi-turn empathic dialogue, LLMs reuse the same discourse moves far more often than humans do; MINT uses RL to diversify them. Give it a try!👇
huggingface.co/hongli-zhan/...
Atlas Wang
Leonie Weissweiler
Hongli Zhan
As large language models become increasingly capable at various writing tasks, their weakness at generating unique and creative content becomes a major liability. Although LLMs have the ability to gen...
New paper! 🏁 Last one from my PhD at UT Austin.
LLMs sound empathic but repeat the same discourse moves turn after turn — at 2x the rate of humans.
We built MINT🌿, the first RL framework for discourse move diversity in empathic dialogue. +25% empathy, −26% repetition.
📄 arxiv.org/abs/2604.11742
Hongli Zhan
What does a scientific figure make you wonder? 📊
We introduce MQUD: multimodal Questions Under Discussion for scientific figures.
With 1,250 author-annotated questions over 245 figures from 56 papers, MQUD asks what scientific question a figure raises in context.
Yating Wu
[2/4] These questions often require reasoning across the figure and paper text:
Why does this curve shift?
What comparison is scientifically meaningful?
What claim is this figure supporting? 🔍