//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
Profile
Loading...









Loading...
Do you want to know what information LLMs prioritize in text synthesis tasks? Here's a short 🧵 about our new paper, led by Jan Trienes: an interpretable framework for salience analysis in LLMs. First of all, information salience is a fuzzy concept. So how can we even measure it? (1/6)
Check out our paper for more results and analysis! 📝 arxiv.org/abs/2504.09373 🐙 github.com/AlliteraryAl... This was a fun collaboration with @yatingwu.bsky.social @asher-zheng.bsky.social @manyawadhwa.bsky.social @gregdnlp.bsky.social @jessyjli.bsky.social
We at UT Linguistics are hiring for 🔥 2 faculty positions in Computational Linguistics! Assistant or Associate professors, deadline Dec 1. UT has a super vibrant comp ling & #nlp community!! Apply here 👉 apply.interfolio.com/158280
Feb 21, 2025
Apr 21, 2025
Nov 19, 2024
[3/4] Do VLMs actually ground in the figure? Fine-tuning Qwen3.5-9B on MQUD makes generated questions more grounded in the figure and more specific to the paper’s scientific content.
1mo
Ramya Namuduri
Yating Wu
Jessy Li
Jessy Li
I did a starter pack of ML/AI people at @utaustin.bsky.social Please distribute and feel free to self nominate! go.bsky.app/QLQznZg
✨New paper✨ Linguistic evaluations of LLMs often implicitly assume that language is generated by symbolic rules. In a new position paper, @adelegoldberg.bsky.social, @kmahowald.bsky.social and I argue that languages are not Lego sets, and evaluations should reflect this! arxiv.org/pdf/2502.13195
1k+ downloads each on the MINT empathy models since release 🔥 Encouraging to see the interest in our work! tl;dr: In multi-turn empathic dialogue, LLMs reuse the same discourse moves far more often than humans do; MINT uses RL to diversify them. Give it a try!👇 huggingface.co/hongli-zhan/...
Nov 22, 2024
Feb 20, 2025
1mo
Atlas Wang
Leonie Weissweiler
Hongli Zhan
As large language models become increasingly capable at various writing tasks, their weakness at generating unique and creative content becomes a major liability. Although LLMs have the ability to gen...
arxiv.org
QUDsim: Quantifying Discourse Similarities in LLM-Generated Text
at://did:plc:vz2my7lhvw254yf43nom4otr/app.bsky.graph.starterpack/3lbjnpo4jc32a
New paper! 🏁 Last one from my PhD at UT Austin. LLMs sound empathic but repeat the same discourse moves turn after turn — at 2x the rate of humans. We built MINT🌿, the first RL framework for discourse move diversity in empathic dialogue. +25% empathy, −26% repetition. 📄 arxiv.org/abs/2604.11742
1mo
Hongli Zhan
What does a scientific figure make you wonder? 📊 We introduce MQUD: multimodal Questions Under Discussion for scientific figures. With 1,250 author-annotated questions over 245 figures from 56 papers, MQUD asks what scientific question a figure raises in context.
1mo
Yating Wu
[2/4] These questions often require reasoning across the figure and paper text: Why does this curve shift? What comparison is scientifically meaningful? What claim is this figure supporting? 🔍
1mo
Yating Wu
[4/4] Paper: arxiv.org/abs/2604.23733 Project page: lingchensanwen.github.io/multimodal-q... Dataset: huggingface.co/datasets/lin... w/ William Rudman, @venkatasg.net @alexdimakis.bsky.social, @jessyjli.bsky.social
1mo
Yating Wu