//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
Profile
Loading...




Loading...
Accepted at ACL main! Come chat about dialectal MT at our poster today at 4 pm. Also, check out this largely bug-free package for generating your own synthetic dialectal data: pypi.org/project/dial...
Are you tired of getting meh results from LLMs in your native language and resorting to English instead? We are too!
10mo
Frustrated with how most of the world’s low-resource languages have NO evaluation resources? 📢 Check out ChiKhaPo, a massively multilingual lexical comprehension and generation benchmark covering 2700+ languages. www.arxiv.org/abs/2510.16928
You have a budget to human-evaluate 100 inputs to your models, but your dataset is 10,000 inputs. Do not just pick 100 randomly!🙅 We can do better. "How to Select Datapoints for Efficient Human Evaluation of NLG Models?" shows how.🕵️ (random is still a devilishly good baseline)
Multimodal LLMs can read text in images, but why do they often perform worse than when the same text is given as tokens? Our work studies the modality gap of models perceiving text as pixels and shows how to close it. 📄 arxiv.org/abs/2603.09095 🧵👇 #NLProc #LLM #ComputerVision
6d
7mo
11mo
3mo
Niyati Bafna
Patricia Schmidtova
Vilém Zouhar
Kaiser Sun