//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
Profile
Loading...
The first iteration of our workshop will be co-located with @colmweb.org 2025 in Montreal. https://wmdqs.org/
Workshop on Multilingual Data Quality Signals









Loading...
If you were able to join us, let us know about your experience: docs.google.com/forms/d/e/1F...
Then we had our second poster session for our paper submissions. The full papers are available on our website!
WMDQS is underway! Come join us in Room 520A at @colmweb.org! #COLM2025
Thank you everyone for coming to WMDQS (pronounced "whim ducks")!
David Adelani gave a keynote about text quality for low-resource languages.
We started with a keynote from @juliakreutzer.bsky.social about multilingual fine-tuning data!
We had our first poster session, hearing from some of our shared task participants!
We presented the results of our shared task! We received annotations for over 30,000 document representing over 60 languages. We also showed the results of our LangID dataset and system shared task tracks. Thank you everyone who participated!
After lunch, @sebnagel.bsky.social gave a keynote about the data collected by @commoncrawl.bsky.social!
8mo
8mo
8mo
8mo
8mo
Looking forward to tomorrow's #COLM2025 workshop on multilingual data quality! 🤩
8mo
8mo
8mo
8mo
8mo
Workshop on Multilingual Data Quality Signals
Workshop on Multilingual Data Quality Signals
Workshop on Multilingual Data Quality Signals
Workshop on Multilingual Data Quality Signals
Workshop on Multilingual Data Quality Signals
Workshop on Multilingual Data Quality Signals
Workshop on Multilingual Data Quality Signals
Workshop on Multilingual Data Quality Signals
Workshop on Multilingual Data Quality Signals
Julia Kreutzer
In collaboration with @commoncrawl.bsky.social, MLCommons, and @eleutherai.bsky.social, the first edition of WMDQS at @colmweb.org starts tomorrow in Room 520A! We have an updated schedule on our website, including a list of all accepted papers.
8mo
Workshop on Multilingual Data Quality Signals