"CLDF Meta" (meta.clld.org) links to data from over 800 CLDF datasets that have been released on Zenodo (e.g. WALS, APiCS and Grambank). There are links to data on over 9000 languoids, e.g. 98 entries on Ambulas (to take a random language). Great work by my colleague Johannes Englisch!
With big thanks to Alexey Koshevoy, @alexeykoshevoy.bsky.social Marie Hallo @marie-hl.bsky.social and Rowan Hall Maudslay!
New post: Tagging my blog with BERTopic and LLMs.
I suspect as token costs increase, we'll see more blended traditional ML/LLM systems. And, in general, they work really well!
vickiboykis.com/2026/05/18/t...
Scientists are asking if it might be unconscious
Multiplier par 15 les frais d’inscription universitaires des étudiant·es étrangèr·es non européen·nes.
Piétiner l'égal accès aux services publics.
Appeler le programme "Bienvenue en France" 🇫🇷🥖
🔴 Signez et faites signer la pétition, ça prend 2 min :
petitions.assemblee-nationale.fr/initiatives/...
LLMs mean you still need a human in the loop, but in a different part
La stratégie Bienvenue en France prétend renforcer l'attractivité internationale de la France en multipliant par 15 les frais d’inscription à l’université des étudiant·es non européen·nes. Si les exon...
🚨New paper alert!🚨 In our new paper with @oliviermorin.bsky.social, Rowan Hall, and Marie Halo published in the Journal of Language Evolution, we explored the relationship between word length and ambiguity on a large-cross linguistic dataset. 1/6