Teaching a course this fall at @isawnyu.bsky.social on text analysis for historical languages—course description here: diyclassics.github.io/isaw-f2026-g.... If you are at NYU (or a consortium program) and are interested, let me know.
#TeachAncient #DigiClass
Made some LatinCy-based suggestions here—what else are people working with?
Announcing—LatinCy Lexicon v0.1, a refactored version of Whitaker's Words that uses LatinCy annotations to disambiguate words/meanings. Can be added as a custom component to any LatinCy pipeline. github.com/latincy/lati... #digiclass #nlproc
From Whitaker's release notes... "Permission is hereby freely given for any and all use of program and data." Amazingly open license, allowing us to build cool stuff for Latin. And thanks also to Martin Keegan for hosting the maintenance of Words since 2015, cf. mk270.github.io/whitakers-wo...
Also—hoping to implement genuine word sense disambiguation soon by combining this functionality with the LatinCy token vectors... getting there...
Very cool—please be in touch about where things could be improved, always looking to make the models/components better.
Graduate course at ISAW (Fall 2026) introducing computational methods for historical-language research: Python-based text analysis & NLP via word embeddings, transformer models, and large language mod...
diyclassics.github.io
Cool—follow up with the results here, looking forward to seeing what you find. Hope to do a good amount of multilingual in the coming year (though tbh Latin~Greek to start)