Sketch Engine is a linguistic search engine and corpus query system with text analysis tools and corpora in 100+ languages. Concordance, n-grams, term extraction, co-occurrences (Word Sketch) are only some of its features.
Sketch Engine
Loading...
Explore how Malay words change through time with our latest corpus — ideal for discovering #neologisms and emerging language trends. ske.li/malay_trends
An example of Sketch Engine used outside the field of pure linguistics. This study in media discourse analysis will be published in @nature.com www.nature.com/articles/s41... #MediaRepresentation #discourseanalysis #corpuslinguistics
Our new Chinese corpus in Traditional Chinese (繁體字) is now available. It is part-of-speech tagged and partly annotated for topics and genres. A useful resource for research and language teaching. #corpuslinguistics #digitalhumanities
www.sketchengine.eu/zhtenten-chi...
We’ve published a new Chinese corpus in Simplified Chinese (简体字). It is part-of-speech tagged and partly annotated for topics and genres. A useful resource for research and language technology. #corpuslinguistics #linguistics #nlp
www.sketchengine.eu/zhtenten-chi...
We’ve published the Urdu Corpus 2021 in Sketch Engine, with 328 million words and topic and genre classification. Urdu is the 11th most spoken language worldwide (Ethnologue, 2025).
🔗 www.sketchengine.eu/urtenten-urd...
#corpuslinguistics #TextAnalysis #اردو
The new Latvian Corpus 2021 now available in Sketch Engine. The corpus is enriched with part-of-speech tagging and lemmatization. Perfect for #corpuslinguistics, #digitalhumanities, #linguistics, #lexicography, and #nlp.
The Telugu Web 2021 corpus, with 100+ million words and part-of-speech tagging, is now available in Sketch Engine! #corpuslinguistics, #digitalhumanities, #linguistics
www.sketchengine.eu/tetenten-tel...
📚🔎 The Adam Kilgarriff Prize is open for applications. If you created a dictionary, corpus, or language tool, consider applying or sharing the opportunity.
#lexicography #NLP
kilgarriff.co.uk/prize/
460M+ words of 🇲🇹 Maltese language data now available in one corpus. A useful resource for research and #NLP on this unique Semitic language written in the Latin script. Special thanks to the University of Malta for making this possible.
www.sketchengine.eu/maltese-refe...
#corpuslinguistics
Sketch Engine
Sketch Engine
Sketch Engine
Sketch Engine
Sketch Engine
Sketch Engine
Sketch Engine
Sketch Engine
Sketch Engine
Registration is open for Lexicom 2026 in Palermo 🇮🇹! Since 2001, this workshop in #lexicography and #corpuslinguistics has welcomed 700+ participants worldwide. Join the community and take part in the next edition, 14–18 September 2026.
🔗 lexicom.courses/lexicom-2026...