Advancing Southeast Asian (SEA) NLP Research
https://seacrowd.github.io/
SEACrowd
Loading...
🌏 Applications are now open for the SEACrowd Apprentice Program 2026!
🗓️ Apply between Nov 17 – Dec 17, 2025 (UTC-12)
📅 Program runs Feb – Jun 2026
seacrowd.org/apprenticeship
★ Extensions to image generation and embedding models for SEA contexts
★ Evaluation across translated and region-specific multimodal benchmarks
This work is made possible by the amazing SEACrowd community, bringing together contributors across languages, cultures, and domains.
From Tagalog to Filipino
If you’ve spent time with Filipinos, you’ve probably heard the words “Tagalog” and “Filipino” used interchangeably. For many, the difference feels minor, but behind those two labels is a long history shaped by migration, colonization, nationalism, and modern technology.
We’re happy to share our latest work on SEA-VL Phase 2, continuing our effort to build language systems tailored for Southeast Asian languages and cultures.
★ SEA-adapted vision-language models (SEA-VLM) built on strong foundation models
★ Multilingual and culturally grounded instructiontuning data