Languages We Support
Native expertise in Bahasa Indonesia and major regional languages. Our annotators bring cultural context and linguistic precision to every project.
Bahasa Indonesia
270+ million speakers
The official language of Indonesia, spoken by over 270 million people. Our core expertise lies in providing high-quality annotation for Bahasa Indonesia across all domains.
Annotation Capabilities
- Full NLP annotation support
- Formal and informal register
- Technical and domain-specific content
- Social media and colloquial text
- News and editorial content
- Legal and government documents
Regional Languages
Indonesia is home to over 700 languages. We provide annotation services for the most widely spoken regional languages.
Javanese
Basa Jawa
82+ million speakers
The most widely spoken regional language in Indonesia with rich literary tradition and multiple speech levels.
Sundanese
Basa Sunda
42+ million speakers
Second-largest regional language with distinct vocabulary and grammatical features.
Minangkabau
Baso Minangkabau
5.5+ million speakers
Language of the Minangkabau people with significant influence on Indonesian vocabulary.
Batak
Hata Batak
3.5+ million speakers
A group of related languages including Toba, Karo, and Simalungun dialects.
Balinese
Basa Bali
3.3+ million speakers
Ancient language with unique script and strong connection to Hindu-Buddhist culture.
Acehnese
Bahsa Acèh
3.5+ million speakers
Language of the Aceh region with Arabic script influence and distinct phonology.
Bugis
Basa Ugi
5+ million speakers
Major language of South Sulawesi with its own traditional script (Lontara).
Madurese
Bhâsa Madhurâ
7+ million speakers
Language of Madura Island and Madurese communities across East Java.