Languages We Support

Native expertise in Bahasa Indonesia and major regional languages. Our annotators bring cultural context and linguistic precision to every project.

Primary Language

Bahasa Indonesia

270+ million speakers

The official language of Indonesia, spoken by over 270 million people. Our core expertise lies in providing high-quality annotation for Bahasa Indonesia across all domains.

Annotation Capabilities

  • Full NLP annotation support
  • Formal and informal register
  • Technical and domain-specific content
  • Social media and colloquial text
  • News and editorial content
  • Legal and government documents

Regional Languages

Indonesia is home to over 700 languages. We provide annotation services for the most widely spoken regional languages.

Javanese

Basa Jawa

Central & East Java

82+ million speakers

The most widely spoken regional language in Indonesia with rich literary tradition and multiple speech levels.

Ngoko (informal)Krama (formal)Modern Javanese

Sundanese

Basa Sunda

West Java

42+ million speakers

Second-largest regional language with distinct vocabulary and grammatical features.

Kasar (informal)Lemes (polite)Written Sundanese

Minangkabau

Baso Minangkabau

West Sumatra

5.5+ million speakers

Language of the Minangkabau people with significant influence on Indonesian vocabulary.

Spoken dialectsWritten formModern usage

Batak

Hata Batak

North Sumatra

3.5+ million speakers

A group of related languages including Toba, Karo, and Simalungun dialects.

Toba BatakKaroSimalungun

Balinese

Basa Bali

Bali

3.3+ million speakers

Ancient language with unique script and strong connection to Hindu-Buddhist culture.

Spoken BalineseFormal registerCultural texts

Acehnese

Bahsa Acèh

Aceh

3.5+ million speakers

Language of the Aceh region with Arabic script influence and distinct phonology.

Modern AcehneseTraditional textsSpoken dialects

Bugis

Basa Ugi

South Sulawesi

5+ million speakers

Major language of South Sulawesi with its own traditional script (Lontara).

Spoken BugisLontara scriptModern usage

Madurese

Bhâsa Madhurâ

Madura Island

7+ million speakers

Language of Madura Island and Madurese communities across East Java.

Island dialectsMainland varietiesModern Madurese

Need Support for Other Languages?

We can source annotators for additional Indonesian regional languages and dialects. Contact us to discuss your specific language requirements.

Papuan LanguagesMakassareseBanjareseSasakAnd 700+ more