Skip to main content

Supported Languages

Transcription:BatchReal-TimeDeployments:All

This page lists the range of languages supported by Speechmatics. For more information on how to use these, please refer to the guide on Accuracy and Language

info

To automatically identify the language in an audio file, use our Language Identification feature.

To dynamically update your system with the latest languages and features offered by Speechmatics, use our Feature Discovery endpoint.

Languages

Speechmatics supports the following languages. Your ability to use any or all of the languages will depend on what languages you are contracted to use.

Speechmatics takes a global-first approach to our languages. In a single language pack, we aim to support many different accents and dialects. This simplifies your workflow when selecting which language to use, not requiring you to know which accent is being spoken in your audio upfront. With this approach we still achieve very high accuracy compared to accent-specific language packs.

LanguageLanguage CodeDescription
AutomaticautoAutomatically detect the language using our Language Identification feature.
ArabicarOur global Arabic gives high-accuracy transcription across many different accents and dialects including (but not limited to) Modern Standard Arabic (MSA) and Arabic spoken in the Gulf, Egypt and the Levant.
Bashkirba
Basqueeu
Belarusianbe
Bengalibn
Bulgarianbg
Cantoneseyue
Catalanca
Croatianhr
Czechcs
Danishda
Dutchnl
EnglishenOur global English gives high-accuracy transcription across many different accents including (but not limited to) English spoken in the United Kingdom, United States, Australia, New Zealand and non-native speakers.
Esperantoeo
Estonianet
Finnishfi
FrenchfrOur global French gives high-accuracy transcription across many different accents including (but not limited to) French spoken in France, Canada and Belgium.
Galiciangl
GermandeOur global German gives high-accuracy transcription across many different accents including (but not limited to) German spoken in Germany, Austria and Switzerland.
Greekel
Hebrewhe
Hindihi
Hungarianhu
Indonesianid
Interlinguaia
Irishga
Italianit
Japaneseja
Koreanko
Latvianlv
Lithuanianlt
Malayms
Maltesemt
MandarincmnOur global Mandarin can output Traditional or Simplified characters and gives high accuracy transcription across many different accents including (but not limited to) China, Taiwan, Singapore, Malaysia.
Marathimr
Mongolianmn
Norwegianno
Persianfa
Polishpl
PortugueseptOur global Portuguese gives high-accuracy transcription across many different accents including (but not limited to) Portuguese spoken in Portugal and Brazil.
Romanianro
Russianru
Slovakiansk
Sloveniansl
SpanishesOur global Spanish gives high-accuracy transcription across many different accents including (but not limited to) Spanish spoken in Spain, US, Mexico, Colombia, Argentina, Venezuela, Chile and Peru.
Spanish & English bilinguales (with domain='bilingual-en')Ideal when transcribing Spanish and English in the same media file or stream. Supports all accents and dialects listed under English and Spanish. Requires the domain config to be set.
Swahilisw
Swedishsv
Tamilta
Thaith
Turkishtr
Ukrainianuk
Urduur
Uyghurug
Vietnamesevi
WelshcyWelsh must be explicitly added to the expected languages list when using our Language Identification feature, otherwise a language not supported for transcription error will be returned.

Each language above is uniquely identified by a two-letter code (ISO639-1) or three-letter code (ISO639-3) in API requests and responses.

Domain Language

The Speechmatics SaaS supports specialized language packs that enhance the requested transcription language with optimization for a particular field through domains. The domain packs build on our global languages to give an extra boost to accuracy in specific areas. How to use domain config.

DomainSupported LanguagesDescription
bilingual-enesSupport transcribing bilingual Spanish and English content in the same media file or stream.
financeenImprove accuracy for audio containing financial terms such as those found in earnings calls or financial broadcast

Translation Languages

Translation is supported for the majority of Speechmatics' languages. The supported translation pairs are listed below. For more details, see Translation.

Audio LanguageTranslation Target Language
English (en)Bulgarian (bg), Catalan (ca), Mandarin (cmn), Czech (cs), Danish (da), German (de), Greek (el), Spanish (es), Estonian (et), Finnish (fi), French (fr), Galician (gl), Hindi (hi), Croatian (hr), Hungarian (hu), Indonesian (id), Italian (it), Japanese (ja), Korean (ko), Lithuanian (lt), Latvian (lv), Malay (ms), Dutch (nl), Norwegian (no), Polish (pl), Portuguese (pt), Romanian (ro), Russian (ru), Slovakian (sk), Slovenian (sl), Swedish (sv), Turkish (tr), Ukrainian (uk), Vietnamese (vi)
Bulgarian (bg), Catalan (ca), Mandarin (cmn), Czech (cs), Danish (da), German (de), Greek (el), Spanish (es), Estonian (et), Finnish (fi), French (fr), Galician (gl), Hindi (hi), Croatian (hr), Hungarian (hu), Indonesian (id), Italian (it), Japanese (ja), Korean (ko), Lithuanian (lt), Latvian (lv), Malay (ms), Dutch (nl), Norwegian (no), Polish (pl), Portuguese (pt), Romanian (ro), Russian (ru), Slovakian (sk), Slovenian (sl), Swedish (sv), Turkish (tr), Ukrainian (uk), Vietnamese (vi)English (en)
Norwegian Bokmål (no)Norwegian Nynorsk (nn)

Speechmatics languages currently not supporting translation: Arabic, Bashkir, Belarusian, Welsh, Esperanto, Basque, Hebrew, Interlingua, Irish, Maltese, Mongolian, Marathi, Tamil, Thai, Uyghur, Cantonese.