Languages
Information about the wide array of languages Speechmatics supports transcription forThis page lists the range of languages supported by Speechmatics.
To automatically identify the language in an audio file, use our Language Identification feature.
To dynamically update your system with the latest languages and features offered by Speechmatics, use our Feature Discovery endpoint.
Speechmatics supports the following languages. Your ability to use any or all of the languages will depend on what languages you are contracted to use.
Speechmatics takes a global-first approach to our languages. In a single language pack, we aim to support many different accents and dialects. This simplifies your workflow when selecting which language to use, not requiring you to know which accent is being spoken in your audio upfront. With this approach we still achieve very high accuracy compared to accent-specific language packs.
Each language above is uniquely identified by a two-letter code (ISO639-1) or three-letter code (ISO639-3) in API requests and responses.
Translation Languages
Translation is supported for the majority of Speechmatics' languages. The supported translation pairs are listed below. For more details, see Translation.
Multilingual speech-to-text
These packs are ideal when transcribing multiple languages in the same media file or stream with high accuracy. For more information on the supported languages, please refer to Supported Language Packs.
Supported multilingual packs are:
Bilingual (excluding Spanish and English) example:
{
"type": "transcription",
"transcription_config": {
"language": "cmn_en",
}
}
Bilingual Spanish and English example:
{
"type": "transcription",
"transcription_config": {
"language": "es",
"domain": "bilingual-en"
}
}