Languages & Models
Information about the wide array of languages Speechmatics supports transcription forOperating points
Choose between two accuracy models when configuring your transcription session:
- Standard — optimized for faster turnaround with strong accuracy. Recommended when speed and efficiency are your priorities
- Enhanced — our highest-accuracy model with strong turnaround times. Recommended when precision is critical, and especially for complex audio (e.g. noisy environments, varied accents)
By default, the standard
operating point is used. You can specify the enhanced
operating point as a part of the transcription config. For example:
{
"type": "transcription",
"transcription_config": {
"language": "en",
"operating_point": "enhanced"
}
}
Transcription languages
To automatically identify the language in an audio file, use our Language Identification feature.
To dynamically update your system with the latest languages and features offered by Speechmatics, use our Feature Discovery endpoint.
Speechmatics supports the following languages. Your ability to use any or all of the languages will depend on what languages you are contracted to use.
Speechmatics takes a global-first approach to our languages. In a single language pack, we aim to support many different accents and dialects. This simplifies your workflow when selecting which language to use, not requiring you to know which accent is being spoken in your audio upfront. With this approach we still achieve very high accuracy compared to accent-specific language packs.
Each language above is uniquely identified by a two-letter code (ISO639-1) or three-letter code (ISO639-3) in API requests and responses.
Translation languages
Translation is supported for the majority of Speechmatics' languages. The supported translation pairs are listed below. For more details, see Translation.
Multilingual speech-to-text
These packs are ideal when transcribing multiple languages in the same media file or stream with high accuracy. For more information on the supported languages, please refer to Supported Language Packs.
Supported multilingual packs are:
Bilingual (excluding Spanish and English) example:
{
"type": "transcription",
"transcription_config": {
"language": "cmn_en"
}
}
Bilingual Spanish and English example:
{
"type": "transcription",
"transcription_config": {
"language": "es",
"domain": "bilingual-en"
}
}
Healthcare transcription
Speechmatics offers industry-leading accuracy in the healthcare domain when using the default enhanced
operating point. For customers working exclusively in the healthcare domain, we recommend using the medical domain-specific model.
The model is kept up to date using officially maintained sources. This brings significant improvements in recognition of medical terminology such as names of procedures, medications, conditions, and anatomy.
The medical domain-specific model must be used with the enhanced
operating point.
Medical domain example:
{
"type": "transcription",
"transcription_config": {
"language": "en",
"operating_point": "enhanced",
"domain": "medical"
}
}