Accessing Images

Transcription:BatchReal-TimeDeployments:Container

The Speechmatics Docker images are obtained from the Speechmatics Docker Repository. If you do not have a Speechmatics Docker Repository account or have lost your details, please reach out to Support.

The latest information about the Containers can be found in the knowledge base section of the Support Portal. If a support account is not available or the Containers section is not visible in the Support Portal, please reach out to Support for help.

Prerequisites

Speechmatics Docker repository credentials – speak to Support to get these
Language Code – the ISO language code (for example fr for French)
LICENSE_TOKEN - The value of the signed claims token which is used to validate the license file. This is required to run the Container. Speechmatics Support will provide this within the license file generated for each customer
TAG – which is used to identify the image version

Using the credentials supplied by Speechmatics, login to our Docker repository:

docker login https://speechmaticspublic.azurecr.io

You will be prompted for your username and password that was provided to you. If successful, you will see the response:

Login Succeeded

If unsuccessful, please verify your credentials and URL. If problems persist, please contact Speechmatics Support.

info

Speechmatics require all customers to cache a copy of the Docker image(s) within their own environment. Once the first version of the container is pulled from the Speechmatics Software Repository, please re-host in a private container registry instead and reference your personal registry in your deployments.

Pulling Core Speech CPU Images

Each supported language pack comes as a different Docker image, so the process will need to be repeated for each language pack required using the relevant language code.

# pulling Batch Global English (en) with the 13.0.0 tag:
docker pull speechmaticspublic.azurecr.io/batch-asr-transcriber-en:13.0.0

# pulling the Batch Spanish (es) model with the 13.0.0 tag:
docker pull speechmaticspublic.azurecr.io/batch-asr-transcriber-es:13.0.0

# pulling Real-Time Global English (en) with the 13.0.0 tag:
docker pull speechmaticspublic.azurecr.io/rt-asr-transcriber-en:13.0.0

See how to run the Core Speech CPU container here.

Pulling Transcription GPU Images

The Transcription GPU images are required to use the most accurate models.

Standard Operating Point

There is a single image available that supports all languages for the Standard Operating Point. There are language specific images available that support the Enhanced and Standard Operating Point.

# pulling the Standard operating point Transcription GPU inference server which supports all languages with the 13.0.0 tag:
docker pull speechmaticspublic.azurecr.io/sm-gpu-inference-server-standard-all:13.0.0

# pulling language specific Transcription GPU inference servers available for en, es, de, fr. Supports both Enhanced and Standard operating points with the 13.0.0 tag:
docker pull speechmaticspublic.azurecr.io/sm-gpu-inference-server-en:13.0.0

Enhanced Operating Point

Depending on which Enhanced Operating Point languages are required, you can pull specific images.

Language Pack 1

Bashkir, Basque, Belarusian, English, English / Mandarin, English / Malay, English / Tamil, Esperanto, Irish, Marathi, Mongolian, Tamil, Turkish, Ukrainian, Uyghur, Welsh

docker pull speechmaticspublic.azurecr.io/sm-gpu-inference-server-enhanced-recipe1:13.0.0

Language Pack 2

Bulgarian, Croatian, Estonian, Galician, Indonesian, Interlingua, Latvian, Lithuanian, Persian, Romanian, Slovakian, Slovenian, Spanish, Urdu

docker pull speechmaticspublic.azurecr.io/sm-gpu-inference-server-enhanced-recipe2:13.0.0

Language Pack 3

Catalan, Czech, Danish, Finnish, German, Greek, Hebrew, Hindi, Hungarian, Italian, Korean, Malay, Swahili, Swedish

docker pull speechmaticspublic.azurecr.io/sm-gpu-inference-server-enhanced-recipe3:13.0.0

Language Pack 4

Arabic, Bengali, Cantonese, Dutch, French, Japanese, Maltese, Mandarin, Norwegian, Polish, Portuguese, Russian, Thai, Vietnamese

docker pull speechmaticspublic.azurecr.io/sm-gpu-inference-server-enhanced-recipe4:13.0.0

See how to run the Transcription GPU container here.

info

To access additional language configurations for containers, please speak to our Support Team.

Pulling Translation GPU Image

This GPU image is required to use Translation in Batch or Real-Time.

# pulling the Translation GPU inference server which supports all translation pairs with the 13.0.0 tag:
docker pull speechmaticspublic.azurecr.io/sm-translation-inference-server:13.0.0

See how to run the Translation GPU container here.

Pulling Bilingual Images

To use Spanish and English bilingual transcription you need to pull the Core Speech CPU image below for the client and use the Spanish GPU Inference Server.

# pulling Batch bilingual Spanish and English with the 13.0.0 tag:
docker pull speechmaticspublic.azurecr.io/batch-asr-transcriber-es-bilingual-en:13.0.0

# pulling Real-Time bilingual Spanish and English with the 13.0.0 tag:
docker pull speechmaticspublic.azurecr.io/rt-asr-transcriber-es-bilingual-en:13.0.0

info

Bilingual is only available for GPU deployments. Batch and Real-Time are supported.

Pulling Language ID Image

This image is required to use Language ID.

# pulling the latest Language ID image:
docker pull speechmaticspublic.azurecr.io/langid:2.2.1

See how to run the Language ID container here.

Accessing Images

Prerequisites​

Docker Repository Login​

Pulling Core Speech CPU Images​

Pulling Transcription GPU Images​

Standard Operating Point​

Enhanced Operating Point​

Pulling Translation GPU Image​

Pulling Bilingual Images​

Pulling Language ID Image​

Prerequisites

Docker Repository Login

Pulling Core Speech CPU Images

Pulling Transcription GPU Images

Standard Operating Point

Enhanced Operating Point

Pulling Translation GPU Image

Pulling Bilingual Images

Pulling Language ID Image