Three new models for speech recognition and text-to-speech are now available in Amazon SageMaker JumpStart

Published
May 14, 2026
https://aws.amazon.com/about-aws/whats-new/2026/05/speech-models-on-sagemaker-jumpstart/

AWS SageMaker JumpStart Model Updates

AWS has introduced three new models in Amazon SageMaker JumpStart: Qwen3-TTS-12Hz-1.7B-CustomVoice, Qwen3-TTS-12Hz-1.7B-Base, and Qwen3-ASR-1.7B. These models offer advanced speech synthesis and recognition capabilities across multiple languages, enhancing the ability to build intelligent voice-powered applications.

New Features

  • Qwen3-TTS-12Hz-1.7B-CustomVoice: Multilingual text-to-speech with customizable voice styles, supporting 10 languages with control over timbre, emotion, and prosody. Ideal for interactive voice applications and content creation.
  • Qwen3-TTS-12Hz-1.7B-Base: Multilingual text-to-speech with rapid voice cloning from audio input. Suitable for custom voice applications and domain-specific speech synthesis.
  • Qwen3-ASR-1.7B: Automatic speech recognition supporting 52 languages and dialects with high accuracy in complex environments. Perfect for transcription services and multilingual customer support.

What to do

Source: AWS release notes




If you need further guidance on AWS, our experts are available at AWS@westloop.io. You may also reach us by submitting the Contact Us form.

Follow our blog

Get the latest insights and advice on AWS services from our experts.

By clicking Sign Up you're confirming that you agree with our Terms and Conditions.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.