Amazon SageMaker AI Inference now supports bidirectional streaming

Published
November 25, 2025
https://aws.amazon.com/about-aws/whats-new/2025/11/sagemaker-ai-inference-bidirectional-streaming/

Amazon SageMaker AI Inference Bidirectional Streaming

Amazon SageMaker AI Inference now supports bidirectional streaming for real-time speech-to-text transcription, enabling continuous speech processing instead of batch input. Models can now receive audio streams and return partial transcripts simultaneously as users speak, enabling you to build voice agents that process speech with minimal latency.

Bidirectional streaming is available in the following AWS Regions:

  • Canada (Central)
  • South America (São Paulo)
  • Africa (Cape Town)
  • Europe (Paris)
  • Asia Pacific (Hyderabad)
  • Asia Pacific (Jakarta)
  • Israel (Tel Aviv)
  • Europe (Zurich)
  • Asia Pacific (Tokyo)
  • AWS GovCloud US (West)
  • AWS GovCloud US (East)
  • Asia Pacific (Mumbai)
  • Middle East (Bahrain)
  • US West (Oregon)
  • China (Ningxia)
  • US West (Northern California)
  • Asia Pacific (Sydney)
  • Europe (London)
  • Asia Pacific (Seoul)
  • US East (N. Virginia)
  • Asia Pacific (Hong Kong)
  • US East (Ohio)
  • China (Beijing)
  • Europe (Stockholm)
  • Europe (Ireland)
  • Middle East (UAE)
  • Asia Pacific (Osaka)
  • Asia Pacific (Melbourne)
  • Europe (Spain)
  • Europe (Frankfurt)
  • Europe (Milan)
  • Asia Pacific (Singapore)

To learn more, visit AWS News Blog and SageMaker AI documentation.




If you need further guidance on AWS, our experts are available at AWS@westloop.io. You may also reach us by submitting the Contact Us form.

Follow our blog

Get the latest insights and advice on AWS services from our experts.

By clicking Sign Up you're confirming that you agree with our Terms and Conditions.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.