Amazon SageMaker AI Inference now supports bidirectional streaming

Published

November 25, 2025

Amazon SageMaker AI Inference Bidirectional Streaming

Amazon SageMaker AI Inference now supports bidirectional streaming for real-time speech-to-text transcription, enabling continuous speech processing instead of batch input. Models can now receive audio streams and return partial transcripts simultaneously as users speak, enabling you to build voice agents that process speech with minimal latency.

Bidirectional streaming is available in the following AWS Regions: