Amazon SageMaker HyperPod now supports NVIDIA Multi-Instance GPU (MIG) for generative AI tasks

Published

November 24, 2025

Amazon SageMaker HyperPod Update

Amazon SageMaker HyperPod now supports NVIDIA Multi-Instance GPU (MIG) technology, allowing administrators to partition a single GPU into multiple isolated GPUs. This enables running diverse, small generative AI tasks simultaneously while maintaining performance and task isolation.

Administrators can configure GPU partitions via the SageMaker HyperPod console or a custom setup. They can also allocate compute quota for fair distribution across teams. Real-time performance metrics and resource utilization monitoring are available to optimize resource allocation.

Data scientists can accelerate time-to-market by scheduling lightweight inference tasks and running interactive notebooks in parallel on GPU partitions.