SageMaker HyperPod Slurm clusters now support minimum capacity
aiawsengineer
feature
Amazon SageMaker HyperPod for Slurm clusters now supports specifying minimum instance requirements (MinCount) with continuous provisioning. This allows users to guarantee a baseline number of nodes before training jobs start, improving reliability for distributed workloads and meeting SLA targets. Engineers and architects running large-scale AI/ML training jobs on SageMaker HyperPod will benefit from this enhanced control over cluster availability.
Read the original announcement →
https://aws.amazon.com/about-aws/whats-new/2026/05/amazon-sagemaker-hyperpod-mincount/
