AWS Parallel Computing Service now supports Slurm 25.11

AWS Parallel Computing Service (AWS PCS) Release Notes
AWS PCS now supports Slurm version 25.11, introducing new features and log types.
New Features
- Expedited Re-queue: Automatically reschedules jobs affected by node issues at the highest priority to help workloads recover faster.
- OpenMetrics Endpoint: Provides real-time visibility into jobs, nodes, and scheduling using existing monitoring tools.
- New Log Types: Includes scheduler audit logs, slurmdbd logs, and slurmrestd logs delivered to Amazon CloudWatch Logs, Amazon S3, or Amazon Data Firehose.
What to do
- Enable the OpenMetrics endpoint for real-time monitoring.
- Configure log delivery to CloudWatch Logs, S3, or Data Firehose for better diagnostics and debugging.
These features are available in all AWS Regions where AWS PCS is available. Standard charges apply for log delivery destinations.
Source: AWS release notes
If you need further guidance on AWS, our experts are available at AWS@westloop.io. You may also reach us by submitting the Contact Us form.



