Announcing Azure CycleCloud Workspace for Slurm: Version 2025.12.01 Release
Summary
We are excited to announce the latest release of Azure CycleCloud Workspace for Slurm, now available with the powerful features and enhancements introduced in CycleCloud 8.8.1. This update brings significant improvements to cluster management, monitoring, security, and platform support, empowering technical communities to build and operate scalable HPC environments with greater efficiency and flexibility. The integration of Prometheus self-agent enables automated collection of metrics from compute nodes and Slurm jobs, providing real-time insights into cluster performance and resource utilization.
Coupled with managed Grafana, users can visualize these metrics through customizable dashboards, making it simple to track system health, identify bottlenecks, and optimize workloads. This seamless monitoring solution reduces operational overhead and enhances the reliability of your HPC environment. Create the managed monitoring infrastructure To use this feature, simply set up an Azure Monitor Workspace for Prometheus and an Azure Managed Grafana environment.
Follow these steps as outlined here: Azure/cyclecloud-monitoring: Cluster-init project and related tools for adding managed monitoring to a CycleCloud cluster. Create a resource group for the monitoring infrastructure 2. Deploy with the provided commands git clone cd cyclecloud-monitoring..
Official source
Microsoft Tech

