Skip to main content
Cloud / Azure / Products / Azure CycleCloud - HPC Cluster Orchestration

Azure CycleCloud - HPC Cluster Orchestration

Azure CycleCloud orchestrates HPC environments on Azure: provision infrastructure, deploy schedulers and autoscale clusters with familiar tooling.

compute
Pricing Model No license or subscription fee, only the underlying Azure resources
Availability Multiple regions incl. EU (e.g. Europe West, Germany West Central)
Data Sovereignty EU regions available
Reliability N/A (tool without its own SLA, depends on the Azure services used) SLA

What is Azure CycleCloud?

Azure CycleCloud is a tool for orchestrating and managing High Performance Computing (HPC) environments on Azure. It provisions the required infrastructure, deploys familiar HPC schedulers and automatically scales clusters so jobs run efficiently at any scale. With CycleCloud you can create different file systems and mount them to the compute nodes to support HPC workloads.

Organizations with long-standing HPC environments often build deep expertise and custom tooling around a specific scheduler. CycleCloud solves the problem of moving these setups to Azure without a redesign: it abstracts the Azure building blocks such as VMs, scale sets, network interfaces and disks, so HPC administrators keep working with the cluster and scheduler they already know. CycleCloud is not a scheduler itself but a management platform for deploying and operating HPC resources.

Core Features

  • Scheduler-agnostic orchestration: Built-in support with autoscaling plugins for Slurm, PBS Pro, IBM LSF, Grid Engine and HTCondor. You can integrate your own scheduler through the autoscaling REST API.
  • Automatic scaling: CycleCloud adjusts cluster size and components based on job load, availability and time requirements, so users do not need to build their own autoscaling routines.
  • Declarative templates: Complete HPC environments including NFS servers, parallel file systems, authentication and license servers are defined through a single management plane and can be shared and replicated.
  • Azure integration and APIs: Integrates with Azure Monitor and Microsoft Cost Management as well as services such as Azure NetApp Files and Microsoft Entra ID, complemented by a RESTful API and a built-in CLI.

Typical Use Cases

Migrating existing HPC environments: Teams move long-standing on-premises setups with established schedulers to Azure without rebuilding their workflows and tools.

Simulation and research: Scientific and engineering simulations run on clusters that grow on demand and shrink again once the jobs complete.

Reproducible environments for multiple teams: HPC setups defined as templates are rolled out consistently across different teams and projects.

Benefits

  • Keep using familiar schedulers instead of learning new tooling
  • Automatic scaling reduces idle costs and manual effort
  • No license fee for CycleCloud, billing only for the Azure resources you use
  • EU regions available for workloads with data sovereignty requirements

Integration with innFactory

As a Microsoft Solutions Partner, innFactory supports you with the adoption and operation of this service.

Typical Use Cases

Migrate existing on-premises HPC environments to Azure with a familiar scheduler
Run scientific and engineering simulations on automatically scaling clusters
Provision reproducible HPC environments across multiple teams via templates
Adjust compute capacity dynamically to job load and reduce idle costs

Frequently Asked Questions

What is Azure CycleCloud?

Azure CycleCloud is a tool for orchestrating and managing High Performance Computing environments on Azure. It provisions the infrastructure, deploys familiar HPC schedulers and autoscales clusters based on job load. CycleCloud is not a scheduler itself but a platform for building and running complete HPC systems.

When should I use Azure CycleCloud?

CycleCloud fits when you move an existing HPC environment with an established scheduler such as Slurm, PBS Pro, LSF, Grid Engine or HTCondor to Azure while keeping your familiar workflows. It also suits simulation and research workloads that need clusters scaling on demand, as well as reproducible HPC setups shared across multiple teams.

How much does Azure CycleCloud cost?

There is no license or subscription fee for CycleCloud itself. You pay only for the underlying Azure resources such as VMs, scale sets, disks, storage and networking on a pay-as-you-go basis. Spot VMs and reserved capacity can be used to further reduce compute costs.

Which HPC schedulers does Azure CycleCloud support?

CycleCloud is scheduler-agnostic and provides built-in support with autoscaling plugins for Slurm, PBS Pro, IBM LSF, Grid Engine and HTCondor. You can also integrate your own scheduler through the autoscaling REST API.

Microsoft Solutions Partner

innFactory is a Microsoft Solutions Partner. We provide expert consulting, implementation, and managed services for Azure.

Microsoft Solutions Partner Microsoft Data & AI

Similar Products from Other Clouds

Other cloud providers offer comparable services in this category. As a multi-cloud partner, we help you choose the right solution.

AWS

AWS Lambda durable functions - Stateful Workflows

AWS Lambda durable functions: stateful, multi-step workflows with checkpointing, runtimes up to one year, and no compute …

Pricing Pay-per-use (standard Lambda: requests + …
SLA 99.95% (AWS Lambda SLA)
Compare →
AWS

AWS Lambda Managed Instances - Functions on EC2

AWS Lambda Managed Instances runs Lambda functions on dedicated EC2 compute: predictable performance, up to 32 GB …

Pricing Pay for EC2 compute (instance-hours), …
SLA N/A
Compare →
Google Cloud

Cloud Run - Serverless Container Platform

Cloud Run: Serverless container platform on Google Cloud. Scale-to-zero, pay-per-use, GPU for AI inference, worker …

Pricing Pay-per-use (100ms granularity) or …
SLA 99.95% monthly availability
Compare →
Google Cloud

Cloud Run worker pools - Pull-Based Background Work

Cloud Run worker pools: serverless resource for non-HTTP, pull-based workloads from queues like Pub/Sub and Kafka, with …

Pricing Pay-per-use, resource-based (vCPU plus …
SLA 99.95% (Cloud Run SLA)
Compare →
Google Cloud

Google Axion Processors - Arm-Based CPUs for Compute Engine

Google Axion are Google's own Arm-based CPUs for Compute Engine, offering up to 50% better performance per watt than …

Pricing Pay-per-use (same models as Compute …
SLA 99.9%
Compare →
STACKIT

STACKIT Run Command

STACKIT Run Command enables secure remote execution of scripts and commands on STACKIT Compute Engine instances without …

Pricing Included in Compute Engine
SLA 99.9% availability
Compare →

38 comparable products found across other clouds.

Ready to start with Azure CycleCloud - HPC Cluster Orchestration?

Our certified Azure experts help you with architecture, integration, and optimization.

Schedule Consultation