Seeking a Senior GCP DevOps HPC Engineer to lead cloud-based HPC environments. Responsibilities include migrating on-prem SLURM clusters to GCP, designing scalable architectures, and optimizing high-performance workloads. Requires 5+ years of HPC experience and strong GCP, Terraform, and Ansible skills.
Key Highlights
Technical Skills Required
Benefits & Perks
Job Description
GCP DevOps HPC Engineer (Senior)
About the Role
We’re hiring a Senior GCP DevOps HPC Engineer to join a high-performing engineering team working on large-scale, cloud-based HPC environments. This role is ideal for an experienced HPC engineer who enjoys leading complex migrations, designing scalable architectures, and optimising high-performance workloads in Google Cloud Platform (GCP).
You’ll take ownership of migrating on-prem SLURM HPC clusters to GCP, while acting as a technical authority across HPC, DevOps, and cloud infrastructure.
What You’ll Be Doing
- Lead end-to-end migrations of SLURM-based HPC clusters from on-prem to GCP
- Design, build, and operate secure, scalable HPC architectures in the cloud
- Optimise SLURM scheduling, workload performance, and resource utilisation
- Automate cluster deployment and operations using Terraform, Ansible, Python, and Bash
- Manage HPC software stacks using Spack
- Deploy and support parallel workloads using MPI, OpenMP, and related frameworks
- Troubleshoot performance issues and drive continuous optimisation
- Collaborate with engineering teams and stakeholders in a fully remote environment
What We’re Looking For
Essential
- 5+ years’ experience in HPC environments (SLURM, MPI, parallel workloads)
- Strong Linux systems expertise in performance-critical environments
- Hands-on experience running or migrating HPC workloads in the cloud (GCP preferred)
- Solid experience with Terraform and Ansible
- Strong scripting skills (Python, Bash)
- Deep understanding of GCP services (GCE, VPC, Cloud Storage)
Nice to Have
- GCP certifications (DevOps / Cloud Engineer)
- Experience with Preemptible VMs and cloud cost-optimisation strategies
- HPC performance profiling and debugging tools
- Containers in HPC (Singularity, Docker)
- Exposure to Spark or big data tooling
Why Apply
- Work on complex, high-impact HPC systems at scale
- Influence architecture and technical decisions
- Fully remote role based in Spain
- Collaborative, engineering-led culture with strong technical ownership
Interested? Apply directly or message me to learn more.
Similar Jobs
Explore other opportunities that match your interests
Matchtech
TROOP