Senior Kubernetes Engineer - Bare Metal and GPU

circle b Netherlands
Visa Sponsorship
Apply
AI Summary

We are looking for a Senior Kubernetes Engineer to build and maintain our sovereign EU GPU cloud. The role involves designing and deploying datacenter, edge, and AI/HPC systems on Open Compute Project (OCP) hardware. The ideal candidate will have 5+ years of experience in infrastructure or SRE, including 3+ years running Kubernetes in production.

Key Highlights
Design and deploy datacenter, edge, and AI/HPC systems on OCP hardware
Build and maintain a sovereign EU GPU cloud
5+ years of experience in infrastructure or SRE
Key Responsibilities
Design and deploy datacenter, edge, and AI/HPC systems on OCP hardware
Build and maintain a sovereign EU GPU cloud
Manage HA management cluster (etcd and core services) on the management hardware
Technical Skills Required
Kubernetes Linux NVIDIA GPU stack
Benefits & Perks
Competitive salary
Pension scheme
Visa sponsorship
Nice to Have
Experience building to or operating NVIDIA reference architectures for AI compute (NCP, HGX)
Distributed storage in production (Ceph/Rook or MinIO)

Job Description


What we do-

Circle B builds sustainable IT infrastructure for the AI and cloud era. For over a decade— we have designed and deployed datacenter, edge, and AI/HPC systems on Open Compute Project (OCP) hardware. We are independent, vendor-neutral, and ISO 9001 / 14001 / 27001 certified, with deployments across multiple countries.

Our newest initiative is a sovereign EU GPU cloud — operating under full Dutch/EU jurisdiction and beyond the reach of the US CLOUD Act, for regulated European organizations that cannot compromise on where their data lives.


The Role

You look after the servers and the clusters that run on them: bringing machines up from their BMC, provisioning them through automated tooling, and handing over GPU-ready Kubernetes clusters for tenants to use.


What you will own

  • The HA management cluster (etcd and core services) on the management hardware.
  • Automated bare-metal provisioning for the GPU fleet: BMC/Redfish, PXE and virtual media, inspection, and lifecycle.
  • The NVIDIA GPU stack on Kubernetes through the GPU Operator (driver, container toolkit, device plugin, NFD/GFD, DCGM exporter), including whole-GPU allocation and per-tenant isolation.
  • The host side of the GPU fabric: NVLink and NVSwitch health through NVIDIA Fabric Manager, plus the RDMA NICs (ConnectX-8 class), jumbo frames, and GPUDirect setup on the node. The Network Engineer owns the switch fabric.
  • Secrets and certificate management (Vault or OpenBao, cert-manager). Identity, SIEM, and overall security posture will sit with a dedicated security hire.
  • GitOps-driven provisioning, so infrastructure changes are repeatable and auditable.
  • Secure tenant decommissioning: cryptographic NVMe wipe and GPU memory zeroing between tenants.


What We Are Looking For


Required

  • 5+ years in infrastructure or SRE, including 3+ years running Kubernetes in production with real node and cluster lifecycle ownership. On-prem or bare-metal experience counts for more here than managed-cloud Kubernetes.
  • Strong Linux fundamentals: systemd, kernel modules, NUMA, cgroups, NVMe and LVM storage, and host networking (bonding, VLANs, nftables). You are comfortable debugging at the hardware and driver level.
  • Hands-on server provisioning through IPMI, Redfish, or BMC tooling, with config management such as Ansible.
  • The NVIDIA GPU stack on Kubernetes: the GPU Operator and its components, GPU health and telemetry through DCGM, and GPU scheduling and per-tenant allocation.
  • Infrastructure as code and GitOps in production: Helm, Kustomize, and Argo CD or Flux.
  • Comfortable scripting and automating in Python and/or Go.
  • Self-directed: you find the problem, propose a fix, carry it out, and write it down.


Nice to have

  • Experience building to or operating NVIDIA reference architectures for AI compute (NCP, HGX).
  • A bare-metal provisioning system in production: Metal3/Ironic, MAAS, Tinkerbell, or Foreman.
  • Container security basics: RBAC, Pod Security Standards, network policies, and secrets management (Vault or OpenBao).
  • Distributed storage in production (Ceph/Rook or MinIO), or readiness to own it with vendor support.
  • KubeVirt with GPU passthrough for VM-level tenant isolation.
  • Awareness of the EU rules that shape this work: GDPR, the AI Act, DORA, NIS2, and NEN 7510
  • NVIDIA Run:ai or the KAI Scheduler for GPU scheduling and quota.
  • NVIDIA certifications (NCP-AIO, NCA-AIIO)
  • Experience at a GPU cloud, AI provider, or HPC centre.

We don't expect one person to be an expert in bare metal, GPUs, storage, and security all at once. The core we're hiring for is bare-metal Kubernetes, the NVIDIA GPU stack, and host-side GPU networking. The storage and security depth can be built up on the job, with support.


Why Join Us

  • Help build a sovereign EU GPU cloud from the ground up.
  • Own a critical platform layer, not just tickets or maintenance.
  • Work on modern AI infrastructure, GPU platforms, Kubernetes, observability, and automation.
  • Join a company with deep experience in OCP, datacenter, AI/HPC, and cloud infrastructure.
  • Build infrastructure for organizations where data location, compliance, and reliability truly matter.


Benefits

  • Competitive salary.
  • Pension scheme.
  • Visa sponsorship.
  • Company gym.
  • Modern office in Hoofddorp.
  • Informal and open working culture.
  • Participation in relevant conferences and exhibitions across Europe.
  • Opportunity to develop your skills in a fast-growing technology environment.


Our Work Culture

Circle B offers an informal working atmosphere with energetic people who enjoy being part of a growing technology company. We have an open management culture and encourage colleagues to contribute to improving our products, services, and processes.

If this sounds like a good fit, please send your CV and motivation letter to:

surbi@tauruseu.com


Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

A2G Consulting BV (A2G Technol...

Netherlands

Java Backend Developer

Devops
1d ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

A2G Consulting BV (A2G Technol...

Netherlands
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

A2G Consulting BV (A2G Technol...

Netherlands

Subscribe our newsletter

New Things Will Always Update Regularly