Datacenter Operations Director - AI Infrastructure

Hamilton Barnes 🌳 • United State
Relocation
Apply
AI Summary

Lead datacenter operations for a hyper-growth startup, overseeing 300MW datacenter footprint. Ensure compute and network infrastructure meets performance and capacity targets. Develop custom cluster plans and manage operations at scale.

Key Highlights
Provide full rack lifecycle support across hyperscale datacenters
Ensure datacenter delivery workflows meet business Service Level Objective (SLO) targets
Lead and mentor a team of engineers and contingent datacenter technicians
Develop custom cluster plans for network and compute capacities
Manage operations at scale, involving production datacenter space generating multi-billion dollar revenue streams
Technical Skills Required
Hyperscale datacenter operations Compute and network infrastructure Mechanical and electrical (M&E) systems Control systems monitoring and commissioning Custom capacity planning AI/GPU hardware platforms
Benefits & Perks
High-visibility, high-stakes leadership opportunity
Instrumental in scaling operational capabilities to meet future demands of artificial intelligence
Opportunity to work with a rapidly expanding datacenter footprint

Job Description


Datacenter Operations Director - AI Infrastructure


Join an ambitious, hyper-growth startup focused on building world-class AI infrastructure. This critical leadership role will directly oversee the operational readiness and performance of their rapidly expanding datacenter footprint, which is slated for a build-out of 300MW by 2026, with the first 50MW facility going live in January 2026. You will be responsible for ensuring that their compute and network infrastructure consistently meets aggressive performance and capacity targets, providing the essential foundation for next-generation AI platforms.


This is a high-visibility, high-stakes leadership opportunity where your expertise will be instrumental in scaling their operational capabilities to meet the future demands of artificial intelligence.



Responsibilities

  • Provide full rack lifecycle support across a growing fleet of hyperscale datacenters, including asset receipt, relocation, configuration, and decommissioning.
  • Ensure datacenter delivery workflows maintain consistent performance in line with business Service Level Objective (SLO) targets.
  • Oversee and ensure that new data halls, network rooms, and supporting cooling and electrical infrastructure meet or exceed server readiness dates for new AI deployments.
  • Identify and drive cross-functional workflow improvements, representing the interests of regional delivery teams to standardise processes globally.
  • Lead and mentor a team of engineers and contingent datacenter technicians across multiple facilities, overseeing the end-to-end lifecycle of compute and network infrastructure
  • Develop custom cluster plans for network and compute capacities based on upstream demand signals, power, available floor space, and mechanical system capabilities to fully leverage resources.
  • Manage operations at scale, involving production datacenter space generating multi-billion dollar revenue streams, operating at over 200+ MW.
  • Oversee the construction, commissioning, and validation of mechanical and electrical systems for new data hall builds.
  • Pilot and operationalise strategic initiatives, such as rack relocations and lifecycle extension programs, focused on driving operational efficiencies and cost avoidance.
  • Direct local teams in the successful deployment and stabilisation of new AI GPU platforms, including advanced hardware configurations.




Required Skills & Experience

  • Proven experience leading large, multi-site datacenter operations encompassing 200+ MW.
  • Deep expertise in the end-to-end lifecycle of hyperscale compute and network infrastructure (receiving, deployment, decommissioning, strategic reuse).
  • Advanced knowledge of mechanical and electrical (M&E) systems, including control systems monitoring and commissioning within a critical datacenter environment.
  • Demonstrated ability to build complex, custom capacity plans, optimising distribution based on power, floor space, and cooling capabilities.
  • Experience with the landing and operationalisation of cutting-edge AI/GPU hardware platforms.
  • Experience working at the scale of 200+MW sites.

Subscribe our newsletter

New Things Will Always Update Regularly