Senior Site Reliability Engineer

3b staffing llc United State
Remote
Apply
AI Summary

Seeking a highly experienced Senior Site Reliability Engineer to design, implement, and support Kubernetes on baremetal and hypervisor platforms. Expert-level understanding of compute hardware management, Kubernetes, OpenStack, and Linux Operating systems required. Collaborate with platform and SRE teams to maintain secure, performant, and multi-tenant-isolated services.

Key Highlights
Design, implement, and support Kubernetes on baremetal and hypervisor platforms
Expert-level understanding of compute hardware management, Kubernetes, OpenStack, and Linux
Collaborate with platform and SRE teams to maintain secure, performant, and multi-tenant-isolated services
Technical Skills Required
Kubernetes OpenStack Linux Redfish APIs PXE boot Ansible Terraform Python Bash KVM
Benefits & Perks
Contract, with potential for Contract-to-hire

Job Description


Title: Sr. Site Reliability Engineer (Compute Platform) 

Visa: USC only

Job Type: Contract, with potential for Contract-to-hire – the client only wants to see candidates that are willing to convert to full-time employment for this role that do not require any type of sponsorship.

Worksite Requirement: Fully Remote 

Interview Process: 2-3 rounds of video conference interviews

 

Note: I need Only US citizen, not any other visa.


Job Summary:

We are seeking a highly experienced Sr Site Reliability Engineer – Compute Platforms to design, implement, and support Kubernetes on baremetal and hypervisor platforms in a private cloud environment. This role is responsible for the architecture, design, and

standardization of enterprise compute and hypervisor environments spanning bare metal infrastructure, operating systems, hypervisors, private cloud orchestration, and Kubernetes using Infrastructure-as-Code and GitOps practices.

 

This is a deeply technical role requiring expert-level understanding of compute hardware management, Kubernetes, OpenStack, hypervisors and extensive working knowledge on Linux Operating systems. You will also collaborate with platform and SRE teams to maintain secure, performant, and multi-tenant-isolated services that serve high-throughput, mission critical applications.

 

Key Responsibilities

  • Lead the architecture and design of enterprise compute and hypervisor platform solutions across hardware, OS, virtualization, cloud orchestration, and container orchestration layers
  • Define standards and automation frameworks for bare metal provisioning and lifecycle management
  • Design and implement Bare Metal as a Service (BMaaS) capabilities for scalable infrastructure consumption
  • Architect and design Kubernetes platforms on bare metal with QoS and Affinity (ArgoCD)
  • Architect and validate automated deployments of operating systems and hypervisors including Ubuntu and Harvester
  • Design and maintain PXE-based provisioning environments leveraging Redfish APIs for large-scale server deployments
  • Develop Infrastructure-as-Code using Ansible, Terraform, Helm and Git, with Python/Bash automation.
  • Implement CI/CD pipelines for infrastructure updates, patching, upgrades, testing, and rollback.
  • Design automated workflows for server build, firmware lifecycle management, patching, and hardware validation
  • Evaluate and standardize enterprise hardware platforms to meet performance, scalability, and reliability requirements
  • Produce detailed high-level and low-level design documentation, build guides, and operational handoff materials
  • Perform deep troubleshooting across storage, Kubernetes, hypervisors, networking, and Linux systems
  • Partner with operations, network, storage, and platform teams to ensure designs are supportable and production-ready
  • Participate in on-call escalation support for complex platform-related issues
  • Collaborate globally on change management, documentation, and operational best practices.


Must Have: 

  • 8+ years of experience in infrastructure engineering, platform engineering, or DevOps with a strong focus on Compute system design
  • Proven experience designing and automating bare metal compute environments at scale
  • Strong hands-on experience with PXE boot, network-based OS provisioning, and automated server imaging
  • Experience implementing or supporting Bare Metal as a Service (BMaaS) platforms
  • Practical experience using Redfish APIs for hardware provisioning, power management, and remote lifecycle operations
  • Deep expertise with Ubuntu Linux in enterprise environments
  • Strong Hands-on experience with KVM hypervisors (Suse Harvester, OpenStack).
  • Experience designing and deploying production-grade Kubernetes clusters
  • Strong background with enterprise compute hardware platforms, including Cisco UCS, Dell PowerEdge, Supermicro systems & HPE
  • Proficiency with Infrastructure as Code tools (e.g., Terraform, Ansible, or similar)
  • Experience building or supporting CI/CD pipelines for infrastructure and platform automation
  • Strong scripting skills in Python, Bash, or similar languages


  • OpenStack, Ubuntu KVM administration.
  • BareMetal as a Service (PXE, Redfish).
  • Kubernetes on BareMetal
  • CIS/NIST security and infrastructure lifecycle management.
  • ITIL Foundation/advanced certifications in support of ITSM standard methodology.
  • Background in telco, edge cloud, or large enterprise environments.
  • Ubuntu Certifications, CNCF Certified Kubernetes Administrator (CKA), Certified Kubernetes Security Specialist (CKS)
  • Master’s degree in computer science, IT, Engineering, or a related field preferred; equivalent experience and relevant industry certifications will also be considered.



Thanks & Regards

Shivam Mishra

Technical Recuriter

3B Staffing LLC

Contact: (973)-384-0655

Email: Shivam.mishra@3bstaffing.com 

485B US Highway 1 S, STE 300, Iselin, New Jersey 08830

www.3bstaffing.com

USA|Canada|India



Similar Jobs

Explore other opportunities that match your interests

Senior Director of Cloud Operations

Devops
1h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

horizon3.ai

United State

Senior Azure Engineer

Devops
1h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Amicus

United State

AWS Cloud Administrator

Devops
3h ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

IO Datasphere, Inc.

United State

Subscribe our newsletter

New Things Will Always Update Regularly