Senior Site Reliability Engineer / Network Engineer (MAAS)

Jobgether • United State
Remote
Apply
AI Summary

Design, operate, and automate large-scale bare-metal and cloud-adjacent infrastructure. Ensure reliability across distributed systems. Work autonomously in a fast-paced, highly distributed environment. Influence next-generation cloud infrastructure stability and scalability.

Key Highlights
Design and maintain large-scale Linux-based infrastructure
Manage bare-metal systems and MAAS-based provisioning workflows
Implement and manage network architectures using VLANs, L2/L3 routing, and VPNs
Automate infrastructure provisioning and operations using Ansible, Bash, Python, and Git
Key Responsibilities
Operate and maintain large-scale Linux-based infrastructure
Manage bare-metal systems at hardware level
Design, implement, and maintain scalable network architectures
Automate infrastructure provisioning and operations
Build and maintain MAAS-based provisioning workflows
Implement and manage observability stacks
Develop internal tooling and APIs
Deploy and support virtualization and containerization platforms
Technical Skills Required
Linux System Administration MAAS Networking Fundamentals Ansible Python Bash Git Observability Stacks (Prometheus, Grafana, ELK/Graylog, Loki) Virtualization and Containerization Platforms (OpenStack, Proxmox, VMware, Kubernetes)
Benefits & Perks
Competitive compensation
Fully remote role with flexibility across LATAM / global distributed teams
Opportunity to work on cutting-edge decentralized cloud infrastructure

Job Description


This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a SRE / Network Engineer (MAAS) based in the United States.

This role is centered on designing, operating, and automating large-scale bare-metal and cloud-adjacent infrastructure in a highly distributed environment. You will work at the core of a decentralized compute platform focused on performance, efficiency, and sovereignty over cloud resources. The position combines deep systems engineering, networking expertise, and infrastructure automation, with a strong emphasis on Metal-as-a-Service (MAAS) environments. You will be responsible for ensuring reliability across hundreds of nodes spanning multiple sites while building the tooling needed to scale operations efficiently. The environment is fast-paced and highly autonomous, requiring strong ownership and problem-solving skills. Your work will directly influence the stability, scalability, and evolution of next-generation cloud infrastructure.

Accountabilities

  • Operate and maintain large-scale Linux-based infrastructure (Debian/Ubuntu), ensuring reliability and performance across distributed systems.
  • Manage bare-metal systems at hardware level, including BIOS configurations, IPMI, RAID setups, and diagnostic troubleshooting.
  • Design, implement, and maintain scalable network architectures using VLANs, L2/L3 routing, VPNs, and enterprise-grade networking equipment.
  • Automate infrastructure provisioning and operations using Ansible, Bash, Python, and Git-based workflows to support Infrastructure-as-Code practices.
  • Build and maintain MAAS-based provisioning workflows, including PXE booting, Preseed/Cloud-init automation, and OS deployment pipelines.
  • Implement and manage observability stacks using tools such as Prometheus, Grafana, ELK/Graylog, or Loki for metrics, logs, and system insights.
  • Develop internal tooling and APIs for compute and GPU resource tracking, infrastructure monitoring, and system integrations.
  • Deploy and support virtualization and containerization platforms such as OpenStack, Proxmox VE, VMware ESXi, and container orchestration systems.

Requirements

  • Strong expertise in Linux system administration, particularly Debian and Ubuntu environments.
  • Hands-on experience with MAAS, Ironic, or other bare-metal provisioning and automation systems.
  • Solid understanding of networking fundamentals, including VLANs, routing, VPNs, and multi-site infrastructure design.
  • Proven experience with Infrastructure-as-Code tools such as Ansible and scripting languages like Bash and Python.
  • Familiarity with observability and monitoring stacks including Prometheus, Grafana, ELK/Graylog, or Loki.
  • Experience with automated deployment workflows (PXE, Preseed, Cloud-init) and infrastructure automation pipelines.
  • Background in virtualization and orchestration technologies such as OpenStack, Proxmox, VMware, or Kubernetes-based environments.
  • Ability to work autonomously in fast-paced, high-growth or startup-like environments with strong problem-solving skills.

Benefits

  • Competitive compensation aligned with experience and market benchmarks
  • Fully remote role with flexibility across LATAM / global distributed teams
  • Opportunity to work on cutting-edge decentralized cloud infrastructure
  • High-impact engineering role with ownership over production-scale systems
  • Exposure to advanced bare-metal, networking, and cloud orchestration technologies
  • Fast-paced, autonomy-driven environment with strong technical ownership
  • Continuous learning opportunities in large-scale infrastructure and distributed systems
  • Inclusive and collaborative engineering culture focused on innovation and reliability

How Jobgether Works

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Why Apply Through Jobgether?

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.


Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Jobgether

United State
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Bright Vision Technologies

United State
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Bright Vision Technologies

United State

Subscribe our newsletter

New Things Will Always Update Regularly