Senior Site Reliability Engineer

Haystack United Kingdom
Remote
This Job is No Longer Active This position is no longer accepting applications
AI Summary

High-impact Senior SRE role where you will be the architect of reliability for a massive distributed systems landscape. Design, deploy, and scale high-performance observability platforms and Prometheus monitoring systems. Drive 'Infrastructure as Code' (IaC) initiatives and build custom internal tools.

Key Highlights
High-impact Senior SRE role
Design, deploy, and scale high-performance observability platforms
Drive 'Infrastructure as Code' (IaC) initiatives
Key Responsibilities
Design, deploy, and scale high-performance observability platforms and Prometheus monitoring systems
Architect and maintain massive Elasticsearch clusters and robust data pipelines leveraging Kafka for real-time streaming
Drive 'Infrastructure as Code' (IaC) initiatives by automating complex cloud environments using Terraform and Ansible
Build custom internal tools and sophisticated automation scripts using Python, Go, or Ruby to eliminate toil and boost system performance
Optimize Linux systems (Debian/Ubuntu) and participate in a collaborative on-call rotation to maintain 24/7 service availability
Technical Skills Required
Prometheus Kafka ELK Stack Elasticsearch Logstash Kibana Terraform Ansible Python Go Ruby Linux systems administration
Benefits & Perks
Competitive day rate of £55 - £62 per hour
Long-term stability with an initial 12-month contract
100% remote working flexibility

Job Description


SRE - Site Reliability Engineer | £55 - £62

We're working with a global technology powerhouse supporting millions of connected devices on this exciting opportunity.

Step into a high-impact Senior SRE role where you will be the architect of reliability for a massive distributed systems landscape. You will take the lead on scaling mission-critical observability and monitoring platforms using a cutting-edge stack including Prometheus, Kafka, and the ELK stack to ensure seamless performance for a global user base.

The Role

  • Design, deploy, and scale high-performance observability platforms and Prometheus monitoring systems to support millions of global devices.
  • Architect and maintain massive Elasticsearch clusters and robust data pipelines leveraging Kafka for real-time streaming.
  • Drive "Infrastructure as Code" (IaC) initiatives by automating complex cloud environments using Terraform and Ansible.
  • Build custom internal tools and sophisticated automation scripts using Python, Go, or Ruby to eliminate toil and boost system performance.
  • Optimize Linux systems (Debian/Ubuntu) and participate in a collaborative on-call rotation to maintain 24/7 service availability.

What You'll Need

  • 5+ years of battle-tested experience in Site Reliability Engineering (SRE) or DevOps within enterprise-scale cloud environments.
  • Mastery of the Observability stack, specifically Prometheus, Grafana, and the full ELK Stack (Elasticsearch, Logstash, Kibana).
  • Expert-level Linux systems administration skills and deep knowledge of distributed systems architecture and Kafka messaging.
  • Hands-on proficiency with automation and configuration tools, including Terraform, Ansible, and programming in Python or Golang.
  • The ability to thrive in a fast-paced environment, tackling complex scaling challenges for high-traffic cloud services.

What's On Offer

  • Competitive day rate of £55 - £62 per hour (Inside IR35).
  • Long-term stability with an initial 12-month contract and high potential for extension.
  • 100% remote working flexibility while supporting a premier London-based technology hub.
  • Opportunity to work on a truly global scale, impacting the experience of millions of daily active users.

Apply via Haystack today!


Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Oliver Bernard

United Kingdom
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

Tenth Revolution Group

United Kingdom
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Oliver Bernard

United Kingdom

Subscribe our newsletter

New Things Will Always Update Regularly