Site Reliability Engineer / Platform Engineer

James Adams United Kingdom
Remote
Apply
AI Summary

Join a growing digital marketplace platform as a Site Reliability Engineer / Platform Engineer to improve platform reliability, scalability, and performance. This role combines hands-on engineering with operational responsibility and offers the chance to shape reliability processes, tooling, and best practices. The successful candidate will play a key role in building a more proactive reliability function.

Key Highlights
Improve platform reliability, scalability, and performance
Take ownership of platform reliability, monitoring, automation, incident response, and operational improvements
Work closely with engineering teams to reduce repeat incidents and improve resilience
Key Responsibilities
Improve the reliability, scalability, and performance of the platform through upgrades and service improvements
Identify opportunities for automation to reduce manual operational workload
Build internal tooling, dashboards, and admin utilities to support operational efficiency
Technical Skills Required
AWS environments and cloud infrastructure Linux / Unix systems, networking, and system security Scripting or building tooling using languages such as Python or Bash Monitoring and observability tooling such as CloudWatch, Datadog, or similar
Benefits & Perks
Fully remote position
£50,000 - £55,000 annual salary

Job Description


Site Reliability Engineer / Platform Engineer

Fully Remote | £50,000 - £55,000


A growing digital marketplace platform is looking for a Site Reliability Engineer / Platform Engineer to help scale and improve the reliability, resilience, and operational maturity of a high traffic consumer platform used by millions of fans.


This is an opportunity to join a business where customer experience and platform stability are genuinely central to the engineering culture. The successful candidate will play a key role in building a more proactive reliability function, helping move operational ownership from reactive support into a modern, scalable engineering practice.


The role would suit someone already working within SRE, Platform Engineering or DevOps, but the business is equally open to software engineers with strong operational experience who are looking to transition further into infrastructure and reliability engineering.


The Role

Working closely with the wider engineering function, this individual will take ownership of platform reliability, monitoring, automation, incident response, and operational improvements across the production environment.


The position combines hands on engineering with operational responsibility and offers the chance to shape reliability processes, tooling, and best practice within a scaling technology business.


Key Responsibilities

Platform Operations & Reliability

  • Improve the reliability, scalability, and performance of the platform through upgrades and service improvements
  • Identify opportunities for automation to reduce manual operational workload
  • Build internal tooling, dashboards, and admin utilities to support operational efficiency
  • Own service monitoring and platform health visibility across production systems
  • Monitor newly released features and provide rapid operational feedback to engineering teams
  • Support configuration management and performance optimisation
  • Monitor bot protection and platform security tooling
  • Support technical onboarding and configuration for new partners and integrations


Incident Management & Support

  • Participate in an on-call rota and respond to platform or partner incidents
  • Lead issue resolution for operational and reliability related problems
  • Coordinate escalation of more complex technical issues where required
  • Run blameless post-mortems and help drive continuous improvement initiatives
  • Work closely with engineering teams to reduce repeat incidents and improve resilience


Platform & Engineering Standards

  • Support CI/CD pipelines and release management processes
  • Help manage technical platform requirements including accessibility, SEO, and app ecosystem compliance
  • Manage production accounts, certifications, renewals, and operational governance
  • Contribute to disaster recovery planning and testing


Skills & Experience

  • Experience within an SRE, DevOps, Platform Engineering, Infrastructure, or operationally focused software engineering role
  • Strong understanding of AWS environments and cloud infrastructure
  • Experience working with Linux / Unix systems, networking, and system security
  • Ability to script or build tooling using languages such as Python or Bash
  • Experience with monitoring and observability tooling such as CloudWatch, Datadog, or similar
  • Understanding of CI/CD pipelines and modern SDLC practices
  • Comfortable working within Agile engineering teams
  • Experience supporting high availability or customer facing digital platforms would be beneficial


Desirable Experience

  • Infrastructure as Code (IaC)
  • Experience automating operational processes
  • Exposure to mobile app release environments across iOS and Android
  • Understanding of GDPR or production environment compliance standards
  • Experience within marketplace, ecommerce, ticketing, or live event platforms


The Opportunity

This is a fully remote position offering the chance to join a collaborative engineering environment where reliability and customer experience are key priorities.


The business is looking for someone who enjoys solving operational challenges, improving systems, and helping engineering teams build resilient, scalable services in a fast moving product environment.


Similar Jobs

Explore other opportunities that match your interests

DevOps Engineer

Devops
1h ago
Visa Sponsorship Relocation Remote
Job Type Other
Experience Level Not Applicable

atom learning

United Kingdom
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

MBN Solutions

United Kingdom
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Not Applicable

io associates

United Kingdom

Subscribe our newsletter

New Things Will Always Update Regularly