Job Description
Our client is looking for a Senior Site Reliability Engineer (SRE) to help scale and optimise critical infrastructure in preparation for a significant increase in traffic. You'll collaborate closely with the founding team, take charge of essential components, and ensure system stability under high demand.
Responsibilities:
- Design and maintain distributed databases at a large scale (ScyllaDB).
- Build high-throughput monitoring infrastructure handling 1M+ series per second.
- Automate everything from monitoring dashboards to database installations and cluster management.
- Implement Kubernetes Pod Autoscalers to maximise CPU efficiency while maintaining service quality.
- Collaborate daily with backend engineers and external partners (GCP, ScyllaDB) to maximise the potential of technology.
- Build CI/CD pipelines and develop tools to accelerate company growth.
Requirements:
- Deep knowledge of system architecture and software engineering principles.
- Strong command of a low-level programming language; Rust or Go.
- Practical experience with setting up and managing monitoring and alerting systems.
- Advanced proficiency in Kubernetes and cloud services such as AWS or GCP.
- Proven ability to work closely and effectively with backend engineering teams.
Why Join?
Work culture & benefits
- Onsite work in a central Paris office (on site 5 days per week and commuting expenses fully covered).
- Competitive benefits: 100% healthcare coverage, generous parental leave, and two company-wide shutdowns (two weeks in summer, one week in winter).
- Relocation support: Sponsored visa, Airbnb covered for a month, and help with French paperwork.
If you're ready to build world-class infrastructure, we’d love to hear from you!