Job Description
Our client is on the lookout for an additional seasoned Senior SRE to bolster their global team and will be relocated to their tech hub in Malaysia.
What You Do.
What You Do.
- Work for large-scale distributed systems and actively contribute to the management of critical incidents
- Offer problem-solving approaches and eliminate obstacles, demonstrating creativity and adaptability
- Spearhead the management of incidents and implement automated solutions for incident resolution within assigned clusters
- Elevate the reliability of products and services by conducting thorough post-incident analyses and minimizing operational burdens
- Champion automation initiatives aimed at simplifying repetitive tasks across diverse toolsets
- Establish Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to uphold superior reliability standards
- Bachelor’s/Master’s degree in Computer Engineering, Computer Science, or relevant field
- Min. 5 years of experience in SRE/DevOps
- Clear communication skills in English with strong problem-solving, customers centric, and critical thinking
- Proficiency in Agile methodologies, and ability to collaborate effectively with diverse teams across different time zones
- Technical; Proficient in Unix commands, strong understanding of cloud concepts, DevOps methodologies, and CI/CD pipelines and tools
- Expertise in Java full-stack/Python development
- Experience with microservices development, auto-scaling, and REST API integration