Senior Site Reliability Engineer

drake international β€’ Japan
Remote
Apply
AI Summary

Transforming global manufacturers' engineering drawings, documents, and supply-chain data management. Full reliability posture, automation-first infrastructure, and direct partnership with engineering. 9+ years of hands-on software development experience and 7+ years in SRE or a closely related role.

Key Highlights
Full reliability posture
Automation-first infrastructure
Direct partnership with engineering
Key Responsibilities
Full reliability posture of the product
Automation-first infrastructure on GCP/GKE
Direct partnership with engineering to embed reliability into product design
Technical Skills Required
Cloud infrastructure IaC (Terraform) Kubernetes CI/CD pipelines Web application development Production troubleshooting GCP Datadog Observability platforms Rust TypeScript
Benefits & Perks
Full remote work
Full-time employment
Nice to Have
GCP hands-on experience
Datadog or observability platforms
Experience scaling SRE culture in a 50+ engineer org
Business-level Japanese (JLPT N2 or equivalent)

Job Description


πŸ”§ Hiring: Senior Site Reliability Engineer (SRE) πŸ“ Tokyo, Japan | Full-time | Full remote


My client is a product team within a B2B SaaS company transforming how global manufacturers manage engineering drawings, documents, and supply-chain data. Their platform is trusted across the manufacturing industry, and they're now preparing for the full-scale launch of a new product built on top of it.


What you'll own:

  • Full reliability posture of the product: monitoring, alerting, SLIs/SLOs, incident response, and post-mortem culture
  • Automation-first infrastructure on GCP/GKE so the team scales without operational drag
  • CI/CD pipelines with a focus on delivery safety and developer productivity
  • Direct partnership with engineering to embed reliability into product design
  • Reliability culture and leadership


What they're looking for:

  • 9+ years of hands-on software development experience
  • 7+ years in SRE, platform engineering, or a closely related role
  • Strong cloud infrastructure background with IaC (Terraform or equivalent)
  • Production-grade Kubernetes experience
  • Experience designing and running CI/CD pipelines with reliability in mind
  • Web application development and production troubleshooting experience


Bonus points for: GCP hands-on experience, Datadog or observability platforms, experience scaling SRE culture in a 50+ engineer org, and business-level Japanese (JLPT N2 or equivalent).


Tech stack highlights:

GCP Β· GKE Β· Terraform Β· ArgoCD Β· GitHub Actions Β· Helm Β· Kustomize Β· Istio Β· Datadog Β· Sentry Β· AlloyDB Β· BigQuery Β· Cloud Pub/Sub Β· Rust Β· TypeScript Β· Cloudflare Β· Auth0


SRE #SiteReliabilityEngineering #DevOps #GCP #Kubernetes #Infrastructure


Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Specialized Group

Japan

Director of Engineering

Programming
β€’
1w ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Director

drake international

Japan
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

peoplex inc. active connector...

Japan

Subscribe our newsletter

New Things Will Always Update Regularly