DevOps/Observability Engineer

Jobgether • Canada
Remote
Apply
AI Summary

Design and build a next-generation observability ecosystem, leading architecture of unified telemetry pipelines, and collaborating with engineering teams to integrate observability into CI/CD workflows and production systems. Strong technical expertise in DevOps, cloud infrastructure, and observability engineering required. 8+ years of experience in DevOps, SRE, or observability engineering roles.

Key Highlights
Design and build a next-generation observability ecosystem
Lead architecture of unified telemetry pipelines
Collaborate with engineering teams to integrate observability into CI/CD workflows and production systems
Key Responsibilities
Design, implement, and evolve a unified observability platform that supports large-scale distributed systems and ensures operational visibility across environments
Architect and implement end-to-end observability pipelines using OpenTelemetry, Prometheus, Grafana, and related tooling in AWS environments
Design scalable log, metric, and trace collection strategies, including cross-account AWS telemetry integration and centralized monitoring frameworks
Build and optimize log aggregation, filtering, and routing systems, including integrations with Splunk and other enterprise tools
Develop advanced alerting, dashboards, and monitoring solutions using PromQL, CloudWatch, and Alertmanager
Implement Infrastructure as Code using Terraform to deploy and manage observability and cloud infrastructure components
Support Kubernetes-based observability across EKS/ECS environments, ensuring full-stack visibility and reliability
Drive cost optimization initiatives by improving telemetry efficiency, storage strategies, and data filtering approaches
Technical Skills Required
OpenTelemetry Prometheus Grafana Splunk CloudWatch Terraform Kubernetes AWS Python Go
Benefits & Perks
Competitive compensation aligned with experience and market benchmarks
Fully remote work setup across Canada
Opportunity to work on large-scale, cloud-native systems and cutting-edge observability platforms
Exposure to advanced AI, cloud, and distributed engineering environments
Career growth within a high-performance, innovation-driven engineering culture

Job Description


This position is posted by Jobgether on behalf of a partner company. We are currently looking for a DevOps/Observability Engineer in Canada.

This role is focused on designing and building a next-generation observability ecosystem that enables deep visibility across large-scale, distributed cloud environments. You will lead the architecture of unified telemetry pipelines, ensuring logs, metrics, and traces are efficiently collected, processed, and analyzed. Working within a modern AWS-based infrastructure, you will leverage OpenTelemetry, Kubernetes, and industry-leading monitoring tools to enhance system reliability and performance. The environment is highly technical, cloud-native, and centered on automation, scalability, and continuous improvement. You will collaborate closely with engineering teams to integrate observability into CI/CD workflows and production systems. This position offers the opportunity to shape enterprise-wide monitoring standards and directly influence operational excellence at scale.

Accountabilities

In this role, you will design, implement, and evolve a unified observability platform that supports large-scale distributed systems and ensures operational visibility across environments.

  • Architect and implement end-to-end observability pipelines using OpenTelemetry, Prometheus, Grafana, and related tooling in AWS environments
  • Design scalable log, metric, and trace collection strategies, including cross-account AWS telemetry integration and centralized monitoring frameworks
  • Build and optimize log aggregation, filtering, and routing systems, including integrations with Splunk and other enterprise tools
  • Develop advanced alerting, dashboards, and monitoring solutions using PromQL, CloudWatch, and Alertmanager
  • Implement Infrastructure as Code using Terraform to deploy and manage observability and cloud infrastructure components
  • Support Kubernetes-based observability across EKS/ECS environments, ensuring full-stack visibility and reliability
  • Drive cost optimization initiatives by improving telemetry efficiency, storage strategies, and data filtering approaches
  • Collaborate with engineering and platform teams to embed observability into deployment pipelines and production systems

Requirements

This position requires strong technical expertise in DevOps, cloud infrastructure, and observability engineering, with a proven ability to build scalable monitoring systems in complex environments.

  • 8+ years of experience in DevOps, SRE, or observability engineering roles
  • Strong expertise in AWS cloud services and multi-account observability architectures
  • Hands-on experience with OpenTelemetry, Prometheus, Grafana, Splunk, and CloudWatch
  • Strong proficiency with Infrastructure as Code tools, particularly Terraform
  • Advanced programming/scripting skills (Python, Go, or similar) for automation and tooling
  • Experience with Kubernetes (EKS) and containerized environments (Docker, ECS)
  • Deep understanding of logging, metrics, tracing, and distributed system observability principles
  • Strong analytical, problem-solving, and systems-thinking abilities with a focus on scalability and reliability
  • Excellent communication skills and ability to work in cross-functional, fast-paced engineering teams

Benefits

  • Competitive compensation aligned with experience and market benchmarks
  • Fully remote work setup across Canada
  • Opportunity to work on large-scale, cloud-native systems and cutting-edge observability platforms
  • Exposure to advanced AI, cloud, and distributed engineering environments
  • Career growth within a high-performance, innovation-driven engineering culture
  • Collaborative and knowledge-sharing work environment with global teams
  • Continuous learning opportunities and access to modern DevOps and cloud technologies
  • Inclusive and flexible work culture supporting work-life balance.

How Jobgether Works

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Why Apply Through Jobgether?

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.


Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

breed staffing

Canada

Cloud and Infrastructure Engineer

Devops
•
3d ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

Jobgether

Canada

SRE Operations Engineer

Devops
•
1w ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

net2source (n2s)

Canada

Subscribe our newsletter

New Things Will Always Update Regularly