Senior Data Engineer (Spark Specialist)

retailogists β€’ Canada
Remote
Apply
AI Summary

We're looking for a Senior Data Engineer with expertise in Spark to lead the development of large-scale data pipelines on AWS Glue. As a Spark specialist, you'll design, build, and maintain pipelines that move, transform, and reshape large volumes of retail data. You'll work with analysts, data scientists, and client teams to land production-ready data where it needs to go.

Key Highlights
Lead the development of large-scale data pipelines on AWS Glue
Design, build, and maintain pipelines that move, transform, and reshape large volumes of retail data
Work with analysts, data scientists, and client teams to land production-ready data
Key Responsibilities
Designing, building, and maintaining large-scale Spark pipelines on AWS Glue
Tuning Spark jobs for performance and cost
Debugging and stabilizing production Spark workloads
Technical Skills Required
Spark AWS Glue PySpark Scala Parquet Iceberg Delta Hudi S3 Athena Lake Formation Step Functions EMR
Benefits & Perks
Flexible hours
Collaborative culture
High-impact work
Direct exposure to clients and real business problems
Nice to Have
Experience with cloud data warehouses (Redshift, Snowflake, BigQuery)
Familiarity with dbt and semantic-layer modelling
Exposure to BI tooling (Metabase, Looker Studio, Power BI, etc.)

Job Description


πŸš€ Data Engineer (Spark Specialist)


Location: Remote

Experience Level: Senior (5+ years)

Type: Full-time, Permanent


About Retailogists Retailogists is a fast-growing startup at the intersection of retail consulting and technology. We combine deep retail domain expertise with technical excellence in big data, full-stack engineering, and AI/ML. Our clients range from fast-scaling digital brands to large, multi-location retailers.

We're a nimble team of technologists, consultants, and builders β€” and we're looking for a Senior Spark Engineer who lives and breathes distributed data processing. If tuning a misbehaving Spark job is your idea of a good afternoon, we want to talk to you.


What You'll Do As our Spark specialist, you'll play a leadership role the heavy-lifting layer of our clients' data platform: the pipelines that move, transform, and reshape large volumes of retail data for both internal tools and client-facing products. Most of this runs on AWS Glue today, and you'll be the person we turn to for getting it fast, reliable, and cost-efficient.


Responsibilities include:

  • Designing, building, and maintaining large-scale Spark pipelines on AWS Glue (PySpark and/or Scala)
  • Tuning Spark jobs for performance and cost β€” partitioning, shuffles, joins, caching, executor sizing, the works
  • Debugging and stabilizing production Spark workloads, including spill, skew, and OOM issues
  • Architecting batch and incremental ETL/ELT patterns across S3-based data lakes (Parquet, Iceberg, Delta, or Hudi)
  • Integrating Glue with the broader AWS data stack (S3, Athena, Lake Formation, Step Functions, EMR where relevant)
  • Establishing engineering standards for Spark code β€” testing, modularity, reusability, and CI/CD for Glue jobs
  • Partnering with analysts, data scientists, and client teams to land production-ready data where it needs to go


What We're Looking For (must-haves)

  • 5+ years of professional data engineering experience, with a heavy Spark focus
  • Deep, hands-on Spark expertise: you understand the execution model, the Catalyst optimizer, and how to read a Spark UI to find the real bottleneck
  • Strong production experience with AWS Glue β€” Glue jobs, Glue Catalog, crawlers, bookmarks, and the quirks that come with them
  • Proficiency in PySpark (Scala is a plus)
  • Comfort working with columnar formats and modern lakehouse table formats (Parquet, Iceberg, Delta, or Hudi)
  • Solid SQL fundamentals


Nice to Have

  • Experience with cloud data warehouses (Redshift, Snowflake, BigQuery)
  • Familiarity with dbt and semantic-layer modelling
  • Exposure to BI tooling (Metabase, Looker Studio, Power BI, etc.)
  • Background in analytics engineering or BI workflows
  • Orchestration experience (Airflow, Step Functions, Dagster)
  • Retail or e-commerce data experience


Work Environment

  • Fully remote with the option to use offices in Montreal / Toronto
  • Flexible hours, collaborative culture, and high-impact work
  • Direct exposure to clients and real business problems β€” your pipelines will power decisions, not sit in a backlog



Similar Jobs

Explore other opportunities that match your interests

Business Analyst, IT Business Solutions

Data Science
β€’
3d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

westland insurance group ltd.

Canada

Azure Migration Business Analyst

Data Science
β€’
4d ago
Visa Sponsorship Relocation Remote
Job Type Contract
Experience Level Mid-Senior level

peopletogo inc.

Canada

Senior Data Analyst, Fraud and Identity

Data Science
β€’
4d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Associate

Jobgether

Canada

Subscribe our newsletter

New Things Will Always Update Regularly