Data Engineer for Embodied AI and Vision-Language-Action Models

sensmore • Germany
Relocation
Apply
AI Summary

Design, build, and maintain data infrastructure for Sensmore's embodied AI and Vision-Language-Action Models. Collaborate with Robotics, ML, and Software engineers to ensure clean, reliable data flows. Blend classic data engineering with ML Ops best practices.

Key Highlights
Design and operate data pipelines
Collaborate with cross-functional teams
Ensure data quality and performance
Key Responsibilities
Build and operate data pipelines
Design scalable storage
Enable ML Ops workflows
Ensure data quality
Collaborate cross-functionally
Optimize performance
Document and evangelize
Technical Skills Required
Python SQL AWS GCP Azure DVC MLflow Kubeflow Redshift Snowflake BigQuery Delta Lake Parquet Avro
Benefits & Perks
Attractive compensation package
Stock options
Beverages on-site
Regular social events
Nice to Have
Background in robotics or sensor data
Knowledge of real-time data processing and edge-computing constraints
Experience with infrastructure as code and CI/CD for data workflows

Job Description


sensmore automates the world's largest machines with unprecedented intelligence. Our proprietary Physical AI enables heavy machines such as wheel loaders to instantly adapt to dynamic environments and execute new tasks without prior training.

We integrate cutting-edge robotics into a platform powering intelligence and automation products - transforming productivity and safety for customers in mining, construction, and adjacent industries today.

Join us and play a pivotal role in transforming the automation landscape in heavy industries.

Role Overview:

As our Data Engineer, you will design, build, and maintain the data infrastructure that powers Sensmore’s embodied AI and Vision-Language-Action Models (VLAMs). You’ll collaborate with Robotics, ML and Software engineers to ensure clean, reliable data flows from our sensor arrays (radar, LiDAR, cameras, IMUs) into training and inference pipelines. This role blends classic data engineering (ETL/ELT, warehouse design, monitoring) with ML Ops best practices: model versioning, data drift detection, and automated retraining.

Key Responsibilities:

  • Build & operate data pipelines: Ingest, process, and transform multi-sensor telemetry (radar point-clouds, video frames, log streams) into analytics-ready and ML-ready formats.
  • Design scalable storage: Architect high-throughput, low-latency data lakes and warehouses (e.g., S3, Delta Lake, Redshift/Snowflake).
  • Enable ML Ops workflows: Integrate DVC or MLflow, automate model training/retraining triggers, track data/model lineage.
  • Ensure data quality: Implement validation, monitoring, and alerting to catch anomalies and schema changes early.
  • Collaborate cross-functionally: Partner with Embedded Systems, Robotics, and Software teams to align on data schemas, APIs, and real-time requirements.
  • Optimize performance: Tune distributed processing, queries, and storage layouts for cost-efficiency and throughput.
  • Document & evangelize: Maintain clear documentation for data schemas, pipeline architectures, and ML Ops practices to uplift the whole team.

Required Qualifications:

  • 3+ years of hands-on experience building production data pipelines in the cloud (AWS, GCP, or Azure).
  • Proficiency in Python, SQL, and at least one big-data framework.
  • Familiarity with ML Ops tooling: DVC, MLflow, Kubeflow, or similar.
  • Experience designing and operating data warehouses/data lakes (e.g., Redshift, Snowflake, BigQuery, Delta Lake).
  • Strong understanding of distributed systems, data serialization (Parquet, Avro), and batch vs. streaming paradigms.
  • Excellent problem-solving skills and the ability to work in ambiguous, fast-paced environments.

Preferred Skills:

  • Background in robotics or sensor data (radar, LiDAR, camera pipelines).
  • Knowledge of real-time data processing and edge-computing constraints.
  • Experience with infrastructure as code (Terraform, CloudFormation) and CI/CD for data workflows.
  • Familiarity with Kubernetes and containerized deployments.
  • Exposure to vision-language or action-planning ML models.

What we offer:

  • Build physical AI for the world's largest off-highway machinery – making them intelligent, safe, and ready for every tough task
  • Join the pioneer in intelligent robotics backed by Point Nine & other Tier 1 investors
  • Combine cutting-edge robotics research in end-to-end learning & Vision Language Action Model with real-world heavy mobile equipment
  • Tailor your own career path, whether you like to become technical specialist or technical team lead
  • Experience a great team culture, beverages, and an amazing office environment

Benefits:

  • Attractive compensation package and stock options.
  • Beverages on-site and regular social events.
  • Engage with top-tier researchers, engineers, and thought leaders.
  • Influence the future of robotic technologies and tackle significant technological challenges.
  • Assistance with relocation to Berlin.

About Us:

Heavy machinery, light years ahead.

sensmore automates the world's largest machines with unprecedented intelligence. Our proprietary Physical AI enables heavy machines such as wheel loaders to instantly adapt to dynamic environments and execute new tasks without prior training.

We integrate cutting-edge robotics into a platform powering intelligence and automation products - transforming productivity and safety for customers in mining, construction, and adjacent industries today.

We are proudly backed by Point Nine and other Tier 1 investors.


Similar Jobs

Explore other opportunities that match your interests

Business Operations Analyst

Data Science
•
10h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

adjoe

Germany

Data Analyst - Marketing Strategy and Intelligence

Data Science
•
1d ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

trivago

Germany

Data Scientist

Data Science
•
2d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

omegga

Germany

Subscribe our newsletter

New Things Will Always Update Regularly