Low-Level Systems Engineer for Emerging ML Hardware

netpreme • United State
Visa Sponsorship Relocation
Apply
AI Summary

We are looking for experienced low-level systems engineers to design device drivers and systems software for emerging ML hardware. The ideal candidate should have experience with communication devices, protocols, and memory models. The role will involve working with both Hardware and ML Software teams to design and implement efficient low-level system layers for our devices.

Key Highlights
Design device drivers and systems software for emerging ML hardware
Develop low-latency, high-throughput data exchange systems between GPUs
Develop high-performance data movement kernels
Define and expose data movement interfaces to high-level ML frameworks
Technical Skills Required
low-level systems programming OS internals PCIe/IO sub-systems memory management CUDA JAX/Pallas ROCM nccl GPUDirect RDMA
Benefits & Perks
competitive salary
incentive-based bonus
early stage equity grant
health insurance
dental insurance
vision insurance
life insurance
401k match
daily lunch stipend
relocation assistance
visa sponsorship

Job Description


About The Role

We are looking for experienced low-level systems (OS, drivers, low-level networking) engineers to lead our effort to design device drivers and systems software for emerging ML hardware.

This project is in the high-performance communication space, therefore an ideal candidate should have experience with communication devices (e.g. NICs, RDMA, CXL, etc.) and protocols, memory models (ideally for GPUs), and a broad understanding of accelerator (GPUs, TPUs, FPGAs, custom compute ASICs) communication and memory models. In this role, you will work with both our Hardware and ML Software teams to design and implement efficient low-level system layers for our devices.

This role will be performed onsite from one of our offices in Santa Clara, CA or Boston, MA.

Essential Duties & Responsibilities

  • Develop low-latency, high-throughput data exchange systems between GPUs;
  • Develop high-performance data movement kernels;
  • Define and expose data movement interfaces to high-level ML frameworks (e.g. PyTorch);
  • Develop Linux device drivers for custom hardware.

Qualifications

  • Hacker mentality.
  • Deep experience with low-level systems programming.
  • Knowledge of OS internals, especially PCIe/IO sub-systems and memory management.
  • Prior experience in accelerator programming (e.g. CUDA, JAX/Pallas, ROCm).
  • Prior experience with collective communication libraries (e.g. nccl).
  • Experience with GPUDirect and RDMA is a strong plus.

Compensation & Benefits

  • Competitive salary commensurate with experience including base salary, incentive-based bonus, and early stage equity grant.
  • Comprehensive benefits including health, dental, vision, and life insurance.
  • Well-equipped, sunny offices in Santa Clara, CA and Boston, MA.
  • Relocation assistance and visa sponsorship.
  • Perks include a daily lunch stipend, 401k match, and more.
  • A collaborative, continuous-learning work environment with smart, dedicated colleagues engaged in developing the next generation of architecture for high-performance computing.

The Opportunity

  • Impact: We are tackling a fundamental challenge at the infrastructure layer: unlocking greater AI capability while dramatically improving efficiency. The work we do here compounds across state-of-the-art AI models, systems, and real-world applications.
  • Timing: Joining now means real ownership of the company and meaningful influence over product direction and execution. You’ll work from first principles, move quickly from insight to execution, and see your contributions directly reflected in what we build.
  • Culture: You’ll work alongside a group of people who care deeply about rigor, clarity, and impact. We value thoughtful disagreement, fast learning, and intellectual fearlessness. This is a place where strong ideas shine, curiosity is encouraged, and growth is a daily practice.

Subscribe our newsletter

New Things Will Always Update Regularly