Low-Level Systems Engineer for Emerging ML Hardware

netpreme • United State

Visa Sponsorship Relocation

Apply

AI Summary

We are looking for experienced low-level systems engineers to design device drivers and systems software for emerging ML hardware. The ideal candidate should have experience with communication devices, protocols, and memory models. The role will involve working with both Hardware and ML Software teams to design and implement efficient low-level system layers for our devices.

Key Highlights

Design device drivers and systems software for emerging ML hardware

Develop low-latency, high-throughput data exchange systems between GPUs

Develop high-performance data movement kernels

Define and expose data movement interfaces to high-level ML frameworks

Technical Skills Required

low-level systems programming OS internals PCIe/IO sub-systems memory management CUDA JAX/Pallas ROCM nccl GPUDirect RDMA

Benefits & Perks

competitive salary

incentive-based bonus

early stage equity grant

health insurance

dental insurance

vision insurance

life insurance

401k match

daily lunch stipend

relocation assistance

visa sponsorship

Job Description

About The Role

We are looking for experienced low-level systems (OS, drivers, low-level networking) engineers to lead our effort to design device drivers and systems software for emerging ML hardware.

This project is in the high-performance communication space, therefore an ideal candidate should have experience with communication devices (e.g. NICs, RDMA, CXL, etc.) and protocols, memory models (ideally for GPUs), and a broad understanding of accelerator (GPUs, TPUs, FPGAs, custom compute ASICs) communication and memory models. In this role, you will work with both our Hardware and ML Software teams to design and implement efficient low-level system layers for our devices.

This role will be performed onsite from one of our offices in Santa Clara, CA or Boston, MA.

Essential Duties & Responsibilities

Develop low-latency, high-throughput data exchange systems between GPUs;
Develop high-performance data movement kernels;
Define and expose data movement interfaces to high-level ML frameworks (e.g. PyTorch);
Develop Linux device drivers for custom hardware.

Qualifications

Hacker mentality.
Deep experience with low-level systems programming.
Knowledge of OS internals, especially PCIe/IO sub-systems and memory management.
Prior experience in accelerator programming (e.g. CUDA, JAX/Pallas, ROCm).
Prior experience with collective communication libraries (e.g. nccl).
Experience with GPUDirect and RDMA is a strong plus.

Compensation & Benefits

Competitive salary commensurate with experience including base salary, incentive-based bonus, and early stage equity grant.
Comprehensive benefits including health, dental, vision, and life insurance.
Well-equipped, sunny offices in Santa Clara, CA and Boston, MA.
Relocation assistance and visa sponsorship.
Perks include a daily lunch stipend, 401k match, and more.
A collaborative, continuous-learning work environment with smart, dedicated colleagues engaged in developing the next generation of architecture for high-performance computing.

The Opportunity

Impact: We are tackling a fundamental challenge at the infrastructure layer: unlocking greater AI capability while dramatically improving efficiency. The work we do here compounds across state-of-the-art AI models, systems, and real-world applications.
Timing: Joining now means real ownership of the company and meaningful influence over product direction and execution. You’ll work from first principles, move quickly from insight to execution, and see your contributions directly reflected in what we build.
Culture: You’ll work alongside a group of people who care deeply about rigor, clarity, and impact. We value thoughtful disagreement, fast learning, and intellectual fearlessness. This is a place where strong ideas shine, curiosity is encouraged, and growth is a daily practice.

Job Overview

Posted Date Dec 24, 2025

Employment Type Full-time

Experience Level Mid-Senior level

Location United State

Annual Salary 100000-150000 USD

Category Programming

Company netpreme

Low-Level Systems Engineer for Emerging ML Hardware

Key Highlights

Technical Skills Required

Benefits & Perks

Job Description

Job Overview

Mentioned Skills

Industries

Low-Level Systems Engineer for Emerging ML Hardware

Key Highlights

Technical Skills Required

Benefits & Perks

Job Description

Job Overview

Mentioned Skills

Industries

Subscribe our newsletter