Low-Level Kernel & CUDA Engineer

itur ai Israel
Remote
Apply
AI Summary

Design, develop, and optimize low-level systems software with a focus on CUDA and kernel development. Troubleshoot system-level performance issues and collaborate with the team to enhance software architecture. Work on a custom C++ driver for direct hardware and CUDA orchestration.

Key Highlights
Design and develop low-level systems software
Troubleshoot system-level performance issues
Collaborate with the team to enhance software architecture
Key Responsibilities
Design, develop, and optimize low-level systems software
Troubleshoot system-level performance issues
Collaborate with the team to enhance software architecture
Help define best practices and ensure functionality across a variety of hardware platforms
Technical Skills Required
C++ CUDA NVIDIA AMD ROCm GPU architecture Memory-mapped I/O Low-level system architecture Transformer/MoE structures Tensor libraries
Benefits & Perks
Full-time
Fully remote
Good communication skills and ability to work collaboratively in a remote environment
Nice to Have
AI/ML workflows

Job Description


Company Description

Itur AI is building a proprietary, hardware-native acceleration stack that skips the standard runtime fluff for deterministic AI performance. You’ll be engineering the C++ driver layer that serves as our core technical moat.


The Mission


Driver Playground: Build a custom C++ driver for direct hardware and CUDA orchestration.


Role Description

This is a full-time and fully remote role for a Low-Level Kernel & CUDA Engineer. The role involves designing, developing, and optimizing low-level systems software, with a focus on CUDA and kernel development. The engineer will work on troubleshooting system-level performance issues, creating hardware-efficient algorithms, and collaborating with the team to enhance software architecture. You will also help define best practices and ensure functionality across a variety of hardware platforms.


Qualifications

  • Mastery of C++ and CUDA (NVIDIA and AMD/ROCm) programming and GPU architecture
  • Experience in low-level kernel development and system programming
  • Serious experience with memory-mapped I/O and low-level system architecture
  • Solid grasp of Transformer/MoE structures and what it takes to deploy at an enterprise grade
  • Understanding of operating system concepts and hardware architectures
  • Ability to debug and troubleshoot complex system-level issues
  • Experience in multi-threading, parallel processing, and memory management
  • Good communication skills and ability to work collaboratively in a remote environment
  • Familiarity with AI/ML workflows, tensor libraries, or similar technologies is a plus

Similar Jobs

Explore other opportunities that match your interests

Senior AI Operations Specialist

Programming
4h ago
Visa Sponsorship Relocation Remote
Job Type Other
Experience Level Not Applicable

gunia consulting

Emea
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Provisions Group

United State

Junior JavaScript Developer

Programming
5h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Entry level

blue oak consulting

France

Subscribe our newsletter

New Things Will Always Update Regularly