Linux Systems Engineer - HPC/AI

aithyra research institute for biomedical artificial intelligence • Austria

Relocation

Apply

AI Summary

We are seeking a Linux Systems Engineer with experience in HPC and AI to build and operate an HPC cluster, deploying, configuring, and maintaining server hardware, and administering a large-scale Linux environment.

Key Highlights

Deploy, configure, and maintain server hardware and HPC cluster

Administer and harden a large-scale Linux environment

Configure and manage workload manager, high-performance storage solutions, and key software components

Key Responsibilities

Deploy, rack, cable, configure, and maintain server hardware, GPU nodes, networking equipment, and storage systems

Administer and harden a large-scale Linux environment (Debian/Ubuntu)

Configure and manage the workload manager (SLURM) to efficiently schedule, monitor, and manage diverse jobs including AI training and inference

Implement and optimize high-performance storage solutions (e.g., BeeGFS, Lustre) tailored for large-scale AI/HPC datasets and model training

Troubleshoot and resolve complex technical issues related to hardware, software, and networking components during the cluster build and initial operation phases

Provide technical support and guidance to scientists for running their AI workloads on the cluster, including job submission, monitoring, and basic troubleshooting

Technical Skills Required

Linux system administration Scripting (Bash, Python, Lua) Server hardware Configuration management (Ansible, Puppet, Salt) Networking fundamentals (TCP/IP, VLANs, firewalls, DNS/DHCP) High-speed networking (InfiniBand) HPC systems Cluster management tools Job schedulers (SLURM, PBS) Containers (Docker, Apptainer, Kubernetes) Parallel file systems (BeeGFS, Lustre, GPFS) GPU management (CUDA toolkits) AI frameworks (TensorFlow, PyTorch)

Benefits & Perks

Competitive salary

Support for wellbeing

Flexible working arrangements

Meal allowance

Relocation support

Nice to Have

Experience with HPC systems, cluster management tools, or job schedulers (SLURM, PBS)

Experience with containers and orchestration (e.g., Docker, Apptainer, Kubernetes)

Familiarity with parallel or network file systems (e.g., BeeGFS, Lustre, GPFS)

Exposure to GPU management, CUDA toolkits, or AI frameworks (TensorFlow, PyTorch)

Job Description

Your role

Employment Type Full Time

Application Deadline 18.05.2026

Apply now

Application details

We are seeking a talented Linux Systems Engineer – HPC/AI who will join a world-class team and help build and operate the foundational infrastructure needed to support groundbreaking research. This is a unique opportunity to be part of something from the early beginning.

As a Linux Systems Engineer with a focus on HPC/AI, you will help build and operate an HPC cluster specialized for AI workloads. This role is ideal for someone with solid Linux systems administration experience who is excited to grow into the world of High-Performance Computing and AI infrastructure. You will contribute to bringing advanced AI solutions to life, using your technical skills to support scalable, reliable, and high-performance systems for cutting-edge research.

Reporting to Stephan Stadlbauer, Head of Scientific Computing, your role combines Linux systems engineering, hardware and infrastructure support, and close collaboration with multidisciplinary teams. This position focuses on helping design, implement, and operate infrastructure for innovative AI research. If you are passionate about Linux systems, on-premises infrastructure, and want to develop further in HPC and AI, this role is an excellent opportunity.

Start date: flexible

Contract: full-time, 40h/week

Positions available: 1

Your Tasks

Deploy, rack, cable, configure, and maintain server hardware, GPU nodes, networking equipment, and storage systems in our on-premises data centers.
Administer and harden a large-scale Linux environment (Debian/Ubuntu) that forms the backbone of the HPC/AI cluster.
Assist in designing, building, and scaling our HPC cluster specifically optimized for AI workloads - learning HPC best practices along the way.
Configure and manage the workload manager (SLURM) to efficiently schedule, monitor, and manage diverse jobs including AI training and inference.
Implement and optimize high-performance storage solutions (e.g., BeeGFS, Lustre) tailored for large-scale AI/HPC datasets and model training.
Install and configure key software components, including parallel file systems, networking fabrics, and AI-specific libraries and frameworks (e.g., TensorFlow, PyTorch).
Troubleshoot and resolve complex technical issues related to hardware, software, and networking components during the cluster build and initial operation phases.
Provide technical support and guidance to scientists for running their AI workloads on the cluster, including job submission, monitoring, and basic troubleshooting.
Monitor system performance, resource utilization, and job efficiency to optimize throughput and infrastructure.
Document system design, configurations, procedures, and best practices for building and operating the AI HPC cluster.

Looking to advance your Devops career with relocation support? Explore Devops Jobs with Relocation Packages that include comprehensive packages to help you move and settle in your new role.

Your Profile

Education in Computer Science, Information Technology, or a related field (or equivalent practical experience).
Solid, hands-on experience in Linux system administration (e.g., Ubuntu, Debian, RHEL) in professional or large-scale environments.
Proficiency in scripting and automation (e.g., Bash, Python, Lua) for system management, deployment, and monitoring tasks.
Practical experience with server hardware — you are comfortable racking equipment, diagnosing hardware faults, and working in a data-center environment.
Familiarity with configuration management and automation tools (e.g., Ansible, Puppet, Salt) and a strong desire to apply automation best practices at scale.
Good understanding of networking fundamentals (TCP/IP, VLANs, firewalls, DNS/DHCP); experience with high-speed networking or InfiniBand is a plus.
Interest in or initial exposure to HPC concepts (job schedulers, parallel file systems, cluster management) — with a genuine eagerness to learn and develop deep expertise.
Interest in or initial exposure to GPU-accelerated computing and AI workloads — with a willingness to grow into this area.
Excellent problem-solving skills and a proactive, hands-on attitude towards tackling complex technical challenges in a fast-paced environment.
Ability to communicate effectively in English and collaborate with technical and research teams.

Desired Skills

Experience with HPC systems, cluster management tools, or job schedulers (SLURM, PBS).
Experience with containers and orchestration (e.g., Docker, Apptainer, Kubernetes).
Familiarity with parallel or network file systems (e.g., BeeGFS, Lustre, GPFS).
Exposure to GPU management, CUDA toolkits, or AI frameworks (TensorFlow, PyTorch).
Experience working with research scientists or in an academic environment.
Familiarity with monitoring and observability stacks (Prometheus, Grafana, CheckMK).

Discover our full range of relocation jobs with comprehensive support packages to help you relocate and settle in your new location.

What We Offer

A competitive salary (minimum gross annual salary of EUR 58000)
Support for your wellbeing, including access to a company doctor
Fresh fruits, sweet treats, and free coffee & tea are available every day
Flexible working arrangements, with the option for one home office day per week Core hours: Monday–Thursday 09:00-15:00, Friday 09:00-13:00
Meal allowance to make your day a little easier
A welcoming community with diverse social and cultural activities
Relocation support to help you settle in comfortably if you’re moving to join us

Ready to to build cutting-edge AI infrastructure from the ground up and shape its future?

Application Process

Please submit your application until: 18.05.2026

We will be in touch afterward.

CV highlighting your relevant experience and achievements
Cover letter outlining your interest in the position and how your background aligns with the job requirements
Contact details of at least two professional references
Completed application questionnaire

Apply Online: https://aithyra.onlyfy.jobs/job/ogi8bhvxj6wkhmms8o7901hlcjeu8oz

We are a curiosity-driven, globally minded organization committed to building an inclusive and flexible workplace. At AITHYRA, we believe diverse perspectives strengthen collaboration and spark innovation. We welcome applicants from all backgrounds, cultures, and experiences to help us create teams that reflect the communities our science serves. Your unique contribution matters here - come realize your full potential with us.

Should you have any questions, please contact the AITHYRA HR Team at

recruitment@aithyra.ac.at

Job Overview

Posted Date May 11, 2026

Employment Type Full-time

Experience Level Entry level

Location Austria

Annual Salary 58,000 EUR

Category Devops

Company aithyra research institute for biomedical artificial intelligence

Mentioned Skills

Similar Jobs

Explore other opportunities that match your interests

Senior Java Full-Stack Engineer

Devops

•

3d ago

Visa Sponsorship Relocation Remote

Job Type Temporary

Experience Level Mid-Senior level

DL Remote

Austria

VP of Technology

Devops

•

2w ago

Visa Sponsorship Relocation Remote

Job Type Full-time

Experience Level Executive

Van Kaizen

Austria

Senior Manager, Enterprise Integration

Devops

•

2h ago

Premium Job

•••••• •••••• ••••••

Job Type ••••••

Experience Level ••••••

versigent

United State

Linux Systems Engineer - HPC/AI

Key Highlights

Key Responsibilities

Technical Skills Required

Benefits & Perks

Nice to Have

Job Description

Job Overview

Mentioned Skills

Industries

Similar Jobs

Senior Java Full-Stack Engineer

DL Remote

VP of Technology

Van Kaizen

Senior Manager, Enterprise Integration

Premium Job

versigent

Subscribe our newsletter