Design and develop high-performance compute cluster configurations optimized for performance and reliability. Collaborate with cross-functional teams for system integration and problem-solving. Requires expertise in HPC environments, InfiniBand architectures, and DevOps automation.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Job Description
We are seeking a Compute Cluster / HPC Engineer to join our project in Singapore and needs relocation.
Looking to advance your Devops career with relocation support? Explore Devops Jobs with Relocation Packages that include comprehensive packages to help you move and settle in your new role.
We seek someone who can design and develop high-performance compute cluster configurations optimized for performance, reliability, and scalability within CLIENTS systems. The role involves selecting, integrating, and validating hardware components such as CPUs, memory, storage, networking, and specialized accelerators, with a strong focus on InfiniBand-based architectures. You will collaborate closely with hardware, software, and systems engineering teams to ensure seamless system integration, participate in design reviews and integration planning, and contribute to cross-functional problem-solving efforts.
The position also requires DevOps, Ansible knowledge for automation, documenting hardware design decisions, integration procedures, diagnostic workflows, and supporting rack-level design considerations including power, cooling, and cabling.
Discover our full range of relocation jobs with comprehensive support packages to help you relocate and settle in your new location.
Required Skills & Qualifications: The ideal candidate has solid exposure to HPC environments running SUSE Linux, with hands-on experience in Linux system administration and OS customization. You should be familiar with InfiniBand fundamentals, common IB tools, and troubleshooting practices (candidates are expected to specify the tools they have used), as well as an understanding of CMU and golden image concepts. Experience with rack design fundamentals, FRU replacement or qualification processes, and system-level performance tuning is required, along with a strong understanding of hardware–software interaction. Excellent documentation and communication skills are essential to effectively support internal teams and cross-functional collaboration.
Similar Jobs
Explore other opportunities that match your interests
Hyd Consulting Inc
SRE Tech Lead
entain india
Senior DevOps/Platform Engineer