Senior HPC Systems Administrator - Modeling & Simulation

asenium consulting • European Union
Remote
This Job is No Longer Active This position is no longer accepting applications
AI Summary

Seeking a Senior Simulation Engineer with HPC expertise for a 6+ month renewable remote role in Europe. Responsibilities include user support, platform maintenance, security, and incident management for CAE, CFD, and Molecular Modeling. Requires strong Linux administration and scientific application support skills.

Key Highlights
Support, maintenance, and security of HPC computing resources for modeling and simulation.
User support including scripts, compilation, and incident management.
Management of major incidents and coordination with the server team.
Comprehensive supervision of HPC platforms (excluding hardware).
User management (onboarding, departures, access rights).
Technical Skills Required
HPC system administration Linux system administration RedHat Rocky Linux Scientific application installation Scientific application maintenance Scripting (implied for user assistance)
Benefits & Perks
6+ months renewable contract
100% remote
International context (worldwide scope)

Job Description


For one of my customer we are looking for a Senior Simulation Engineer.

Duration: 6+ months renewable

Location: Europe(100% remote)

Description:

We are seeking an IT HPC expertise to ensure the support, maintenance, and security of its computing resources dedicated to modeling and simulation (CAE, CFD, Molecular Modeling) in an international context (worldwide scope).

Main objectives:

  • User support (assistance with scripts, compilation, incident management)
  • Operational maintenance and security of the platforms
  • User management (onboarding, departures, access rights management)
  • Management of major incidents and communication with the server team for hardware issues
  • Regular updates of documentation


Goals and deliverables:

  • Comprehensive supervision of HPC platforms (excluding hardware supervision, which is already handled by the server team)
  • Management of user tickets (incidents, installation requests, user management, assistance with scripts/compilation)
  • Management of major incidents (major outages, license issues, disk space management, prioritization during peak periods)
  • Keeping systems up to date
  • Keeping existing documentation up to date
  • Communication and coordination with the server team during interventions or planned downtimes


Skills:

  • HPC system administration and support
  • User support and incident management
  • Documentation and reporting
  • Linux system administration (RedHat, Rocky Linux)
  • Installation and maintenance of scientific applications

Subscribe our newsletter

New Things Will Always Update Regularly