#1 Job Board for tech industry in Europe

👉 GCP HPC DevOps Engineer
New
DevOps

👉 GCP HPC DevOps Engineer

5 907 - 7 759 USD/monthNet per month - B2B
5 907 - 7 759 USD/monthNet per month - B2B
Type of work
Full-time
Experience
Senior
Employment Type
B2B
Operating mode
Remote
Xebia sp. z o.o.

Xebia sp. z o.o.

Xebia to globalna grupa firm tworzona przez ponad 5000 ekspertów. Od 20 lat doradzamy i tworzymy rozwiązania IT szyte na miarę dla Klientów z całego świata. Łączy nas chęć ciągłego rozwoju. Dlatego podejmujemy każde, nawet najtrudniejsze wyzwania.

Tech stack

    GCP

    advanced

    Python

    regular

    Ansible

    regular

    Bash

    regular

    Terraform

    regular

    High-Performance Computing

    regular

Job description

Online interview

🟣You will be:


  • leading the migration of on-premises SLURM-based HPC (High-Performance Computing) clusters to Google Cloud Platform,

  • designing, implementing, and managing scalable and secure HPC infrastructure solutions on GCP,

  • optimizing SLURM configurations and workflows to ensure efficient use of cloud resources,

  • managing and optimizing HPC environments, focusing on workload scheduling, job efficiency, and scaling SLURM clusters,

  • automating cluster deployment, configuration, and maintenance tasks using scripting languages (Python, Bash) and automation tools (Ansible, Terraform),

  • integrating HPC software stacks using tools like Spack for dependency management and easy installation of HPC libraries and applications,

  • deploying, managing, and troubleshooting applications using MPI, OpenMP, and other parallel computing frameworks on GCP instances,

  • collaborating with engineering, support teams, and stakeholders to ensure smooth migration and ongoing operation of HPC workloads,

  • providing expert-level support for performance tuning, job scheduling, and cluster resource optimization,

  • staying current with emerging HPC technologies and GCP services to continually improve HPC cluster performance and cost efficiency.


🟣 Your profile:

  • 5+ years of experience with HPC (High-Performance Computing) environments, including SLURM workload manager, MPI, and other HPC-related software,

  • extensive hands-on experience managing Linux-based systems, including performance tuning and troubleshooting in an HPC context,

  • proven experience migrating and managing SLURM clusters in cloud environments, preferably GCP,

  • proficiency with automation tools such as Ansible and Terraform for cluster deployment and management,

  • experience with Spack for managing and deploying HPC software stacks,

  • strong scripting skills in Python, Bash, or similar languages for automating cluster operations,

  • in-depth knowledge of GCP services relevant to HPC, such as Compute Engine (GCE), Cloud Storage, and VPC networking,

  • strong problem-solving skills with a focus on optimizing HPC workloads and resource utilization.



Work from the European Union region and a work permit are required.

Candidates must have an active VAT status in the EU VIES registry: https://ec.europa.eu/taxation_customs/vies/



🟣 Nice to have:

  • Google Cloud Professional DevOps Engineer or similar GCP certifications,

  • familiarity with GCP’s HPC-specific offerings, such as Preemptible VMs, HPC VM images, and other cost-optimization strategies,

  • experience with performance profiling and debugging tools for HPC applications,

  • advanced knowledge of HPC data management strategies, including parallel file systems and data transfer tools,

  • understanding of container technologies (e.g., Singularity, Docker) specifically within HPC contexts,

  • experience with Spark or other big data tools in an HPC environment.



🟣 Recruitment Process:

CV review – HR call – InterviewClient Interview – Hiring Manager Interview – Decision



🎁 Benefits 🎁


Development:

  1. development budgets of up to 6,800 PLN,

  2. we fund certifications e.g.: AWS, Azure,

  3. access to Udemy, O'Reilly (formerly Safari Books Online) and more,

  4. events and technology conferences,

  5. technology Guilds,

  6. internal training,

  7. Xebia Upskill.


🩺 We take care of your health:

  1. private medical healthcare,

  2. multiSport card - we subsidise a MultiSport card,

  3. mental Health Support.


🤸‍♂️ We are flexible:

  1. B2B or employment contract,

  2. contract for an indefinite period.

5 907 - 7 759 USD/month

Net per month - B2B