The Next Chapter W&S

Senior Performance & Infrastructure Engineer - HPC

Posted 3 months ago

Netherlands

⭐ 5-10 years experience

Apply Now

Please mention DailyRemote when applying

AI Summary

You will trace, profile, and optimize Linux kernel subsystems for massive GPU clusters and InfiniBand fabrics. Additionally, you will troubleshoot performance bottlenecks and integrate new hardware into the existing distributed infrastructure.

The organization

Our client operates one of the largest GPU infrastructures in the world — 100,000+ GPUs. Their infrastructure doubles in size every year. We’re looking for engineers who love getting deep into Linux systems, pushing hardware and software to their limits, and making the world’s fastest AI and HPC workloads run even faster

The role

You’ll join a small, senior team that works between the hardware and Linux OS layers, solving performance problems that affect tens of thousands of GPUs. This is hands-on, high-impact engineering where microsecond gains matter and every optimization is felt at global scale.

What you’ll do

Trace, profile, tune and optimize Linux kernel & subsystems (CPU scheduling, memory management, networking stack) for GPU clusters and InfiniBand fabrics
Troubleshoot and resolve complex performance bottlenecks
Integrate and validate new GPU hardware & infra (KVM/QEMU, PCIe devices, Kubernetes)
Improve monitoring, alerting, and automation for large-scale, distributed systems
Occasionally assist customers in optimizing workloads

Your profile

Key requirements (non-negotiable):

Solid Linux internals knowledge, with kernel tracing, profiling and tuning experience (eg. perf, ftrace, eBPF, sysctl, kgdb etc.)
Excellent programming skills, C or C++ system-level code, with a good grasp of data structures & algorithms
Experience in performance optimization (eg. high-load/high-throughput, low-latency, low-jitter, memory bypasses, zero-copy, lock-free, synchronization across large-scale clusters etc.)
Scripting or development skills in Go, Python, or similar

Nice-to-haves (not key):

Large-scale clusters (GPU or CPU)
Virtualization stacks (KVM/QEMU), Slurm, Kubernetes
Deep learning frameworks (eg. PyTortch, Tensorflow...)
GPU-specific stack (eg. CUDA, NCCL....)

This is for you if you

Love solving deep technical challenges, care about performance downto the microsecond, and want to work on infrastructure that pushes the limits of what’s possible.

What's offered

Salary: up to 160k + 25% bonus.
Flexible working arrangements.
A dynamic and collaborative work environment that values initiative and innovation.
Location: Amsterdam or full-remote from anywhere within the EU/EER

Automatically Apply to the Best Remote Jobs

Stop the endless job search. Our AI finds and applies to the best jobs for you.

Try it Now

The Next Chapter W&S

🧑‍💻 Employees 2-10 employees 🏢 Industry Staffing and Recruiting

View More Jobs From The Next Chapter W&S

The Next Chapter W&S

Senior Performance & Infrastructure Engineer - HPC

AI Summary

The organization

The role

What you’ll do

Your profile

This is for you if you

What's offered

Automatically Apply to the Best Remote Jobs

Ace Your Job Interview

How to Answer "How Do You Handle Criticism"?

How to Answer "Tell Me About Yourself?" in an Interview

How to Answer "What is your Experience with Customer Service?"

How to Answer "Describe Your Experience Working With Diverse Teams Or Different Cultures?"

How to Answer The Interview Question "What Sets You Apart From Other Candidates?"

How to Answer "Why Are You The Best Person For This Job"?

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Why Should We Hire You?"

How to Answer "What Areas Need Improvement?"

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Tell Me About a Time You Received Constructive Feedback"

How to Answer "What Is Your Greatest Accomplishment?"

Similar Jobs

Generative AI Analyst | German (Germany)

Generative AI Analyst | Spanish (Argentina)

Generative AI Analyst | French (France)

Generative AI Analyst | Italian (Italy)

Generative AI Analyst | Portuguese (Brazil)

Go-to-Market Solution Architect – Post Sale

The Next Chapter W&S

Senior Performance & Infrastructure Engineer - HPC

AI Summary

The organization

The role

What you’ll do

Your profile

This is for you if you

What's offered

Automatically Apply to the Best Remote Jobs

Share This Job:

Similar Jobs

Generative AI Analyst | German (Germany)

Generative AI Analyst | Spanish (Argentina)

Generative AI Analyst | French (France)

Generative AI Analyst | Italian (Italy)

Generative AI Analyst | Portuguese (Brazil)

Go-to-Market Solution Architect – Post Sale

Personalize your Remote Job Search in 3 Easy Steps!