Senior Cluster Engineer - High-performance Computing
We usually respond within three days
The organization
Our client operates one of the largest GPU infrastructures in the world — 100,000+ GPUs. Their infrastructure doubles in size every year. We’re looking for engineers who love getting deep into Linux systems, pushing hardware and software to their limits, and making the world’s fastest AI and HPC workloads run even faster
The role
You’ll join a small, senior team that works between the hardware and Linux OS layers, solving performance problems that affect tens of thousands of GPUs. This is hands-on, high-impact engineering where microsecond gains matter and every optimization is felt at global scale.
What you’ll do
Trace, profile, tune and optimize Linux kernel & subsystems (CPU scheduling, memory management, networking stack) for GPU clusters and InfiniBand fabrics
Troubleshoot and resolve complex performance bottlenecks
Integrate and validate new GPU hardware & infra (KVM/QEMU, PCIe devices, Kubernetes)
Improve monitoring, alerting, and automation for large-scale, distributed systems
Occasionally assist customers in optimizing workloads
Your profile
Key requirements (non-negotiable):
Solid Linux internals knowledge, with kernel tracing, profiling and tuning experience (eg. perf, ftrace, eBPF, sysctl, kgdb etc.)
Excellent programming skills, C or C++ system-level code, with a good grasp of data structures & algorithms
Experience in performance optimization (eg. high-load/high-throughput, low-latency, low-jitter, memory bypasses, zero-copy, lock-free, synchronization across large-scale clusters etc.)
Scripting or development skills in Go, Python, or similar
Nice-to-haves (not key):
(Large-scale) clusters (GPU or CPU)
InfiniBand or other high-performance interconnect knowledge
Virtualization stacks (KVM/QEMU), Slurm, Kubernetes
Deep learning frameworks (eg. PyTortch, Tensorflow...)
GPU-specific stack (eg. CUDA, NCCL....)
This is for you if you
Love solving deep technical challenges, care about performance downto the microsecond, and want to work on infrastructure that pushes the limits of what’s possible.
What's offered
Salary: up to 150k + 25% bonus.
Flexible working arrangements.
A dynamic and collaborative work environment that values initiative and innovation.
Location: Amsterdam or remote.
- Business unit
- The Next Chapter W&S
- Locations
- Europe, Amsterdam
- Remote status
- Hybrid
- Is work permit / visa sponsorship offered?
- Yes, but only for candidates already based in Europe.
- Is remote possible?
- This role is open for both on-site in The Netherlands as well as full-remote
- Is freelance possible?
- No, this is a permanent job with a regular contract of employment.
- Which language skills are required (professional level)?
- English
- Employment type
- Full-time, Regular - indefinite, Regular - temporary
About The Next Chapter W&S
We focus on job opportunities in The Netherlands for IT and engineering professionals. We share relevant tips and tricks with jobseekers and we can support employers with regards to relocation, work permit rules, 30% ruling et cetera. We value transparancy, honesty and a no-nonsense approach based on our extensive technical and international recruitment expertise.