Verda

ML Infrastructure Engineer, Forward-Deployed

Posted 2 months ago

Finland

⭐ 5-10 years experience

Apply Now

Please mention DailyRemote when applying

AI Summary

Collaborate with strategic GPU customers to optimize training and inference workloads on the Verda platform. Contribute to the development of internal ML platform features on Kubernetes, including job scheduling and workflow orchestration.

Imagine a future where anyone can train and run large-scale AI workloads instantly - without worrying about infrastructure bottlenecks.

At Verda, we’re building a fully featured European cloud computing platform designed for high-performance AI workloads. Our mission is to make powerful compute accessible, scalable, and efficient for the teams building the future of AI.

We’re ambitious, curious, and pragmatic builders. We operate with low hierarchy, high ownership, and a strong bias for action. We’ve already achieved a lot, but we’re just getting started.

Now it’s your chance to join the ride. Join Verda while it’s still being built - not once it’s finished!

Your responsibilities

In this role, you will work closely with strategic GPU customers, embedding directly with their teams to help get training and inference workloads running efficiently on Verda. You will collaborate with ML engineers and researchers to troubleshoot, optimize, and guide them in getting the most out of our infrastructure.

At the same time, you will contribute to building and improving our internal ML platform on Kubernetes, including job scheduling, workflow orchestration, and training infrastructure. You will also help evolve our inference stack, working on model packaging, serving frameworks, and performance optimization.

A key part of your role will be translating customer needs into scalable platform features, helping prioritize what we build to serve the broadest set of users. You will work closely with infrastructure and engineering teams to continuously improve performance, reliability, and developer experience across our platform.

Your key competencies

Strong ML engineering background with hands-on experience training, fine-tuning, or optimizing models at scale
Proficiency with PyTorch (JAX is a plus)
Experience with software or infrastructure engineering, including CI/CD or GitOps workflows
Strong programming skills in Python (additional languages such as Rust are a plus)
Comfortable working in Linux environments, including debugging GPU performance issues (CUDA, drivers, networking, filesystems)
Experience working directly with customers or stakeholders, with the ability to guide, collaborate, and challenge when needed
Ability and willingness to travel to customer sites when needed

Nice to have

Experience with Kubernetes (operators, CRDs, job scheduling, GPU scheduling)
Familiarity with systems such as Kueue, Flyte, Ray, or Slurm
Experience deploying inference workloads using vLLM, SGLang, TensorRT-LLM, or Triton
Knowledge of GPU networking and performance tuning (e.g., InfiniBand, NVLink, NCCL)
Research background (PhD or equivalent)
Experience in forward-deployed, solutions engineering or consulting roles

Why Verda

Cash + equity compensation along with various fringe benefits
Profitable operations with rapid, sustained growth
31 nationalities, with 6 different ones on the management team
An opportunity to work at the intersection of infrastructure and cutting-edge AI workloads, collaborating directly with leading ML teams

Practicalities

Location: Helsinki (hybrid) or remote in Europe

Employment type: Full-time and permanent

What's next

We’re building fast and this role needs the right person behind it. There’s no artificial deadline, but when we find who we’re looking for, we move.

If this sounds like your next move, apply now.

Please submit your application through our Careers page. We don’t accept applications sent by email.

Automatically Apply to the Best Remote Jobs

Stop the endless job search. Our AI finds and applies to the best jobs for you.

Try it Now

Verda

ML Infrastructure Engineer, Forward-Deployed

AI Summary

Your responsibilities

Your key competencies

Nice to have

Why Verda

Practicalities

What's next

Automatically Apply to the Best Remote Jobs

Ace Your Job Interview

How to Answer "How Do You Handle Criticism"?

How to Answer "Tell Me About Yourself?" in an Interview

How to Answer "What is your Experience with Customer Service?"

How to Answer "Describe Your Experience Working With Diverse Teams Or Different Cultures?"

How to Answer The Interview Question "What Sets You Apart From Other Candidates?"

How to Answer "Why Are You The Best Person For This Job"?

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Why Should We Hire You?"

How to Answer "What Areas Need Improvement?"

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Tell Me About a Time You Received Constructive Feedback"

How to Answer "What Is Your Greatest Accomplishment?"

Similar Jobs

Automation Engineer (Hubspot and AI Experience)

Automation Engineer (Hubspot and AI Experience)

Product Engineer

SEO & AI Search Strategist

Fullstack Entwickler (m/w/d) 4-Tage-Woche

Linux Systems Administrator

Verda

ML Infrastructure Engineer, Forward-Deployed

AI Summary

Your responsibilities

Your key competencies

Nice to have

Why Verda

Practicalities

What's next

Automatically Apply to the Best Remote Jobs

Share This Job:

Similar Jobs

Automation Engineer (Hubspot and AI Experience)

Automation Engineer (Hubspot and AI Experience)

Product Engineer

SEO & AI Search Strategist

Fullstack Entwickler (m/w/d) 4-Tage-Woche

Linux Systems Administrator

Personalize your Remote Job Search in 3 Easy Steps!