Yotta Labs

Research Engineer - AI Systems

Posted 23 days ago

United States

⭐ 5-10 years experience

Apply Now

Please mention DailyRemote when applying

AI Summary

Design and implement high-performance kernels for AI operations across NVIDIA, AMD, and AWS Trainium hardware. Develop scalable distributed training and inference solutions to optimize the performance of Large Language Model infrastructure.

Location: Remote (Global)

Type: Full-time

Company: Yotta Labs

Apply: careers@yottalabs.ai

🧠 About Yotta Labs

Yotta Labs is building the next generation multi-silicon AI cloud and runtime platform. We enable training and inference across NVIDIA GPUs, AMD GPUs, and AWS Trainium, helping AI companies achieve the best performance and economics across heterogeneous hardware.

🛠️ Role Overview

We are seeking a highly motivated AI Systems Research Engineer specializing in Trainium, GPU kernels, and LLM systems optimization. You will work at the intersection of AI Systems, Compiler and Runtime Optimization, Distributed Training & Inference, GPU/Accelerator Kernel Development, and Large Language Model Infrastructure. Your work will directly impact the scalability and performance of AI applications deployed on our platform.

🎯 Responsibilities

Design and implement high-performance kernels for Attention, MoE, GEMM, collective communication, and quantization.
Optimize kernels for NVIDIA, AMD, and AWS Trainium.
Develop custom operators and graph optimizations using Neuron SDK, PyTorch/XLA, Torch Dynamo, and Neuron Compiler.
Improve performance of vLLM, SGLang, TensorRT-LLM, and custom inference runtimes.
Design scalable distributed training and inference solutions across thousands of accelerators.
Contribute to open-source projects, publish technical findings and engage with the developer community.

✅ Qualifications

Proficiency in AI programming languages such as Python and C++.
Deep understanding of GPU architecture and performance optimization.
Experience with CUDA, Triton, ROCm/HIP, or AWS Neuron.
Strong understanding of AI frameworks (e.g., PyTorch, Dynamo, LMCache), model architectures and profiling tools (e.g. Nsight, ROCm Profiler, or Neuron Profiler).
Strong problem-solving skills and the ability to work in a collaborative, remote environment.
A background in computer science, engineering, or a related field is preferred.

🌟 Preferred Experience

Contributions to open-source AI infra projects like vLLM, SGLang, PyTorch, or Triton.
Experience with with FlashAttention, PagedAttention, MoE, RLHF, or distributed AI systems.
Publications in top-tier conferences like MLSys, OSDI, SOSP, NSDI, SC, HPCA, or ISCA

🌐 Why Join Yotta Labs?

Be part of a visionary team aiming to redefine AI infrastructure and influence the future of multi-silicon AI computing.
Work on cutting-edge technologies that solves frontier AI infrastructure problems.
Collaborate with experts from leading institutions and tech companies.
Competitive compensation with equity. Enjoy a flexible, remote work environment that values innovation and autonomy.

📩 How to Apply

Interested candidates should apply directly or send their resume and a brief cover letter to careers@yottalabs.ai. Please include links to any relevant projects or contributions.

Automatically Apply to the Best Remote Jobs

Stop the endless job search. Our AI finds and applies to the best jobs for you.

Try it Now

Yotta Labs

Research Engineer - AI Systems

AI Summary

Automatically Apply to the Best Remote Jobs

Ace Your Job Interview

How to Answer "How Do You Handle Criticism"?

How to Answer "Tell Me About Yourself?" in an Interview

How to Answer "What is your Experience with Customer Service?"

How to Answer "Describe Your Experience Working With Diverse Teams Or Different Cultures?"

How to Answer The Interview Question "What Sets You Apart From Other Candidates?"

How to Answer "Why Are You The Best Person For This Job"?

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Why Should We Hire You?"

How to Answer "What Areas Need Improvement?"

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Tell Me About a Time You Received Constructive Feedback"

How to Answer "What Is Your Greatest Accomplishment?"

Similar Jobs

Senior Technical Product Manager - AI Innovation, Remote

Staff AI Engineer | Colombia | English C1

[Job - 29879] Senior Mobile Developer, Colombia

Manager, Business/Data Analyst

Product Support Engineer - EMEA

QA Engineer - AI Native

Yotta Labs

Research Engineer - AI Systems

AI Summary

Automatically Apply to the Best Remote Jobs

Share This Job:

Similar Jobs

Senior Technical Product Manager - AI Innovation, Remote

Staff AI Engineer | Colombia | English C1

[Job - 29879] Senior Mobile Developer, Colombia

Manager, Business/Data Analyst

Product Support Engineer - EMEA

QA Engineer - AI Native

Personalize your Remote Job Search in 3 Easy Steps!