Genesis

Member of Technical Staff, Training (Bay Area, Remote)

Posted 2 months ago

United States

⭐ 10+ years experience

Apply Now

Please mention DailyRemote when applying

AI Summary

Optimize the foundation model training stack by eliminating bottlenecks and designing scalable distributed training systems on multi-node GPU clusters. Develop low-level CUDA kernels and monitoring tools to improve hardware efficiency and diagnose performance regressions.

What You’ll Do

Drive down wall-clock time to convergence by profiling and eliminating bottlenecks across the foundation model training stack stack, from data pipelines to GPU kernels
Design, build, and optimize distributed training systems (PyTorch) for multi-node GPU clusters, ensuring scalability, robustness, and high utilization
Implement efficient low-level code (CUDA, cuDNN, Triton, custom kernels) and integrate it seamlessly into high-level training frameworks
Optimize workloads for hardware efficiency: CPU/GPU compute balance, memory management, data throughput, and networking
Develop monitoring and debugging tools for large-scale runs, enabling rapid diagnosis of performance regressions and failures

What You’ll Bring

Deep experience in distributed systems, ML infrastructure, or high-performance computing (8+ years)
Production-grade expertise in Python
Low-level performance mastery: CUDA/cuDNN/Triton, CPU–GPU interactions, data movement, and kernel optimization
Scaling at the frontier: experience with PyTorch and training jobs using data, context, pipeline, and model parallelism
System-level mindset with a track record of tuning hardware–software interactions for maximum utilization

Automatically Apply to the Best Remote Jobs

Stop the endless job search. Our AI finds and applies to the best jobs for you.

Try it Now

Genesis

🧑‍💻 Employees 1,001-5,000 employees 🏢 Industry Technology, Information and Internet

View More Jobs From Genesis

Genesis

Member of Technical Staff, Training (Bay Area, Remote)

AI Summary

What You’ll Do

What You’ll Bring

Automatically Apply to the Best Remote Jobs

Ace Your Job Interview

How to Answer "How Do You Handle Criticism"?

How to Answer "Tell Me About Yourself?" in an Interview

How to Answer "What is your Experience with Customer Service?"

How to Answer "Describe Your Experience Working With Diverse Teams Or Different Cultures?"

How to Answer The Interview Question "What Sets You Apart From Other Candidates?"

How to Answer "Why Are You The Best Person For This Job"?

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Why Should We Hire You?"

How to Answer "What Areas Need Improvement?"

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Tell Me About a Time You Received Constructive Feedback"

How to Answer "What Is Your Greatest Accomplishment?"

Similar Jobs

Junior Crypto Trader (Remote)

Mid/Senior AI Cinematic Video Editor (Full Remote - Worldwide)

Manager, Advocacy Engagement and Mobilization

Full Time Remote BCBA (With New York LBA & Medicaid Credentialed)

Manager,Field Training Remote-100% Travel

Head of Engineering

Genesis

Member of Technical Staff, Training (Bay Area, Remote)

AI Summary

What You’ll Do

What You’ll Bring

Automatically Apply to the Best Remote Jobs

Share This Job:

Similar Jobs

Junior Crypto Trader (Remote)

Mid/Senior AI Cinematic Video Editor (Full Remote - Worldwide)

Manager, Advocacy Engagement and Mobilization

Full Time Remote BCBA (With New York LBA & Medicaid Credentialed)

Manager,Field Training Remote-100% Travel

Head of Engineering

Personalize your Remote Job Search in 3 Easy Steps!