Genesis

Member of Technical Staff, Inference (Bay Area, Remote)

Posted 2 months ago

United States

⭐ 10+ years experience

Apply Now

Please mention DailyRemote when applying

AI Summary

Build and optimize low-latency inference pipelines for on-device robotics and distributed GPU clusters. Develop low-level kernels and monitoring tools to ensure high throughput, reliability, and efficient resource utilization.

What You’ll Do

Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics
Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilization
Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks
Optimize workloads for both throughput (batching, scheduling, quantization) and latency (caching, memory management, graph compilation)
Develop monitoring and debugging tools to guarantee reliability, determinism, and rapid diagnosis of regressions across both stacks

What You’ll Bring

Deep experience in distributed systems, ML infrastructure, or high-performance serving (8+ years)
Production-grade expertise in Python, with strong background in systems languages (C++/Rust/Go)
Low-level performance mastery: CUDA, Triton, kernel optimization, quantization, memory and compute scheduling
Proven track record scaling inference workloads in both throughput-oriented cluster environments and latency-critical on-device deployments
System-level mindset with a history of tuning hardware–software interactions for maximum efficiency, throughput, and responsiveness

Automatically Apply to the Best Remote Jobs

Stop the endless job search. Our AI finds and applies to the best jobs for you.

Try it Now

Genesis

🧑‍💻 Employees 1,001-5,000 employees 🏢 Industry Technology, Information and Internet

View More Jobs From Genesis

Genesis

Member of Technical Staff, Inference (Bay Area, Remote)

AI Summary

What You’ll Do

What You’ll Bring

Automatically Apply to the Best Remote Jobs

Ace Your Job Interview

How to Answer "How Do You Handle Criticism"?

How to Answer "Tell Me About Yourself?" in an Interview

How to Answer "What is your Experience with Customer Service?"

How to Answer "Describe Your Experience Working With Diverse Teams Or Different Cultures?"

How to Answer The Interview Question "What Sets You Apart From Other Candidates?"

How to Answer "Why Are You The Best Person For This Job"?

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Why Should We Hire You?"

How to Answer "What Areas Need Improvement?"

How to Answer "Tell Me About A Time When You Had To Balance Competing Priorities?"

How to Answer "Tell Me About a Time You Received Constructive Feedback"

How to Answer "What Is Your Greatest Accomplishment?"

Similar Jobs

Junior Crypto Trader (Remote)

Mid/Senior AI Cinematic Video Editor (Full Remote - Worldwide)

Tagalog/Filipino Remote Interpreter

Senior Environmental Scientist

Director, Cybersecurity

Strategic Initiatives Research Analyst

Genesis

Member of Technical Staff, Inference (Bay Area, Remote)

AI Summary

What You’ll Do

What You’ll Bring

Automatically Apply to the Best Remote Jobs

Share This Job:

Similar Jobs

Junior Crypto Trader (Remote)

Mid/Senior AI Cinematic Video Editor (Full Remote - Worldwide)

Tagalog/Filipino Remote Interpreter

Senior Environmental Scientist

Director, Cybersecurity

Strategic Initiatives Research Analyst

Personalize your Remote Job Search in 3 Easy Steps!