About the Team:
In the ML Training and DevEx team, our mission is to provide a reliable, scalable, and easy to use training framework for modeling needs of Stack AV. In addition, this team is responsible for the overall developer experience of ML engineers including building tools for testing, validation, and understanding models and the data used to train them. Finally, this team is responsible for model optimization and deployment.
What Success Looks Like:
- Experience with both ML Platforms and building ML-based applications (bonus point if you have modeling experience).
- Experience building scalable, reliable infra at a fast-paced environment.
- Ability to work across teams.
- Experience building or using ML infra built for a large number of customer teams.
- A deep understanding of design tradeoffs and ability to articulate those tradeoffs and work with others on getting alignment.
- Experience with building ML models or ML infra in the domains of autonomous vehicles, perception, and decision making (desirable but not required).
- Experience with model training, model optimization, or large data processing pipelines.
Preferred Experience:
- Knows how to push the GPU to its limit from Python to CUDA kernel level.
- Built the inference or training loop for a large model (ideally with LLM flavor).
- Shipped ML products (NLP, computer vision, recommender systems, etc.) at scale to make business impact
- Knows how to build low latency / high throughput batch or stream processing pipelines.
- Knows how to write (readable) high performance C++.
- Prior AV experience.
#LI-AW1