Lead high-stakes cross-functional programs to align engineering teams with the commercial autonomy roadmap. Design operational frameworks and technical roadmaps to accelerate the development of autonomous trucking systems.
Stack AV
14 Remote Job Openings at Stack AV
Ensure the health, reliability, and scalability of production systems through observability and automation. Design scalable distributed systems and implement an incident management framework to maximize uptime.
Ensure the reliability, scalability, and performance of the compute platform powering large-scale autonomous systems. This includes orchestrating batch workloads across Kubernetes clusters and maintaining production service SLOs through automation and observability.
Design and operate high-scale distributed storage systems to support large-scale batch workloads and data platform components. Collaborate across teams to optimize storage resource utilization and improve platform reliability and fault tolerance.
Design and operate high-scale distributed systems for scheduling and executing batch workloads across Kubernetes clusters. Optimize compute resource utilization and improve the reliability and fault tolerance of platform components.
Senior Software Engineer, Machine Learning Inference Platform
Stack AV
·
Full Time
·
9 days ago
Stack AV
Design and deliver high-throughput, low-latency subsystems for a multi-tenant ML inference platform. Develop API layers, optimize GPU performance, and build observability tools to monitor system economics and utilization.
Staff Software Engineer, Machine Learning Inference Platform
Stack AV
·
Full Time
·
9 days ago
Stack AV
Define and drive the architecture for a high-throughput, multi-tenant ML inference platform including the control plane and API layers. Optimize system performance across the stack and establish observability and SLOs for GPU utilization and cost accounting.
Develop and maintain threat detection capabilities and lead security investigations and incident response efforts. Secure Stack AV's infrastructure, data, and users across cloud, on-prem, and remote environments.
Orchestrate safe and compliant freight movements while leading the implementation and execution of commercial pilots. Act as the primary operational contact for customers and partner with engineering teams to improve internal dispatching and tracking tools.
Develop foundational ML architecture and systematic approaches to solve complex perception problems for autonomous trucking. Collaborate with AI teams to prototype and implement core components that ensure the system scales robustly.
Drive the delivery of classical and machine learning solutions for real-time tracking of actors and obstacles in self-driving systems. Lead the design of onboard algorithms and contribute to the long-term technical vision of the perception organization.
The role involves analyzing and optimizing machine learning models to resolve performance bottlenecks and improve training system scalability. You will collaborate with researchers to streamline model deployment across hardware platforms while promoting engineering excellence.
The role involves designing and building the next generation of data infrastructure, focusing on creating low latency/high throughput, fault-tolerant batch or stream processing systems. Responsibilities also include building scalable backend services for data search and curation systems while writing high-quality Python and SQL.
Build multimodal data mining and semantic search solutions to support AV product development. Develop infrastructure for real-time querying and batch/stream processing.