Scale the operational maturity of cloud infrastructure by implementing automation, tooling, and clear support processes. Improve platform reliability, observability, and incident response while bridging the gap between technical teams and customer support.
NexGen Cloud
6 Remote Job Openings at NexGen Cloud
Own the end-to-end architecture cycle for large-scale NVIDIA GPU clusters, from initial customer requirements to production deployment. Provide technical oversight for hardware validation, performance testing, and integration while acting as a senior technical leader across engineering teams.
You will own the design, implementation, and evolution of core MLOps systems to ensure reliable, scalable, and repeatable production AI workloads. This includes providing technical leadership, defining operational standards, and managing the infrastructure that powers model training and inference.
Senior Infrastructure Engineer (Openstack) - Europe
NexGen Cloud
·
Full Time
·
2 months ago
NexGen Cloud
You will own the design, deployment, and operation of global OpenStack and Kubernetes environments to ensure platform scalability for GPU workloads. Additionally, you will drive infrastructure automation, implement security controls, and lead incident response to maintain high system reliability.
Senior Infrastructure Engineer (Openstack) - Australia
NexGen Cloud
·
Full Time
·
2 months ago
NexGen Cloud
The Senior Infrastructure Engineer will own the design, deployment, and operation of OpenStack and Kubernetes environments to support high-performance GPU workloads. They will also drive infrastructure automation, ensure platform reliability, and implement robust security controls across the stack.
The Senior Software Engineer will own the design, implementation, and evolution of complex product areas and backend systems for the AI Studio platform, building robust user-facing features, APIs, and backend services that orchestrate GenAI workflows at scale. Key duties include leading technical design, ensuring code quality, driving performance improvements, and collaborating closely with Product and Design teams.