You will be responsible for the secure, scalable, and observable operation of the AI platform using Kubernetes and IaC principles. Additionally, you will collaborate with AI engineers to manage model serving infrastructure, including GPU scheduling and performance monitoring.