I design, operate, and stabilize production-grade cloud platforms on AWS,Azure,GCP. My work sits at the intersection of cloud architecture, Kubernetes operations, and reliability engineering where the goal is not just to deploy software, but to ensure systems 99% availability, cost optimized, secure, observable, and predictable under real-world failure conditions. I specialize in building Kubernetes (EKS,AKS,GKE) based platforms using Terraform as the infrastructure control plane and CI/CD pipelines as the delivery backbone. I am deeply involved in decisions around VPC design, IAM boundaries, cluster architecture, deployment strategies, and observability with a strong focus on limiting blast radius and reducing operational risk. A core part of my role involves owning production reliability. I regularly troubleshoot incidents across AWS,Azure,GCP infrastructure, Kubernetes clusters, and deployment pipelines, perform root cause analysis, and implement long-term fixes that eliminate entire classes of failure rather than treating symptoms. I believe strong platforms are built by: • Designing for failure, not assuming success • Removing manual steps from critical paths • Using Infrastructure as Code as a safety mechanism • Treating observability as a first-class system component Over time, my work has helped teams ship faster with fewer incidents, reduced manual operational effort through automation, and improved confidence in production deployments. I am looking toward Senior / Lead Cloud Platform or SRE roles where I can take ownership of platform reliability, influence cloud architecture decisions, mentor engineers, and help organizations operate large-scale distributed systems with confidence.
Member Since
April 23, 2026
Last Active
a month ago