Position Overview
Design, deploy, and manage Kubernetes clusters at scale across multiple production sites using VMware Tanzu Kubernetes Grid (TKG) and VMware Telco Cloud Automation (TCA). Operate and maintain VMware-based infrastructure including vSphere, VCF, NSX‑T, TCA, and TKG/VKS. Manage cluster lifecycle activities including upgrades, patching, capacity planning, and security hardening. Contribute to platform automation, monitoring, observability, and disaster recovery practices. Troubleshoot complex production issues spanning Kubernetes, networking, storage, and underlying infrastructure. Configure and maintain ingress, load balancing (AVI/AKO), and service mesh solutions. Implement and maintain GitOps‑based deployment pipelines and Infrastructure as Code. Work closely with application teams to onboard workloads and improve developer experience. Document architectures, runbooks, and operational procedures for knowledge sharing across teams. Collaborate with cross‑functional teams across networkin...