Position Overview
Join a leading technology firm in Toronto as a Senior Site Reliability Engineering and Platform Engineer. Leverage your expertise in Kubernetes, cloud architecture, and observability to enhance platform reliability and performance.
In this full-time role, you will engage in Site Reliability Engineering practices to build robust, fault-tolerant systems. The position demands 7β10+ years of experience in Platform and Site Reliability Engineering, with a strong focus on managing multi-cloud ecosystems, particularly AWS and GCP. Key areas include Kubernetes management, Infrastructure as Code using Terraform, and advanced deployment strategies.
Key Responsibilities: β’ Design and manage production Kubernetes clusters across clouds β’ Implement Infrastructure as Code with standardized Terraform scripts β’ Maintain GitOps practices using ArgoCD for environment management β’ Architect complex cloud networking solutions and enforce security policies β’ Implement observability monito...