Position Overview
Role Overview
We’re modernising our infrastructure by transitioning core components to managed services and significantly improving cluster reliability, deployment automation, and observability.
This role is central to scaling infrastructure and enhancing system reliability across services. If you enjoy working on high-impact systems and solving real-world reliability challenges, this role is for you.
⚙️ What You’ll Own
Manage infrastructure upgrades and system improvements
Maintain and optimise production Kubernetes environments
Improve deployment pipelines and release processes
Drive capacity planning and autoscaling strategies
Handle production incidents and conduct RCA (Root Cause Analysis)
Build and maintain infrastructure automation scripts
Strengthen performance monitoring and alerting systems
What We’re Looking For
2-5 years of experience in Dev Ops / Platform Reliability roles.
Strong Experience In:
Kubernetes
CI/CD pipelines
Clo...