As a Site Reliability Engineer reporting to Director, System Operations, you'll play a critical role in the delivery, integration, and support of complex, distributed, high-availability solutions for financial institutions and organizations around the globe.
You'll thrive in this position if you're analytical, collaborative, and passionate about building resilient, high-performance systems in a fast-paced, client-focused environment.
Key Responsibilities
Design, implement, and maintain cloud infrastructure and automation tooling to ensure platform reliability, scalability, and high availability.Monitor system performance, respond to incidents, and lead root cause analysis to drive continuous improvement across distributed environments.Partner with engineering and product teams to embed reliability and DevOps best practices throughout the software development lifecycle.Leverage AI tooling and automation to reduce manual effort, ...