Cloud Infrastructure Management: Design, implement, and manage cloud-based infrastructure on AWS and Azure, ensuring optimal scalability, performance, and security.
CI/CD Pipeline Development: Develop and maintain CI/CD pipelines using GitHub Actions for automated code deployments and testing.
System Monitoring and Incident Management
Implement and configure Datadog for comprehensive system monitoring.
Develop and maintain Datadog dashboards to visualize system performance and metrics.
Set up proactive alerts in Datadog to detect and respond to incidents swiftly, ensuring high system reliability and uptime.
Conduct root cause analysis of incidents and implement corrective actions using Datadog insights.
Collaboration with AI Teams: Work closely with AI teams to s...
Ready to Apply?
Join thousands of Americans building their careers