Our team brings huge of cutting-edge, specialized expertise in Machine Learning and Speech Technologies, which are used daily by hundreds of millions of people worldwide.
We already have several major projects underway and are looking to strengthen our team for a DevOps/SRE Engineer!
- Minimum 5 years of experience in a DevOps and/or Site Reliability Engineering role
- Strong hands-on experience with Linux system administration
- Extensive experience deploying, operating, and scaling Kubernetes in both cloud and bare-metal environments
- Deep expertise and practical experience with at least one major cloud provider (preferably Google Cloud Platform)
- Experience with ML inference on GPU/CPU is a strong plus
- Proven experience implementing SRE practices and building observability stacks using Grafana, Prometheus, and Loki
- Strong adherence to GitOps, Infrastructure as Code (IaC), and CI/CD principles
- Advanced expe...