Position Overview
Lead advancements in AI as a Senior Performance and Reliability Engineer. Focus on benchmarking and optimizing hardware/software systems to enhance power management and reliability.
In this role, you will contribute to the development of the next-generation AI architecture. Characterize advanced ML hardware performance, while collaborating closely with ML engineers and researchers to drive impactful system-level improvements. Your software solutions will be essential in enhancing reliability and performance across innovative applications.
Key Responsibilities: β’ Characterize and optimize advanced ML systems β’ Analyze workloads for performance and power impacts β’ Develop solutions for enhanced software reliability β’ Influence AI architecture design through analysis β’ Collaborate with cross-disciplinary engineering teams
Requirements: β’ BS, MS, or PhD degree in a related field β’ 3+ years in performance engineering/optimization β’ Skilled in Python and C/C++ prog...