Position Overview
Elevate AI inference systems as a Performance Engineer, specializing in model evaluations and optimization on wafer-scale technology. Engage with the latest innovations to implement enhancements for greater efficiency.
This position focuses on bringing state-of-the-art AI models to production through rigorous validation and architectural prototyping. You will develop automation solutions for experimentation and collaborate with cross-functional teams to push the boundaries of AI technology. This is a unique chance to work at the intersection of software and hardware.
Key Responsibilities: β’ Prototype and benchmark novel AI methodologies β’ Design automation for streamlined experimental workflows β’ Collaborate with silicon, runtime, and compiler teams β’ Assess and optimize newly released models
Requirements: β’ 3+ years in high-performance ML or systems engineering β’ Strong grasp of Transformer mathematics and methodologies β’ Skilled with AI toolchains and profi...