Position Overview
Join the Annapurna Labs team at Amazon Web Services as an ML Kernel Performance Engineer focused on optimizing deep learning performance with AWS Neuron on custom hardware. Shape the future of AI acceleration technology while working on critical machine learning projects.
In this engineering role, you'll contribute to enhancing the performance of AWS's ML accelerators, Inferentia and Trainium. Your expertise in designing high-performance compute kernels and optimizing kernel-level performance will be essential. Collaborate with cross-functional teams and directly interface with customers to maximize their ML models’ efficiency on AWS.
Key Responsibilities:
• Design and implement high-performance compute kernels for ML operations
• Analyze and optimize kernel performance on Neuron hardware
• Conduct detailed performance analysis using profiling tools
• Implement compiler optimizations like tiling and scheduling
• Collaborate with customers to enhance their ML models...