Position Overview
Drive innovation in AI as a Deep Learning Optimization Engineer at RedHat. Focus on enhancing open-source LLM technologies and streamlining GenAI performance across enterprises.
In this senior position, you will collaborate with the RedHat AI Inference team to develop cutting-edge optimization algorithms for deep learning applications. Your role will emphasize collaborating with research scientists and mentoring junior engineers, while supporting impactful open-source projects that shape the AI landscape.
Key Responsibilities:
β’ Design and implement inference optimization algorithms
β’ Enhance model compression using quantization methods
β’ Profile LLM end-to-end performance for optimization
β’ Collaborate to translate experimental solutions into production
β’ Guide teams and participate in open-source contributions
Requirements:
β’ Deep understanding of machine learning fundamentals
β’ Experience with PyTorch a...