Responsible for the R&D and optimization of online inference services for deep learning models in large-scale sparse feature scenarios, supporting high-efficiency inference needs across Shopee's various business lines.
Conduct in-depth research into various inference acceleration algorithms to reduce the computational cost of model deployment.
Collaborate across the business pipeline to tune the end-to-end online service system, ensuring high availability and stability.
Research and implement efficient inference solutions that combine Large Language Models (LLMs) with Search, Ads, and Recommendation (GR).
Requirements:
Bachelor's degree or above in Computer Science, Electronics, Automation, Software Engineering, or related fields, with at least 2 years of relevant work experience.
Expertise in C++ programming with a solid foundation in low-level systems prof...
Ready to Apply?
Join thousands of Americans building their careers