We build and develop the core machine translation models that power Localization across Services in an efficient and scalable manner. We work on a wide spectrum of approaches, including agentic workflows, foundation modeling, deep learning, model compression, transfer learning, federated learning, and more. We also build the systems that power Apple Music lyrics translations and lyrics transliterations (phonetic pronunciation). This position involves a wide variety of skills and innovation. This role offers a unique chance to work at the forefront of machine learning, where Localization meets state-of-the-art software development.
IN THIS ROLE, YOU WILL:
- Design and implement sophisticated machine translation models to optimize user experiences across Apple's ecosystem, including Apple Music, the App Store, Subscription services, and marketing campaigns.
- Lead research and development initiatives, including LLM fine-tuning and the exploration of emerging AI technologies such as agentic workflows and RAG techniques.
- Collaborate with cross-functional teams to translate business objectives into technical solutions.
- Develop and evaluate prototypes, proof-of-concepts, and production-ready solutions at huge scale.
- Communicate findings and recommendations effectively to technical and non-technical audiences, including leadership and operations partners.
- Contribute to the team's research goals by authoring publications and filing patents in alignment with Apple's innovation standards.
- Mentor junior researchers and foster a culture of collaboration, innovation, and excellence within the team.
Does this sound like you? Join our team!
BS/MS/PhD in a quantitative field (Computer Science, Math, Statistics, Physics, etc.) and 5+ years of experience
Proficient programming skills in Python
Hands-on experience working with deep learning toolkits such as Jax, Tensorflow or PyTorch
Proven track record in training or deployment of large models or building large-scale distributed systems
Deep understanding of Deep Learning and Large Language Models (LLMs) Natural Language Processing
BS, MS or PhD in a quantitative field, including Computer Science, Math, Statistics, Physics, etc.
Experience with the ReAct pattern in agentic workflows and building LLM based agents