As a pivotal member of Apple's enterprise generative AI efforts, you will support the architecture and optimizing backend systems for high availability and scalability. We design and implement scalable RESTful APIs and microservices using Python, Java, or Go. Our team tackles unique challenges in privacy-preserving generation, efficient inference, and multimodal integration. We deliver production-grade models that meet Apple's rigorous standards for quality, performance, and scalability.
Bachelor of Science in Computer Science, Machine Learning, or a related quantitative field or equivalent experience
2+ years of hands-on experience in machine learning and backend development in industry
Experience with microservices architecture and distributed systems
Experience in ML frameworks (PyTorch, JAX) for training, fine-tuning, and deploying ML/generative models at scale
Proven track record of building enterprise-grade ML pipelines (data prep, distributed training, optimization, monitoring) in cloud environments (AWS, GCP, Azure) or on-prem infrastructure
MS in Computer Science, Machine Learning, or a related quantitative field
Deep expertise in one of Enterprise GenAI's critical domain, such as document-AI, LLM, agent, embedding, or search
Contributions to major open-source ML frameworks or research communities