We are looking for a strong ML applied scientists and engineers to build ground breaking AI infrastructures to power the infrastructures that Apple in-house ML experts use everyday to optimize models shipped on devices and servers for Apple Intelligence.
We are part of a collaborative group of software developers and deep learning authorities working in the area of neural network optimization, on device inference, and model evaluation. You will work with world-class talents in visualization, LLM training, on-device optimization, ML tools/platforms. You will develop reliable and scalable web services for ML developers: e.g., model optimization pipeline, effective ML dev workflow, infrastructure to serve internal service.
Experience developing/optimizing/training large language models (LLMs), or large computer vision models, or generative AI models.
Software engineering skills in Python and general purpose system admin and infrastructure management abilities.
History of applied research in neural network model life cycle or training or a related area application.
Track record to drive scientific investigations and experiments and overcome obstacles and uncertainty in a research environment.
BS degree and 3+ years of proven experience.
Publication record at top AI/ML venues
Experience with LLM LoRA fine-tuning, neural network optimization (e.g. quantization and compression)
Experience with on-device/server scale deployment
Experience with languages like C/C++
Infrastructure management and debugging experience
Experimental rigor when training/evaluating LLMs for the purpose of benchmarking LLM optimization algorithms
Strong communication and accountability skills; hard-working, strong work ethic, and collaboration abilities