3+ years of experience as a technical lead, guiding teams through complex design decisions and setting high benchmarks for code quality, performance, and scalability
In-depth understanding of large language models (LLMs) and their application in AI-driven solutions, including inferencing, embedding, and knowledge base integration (RAG) for improved data retrieval and contextualization
Hands-on experience designing and building GenAI platforms that allow users to create, configure, and deploy AI applications supporting features like agent orchestration, prompt engineering, RAG integration, and model selection
Experience building AI agents capable of complex multi-step reasoning and tool usage, with a focus on reliability, traceability, and composability
Proven experience in fine-tuning and customizing foundation models to improve task-specific performance and domain alignment
Deep knowledge of LLM inference optimization techniques, including prompt tuning, caching, quantization, and latency reduction across different model families
Strong programming skills in Python, Java, or similar languages, with an emphasis on AI/ML systems development and platform engineering
Demonstrated ability to work cross-functionally and influence product development through a combination of technical leadership and user-centered thinking
Passion for operational excellence, automation, and delivering scalable, developer-friendly AI infrastructure
B.S, M.S. or PhD Degree in Computer Science/Engineering, or equivalent work experience
Expertise in AWS Cloud
Hands on experience in using Kubernetes as orchestration layer