AI Engineer II
United States
Tech, Product, & Design – Engineering /
Remote
Mission
We are building the next generation of hyper-personal AI systems that fundamentally transform how humans interact with businesses. Led by our Chief AI Scientist, the Personal Superintelligence Lab at Gopuff develops cutting-edge agentic AI solutions that reason about complex user contexts, preferences, and real-world constraints to deliver superhuman performance in personalized shopping assistance. You will advance the state-of-the-art in prediction, alignment, grounding, and multi-agent orchestration while deploying these breakthroughs at massive scale.
Scope and Impact
You will own core components of our instant shopping personal intelligence—context engineering, grounding, alignment, and serving—so technology accelerates customers to their goals, leveraging our unparalleled delivery speed. You’ll design context engineering strategies and real‑time retrieval, build SFT pipelines, and implement reinforcement learning from human feedback (RLHF) and verifiable rewards (RLVR) using GRPO/GSPO. You’ll use relevant APIs for data augmentation and ship low‑latency, reliable inference (batching, caching, quantization, streaming) with strong observability and safety. In close partnership with Engineering and Data Science, you’ll run experiments end‑to‑end and deliver production models that drive measurable lifts in business KPI under privacy‑by‑design and rigorous evaluation.
Research Areas & Contributions
Advanced Context & Grounding Research:
- Pioneer novel architectures for multi-modal context integration across temporal, spatial, and behavioral dimensions
- Develop foundational techniques for real-time knowledge grounding with dynamic constraint satisfaction
- Research declarative programming paradigms (DSPy) for robust prompt compilation and systematic LLM behavior specification
- Design and develop multi-agent orchestration systems (LangChain, LangGraph, CrewAI, Autogen, LlamaIndex) that exhibit emergent reasoning capabilities
Alignment & Learning Systems:
- Advance reasoning-centered supervised fine-tuning methodologies with novel data curation, synthetic generation, and quality assurance frameworks
- Develop and evaluate model reasoning (task decomposition, tool-use, and self-correction) to improve grounded recommendations and task completion under real-world constraints.
- Research parameter-efficient adaptation techniques including LoRA variants, Text-to-LoRA synthesis, and dynamic adapter routing
- Pioneer RLHF and RLVR with emphasis on scalable oversight
- Develop next-generation policy optimization algorithms beyond GRPO/GSPO with formal safety guarantees
Safety & Robustness:
- Research interpretability, controllability, and alignment verification for production agentic systems
- Develop formal methods for safe exploration, reward hacking prevention, and distributional robustness
- Pioneer privacy-preserving techniques and federated learning approaches for personal AI systems
Systems & Deployment:
- Research efficient inference architectures including novel quantization, caching, and streaming paradigms
- Develop evaluation frameworks that bridge offline metrics with online performance and safety criteria
- Build foundational infrastructure for rapid experimentation and deployment of research breakthroughs
Requirements:
- MSc or PhD in Computer Science, Machine Learning, or equivalent research experience with significant contributions to AI/ML literature
- Established experience in large-scale machine learning research with demonstrated impact on real-world systems
- Deep expertise in transformer architectures, large language models, and modern pre- and post-training paradigms
- Mastery of advanced fine-tuning techniques including LoRA/QLoRA, adapter methods, and parameter-efficient transfer learning
- Research experience with agentic AI frameworks, multi-agent systems, and declarative programming approaches (DSPy, LangChain ecosystem)
- Strong systems engineering capabilities with PyTorch, distributed training, and cloud-native ML infrastructure
- Track record of publications in top-tier venues (NeurIPS, ICML, ICLR, AAAI) or equivalent industry impact
- Commitment to responsible AI development and alignment research
Preferred Qualifications:
- Research background in reinforcement learning, multi-agent systems, or decision theory
- Experience with formal verification, program synthesis, or automated reasoning
- Contributions to open-source AI frameworks or foundational model development
- Background in cognitive science, human-computer interaction, or behavioral economics
- Experience with privacy-enhancing technologies, federated learning, or on-device AI
What We Offer:
- Opportunity to shape the future of personal AI and human-machine collaboration
- Access to world-class compute resources and datasets
- Collaboration with leading researchers across academia and industry
- Publication support and conference travel for research dissemination
- Competitive compensation with significant equity upside in breakthrough AI applications
Compensation
- Gopuff pays employees based on market pricing and pay may vary depending on your location. The salary range below reflects what we’d reasonably expect to pay candidates. A candidate’s starting pay will be determined based on job-related skills, experience, qualifications, interview performance, and market conditions. These ranges may be modified in the future. Exceptions may be made for exceptional individuals. For additional information on this role’s compensation package, please reach out to the designated recruiter for this role.
- This role is eligible for a discretionary annual cash bonus and participation in Gopuff’s equity incentive plan.
- Base Salary Range: $175,000 - $220,000
Benefits Overview
- Medical/Dental/Vision Insurance
- 401(k) Retirement Savings Plan
- HSA or FSA eligibility
- Long and Short-Term Disability Insurance
- Mental Health Benefits
- Fitness Reimbursement Program
- 25% employee discount & FAM Membership
- Flexible PTO
- Group Life Insurance
- EAP through AllOne Health (formerly Carebridge)
The only predictable thing about life is that it’s wildly unpredictable. That’s where we come in.
When life does what it does best, customers turn to Gopuff to deliver their everyday essentials, and to get through their day & night, work day and weekend.
We’re assembling a team of thinkers, dreamers & risk-takers...the kind of people who know the value of peace of mind in an unpredictable world. (And people who love snacks.)
Like what you’re hearing? Welcome to Gopuff.
The Gopuff Fam is committed to an inclusive workplace where we do not discriminate on the basis of race, sex, gender, national origin, religion, sexual orientation, gender identity, marital or familial status, age, ancestry, disability, genetic information, or any other characteristic protected by applicable laws. We believe in diversity and encourage any qualified individual to apply. We are an equal employment opportunity employer.
#LI-GOPUFF
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.