Software Engineer, Deployment (US)

Palo Alto / New York

Engineering & Infra /

Full-time /

Hybrid

About Mistral

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.

We are a dynamic, collaborative team passionate about AI and its potential to transform society.

Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.

Role Summary

We are seeking experienced backend software engineers to join our Deployment team. You'll help deploy and integrate our products (models, APIs, AI Studio...) across multiple infrastructure configurations, from leading cloud service providers to self-hosted (private cloud and on-premises) solutions. You'll work closely with the research, product, solution architect and program management teams to serve our frontier models to customers wherever they use our technology.

What you will do

As a Software Engineer in the Deployment team, you will be responsible for:

• New releases – you will ensure fast and reliable launch of new products (from models to APIs) to customers

• Build and test infrastructure – you will work to improve and extend the infrastructure needed to package, deploy and integrate our core technology within first-party systems and third-party platforms

• Safety – you will help solve the unique challenges that come with maintaining AI safety on third-party platforms

• Observability and Monitoring – you will collaborate closely with both internal and external stakeholders to ensure our services achieve high availability and deliver state-of-the-art performance for our users

• Build automation to increase deployment performance (velocity, scalability)

• Foster architecture improvements to make our products deployable on all configurations (including on-premise)

• Drive cross-functional feature improvements with other product engineering teams (Le Chat, API/SDK, Mistral Code...)

• Contribute to key technology and architecture trade-offs to break our deployment stack down into small, maintainable and testable pieces

About you

• 5+ years of relevant professional work experience

• Master’s degree in Computer Science, Information Technology or a related field

• Excellent proficiency in backend software development (Python, Golang)

• Strong proficiency in infrastructure management (Docker, CI/CD, K8s, Helm, Terraform...)

• Good knowledge of cloud ecosystems and understanding of the challenges of deploying LLM in multiple environments (public cloud, private cloud, on-premises)

• Autonomous and self-starter profile

• Ability to communicate with influence

What We Offer

💰 Competitive salary, bonus and equity structure

🧑‍⚕️ Health : Highly competitive healthcare program

👴🏻 Pension : 401K (6% matching) for US-based employees

🧢 Transportation and meal stipends, gym membership, coaching...

🪪 Visa sponsorship

Apply for this job