Site Reliability Engineer

Burlingame, CA /
ML + Compchem + Software /
Full-time
/ Hybrid
Genesis Therapeutics is building a world-class software team to solve problems in drug discovery through machine learning, biophysical simulation, and computational chemistry. We are looking for engineers excited to help develop new medicines and play a critical role in building out our software platform.

You will

    • Improve our job scheduling and workflow management system called Flow, written in Python on top of Kubernetes, to manage our cluster of 1,000s of GPUs and 10,000s of CPUs
    • Help maintain our production infrastructure within Google Cloud, refining our scripts and procedures for testing, monitoring, and deployment automation
    • Work closely with our machine learning engineers and computational chemists to optimize our infrastructure for both research throughput and production robustness

You are

    • 3+ years of experience creating and maintaining infrastructure at scale, on top of AWS or Google Cloud
    • Proficient in Python, Bash, and preferably Terraform
    • A responsible and tenacious engineer who loves to architect secure, scalable systems and dive deep to solve obscure issues

What we offer

    • The opportunity to work on high impact infrastructure that is used to accelerate the discovery of new medicines
    • A world-class, tight-knit team of good-hearted people across software, machine learning, computational chemistry, medicinal chemistry, and biology
    • Competitive salary and equity. Medical, dental, and vision insurance, and a 401(k) program