Senior Software Engineer
South San Francisco
Computational Biology /
Hybrid
About Tahoe Therapeutics
Tahoe Therapeutics is a biotechnology company pioneering a fundamentally new approach to drug discovery — one that begins with the biology of real patients. Our Mosaic platform is the first to make in vivo data generation scalable, with single-cell resolution, allowing us to map how drugs affect patient-derived cells in the body across a wide range of biological contexts. We are building the world’s largest in vivo single-cell perturbation atlas — and using it to train multimodal foundation models that learn the context-dependent nature of gene function, disease progression, and drug response. By combining cutting-edge machine learning with the most biologically relevant datasets ever assembled in drug discovery, our mission is to find better drugs, faster — and bring them to more patients who need them.
Your Role
You will be responsible for maintaining our bioinformatics pipelines, and ensuring that we optimally leverage cloud resources for large scale data processing and storage. You will also provide dev-ops support to the machine learning team. You will closely interact with bioinformaticians, computational biologists, machine learning scientists, and experimental biology scientists.
Qualification - Required
- BSc in computer science or related discipline
- 4+ years of experience as software engineer in an industry setting
- Knowledge of cloud infrastructure; familiarity with one of leading cloud providers, AWS, GCP or Azure; proficiency with Docker
- Solid Python programming skills
- Solid knowledge of pipeline orchestrator frameworks (preferably Metaflow, Snakemake)
- Experience working with big data and building scalable software solutions
- Familiarity with Linux based operating systems
Qualification - Desirable
- Basic knowledge of biology and bioinformatics
- Prior experience working in a biotechnology or pharmaceutical company, or other setting requiring interaction with experimental scientists
- Exposure to large-scale bioinformatics data quality control and visualization
- Familiarity with R
Key Responsibilities
- Execute bioinformatics pipeline in a data production cloud environment
- Maintain bioinformatics pipeline and other software systems
- Improve scalability of bioinformatics pipeline
- Support ML scientists using the RunAI GPU scheduling environment and adopting future environments further improving scalability
- Administration of AWS cloud accounts
- More bespoke software engineering tasks related to experimental data generation and analysis
Benefits
- Unlimited Paid Time Off (PTO).
- Monthly Lunch budget
- One-time Office set up budget
- US Employees: HMO Kaiser Platinum and PPO Anthem Gold medical as well as vision and dental plans for both the employee and dependents.
This hybrid role does not necessitate daily on-site attendance, but it does require the ability to access our offices in either South San Francisco, CA, or Toronto, ON; we welcome applications from candidates in these regions or those willing to relocate to the Bay Area or the Greater Toronto Area. Please note, we have one role open to two geographical locations.