(Associate) Data Engineer

New York, New York
Research Institute – Global Stem Cell Array /
Full Time /
Hybrid
The New York Stem Cell Foundation (NYSCF) Research Institute is a rapidly growing and highly successful nonprofit whose mission is to accelerate cures through stem cell research.

We are seeking an (Associate) Data Engineer who will be responsible for optimizing our in-house data processing, handling and storage workflows. Through the optimization and continued development of centralized workflows for data retrieval, this role will be responsible for deploying and building custom pipelines to ingest and process biological data generated by teams within the NYSCF Research Institute.  

You describe yourself as a skilled data engineer who has knowledge and experience in working with large datasets in Python and wide experience with databases including SQL. You will report directly to the Principal Scientist, AI and Data Science and though you’ll primarily interact with our data science and software engineering teams, you’ll be part of a larger team composed of hardware engineers and biologists. Level will be commensurate with experience.

What you'll do:

    • Develop, deploy, and document software that supports the analysis, annotation, and quality control pipelines for data
    • Work with both data science and software engineering teams, as well as end user biologists, on requirements for processing, analyzing, and generate appropriate logs and reports of data 
    • Ingest, process, and perform first quality and balance controls of large datasets of microscopy images, both on premise and on cloud servers
    • Deploy existing pipelines for image processing for data standardization, quality control, characterization and feature extraction, both on premise and on cloud servers
    • Develop and implement novel data visualization strategies to summarize results and QC features
    • Ingest data from screens and process it to look for first impressions, outliers and nuances
    • Optimize the pipelines to be operated on different clusters and virtual machines
    • Centralize and optimize our existing workflows, manage data migration and distribution

What we're looking for:

    • B.S. or M.S. in computer science, engineering, data science, or mathematics
    • 2+ years of experience developing data models and implementing data mining, migration and management
    • Experience with pipeline deploying and resource optimization
    • Strong database experience, must know SQL
    • Strong programming experience, must know Python 
    • Strong experience with cloud computing infrastructures on AWS, Google Cloud, etc. (AWS preferred)
    • Familiarity with Python libraries for data framing and visualization (eg Pandas, Seaborn, pyplot)
    • Familiarity with GPU programming and resource optimization and parallel computing
    • Experience with Git repository systems
    • Knowledge of image processing techniques (preferable)
    • Experience with microscopy images and fluorescence images (preferable)
$65,000 - $115,000 a year
The base starting salary range for the Associate Data Engineer level is $65,000 - $85,000. The base starting salary range for the Data Engineer level is $80,000-$115,000. Multiple factors, including your experience, determine final offer amounts and levels, which may vary from the amounts listed above. NYSCF has a 35-hour workweek and this position is exempt and not eligible for overtime.
At NYSCF, we believe diversity in all forms makes us a better team, and we celebrate it. Yet studies have shown that women, people of color, and other minoritized individuals in STEM may be less likely to apply to jobs where they do not meet all of the criteria. Therefore, if you are excited about this role but your past experience does not align perfectly with every qualification in the job description, we encourage you to apply anyways. You may be just the right candidate for this or other roles.

We offer all full-time employees a comprehensive benefits package that goes into effect on the first of the month following your start date. It includes a choice of medical, dental, and vision insurance (with 100% of the premiums paid for employees and subsidies for any dependents), 403b retirement plan with 5% employer match (immediate vesting schedule which starts after your first 90 days), short and long term insurance, life insurance, inclusive paid parental leave program, pretax transit and parking, legal aid benefits and wellness benefits. Our paid time off includes vacation, sick, personal days, flexible holidays, summer flex program, and all company holidays. If a visa is required, NYSCF will cover all of those costs. Relocation will sometimes be necessary; therefore, we will provide you with an allowance. 

In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and complete the required employment eligibility verification upon hire.  

NYSCF is an equal opportunity employer, and we value diversity in our organization. We provide equal opportunities to all applicants for employment without discrimination or harassment based on race, color, religion, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity or expression, age, disability, national origin, marital or domestic/civil partnership status, genetic information, citizenship status, veteran status, or any other characteristic protected by law.

The position is based at our location in Manhattan.

Recruitment Phishing Scams:
Fake job advertisements and offers are increasingly appearing on the internet. If you have encountered a job posting or have been approached with a job offer that you suspect may be fraudulent, we strongly recommend you do not respond and report it to the Federal Trade Commission and the FBI at https://www.ic3.gov/Home/ComplaintChoice.You can also contact our team jobs@nyscf.org to report details of your experience.

Please be mindful of the following:
·       NYSCF will only reach out to you through an “@nyscf.org” email address.
·       Other than your email address or telephone number, which you may provide via a job application portal, NYSCF will never ask you to provide personally identifiable information about yourself (such as a Social Security Number or Driver’s License Number) via a messaging application (like that used on the LinkedIn platform or Microsoft Teams or Zoom).
·       NYSCF will conduct interviews face-to-face over Zoom or in person.
·       All job postings will be listed on NYSCF's official career page (nyscf.org/careers). If someone contacts you about a job or position that is not listed on the official career page, please contact the NYSCF recruitment team at the contact information below.
·       If you have any questions regarding the validity of a recruitment inquiry or an interview, please contact the NYSCF’s recruitment team at jobs@nyscf.org to confirm before proceeding.