Data Engineer

New York, NY /
Technology /
Data is a core part of our product. Our data pipelines today consist of thousands of datasets from hundreds of unique sources and continues to grow every day. This pipeline powers our machine learning models and API, which powers production workflows that affect real patients.

We’ve experienced rapid growth in data scale over the past few years and are looking for an experienced developer to build scalable systems that can support the next phase of our growth. We are looking for someone who can architect efficient and scalable systems, set data engineering standards for the engineering organization, and enjoys rolling up their sleeves and coding.

What we look for at Ribbon:

    • Passion and drive to simplify healthcare by building products that increase access to care and power every healthcare decision to be high-quality, cost-effective, and convenient
    • Commitment to Ribbon Health company values, working on an exceptional team, and building an exceptional company
    • Grit, hustle, desire, and a “get-it-done” attitude; strong comfort with a lean startup environment, where everyone is encouraged to participate in and contribute across all teams

What we’re looking for in this role:

    • You have experience designing and implementing data warehouses and ETL architecture in the cloud and are excited to leverage your experience to build from the ground up
    • You are very comfortable with SQL (Hive, Oracle, Vertica…etc.) and relational databases; Python experience would be a big plus but not required
    • You have a unique ability and passion for transforming large and complex datasets into information that is useful for real life decision making
    • You are able to break down ambiguous problems and propose clear data modeling designs
    • You have the ability to make thoughtful trade-offs between long-term scalability and moving quickly in the short-term
    • You care deeply about implementing best practices that ensure data integrity and reliability because you understand how our products meaningfully impact patients downstream
    • Helpful but not required: You have experience working with healthcare data (e.g. claims, directory, medical records)

Your day-to-day:

    • Architect and build our data warehouse: You will design and build a data warehouse that will serve as the foundation for our data pipelines and machine learning
    • Scale our machine learning efforts: You will build data infrastructure to enable our machine learning engineers to deploy existing models to our production data pipeline and expand their analytics efforts
    • Build a data extraction framework: You will design and improve upon our current system of record for ingesting data from hundreds of different sources
    • Build data pipelines: You will integrate many data sources into the Ribbon data pipeline. You will set the standards by which other engineers building data pipelines will follow
    • Build light-weight automation: You will develop systems and tools to configure, monitor, and orchestrate our data infrastructure
    • Develop data standards: You will develop Ribbon’s internal standards for data management and data governance