Data Engineer (Curia)
Washington, DC /
Curia – Engineering /
Full-Time (Remote Available)
Curia.ai is a startup focused on delivering measurable business and clinical value through the implementation of practical AI. We use cutting-edge ML and Causal Inference to forecast health risks and the effectiveness of treatments on an individual patient basis. We partner with healthcare providers, ACOs, and insurers to deliver our recommendations to doctors, who then target care based on our insights.
Curia.ai is part of Foundry.ai, a technology fund & studio that creates AI software companies in partnership with large global enterprises. Foundry’s operating companies focus on practical applications of AI that cut through the hype cycle and drive immediate, measurable, and recurring results. Foundry is backed by approximately $100MM in capital from leading private equity and venture capital partners.
Curia's team is close-knit and is aiming to grow as we grow our business. You'll have the opportunity to contribute to the design and implementation of our products and to the direction and aspirations of the company as a whole.
We embrace communication and integration between fields. At Curia, you will have the opportunity to work closely with all segments of our team, from the data scientists who design and implement our ML models, to the engineers behind the Curia AI Platform, to the business strategists who market our products.
As a Data Engineer at Curia, you have proven strengths in software development and a keen desire to help grow our products' reach from a few clients to the whole of the healthcare industry. At Curia, you will help design, build and maintain the data infrastructure that turns raw medical data from our clients into the standardized ML features that fuel the Curia AI Platform.
We are a data-driven organization, utilizing cutting-edge technology in order to drive the most value from our products for customers. The Data Engineer role is fast-paced and dynamic, you will be expected to wear many hats and provide a vision for the future development of data products.
On a given day, you may do any of the following:
- Coordinate with customers on ingesting large clinical datasets
- Talk with our Data Science team about how to implement a new ML feature generation pipeline
- Work on improvements to our core ETL/ELT pipelines
- Deploy infrastructure changes to improve data lake read/write performance
- Deliver/integrate results with clients
- Bachelor's Degree in Computer Science, Engineering or related field
- Relevant experience as a Data Engineer, or Software Engineer working with data
- Strong understanding of Python
- Experience with PySpark, Spark + Scala, or similar
- Strong understanding of distributed data pipelines
- A strong "will to win" mindset, desire to take ownership of data products
- Excellent communication skills, both as a collaborative team member and as an independent worker keeping others apprised of your projects
- Experience architecting cloud native data platforms in AWS/GCP/Azure
- Experience with Infrastructure as Code (Terraform, Pulumi, CloudFormation, etc)
- Experience supporting ML/AI, strong understanding of MLOps principles
Benefits and Culture
Curia offers an extremely competitive compensation package, including equity, employer-covered health/vision/dental insurance (with an optional FSA), and 401k matching. Employees receive a generous PTO allowance, and can work both remotely and from our office in downtown Washington, D.C. Successful candidates will join a small team of ambitious, supportive co-workers with ample opportunities to take on responsibilities beyond their assigned role.
For additional information on benefits & what it's like to work at Curia, please visit curia.ai/careers.
Finally, we highlight that excellence has no single mold, particularly in a field as rapidly evolving as AI. We're looking for excellent candidates of all backgrounds with strong business intuition and coding skills, and welcome applicants regardless of ethnic/national origin, gender, race, religious beliefs, disability, sexual orientation or age.
A Note on the Interview Process
Curia.ai interviews share some common features with other technical hiring processes and have some important differences. These reflect the unique roles our employees play, which often involve early-stage development of products and environments where idea generation, product-market fit, and partner interaction may be significant aspects of their jobs.
All our roles have technical interviews that test core machine learning competencies, ability to discuss technical work, formalize generic problems into a quantitative system, problem-solve, and act as part of a team.
We also ask case-study interview questions, which are less common for technical roles. Case studies are open-ended business problems that do not have set correct answers. They require the interviewee to consider the provided information, decide what is most important, and then build a structure to answer the key questions in discussion with the interviewer. We've incorporated these into our process because Curia.ai is, similarly, working on problems that haven't been solved before without single correct answers. These challenges require business acumen, problem-solving skills, and the ability to think on your feet and prioritize information and actions.
Our staff will often describe this unique mindset as not only wanting to write the code to solve a problem but also being able to define the problem that we are solving — and our interview process is designed to help employees showcase their skills in this vein.