Forward Deployed Data Engineer

United States
Software & Infrastructure Engineering – Cloud & DevOps /
Full-time /
Remote
About Sayari: 
Sayari is the counterparty and supply chain risk intelligence provider trusted by government agencies, multinational corporations, and financial institutions. Its intuitive network analysis platform surfaces hidden risk through integrated corporate ownership, supply chain, trade transaction and risk intelligence data from over 250 jurisdictions. Sayari is headquartered in Washington, D.C., and its solutions are used by thousands of frontline analysts in over 35 countries.

Our company culture is defined by a dedication to our mission of using open data to enhance visibility into global commercial and financial networks, a passion for finding novel approaches to complex problems, and an understanding that diverse perspectives create optimal outcomes. We embrace cross-team collaboration, encourage training and learning opportunities, and reward initiative and innovation. If you like working with supportive, high-performing, and curious teams, Sayari is the place for you.

Positions Description:
Sayari’s flagship product, Sayari Graph, provides instant access to structured business information from hundreds of millions of corporate, legal, and trade records. As a member of Sayari's data team you will work with our Product and Software Engineering teams to collect data from around the globe, maintain existing ETL pipelines, and develop new pipelines that power Sayari Graph. 

Job Responsibilities:

    • Working with customers to help them ETL their data into a format which is usable by Sayari’s on premise offering
    • Working with customers pre-sales to help them design solutions focused around Sayari’s product offerings for Entity Resolution and bulk data
    • Working with customers post-sale to ensure that they are getting value from Sayari’s bulk data product
    • Managing the process of producing customized bulk data products for customers and bulk data samples for prospective customers

Required Skills & Experience:

    • Professional experience with Python and a JVM language (e.g., Scala) 
    • 4+ years of experience designing and maintaining ETL pipelines 
    • Experience using Apache Spark
    • Experience with SQL (e.g., Postgres) and NoSQL databases (e.g., Cassandra, ElasticSearch, etc.)
    • Experience working on a cloud platform like GCP, AWS, or Azure 
    • Experience working collaboratively with git 

Desired Skills & Experience:

    • Understanding of Docker/Kubernetes 
    • Understanding of or interest in knowledge graphs
    • Experienced in supporting and working with internal teams and customers in a dynamic environment
    • Passionate about open source development and innovative technology
Benefits: 
·       Limitless growth and learning opportunities
·       A collaborative and positive culture - your team will be as smart and driven as you
·       A strong commitment to diversity, equity & inclusion
·       Exceedingly generous vacation leave, parental leave, floating holidays, flexible schedule, & other remarkable benefits
·       Outstanding competitive compensation & commission package
·       Comprehensive family-friendly health benefits, including full healthcare coverage plans, commuter benefits, & 401K matching
 
Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.