Senior Data Engineer

Mountain View, CA/Remote /
Engineering /
Full-time
Cape Analytics provides instant property intelligence for buildings across the United States. Cape Analytics enables insurers and other property stakeholders to access valuable property attributes at time of underwriting, with the accuracy and detail that traditionally required an on-site inspection, but with the speed and coverage of property record pre-fill. Founded in 2014, Cape Analytics is backed by leading venture firms and innovative insurers and is comprised of computer vision, data science, and risk analysis experts.

Position Summary:
We are looking for a Senior Data Engineer to join our growing Data Platform and Engineering teams. The ideal candidate has significant experience in building scalable data platforms that enable business intelligence, analytics, data science and data products. They must have strong, hands-on technical expertise in a variety of technologies and the proven ability to fashion robust scalable solutions. They must be at ease working in an agile environment with little supervision. The ability to work across teams with product managers, data scientists and business stakeholders to translate sometimes vague business requirements into working code will be critical to success in this role. This person should embody a passion for continuous improvement and data quality.

What You'll Do:

    • Design and implement data ingestion pipelines
    • Integrate data from multiple data sources, develop cross-platform ETL processes
    • Develop new tools and processes for managing our data workflows and data infrastructure
    • Collaborate with our Engineering and Data Science teams on building, maintaining and monitoring the database infrastructure
    • Collaborate with product managers, data scientists, business users and other engineers to define requirements and design solutions
    • Evaluate third-party datasets for ingestion

Skills/Requirements:

    • Experienced in designing and maintaining data warehouses or row and model tables 
    • Experienced in developing dashboards from analytics tables 
    • Experienced in designing models that unify disparate data sources 
    • Experienced in big data compile infrastructure such as Hadoop, HDES, EMR, S3
    • Comfortable choosing technologies that fit the application (e.g. MySQL versus PostgreSQL, Hadoop versus Cassandra)
    • More than 5 years of experience in object-oriented development with Python
    • Experienced in designing and implementing data ingestion applications using Spark
    • Familiarity with Docker
    • Machine Learning libraries and frameworks like scikit-learn, Tensorflow, Pytorch a plus
    • Familiarity with orchestration framework like Airflow
We believe:

*Talent is critical, but best when tempered with humility
*Self-motivation leads to the best outcomes
*Open, direct communication is a sign of respect
*Teamwork drives success
*Having fun together is an important part of the job

***Cape Analytics is an E-verify participant.***