Data Engineer

Los Angeles, CA /
Engineering Team /
/ Remote
Full-Time, Remote, US-Based
Compensation: $115K - $150k

Our mission is to protect enterprises and government agencies from disinformation attacks by uncovering threatening online trends and neutralizing them.

PeakMetrics is a cybersecurity solution that extracts insights and creates actionable data from millions of unstructured, cross-channel media datasets in real-time. We use ML to spot adversarial online messaging, understand the audiences behind them, present context such as source credibility, and help customers respond.

PeakMetrics has been battle-tested on some of today’s most complex media issues – from responding to crisis management situations to combating state-sponsored disinformation.

As a Data Engineer, you will collaborate and work closely with the data/product/engineering teams.  You will take ownership over a vast amount of data, implementing crucial data crawling capabilities and manipulating data structures to obtain solutions and insights. In this role you will own the creation process of these tools, services, and workflows to improve data scraping processes and data management. 

You will report directly to the Director of Engineering. 

What you will do at PeakMetrics:

    • Create and maintain optimal data pipeline architecture
    • Assemble large, complex data sets that meet business requirements
    • Identify, design, and implement internal process improvements
    • Optimize data delivery and re-design infrastructure for greater scalability
    • Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS technologies
    • Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics
    • Work with internal and external stakeholders to assist with data-related technical issues and support data infrastructure needs

Would love to hear from you if:

    • You have experience with version control and open source tools
    • You have familiarity with data analytics and visualization
    • You are comfortable automating data flows with resilient code using Python and/or Node.js.
    • You are comfortable working with large, complex data structures.
    • You love working with heterogeneous data to answer real human questions, and you know how to write the code that makes that happen.
    • You have strong database architecture design and management knowledge in both structured and unstructured data.
    • Have extensive experience working with SQL, SQLAlchemy and ElasticSearch.
    • Have familiarity with in-memory databases and other database types (e.g. graph dbs, vector dbs) a plus.
    • You identify and document customer requirements when on-boarding new data assets for data analysis and data science work. 
    • You have supported the data needs of ML / data science teams
    • You are able to define and communicate data architecture requirements, keeping current with data management best practices.
    • You have maintained data management systems and participated in the design and development of new data pipelines.
    • Have experience running large scale web scrapers, working with APIs and open source datasets
    • Familiarity with social media and search APIs.
    • Ability to work collaboratively across teams
    • You are Curious - You want to understand how the data you work with is used and get to know new codebases. 
    • You have excellent communication - Your clarity of thought is always apparent in your crisp and articulate emails, Slack chats, phone calls, and in-person conversations.
    • You ask questions to better understand the context around your work and what you need to do to be successful.
    • Remote co-working skills - We are 100% remote and distributed across multiple time zones, and we use remote collaboration tools (e.g. Slack, Notion, GSuite, Zoom) to stay connected and productive.
    • Experience with Postgres, git, github, github codespaces, K8’s

We are looking to build a truly diverse, equitable, and inclusive engineering team. We have a number of outstanding benefits to support that.

We are a fully distributed team. Without a central office, everyone’s on an equal footing wherever you’re based in the United States.  

We offer flexible schedules and unlimited time off. Take vacations to recharge! We’re about long-term growth and success, so that means working sustainably.

We provide outstanding health care and vision coverage for you and your family.

We offer a 401(k) and HSA programs.

We are an equal opportunity employer and are committed to building an organization that empowers and promotes diversity, inclusion, equity and belonging (DEIB). We do not discriminate on the basis of race, religion, color, national origin, gender, gender identity or expression, sexual orientation, age, marital status, veteran status, or disability status.