DevOps Engineer (Developer Tooling)

San Francisco, New York, and Remote /
Engineering - Infrastructure /
Our mission is to bring blockchain to a billion people. The Alchemy Platform is a world class developer platform designed to make building on the blockchain easy. We've built leading infrastructure in the space, powering over $45 billion in transactions for tens of millions of users in 99% of countries worldwide.

The Alchemy team draws from decades of deep expertise in massively scalable infrastructure, AI, and blockchain from leadership roles at leading companies and universities like Google, Microsoft, Facebook, Stanford, and MIT.

Alchemy recently raised a Series C led by a16z at a $3.5B valuation, having previously raised from Coatue, Addition, Stanford University, Coinbase, the Chairman of Google, Charles Schwab, and the founders and executives of leading organizations.

Alchemy powers the top blockchain companies globally and has been featured in TechCrunch, Forbes, Bloomberg, and elsewhere

The Role
As an engineer focused on DevOps and developer tooling at Alchemy, you'll be working with the wider engineering team on the design, deployment, and continuous improvement of the infrastructure that supports our developer platform used globally. You'll leverage your knowledge of metrics, logs, and traces to own the observability use cases for our infrastructure and backend applications, and use your sense of curiosity to identify and automate high impact tasks.


    • Dual focus on developer productivity and product reliability
    • Improve important infrastructure and systems from an operational standpoint (i.e. deployment, logging, monitoring, alerting, etc.)
    • Develop and own best practices for managing production infrastructure: provisioning, application scaling, configuration management, capacity planning, monitoring, etc.
    • Develop and own best practices for developer processes: CI/CD, dev and staging environments, etc.
    • Identify and automate repeated high leverage tasks
    • Provide input into long-term platform requirements and operational guidelines with a focus on reliability
    • Continuously raise our standard of engineering excellence by implementing best practices for coding, testing, and deployment
    • Build and maintain documentation around process and workflows

What We're Looking For

    • 4+ years of experience as a DevOps or Site Reliability Engineer
    • Experience designing and operating large-scale, multi-region production systems
    • Experience working with AWS and cloud infrastructures in general
    • Experience with real-time telemetry and tracing tools like Prometheus, Stackdriver, and DataDog
    • Experience building deployment pipelines leveraging common CI/CD tools
    • Experience with Infrastructure-as-Code (e.g. Terraform, Ansible, CloudFormation, Chef, Puppet, etc.)
    • Experience with networking and configuring / managing VPC networks
    • Experience with container schedulers and runtimes such as Docker and Kubernetes
    • An understanding of security best practices
    • (Preferred) Experience with streaming infrastructure (Kinesis, Kafka, etc.)
    • (Preferred) Good understanding of web applications, microservice architecture
    • Passion for blockchain technologies a plus