Senior Site Reliability Engineer

Anywhere
Engineering
Full-time
As a Senior Site Reliability Engineer at Skillshare, you’ll play a key role in balancing our current operations with building for the future. We’re scaling quickly and are excited to bring someone on board who can help us proactively tackle resulting challenges – both in the day-to-day operations, and anticipating those further out.

This role is an exciting blend of both Infrastructure and DevOps, which means opportunity for impact across the board. We’ll look to your strategic expertise, reliable execution, and sound judgment to improve and maintain our infrastructure, along with creating increasingly smooth processes for our engineers as we grow the platform.

You’ll be joining a team that’s passionate about technology, and helping pave the way for building products together that we’re proud of. We’re excited to meet you.

What you’ll do:

    • Improve, monitor and maintain our infrastructure
    • Ensure site uptime and performance
    • Maintain and improve development and QA environments
    • Work with web developers to improve tooling for initiatives like unit testing, deployment processes, etc.
    • Proactively prep and train developers for improvements or updated workflows
    • Quickly and proactively resolve developer issues
    • Support the platform team in building new application platform on Node.js
    • Make strategic recommendations and improvements to our application and infrastructure security

What you’ll need to be successful:

    • Experience building and supporting cloud-based web infrastructure with AWS
    • Docker experience (Kubernetes experience is a plus)
    • Continuous integration and deployment experience (preferably with CircleCI)
    • Relational databases and queueing systems knowledge (we use MySQL, Redshift, Redis)
    • Experience with application monitoring and alerting systems (we use New Relic and Datadog)
    • Understanding of web infrastructure: load balancing, high availability configurations, disaster recovery, DNS configuration, security best practices, etc.
    • Working knowledge of software engineering practices
    • Strong communication skills – you’re a natural collaborator and can report out to stakeholders of all levels
    • Ability to balance strategy and execution

Why you want this job:

    • Impact: you’ll play a key role in shaping the direction of our infrastructure and developer processes long-term
    • Growth: You’ll have the opportunity to wear a lot of hats and take on more responsibility over time.
    • Our team: We have a passionate, talented team that is a lot of fun to work with.
    • Our mission: We’re doing work that matters – connecting lifelong learners around the world and empowering them to pursue their creativity.
    • Flexibility: We believe that doing your best work means living a full life. That means different things for everyone, so we optimize for trust, invest to support remote teams, have an unlimited vacation policy (with a required minimum!), and encourage work-life balance.
About Skillshare

Skillshare is an online learning community whose mission is to connect curious, lifelong learners everywhere – and, in so doing, build a more creative, more generous, and more prosperous world. Today, our community has grown to millions of members who come to Skillshare to learn creative and entrepreneurial skills, network with peers, and even teach a class themselves. We are backed by Union Square Ventures, Spark Capital, Amasia, Spero Ventures, and Burda Principal Investments.

Skillshare is committed to building a diverse team that reflects a variety of backgrounds, perspectives, and skills. We work to ensure a consistent interview process, fair compensation, and inclusive work environment for all.