Senior Infrastructure Engineer (SRE)

Toronto (Remote)
R & D Teams – Engineering /
Full-Time /
Remote
Are you ready to redefine the future of work with cutting-edge AI? At Cresta, we're on a groundbreaking mission to supercharge the effectiveness of knowledge workers, making them 100x more productive, 10x faster, and 10x better.

Imagine transforming Call Center operations with our real-time agent assist product and harnessing the power of AI with our comprehensive suite of post-call analytics and coaching tools. Born from the prestigious Stanford AI lab, Cresta was co-founded by visionary Sebastian Thrun, the genius behind Google-X, Waymo, Udacity, and more. Our company is now driven by Ping Wu, the co-founder of Google Contact Center AI and the Vertex AI platform and Tim Shi (co-founder), an early member of Open AI. 

Our world-class team comprises AI and ML experts, dynamic go-to-market leaders, and top-tier investors and advisors from Andreessen Horowitz, Greylock Partners, Sequoia Capital, and former AT&T CEO John Donovan. With an impressive roster of clients like Intuit, Porsche, and Verizon, and accolades from Business Insider, Forbes, and Gartner, Cresta is a startup that's capturing the world's attention. We’re also recognized on the Forbes AI50 in 2024!

About the role:

As a member of the infrastructure team you are responsible for designing, building, and advancing our core infrastructure that allows the engineering team to execute quickly, productively, and securely. You will join a collaborative but highly autonomous working environment in which each member has a defined role with clear expectations, as well as the freedom to pursue projects they find interesting.

Join us on this thrilling journey to revolutionize the workforce with AI. The future of work is here, and it's at Cresta.

What you'll do

    • Developer Toolchain. Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure.
    • Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
    • Metrics, logging, analytics, and alerting for performance and security across all endpoints and applications.
    • Infrastructure-as-code deployment tooling and supporting services on multiple cloud providers.
    • Automate operations and engineering. focus on automation so we can spend energy where it matters. 
    • Building machine learning infrastructure that enables AI teams to train, test, and deploy on large-scale datasets.

What we're looking for

    • 5+ years experience in DevOps, Site Reliability Engineering, Production Engineering, or equivalent field.
    • Deep proficiency with coding languages such as Golang or Python.
    • Deep familiarity with container-related security best practices.
    • Production experience working with Kubernetes, and a deep understanding of the Kubernetes ecosystem, including popular open-source tooling such as cert-manager or external-dns.  Experience with GPU-enabled clusters is a bonus.
    • Production experience with Kubernetes templating tools such as Helm or Kustomize.
    • Production experience with IAC tools such as Terraform or CloudFormation.
    • Production experience working with AWS and services such as IAM, S3, EC2, and EKS. Production experience with database software such as PostgreSQL
    • Experience with GitOps tooling such as Flux or Argo.
    • Experience with CI/CD and feature gating systems.Fluency in Linux operations and configurations.
If you want to make an impact with an amazing product, want to improve your tech skills by working with other exceptional engineers, and like to be part of an amazing international team, then you should join us. We pay an attractive salary and with the Cresta stock options, you can benefit from the company's growth.