Software Engineer- II, Infra

Product & Engineering – Engineering /
Full Time /
Remote
About the Role:

We are currently looking for a Software Engineer II - Infrastructure to join our dynamic team. In this role, you will play a pivotal part in debugging technical issues within a sophisticated stack that encompasses virtualization, containers, microservices, and beyond. Your expertise will be instrumental in ensuring the seamless operation of our cloud infrastructure and delivering timely support to our valued customers. If you're ready to make a significant impact in a fast-paced environment, we invite you to be a key player in our mission to drive excellence and innovation.

What will you do? 🤔

    • Technical Issue Resolution:
    • Debug complex technical issues across virtualization, containers, and microservices to ensure platform stability and performance.
    • Customer Cloud Issue Management:
    • Take ownership of customer cloud issues, effectively communicating with workgroups on behalf of end customers to resolve problems promptly.
    • Service Level Management:
    • Proactively manage service levels and abandon rates during business-critical needs to ensure a seamless user experience.
    • Major Incident Handling:
    • Handle major incidents by coordinating with multiple teams and leading as a Priority Incident Manager to ensure swift resolution.
    • Automation and Scripting:
    • Generate scripts and templates to automate resource provisioning, enhancing operational efficiency.
    • Continuous Improvement:
    • Implement proactive measures to manage outage events, driving incident response and fostering blameless postmortems for continuous improvement.
    • Knowledge Sharing:
    • Stay up-to-date with the latest developments and challenges related to cloud infrastructure, providing insights to enhance our platform's performance and reliability.

What makes you a match for us? 😍

    • In depth experience in one of the major cloud platforms (AWS, Azure, or GCP) with a deep understanding of its services and best practices.
    • Strong knowledge of Kubernetes with hands-on experience in managing Kubernetes clusters.
    • Working experience with tools such as Argo CD, Argo Workflow, Loft vCluster, or Prometheus/Grafana.
    • Proficiency in one or more programming languages such as Java, Python, or Go, with the ability to create efficient scripts and automation.
    • Familiarity with databases like Cassandra, Elasticsearch, PostgreSQL, or Redis is a plus.
    • Experience with multi-tenancy on Kubernetes for seamless resource management.
    • Exceptional verbal, presentation, and written communication skills to convey technical information effectively.
    • Availability to work during EST/EDT hours (7 PM to 4 AM IST) to align with global operations.