Devops Engineer - II

Gurugram
Product Engineering – DevOps & Infrastructure /
Full Time - Remote /
Remote
About the Company:
Netomi is the leading agentic AI platform for enterprise customer experience. We work with the largest global brands like Delta Airlines, MetLife, MGM, United, and others to enable agentic automation at scale across the entire customer journey. Our no-code platform delivers the fastest time to market, lowest total cost of ownership, and simple, scalable management of AI agents for any CX use case. Backed by WndrCo, Y Combinator, and Index Ventures, we help enterprises drive efficiency, lower costs, and deliver higher quality customer experiences.

Want to be part of the AI revolution and transform how the world’s largest global brands do business? Join us!

We are looking for a DevOps Engineer to work collaboratively with the software development team to deploy and operate the systems. You'll be responsible to ensure the system is running smoothly and is being monitored continuously to resolve issues.

Responsibilities

    • Kubernetes Management
    • Design, deploy, and manage Kubernetes clusters.
    • Configure and enforce RBAC policies and access control mechanisms.
    • Optimize cluster performance and scalability.

    • CI/CD Pipeline Development
    • Build, maintain, and enhance CI/CD pipelines using tools like Jenkins, ArgoCD, Spinnaker, or similar.
    • Automate deployment processes to improve delivery efficiency and reduce lead times.

    • Helm Templates
    • Develop, maintain, and optimize Helm charts for Kubernetes applications.
    • Ensure proper versioning and deployment strategies through Helm.

    • Service Mesh Implementation
    • Integrate and manage service meshes such as Istio or Linkerd for service discovery, load balancing, and secure communication.
    • Implement observability and traffic management within the service mesh.

    • Scripting and Automation
    • Write and maintain automation scripts using Python, Ruby, or Golang.
    • Leverage scripts for infrastructure provisioning, monitoring, and debugging.

    • (Bonus) Chaos & Resilience Engineering
    • Implement chaos engineering principles to test and improve system reliability.
    • Simulate failures and measure system resilience to ensure production readiness.

Requirements

    • Experience: At least 3 years in DevOps or a related field.

    • Tools and Technologies:
    • Kubernetes (RBAC, cluster administration, Helm).
    • CI/CD tools such as Jenkins, ArgoCD, or Spinnaker.
    • Service mesh solutions like Istio, Linkerd, or similar.
    • Hands-on experience with Amazon ECS and Terraform is a mandatory requirement for this role.

    • Skills:
    • Proficiency in scripting languages (Python, Ruby, Golang).
    • Familiarity with cloud platforms (AWS, GCP, Azure).
    • Strong understanding of system design, networking, and infrastructure management.

    • Bonus Skills: Knowledge and experience in chaos engineering tools like Chaos Monkey, LitmusChaos, or Gremlin.
Netomi is an equal opportunity employer committed to diversity in the workplace. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, disability, veteran status, and other protected characteristics.