Staff Cloud Operations Engineer-Cloud Operations team

Hangzhou, China
Products – Engineering /
Fulltime /
Hybrid
Job Description

Staff Cloud Operations Engineer
Location: China-HANGZHOU


Extreme’s Cloud Operations team is a group of talented engineers passionate about building highly reliable, scalable and secure solutions in public/private cloud environments. We are looking to hire a highly motivated Cloud Operations engineer with strong working experience in production operation and deployment automation. You will work with the team to design, develop and implement deployment automation solutions end-to-end. You will also be expected to participate in continuous cloud service operations and troubleshoot and resolve complex issues in production.

We will work together to design, develop and implement the best public/private/local cloud solutions for our customers. Extreme Networks is the right place to be and now is the right time to join us and be part of our spectacular growth and success. We're looking for the best and the brightest 'A' players who want to make a difference doing a job they love.


Responsibilities

Manage and maintain ExtremeCloud service infrastructure in AWS, GCP & Azure.
Participate in continuous cloud service operations with US, EU, and China teams.
Troubleshoot and follow up on production infrastructure/application-related issues.
Driving root cause analysis and resolution.
Communicate with Dev/QA as well as external carriers to resolve and prevent issues.
Participate in release deployment, system maintenance and cloud expansion.
Design and implement deployment automation platform for Kubernetes-based microservices.
Improve service availability and scalability through tuning, automation, tools, and processes.
Analyze service performance, identify bottlenecks and provide actionable improvement plans.
Improve service monitoring coverage, accuracy and efficiency.
 
Minimum Qualifications:

BS level technical degree required; Computer Science or Engineering background preferred.
8+ years of experience in a CloudOps / DevOps role.
Hands-on experience with AWS or any public cloud (Azure, GCP etc).
Knowledge of Linux, security and networking fundamentals.
Working knowledge of container-based architecture and deployment (Docker, Kubernetes.)
Working knowledge of deployment automation development (Argo Workflows, Terraform, Helm).
Experience in diagnosing and resolving complex application problems.
Working knowledge of Elasticsearch, PostgreSQL, Redis, Ignite, Kafka and RabbitMQ.
Experience with monitoring tools (Nagios, Kibana, Prometheus)
Experience with cloud security and compliance implementation is a plus.
Strong follow-through and initiative to stay with issues until they are resolved.
Comfortable working within a distributed team located in multiple time zones.