Senior DevOps Engineer

San Francisco, CA
Software /
Full-Time /
Hybrid
About Gridware
Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid. We pioneered a groundbreaking new class of grid management called active grid response (AGR), focused on monitoring the electrical, physical, and environmental aspects of the grid that affect reliability and safety. Gridware’s advanced Active Grid Response platform uses high-precision sensors to detect potential issues early, enabling proactive maintenance and fault mitigation. This comprehensive approach helps improve safety, reduce outages, and ensure the grid operates efficiently. The company is backed by climate-tech and Silicon Valley investors. For more information, please visit www.Gridware.io.

Role summary: 
We are seeking a Cloud Engineer to drive the automation, security, and reliability of our cloud infrastructure. This role focuses on integrating security best practices into every stage of the development lifecycle, from infrastructure provisioning to deployment pipelines. You will be responsible for automating infrastructure management with IaC tools, implementing secure CI/CD pipelines, managing Kubernetes clusters, and ensuring strong security controls through SSO, IAM, SIEM integration, and endpoint protection platforms. Working closely with security and development teams, you will play a key role in building secure, scalable, and efficient cloud systems. 

Responsibilities

    • Design, implement, and manage infrastructure 
    • Maintain and optimize Kubernetes clusters for high availability and performance 
    • Build and maintain CI/CD pipelines 
    • Integrate and manage identity and access management for our infrastructure 
    • Ensure security best practices are followed, including integration with SIEM tools 
    • Collaborate with security teams to deploy and monitor EPP/EDR/XDR platforms on our cloud 
    • Work closely with developers to streamline deployment workflows and improve system reliability 
    • Provide support and incident response for production systems, troubleshooting and resolving issues as needed 

Required Skills

    • Proficiency with Terraform and Terragrunt in production environments 
    • Strong experience with Kubernetes( EKS or similar) 
    • Working knowledge of AWS and its core services 
    • Experience with ArgoCD for GitOps workflows 
    • Experience building CI/CD workflows with GitHub Actions 
    • Hands-on experience integrating and managing Auth0 
    • Familiarity with SIEM tools for security monitoring 
    • Experience working with EPP/EDR/XDR security solutions 

Bonus Skills

    • Working with RDS or managed databases on AWS 
    • Experience with MSK (Managed Streaming for Apache Kafka) 
    • Exposure to Databricks or a background inML-Ops 
$170,000 - $190,000 a year
This describes the ideal candidate; many of us have picked up this expertise along the way. Even if you meet only part of this list, we encourage you to apply!

Benefits
Health, Dental & Vision (Gold and Platinum with some providers plans fully covered) 
Paid parental leave 
Alternating day off (every other Monday)
“Off the Grid”, a two week per year paid break for all employees. 
Commuter allowance 
Company-paid training