Senior Site Reliability Engineer (SRE)

United States (Remote) /
Engineering – Site Reliability /
Full time
As a Senior Site Reliability Engineer (SRE) you will help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems.
 
Much of our support and software development focuses on optimizing existing systems, building infrastructure and reducing work through automation. You’ll join a team of curious problem solvers with a diverse set of perspectives who are thinking big and taking risks. In this environment you’ll take the lead on relevant projects, supported by an organization that provides the support and mentorship you need to learn and grow. As an SRE you’ll be focused on running better production applications and systems.

Responsibilities:

    • Design, code, test and deliver software to automate manual operational work
    • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
    • Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
    • Identify application patterns and analytics in support of better service level objectives
    • Design self-healing and resiliency patterns
    • Design automated software and product upgrades, change management, and release management solutions
    • Participate in the 24×7 support coverage as needed
    • Mentor and guide junior developers

Requirements:

    • Expertise in at least one technology stack designing, coding, testing, and delivering software
    • Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm
    • Working knowledge of infrastructure components (e.g. routers, load balancers , cloud products , container systems , compute, storage and networks)
    • Excellent debugging and trouble shooting skills
    • Prior experience in DevOps and/or application development teams
    • Hands on experience using large scale software development, preferably in one of these languages: Java, Python, scripting languages
    • Hands on experience of Kubernetes, Docker, Docker Swarm style deployments
    • Exposure on data-dog and data-dog monitoring
    • Hands on experience of Continuous Delivery tools
    • Hands on experience in Unix: Linux and Solaris
    • Exposure to Orchestration and configuration management tools for applications
    • Experience with infrastructure components utilized in data warehousing or big data environments
    • Excellent communication skills, both written and oral appropriately scaled for senior technical and senior business audience
    • Ability to work and effectively prioritize in a highly dynamic work environment that includes a global focus


Zimperium is an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex including sexual orientation and gender identity, national origin, disability, protected veteran status, or any other characteristic protected by applicable federal, state, or local law.