Lead Site Reliability Engineer (SRE)

Finland
Engineering – Engineering /
Full Time /
Hybrid
Lead SRE is responsible for security, reliability and cost efficiency of DISQO platforms. This role operates with autonomy and discretion, leading internal and cross-functional teams through planning and execution of large complex initiatives.

What you will do:

    • Exhibit a comprehensive understanding of multiple technical domains, providing guidance on both technical and operational challenges faced by the team.
    • Lead by example in conducting design reviews, simplifying complex problems for execution, and overseeing the technical direction for significant projects. Mentor and provide support to fellow engineers.
    • Champion a culture of SecDevOps, influencing teams across the company to adopt practices that enhance system security and reliability.
    • Define and lead technical roadmap and cross-organizational projects to align with company goals and industry trends.
    • Engage with team members and stakeholders to define clear service level indicators, objectives, and error budgets, ensuring alignment with customer expectations.
    • Demonstrate exceptional technical proficiency, proactively addressing technology bottlenecks and driving solutions.
    • Serve as the primary contact during critical incidents, showcasing your ability to swiftly diagnose and resolve issues, minimizing financial impact.
    • Lead knowledge-sharing initiatives, documenting insights and best practices, and contributing to internal forums and communities of practice.
    • Possess and share expert knowledge on a wide range of SRE principles, including reliability, scalability, observability, performance, security, and enterprise system architecture
    • Experience leveraging generative AI in a production environment
    • Experience in Agile development and leading Agile ceremonies

What you bring to the role:

    • Degree in Computer Science, Engineering, or equivalent experience.
    • At least 5 years of experience in Site Reliability Engineering, with a proven track record in technical leadership roles.
    • 5 years experience working in AWS, Linux/Unix administration, and containerization technologies (Docker, Kubernetes).
    • AWS Professional and/or Specialty Certification preferred
    • Expert knowledge in automation and scripting (Python, Bash)
    • Expert knowledge of CI/CD practices and IaC  (ie. Terraform, Gitlab, Argo)
    • Demonstrated ability to lead during high-pressure situations and complex problem-solving capabilities.
    • Excellent communication, mentorship, and leadership skills, capable of motivating and guiding a team towards success.
    • A proactive approach to identifying and solving technical challenges, with a commitment to continuous learning and improvement.
#LI-MV1