Senior DevOps Engineer(SRE)

Singapore
SG Engineering – DevOps&Security /
Full Time /
On-site
About PatSnap
Patsnap empowers IP and R&D teams by providing better answers, so they can make faster decisions with more confidence. Founded in 2007, Patsnap is the global leader in AI-powered IP and R&D intelligence. Our domain-specific LLM, trained on our extensive proprietary innovation data, coupled with Hiro, our AI assistant, delivers actionable insights that increase productivity for IP tasks by 75% and reduce R&D wastage by 25%.
IP and R&D teams collaborate better with a user-friendly platform across the entire innovation lifecycle. Over 15,000 companies trust Patsnap to innovate faster with AI, including NASA, Tesla, PayPal, Sanofi, Dow Chemical, and Wilson Sonsini.

About the Role

    • We are looking for a skilled and experienced Senior DevOps Engineer / Site Reliability Engineer (SRE) to ensure the high availability, stability, and performance of our business platform.
    • This role will be central to designing and implementing scalable and maintainable DevOps architecture and automation systems to enhance operational efficiency.
    • As a senior member, you will lead efforts in optimizing our operational standards, managing risk assessments, and fostering collaboration with our China-based operations team.
    • If you are passionate about high-performance systems, security, and automation, we welcome you to join our team.

Key Responsibilities

    • Ensure high availability, stability, and performance of business platforms, developing optimization strategies and refining operational standards and procedures.
    • Lead the design and implementation of scalable, maintainable DevOps architecture and automation systems to streamline and enhance operational processes.
    • Oversee security risk assessments, and lead the creation and implementation of security strategies to maintain system security.
    • Evaluate and review the system architecture, process logic, performance, and stability, working closely with SRE and developer teams in China to address challenges effectively.
    • Act as the primary incident commander for production environment issues, leading team efforts in troubleshooting and resolution, and ensuring timely response and resolution.
    • Stay updated on the latest trends in technology advancements, organizing team learning sessions to foster continuous improvement.

Requirements

    • Bachelor’s degree in Computer Science or a related field, with at least 4 years of experience in internet system operations or SRE roles.
    • In-depth understanding of internet technology architecture, including expertise in microservices, Kubernetes, Docker, monitoring and alerting systems, CI/CD, logging systems, distributed caching, and database systems.
    • Extensive experience in distributed systems and high-concurrency operations, with strong skills in fault diagnosis and system optimization.
    • Proficient in cloud platform operations (e.g., AWS, Azure), with knowledge of MySQL, PostgreSQL, Redis, and familiarity with big data technologies and hybrid cloud architectures preferred.
    • Skilled in at least one programming language such as Python, Go, or Java, with relevant development experience.
    • Strong organizational and coordination skills, with the ability to guide team members in solving complex issues.
    • Fluent in Mandarin to facilitate effective communication within a multilingual team environment.

Why Join Us

    • Work with innovative DevOps and cloud technologies to drive impactful solutions.
    • Be part of a collaborative, growth-oriented environment that emphasizes continuous learning.
    • Engage in diverse DevOps areas, including system automation, security, and performance tuning, for a comprehensive experience.