Site Reliability Engineer (Enterprise Solution)
Taipei /
KKStream – Engineering /
Permanent
Asia’s leading technology group, KKCompany Technologies (KKCompany), is a leader in software services. Our mission is to build “Freeways to Inspiration” and help industries achieve digital transformation. By creating technology highways with partners, we deliver our services around the world and drive value creation through future technology.
In addition to our flagship brands KKBOX, KKStream, and Going Cloud, our core technologies cover various fields such as music streaming, multimedia, and cloud services. Through a range of products and services, we help customers create commercial value. We also offer software services and solutions to over tens of millions of customers with corporate clients across Asia covering various industries such as telecommunications, entertainment and multimedia, media, education, and fitness centers.
We have over 500 employees across offices in Tokyo, Singapore, Taipei, Kaohsiung, and Hong Kong.
Responsibilities:
- Develop and maintain service monitoring software stack.
- Develop and maintain infrastructure orchestration on clouds.
- Improve the reliability and scalability of online services.
- Engage in and improve the whole lifecycle of services.
- Write documentation for knowledge sharing.
- Participate on-call rotation.
Requirements:
- Bachelor's degree in Computer Science or a related technical field involving software or systems engineering, or equivalent practical experience.
Nice to Have
- Good skills and experience in communication and documentation.
- Experience in programming languages such as Python or Bash script.
- Experience in infrastructure deployments automation tools such as Terraform, AWS Cloudformation, or AWS CDK.
- Experience in Git and CI/CD technical stacks such as GitLab.
- Experience in operating and deploying services on AWS.
- Experience in cloud networking, storage, computing, and streaming relative services.
- Experience in monitoring dashboards and collecting metrics/logging/tracing such as AWS CloudWatch or Container Insights.
- Experience coordinating internal and external audits to ensure compliance with ISO 27001 requirements.