Senior Data Engineer

Taipei
Music – Engineering /
Permanent /
Hybrid
KKCompany Technologies Group is a leading technology group in software services. We have created the world’s first legal music streaming platform, KKBOX, and are an international software technology group focused on “multimedia technologies, digital cloud, and AI applications” as our core business to create value for the customers. The group consists of self-owned brands including KKBOX, BlendVision, and Going Cloud with enterprise customers across Asia. KKCompany Technologies has attained OpenChain ISO/IEC 5230 and ISO 27001 third-party international certification.

We have over 500 employees across offices in Tokyo, Singapore, Taipei, Kaohsiung, and Hong Kong. For more information please visit our website: www.kkcompany.com and blog: blog.kkcompany.com

Team Information
The main aim of the Data Platform team is to leverage data to provide insights and evidence to help the stakeholders take meaningful business actions. With the help of key findings and reports from the team, the Business Leader will be all set to make an informed decision. If you’re passionate about leveraging data to drive business and help organizational leaders to take meaningful actions, we want to hear from you.

Responsibilities

    • Design and implement ETL/ELT processes to integrate data from a variety of internal and external sources into an analytical-based data model, in order to support business analytics or product development
    • Develop procedures for tracking data quality, completeness, redundancy, and improvement
    • Create processes for governing the identification, accuracy, collection, and use of metadata
    • Responsible for data accuracy, maintenance, and support within the analytical-based data model
    • Work closely with data analyst and development teams to implement solutions that enable them to do their jobs
    • Documents ER diagram / Data catalog ensuring long-term maintainability

Requirements

    • 5+ years of experience in building and operating large-scale distributed systems or application
    • strong programming skills in one or more of the following languages: Python/ Scala/ PySpark /SparkSQL
    • Experience in ETL or ELT (designing, coding, and tuning data processes from various sources)
    • Experience with cloud solutions like AWS or GCP
    • Experience with tech stack tools such as Airflow/Jenkins
    • Ability to translate ‘tech speak’ to meaningful business language
    • Team-oriented, open-minded, communicative and eager to learn
    • A proactive, data-driven and logical mindset

Nice to Have

    • Experience in container technology, such as Docker and Kubernetes
    • Experience in backend system design and API development
    • Experience in implementing machine learning algorithms.