Data Quality Engineer

Seoul, South Korea / San Francisco, United States
Tech – ML & Research Engineering /
Full-time /
Who we are

At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media.

With a remarkable $77 million in Seed and Series A  funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation.

We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI.

About the Role

You will be a vital member of the ML Data Team – the team which delivers the data and “ground truth” labels that are critical to our efforts to build world class video models (currently State of the Art [SoTA] on several industry benchmarks!). Your primary role is to help us build and extend our existing partnerships with companies that specialize in human data labeling for machine learning systems; you will be responsible for planning and prioritizing projects, defining project requirements in consultation with our research team, and executing prioritized projects through both individual contributions and via project/partner management. You will also be responsible for automating as much of the repetitive partnership and annotation-quality-evaluation work as possible. A desire to work cross functionally and to build relationships is critical for success in this position.

In this role, you will

    • Oversee, plan, and take care of data collection and labeling projects. Keep an eye out for automation opportunities to make things easier over time
    • Build and keep up solid relationships with our outside vendors and contractors: ensure our collaboration is smooth and valuable
    • Create labeling instructions and evaluate data quality. Make sure we've got a good mix of quality, diversity, and quantity of data
    • Brainstorm ways to make our tools or instructions more user-friendly
    • Keep tabs on ongoing projects to make sure we're putting our resources in the right places. Be ready to tweak project scope and instructions when new information comes in
    • Share updates on projects, including by building diagnostics/dashboards and data analysis tools/reports
    • Work hand in hand with the rest of the Engineering org to make our interfaces (both code interfaces and human interfaces) even better

You may be a good fit if you have

    • Strong professional english speaking and writing skills
    • 3+ years of software development or analytics-heavy operations experience or 2+ years of experience with Python or other popular industry tools for automation
    • Enjoy paying attention to details and analyzing information and data
    • Have excellent project management skills, and can work with internal and external teams
    • Understand the workings of LLMs or VLMs and prompt engineering
    • Have experience in gathering, labeling, and analyzing data
    • Agree that data is the key ingredient for the performance of AI models
    • Have worked with data collection and labeling for multimodal language models
    • Have managed a team of external contractors or vendors
    • Have launched new technical programs
    • Have worked with research scientists and engineers

Even if there are a few checkboxes that aren’t ticked through your prior experience, we still encourage you to apply! If you are a 0-to-1 achiever, a ferocious learner, and a kind and fun team player who motivates others, you will find a home at Twelve Labs.

We welcome applicants from all walks of life and are committed to equal-opportunity employment. We cherish and celebrate diversity not just because it is the right thing to do, but because it makes our company much stronger.

Benefits and Perks
🤝 An open and inclusive culture and work environment.
🧑‍💻 Work closely with a collaborative, mission-driven team on cutting-edge AI technology.
✈️ Extremely flexible PTO and parental leave policy. Office closed the week of Christmas and New Years.
🏙 Remote-flexible, offices in San Francisco and Seoul and coworking stipend.