Research Scientist - Audio 3D Vision

Santa Monica, CA 90401
Science – Research Science /
Flawless is an award winning film technology company pioneering the generative AI revolution in filmmaking. Solving some of the biggest problems filmmakers face by empowering them with groundbreaking AI powered post production tools, allowing on screen dialogue to be visually changed without the need to reshoot or go back to set. The world’s first system has won multiple awards including TIME Best Inventions.

Founded in 2020 by Hollywood director, Scott Mann and serial technology entrepreneur Nick Lynes, Flawless is opening a world of new possibilities for filmmakers whilst ensuring the responsible adoption of generative AI. With headquarters in London and LA, Flawless has established an exceptional team of 100 world leaders in science, film and technology that have come from the likes of Adobe, Google, Lionsgate, NASA, Sony, Facebook, Microsoft and Apple (click here to find out more).

Flawless are looking for a Research Scientist - Audio 3D Vision
You will be working in an environment based on trust, autonomy and collaboration, and this is a great opportunity for someone who wants to be part of a growing company in its most exciting stage of development. You can play a part in shaping the future of a company that’s caring, creative and collaborative.

As a Research Scientist on the science team at Flawless, you will work with and lead a close-knit, passionate group of world-class individuals tackling some of the most challenging problems in deep learning, including Audio-driven 3D facial animation, GAN models for visual speech synthesis, speech-based modeling, multi-modal fusion for audio-visual learning, and a much more.
Our work in automated visual translation is just the beginning, we’re developing countless exciting products based on the application of our proprietary, cornerstone research.


    • Ph.D. or Postdoctoral researcher in 3D Computer Vision, Speech Synthesis, Computer Graphics, or related field
    • Demonstrable research experience with a strong publication record in major 3D Computer Vision, Speech Processing, and Computer Graphics venues and journals like CVPR, SIGGRAPH, or NeurIPS
    • 6+ years of experience in Python with proficiency in deep learning frameworks such as PyTorch or Tensorflow
    • High degree of proficiency in math and statistical methods for signal processing
    • Experience with to audio-visual learning, multimodal fusion, and/or 3D audio-driven face animation
    • Outstanding communication skills to collaborate in a team with Scientist, Research/ML Engineers and VFX artists.
    • Experience with speech modeling and speed synthesis with deep neural networks
    • Generative and cross-domain attention models for 3D visual speech synthesis applications