Research Scientist - Speech Synthesis

Santa Monica, CA 90401
Science – Research Science /
On-site
The most talked about AI company in Hollywood

Flawless is shattering the boundaries of traditional filmmaking with its groundbreaking suite of Gen AI film editing tools. Our mission is to empower filmmakers with cutting-edge technology that allows creativity without compromise, expands storytelling possibilities, and delivers unparalleled visual and emotional experiences.

What we are looking for:

Flawless is looking for a smart, motivated, and committed Research Scientist for our Speech Synthesis team out of the LA Office.  As a Research Scientist on the speech/audio team at Flawless, you will work with and lead a close-knit, passionate group of world-class individuals to tackle some of the most challenging problems in generative speech synthesis, text-to-speech, accent modeling, and voice conversions. Our work in automated visual translation is just the beginning, we’re developing countless exciting products based on the application of our proprietary, cornerstone research.

Qualifications

    • Ph.D. or Postdoctoral researcher in audio synthesis, speech processing, or related field.
    • 3+ years of experience with audio synthesis techniques such as TTS (Text-to-Speech), SST (Speech-to-Speech translation), or voice conversion methods.
    • Demonstrable research experience with a strong publication record in major Speech Synthesis, Speech Processing at venues such as ICASSP, Interspeech, or NeurIPS.
    • Proficiency in linear algebra, deep learning, and numerical optimization.
    • Proficiency in Python, PyTorch, or Tensorflow.
    • Proficiency in compute platforms such as GCP or AWS.

Preferred Qualifications

    • Research experience with demonstrated publication record at top-tier conferences in computer vision/graphics, and deep learning such as ICASSP, Interspeech, or NeurIPS.
    • Experience with audio identity embedding, accent modeling, style-transfer, multi-language audio synthesis.
    • Experience with attention, diffusion models and speech signal processing.
$200,000 - $250,000 a year
Flawless is proud to emphasize an equal opportunity, safe environment for people to do their best work. We are committed to providing equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.