Transcription Software Engineer

Austin, TX

Platform /

Fulltime /

Remote

At LeoTech, we are passionate about building software that solves real-world problems in the Public Safety sector. Our software has been used to help the fight against continuing criminal enterprises, drug trafficking organizations, identifying financial fraud, disrupting sex and human trafficking rings and focusing on mental health matters to name a few.

Role

This is a remote, WFH role.
We are seeking a highly skilled Transcription Engineer to join our Platform Team. This role is core to our mission of extracting intelligence from audio in some of the most challenging environments. You will be responsible for advancing transcription quality across our workflows—experimenting with ASR models, integrating third-party services, and building tooling that ensures accuracy, reliability, and scalability. The ideal candidate has a strong software engineering background with expertise in Python, audio processing, and applied machine learning techniques for speech. A mix of hands-on engineering, data science, and DevOps skills is essential, as the role involves both experimentation with ASR models and deploying services into production at scale. This is a challenging and rewarding role for someone who is passionate about audio, language, and building high-quality systems that power real-world intelligence use cases.

Core Responsibilities

Lead efforts to improve transcription quality by evaluating, testing, and fine-tuning ASR models (both commercial APIs and open-source).
Build pipelines that handle speaker identification, diarization, multi-language support, and noise-robust transcription in difficult audio environments.
Develop and maintain services that integrate multiple ASR providers, ensuring resilience and flexibility across transcription workflows.
Collaborate with platform engineers to ensure seamless ingestion and persistence of transcription outputs in data pipelines.
Use data wrangling and exploratory analysis to deeply understand transcription accuracy and error patterns. - Explore and apply audio engineering techniques (denoising, voice isolation, codecs, signal processing) to improve speech clarity.
Deploy and maintain transcription-related services with basic DevOps practices, ensuring scalability and reliability.
Participate in all stages of the development lifecycle: ideation, design, prototyping, implementation, deployment, and iteration.

What We Value

Strong software engineering background in fields such as Computer Science, Software Engineering, or related disciplines.
5+ years of professional development experience, with significant focus on speech processing, NLP, or transcription systems.
Proficiency in Python and comfort with system-level programming when needed.
Experience with ASR frameworks (e.g., Whisper, Kaldi, Vosk, NVIDIA NeMo, or similar).
Familiarity with audio engineering tools (e.g., ffmpeg, Sox) and denoising/voice enhancement techniques.
Knowledge of speaker diarization, speaker recognition, and multi-language ASR challenges.
Experience with data analysis and wrangling (e.g., Pandas, NumPy, Jupyter) to evaluate model performance.
Understanding of cloud deployment and DevOps basics (e.g., Docker, Kubernetes, serverless workloads).
Comfort working in a fast-paced environment with dynamic objectives and quick iteration cycles.
Demonstrated ability to work independently, make tradeoffs, and deliver results with minimal supervision.
Bonus Points
Hands-on experience fine-tuning ASR models on domain-specific datasets.
Familiarity with real-time streaming pipelines for audio ingestion and transcription.
Exposure to search and retrieval systems (e.g., Elasticsearch) for indexing transcribed text.
Prior experience in audio forensics or noisy-channel speech analysis.
Experience with applying heuristics to improve transcription results.

Technologies We Use

We are hosted on AWS Cloud and use numerous AWS services.
Our backend languages primarily consist of Elixir, NodeJS and some Python.
TypeScript and React are central to our front-end development.
Terraform, CloudFormation, Ansible are leveraged for our Infrastructure deployment and automation.
Industry-standard build tooling and CI/CD using AWS CodePipeline and GitHub Actions.
A low-code test automation framework for end-to-end testing.
A mix of open-source and proprietary technologies that are tailored to the problems at hand.

What You Can Expect

Work from home opportunity
Enjoy great team camaraderie.
Thrive on the fast pace and challenging problems to solve.
Modern technologies and tools.
Continuous learning environment.
Opportunity to communicate and work with people of all technical levels in a team environment.
Grow as you are given feedback and incorporate it into your work.
Be part of a self-managing team that enjoys support and direction when required.
3 weeks of paid vacation – out the gate!!
Competitive Salary.
Generous medical, dental, and vision plans.
Sick, and paid holidays are offered.

$140,000 - $170,000 a year

Please note the national salary range listed in the job posting reflects the new hire salary range across levels and U.S. locations that would be applicable to the position. The final salary will be commensurate with the candidate's accepted hiring level and work location. Also, this range represents base salary only and does not include equity, or benefits if applicable.

LeoTech is an equal opportunity employer and does not discriminate on the basis of any legally protected status.

Apply for this job