Compiler Optimization Engineer

SF Bay Area / Toronto

Engineering /

Full-time /

Hybrid

At Lemurian Labs, we’re on a mission to bring the power of AI to everyone—without leaving a massive environmental footprint. We care deeply about the impact AI has on our society and planet, and we’re building a rock-solid foundation for its future, ensuring AI grows sustainably and responsibly. Because let’s face it, what good is innovation if it doesn’t help the world?

We are building a high-performance, portable compiler that lets developers “build once, deploy anywhere.” Yes, anywhere. We’re talking about seamless cross-platform compatibility, so you can train your models in the cloud, deploy them to the edge, and everything in between—all while optimizing for resource efficiency and scalability.

If the idea of sustainably scaling AI motivates you and you’re excited about making AI development both powerful and accessible, then we’d love to have you. Join us at Lemurian Labs, where you can have fun building the future—without leaving a mess behind.

Key Duties

Design, develop, maintain and improve our heterogeneous AI compiler
Propose improvements to and expand our IR with respect to new advancements in machine learning model architectures
Design and implement new passes and innovative scheduling techniques
Use the latest techniques in parallelization and partitioning to automate generation and exploit highly optimized kernels
Rapid prototyping and data driven exploration of new ideas
Benchmark and analyze the outputs produced by our optimizing compiler on target hardware
Work closely with our product team to understand the evolving needs of ML engineers and drive improvements in architecture
Build tools to collect and analyze performance bottlenecks

Essential Skills and Experience

BS degree in computer science, computer engineering, electrical engineering, or equivalent practical experience
4+ years of experience working with compilers, with a focus on optimization passes, graph construction and traversal, and graph transformations.
Very strong knowledge of compiler algorithms and data structures
4+ years of experience with C/C++ (C++14 or newer)
Strong written and oral communication, and able to write clear and concise documentation
Team first attitude
Detail oriented

Preferred Skills and Experience

Masters or PhD degree in computer science, computer engineering, electrical engineering, or equivalent practical experience
Knowledge of DL frameworks such as PyTorch, JAX or Triton
Experience with programming large compute clusters
Working knowledge of LLVM and polyhedral models
Working knowledge of MLIR
Experience with ML compilers such as TVM, XLA, Glow, TensorRT, IREE, nGraph, etc would be ideal
Experience with loop optimizations (vectorization, unrolling, fusion, parallelization, etc)

Salary depends on experience and geographical location.

This salary range may be inclusive of several career levels and will be narrowed during the interview process based on a number of factors, such as candidate’s experience, knowledge, skills and abilities, as well as internal equity among our team.

Additional benefits for this role may include: equity, company bonus opportunities; medical, dental, and vision benefits; retirement savings plan; and supplemental wellness benefits.

Lemurian Labs ensures equal employment opportunity without discrimination or harassment based on race, color, religion, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity or expression, age, disability, national origin, marital or domestic/civil partnership status, genetic information, citizenship status, veteran status, or any other characteristic protected by law.

EOE

Apply for this job