Software Engineer - Compiler

Toronto, ON / Remote, USA / San Francisco Bay Area, CA

Engineering /

Full-Time /

Hybrid

About Us

We believe AI will fundamentally transform how people live and work. CentML's mission is to massively reduce the cost of developing and deploying ML models so we can enable anyone to harness the power of AI and everyone to benefit from its potential.

Our founding team is made up of experts in AI, compilers, and ML hardware and has led efforts at companies like Amazon, Google, Microsoft Research, Nvidia, Intel, Qualcomm, and IBM. Our co-founder and CEO, Gennady Pekhimenko, is a world-renowned expert in ML systems who holds multiple academic and industry research awards from Google, Amazon, Facebook, and VMware.

Overview:

We are seeking highly motivated and skilled compiler engineers to join our team in developing a state-of-the-art compiler for machine learning systems. Are you interested in pushing the frontier of AI programmability and performance? Do you like building kernels and tools that make programs run with the speed of light? As a compiler engineer, you will play a critical role in developing high-performance GPU kernels using the Hidet compiler, extending the compiler with new capabilities, and deploying it in collaboration with peer teams to accelerate LLM inference and training.

Join our team, and help us in our mission to democratize AI for the world!

Responsibilities:

Write modular, high-quality code for the Hidet compiler in Python/C++.
Implement new and classical compiler optimizations to accelerate the performance of AI workloads on GPU and CPU.
Build highly efficient GPU kernels using Hidet Python-based DSL.
Write tests and benchmarks to validate and evaluate our solutions.
Collaborate with peer teams to deploy the compiler in inference and training systems.

Who you are:

Bachelor’s or higher degree in Computer Science or related technical field.
Strong coding skills in Python, C++, CUDA, or OpenCL.
Excellent communication skills and the ability to work in a team.
2+ industry or academic experience in compiler development, GPGPU programming or performance engineering.
Experience or interest in accelerating AI workloads.
Solid fundamentals in other computer science and computer engineering topics: algorithms and data structures, operating systems, computer architecture, etc.

You will stand out from the crowd if you have:

Experience with ML systems, such as PyTorch, JAX, or TensorFlow.
Experience with ML compilers, such as TVM, IREE, XLA, pytorch.compile, etc.
Experience developing high-performance kernels for NVIDIA or AMD GPUs.
Experience with AI accelerator hardware.
Experience with MLIR or LLVM.
Contributions to open-source projects.
Strong publication record in ML, HPC, compiler or computer system conferences.

Benefits & Perks

- An open and inclusive work environment

- Employee stock options

- Best-in-class medical and dental benefits

- Parental Leave top-up

- Professional development budget

- Flexible vacation time to promote a healthy work-life blend

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability, and any other protected ground of discrimination under applicable human rights legislation.

CentML strives to respect the dignity and ‎‎independence of people with disabilities and is committed to giving them the same ‎‎opportunity to succeed as all other employees.

Inclusiveness is core to our culture at CentML, and we strive to ensure you get the most from your interview experience. CentML makes reasonable accommodations for applicants with disabilities. If a reasonable accommodation is needed to participate in the job application or interview process, please reach out to the Talent team.

Apply for this job