CUDA Kernels Engineer

Palo Alto, CA
Engineering /
Full Time /
On-site

Submit your application

  • File exceeds the maximum upload size of 100MB. Please try a smaller size.

Links

Work Permit

  • Are you authorized to work in the country where this position is based?
  • If you don't have work authorization, will you require work authorization sponsorship?

Relocation

  • Are you willing to relocate if you do not live close to the local office of choice?
  • How soon can you move to the area of the local office?

Short Answer

  • Briefly describe the most relevant project you have worked on. Be sure to outline your specific contributions.

Performance Engineering

  • What is your experience level with writing and optimizing GPU kernels using CUDA or similar low-level programming frameworks (e.g., Triton, OpenCL)?
  • What is your experience level with AI accelerators or GPU/CPU hardware architecture and performance optimization?
  • What is your experience level with foundation model architectures and training infrastructure (e.g., Transformers, LLMs)?