Solutions Architect (AI/ML)
Las Vegas, Nevada
Engineering /
Full Time /
Hybrid
At Tensorwave, we’re leading the charge in AI compute, building a versatile cloud platform that’s driving the next generation of AI innovation. We’re focused on creating a foundation that empowers cutting-edge advancements in intelligent computing, pushing the boundaries of what’s possible in the AI landscape.
Job Description:
TensorWave is seeking a Solutions Architect with a strong background in AI/ML infrastructure, model deployment, and MLOps to support enterprise customers in designing and implementing scalable AI inference and training solutions. This role serves as a technical bridge between sales and engineering teams, ensuring that customers can fully leverage TensorWave’s high-performance GPU cloud platform. As a technical pre-sales expert, you will work closely with sales teams to understand customer requirements, propose tailored solutions, and drive POCs that demonstrate the advantages of TensorWave’s AI cloud. You will also collaborate with the MLE/MLOps engineering teams to refine deployment strategies and optimize AI workloads for cost-efficiency and performance. This is a customer-facing role that requires both deep technical expertise and strong communication skills to guide organizations in deploying LLMs, fine-tuning models, and scaling inference workloads.
Responsibilities
- Pre-Sales Technical Support: Assist sales teams in technical discussions, aligning TensorWave’s GPU infrastructure with customer AI workloads.
- Solution Design: Architect scalable ML pipelines, AI inference, and fine-tuning workflows that meet enterprise needs.
- Proof-of-Concept (POC) Execution: Lead POC engagements with customers, working alongside MLE/MLOps teams to demonstrate TensorWave’s capabilities.
- Customer Enablement: Educate customers on best practices for LLM inference, model optimization, and deployment using TensorWave’s infrastructure.
- Optimization & Performance Tuning: Analyze customer workloads and recommend optimizations for GPU utilization, inference speed, and memory efficiency.
- Technical Presentations & Demos: Develop and deliver technical presentations, live demos, and solution briefs for enterprise clients.
- Cross-Team Collaboration: Work with product, engineering, and marketing teams to incorporate customer feedback into platform improvements.
- AI Infrastructure & MLOps Strategy: Guide customers in MLOps best practices, containerized deployments (Docker, Kubernetes), and model serving frameworks.
Essential Skills & Qualifications
- Equivalent of a Bachelor’s Degree in Computer Science, AI/ML, or a related field.
- 3+ years of experience in AI/ML model deployment, MLOps, or cloud AI solutions.
- Hands-on expertise with LLM inference, fine-tuning, and optimization techniques (e.g., quantization, distillation, tensor parallelism).
- Strong understanding of GPU acceleration, model serving optimizations, and AI infrastructure scaling.
- Proficiency in model deployment frameworks such as Triton, vLLM, TGI, TensorRT, or ONNX Runtime.
- Experience with cloud architectures and scalable AI pipelines.
- Excellent communication and presentation skills, with the ability to translate complex AI concepts for technical and non-technical stakeholders.
- Hands-on experience with containerization (Docker, Kubernetes)
Preferred Qualifications
- Experience in a sales engineering, solutions architecture, or pre-sales technical consulting role.
- Hands-on experience with AMD ROCm ecosystem or similar AI acceleration frameworks.
- Contributions to open-source AI inference and optimization projects.
- Familiarity with networking and HPC environments for AI workloads.
- Strong ability to analyze customer AI workloads and recommend cost-effective scaling strategies.
We’re looking for resilient, adaptable people to join our team—folks who enjoy collaborating and tackling tough challenges. We’re all about offering real opportunities for growth, letting you dive into complex problems and make a meaningful impact through creative solutions. If you're a driven contributor, we encourage you to explore opportunities to make an impact at Tensorwave. Join us as we redefine the possibilities of intelligent computing.
What We Bring:
In addition to a competitive salary, we offer a variety of benefits to support your needs, including:
Stock Options
100% paid Medical, Dental, and Vision insurance
Life and Voluntary Supplemental Insurance
Short Term Disability Insurance
Flexible Spending Account
401(k)
Flexible PTO
Paid Holidays
Parental Leave
Mental Health Benefits through Spring Health