AI Engineer
Noida, Uttar Pradesh
Technology /
Full-Time /
Hybrid
Position Overview-
We are looking for an experienced AI Engineer to design, build, and optimize AI-powered applications, leveraging both traditional machine learning and large language models (LLMs). The ideal candidate will have a strong foundation in LLM fine-tuning, inference optimization, backend development, and MLOps, with the ability to deploy scalable AI systems in production environments.
ShyftLabs is a leading data and AI company, helping enterprises unlock value through AI-driven products and solutions. We specialize in data platforms, machine learning models, and AI-powered automation, offering consulting, prototyping, solution delivery, and platform scaling. Our Fortune 500 clients rely on us to transform their data into actionable insights.
Key Responsibilities:
- Design and implement traditional ML and LLM-based systems and applications.
- Optimize model inference for performance and cost-efficiency.
- Fine-tune foundation models using methods like LoRA, QLoRA, and adapter layers.
- Develop and apply prompt engineering strategies including few-shot learning, chain-of-thought, and RAG.
- Build robust backend infrastructure to support AI-driven applications.
- Implement and manage MLOps pipelines for full AI lifecycle management.
- Design systems for continuous monitoring and evaluation of ML and LLM models.
- Create automated testing frameworks to ensure model quality and performance.
Basic Qualifications:
- Bachelor’s degree in Computer Science, AI, Data Science, or a related field.
- 4+ years of experience in AI/ML engineering, software development, or data-driven solutions.
- LLM Expertise
- Experience with parameter-efficient fine-tuning (LoRA, QLoRA, adapter layers).
- Understanding of inference optimization techniques: quantization, pruning, caching, and serving.
- Skilled in prompt engineering and design, including RAG techniques.
- Familiarity with AI evaluation frameworks and metrics.
- Experience designing automated evaluation and continuous monitoring systems.
- Backend Engineering
- Strong proficiency in Python and frameworks like FastAPI or Flask.
- Experience building RESTful APIs and real-time systems.
- Knowledge of vector databases and traditional databases.
- Hands-on experience with cloud platforms (AWS, GCP, Azure) focusing on ML services.
- MLOps & Infrastructure
- Familiarity with model serving tools (vLLM, SGLang, TensorRT).
- Experience with Docker and Kubernetes for deploying ML workloads.
- Ability to build monitoring systems for performance tracking and alerting.
- Experience building evaluation systems using custom metrics and benchmarks.
- Proficient in CI/CD and automated deployment pipelines.
- Experience with orchestration tools like Airflow.
- Hands-on experience with LLM frameworks (Transformers, LangChain, LlamaIndex).
- Familiarity with LLM-specific monitoring tools and general ML monitoring systems.
- Experience with distributed training and inference on multi-GPU environments.
- Knowledge of model compression techniques like distillation and quantization.
- Experience deploying models for high-throughput, low-latency production use.
- Research background or strong awareness of the latest developments in LLMs.
- Tools & Technologies We Use
- Frameworks: PyTorch, TensorFlow, Hugging Face Transformers
- Serving: vLLM, TensorRT-LLM, SGlang, OpenAI API
- Infrastructure: Docker, Kubernetes, AWS, GCP
- Databases: PostgreSQL, Redis, Vector Databases
We are proud to offer a competitive salary alongside a strong healthcare insurance and benefits package. We pride ourselves on the growth of our employees, offering extensive learning and development resources.