AI Engineer

Noida, Uttar Pradesh
Technology /
Full-Time /
Hybrid
Position Overview-
We are looking for an experienced AI Engineer to design, build, and optimize AI-powered applications, leveraging both traditional machine learning and large language models (LLMs). The ideal candidate will have a strong foundation in LLM fine-tuning, inference optimization, backend development, and MLOps, with the ability to deploy scalable AI systems in production environments.

ShyftLabs is a leading data and AI company, helping enterprises unlock value through AI-driven products and solutions. We specialize in data platforms, machine learning models, and AI-powered automation, offering consulting, prototyping, solution delivery, and platform scaling. Our Fortune 500 clients rely on us to transform their data into actionable insights.

Key Responsibilities:

    • Design and implement traditional ML and LLM-based systems and applications.
    • Optimize model inference for performance and cost-efficiency.
    • Fine-tune foundation models using methods like LoRA, QLoRA, and adapter layers.
    • Develop and apply prompt engineering strategies including few-shot learning, chain-of-thought, and RAG.
    • Build robust backend infrastructure to support AI-driven applications.
    • Implement and manage MLOps pipelines for full AI lifecycle management.
    • Design systems for continuous monitoring and evaluation of ML and LLM models.
    • Create automated testing frameworks to ensure model quality and performance.

Basic Qualifications:

    • Bachelor’s degree in Computer Science, AI, Data Science, or a related field.
    • 4+ years of experience in AI/ML engineering, software development, or data-driven solutions.
    • LLM Expertise
    • Experience with parameter-efficient fine-tuning (LoRA, QLoRA, adapter layers).
    • Understanding of inference optimization techniques: quantization, pruning, caching, and serving.
    • Skilled in prompt engineering and design, including RAG techniques.
    • Familiarity with AI evaluation frameworks and metrics.
    • Experience designing automated evaluation and continuous monitoring systems.
    • Backend Engineering
    • Strong proficiency in Python and frameworks like FastAPI or Flask.
    • Experience building RESTful APIs and real-time systems.
    • Knowledge of vector databases and traditional databases.
    • Hands-on experience with cloud platforms (AWS, GCP, Azure) focusing on ML services.
    • MLOps & Infrastructure
    • Familiarity with model serving tools (vLLM, SGLang, TensorRT).
    • Experience with Docker and Kubernetes for deploying ML workloads.
    • Ability to build monitoring systems for performance tracking and alerting.
    • Experience building evaluation systems using custom metrics and benchmarks.
    • Proficient in CI/CD and automated deployment pipelines.
    • Experience with orchestration tools like Airflow.
    • Hands-on experience with LLM frameworks (Transformers, LangChain, LlamaIndex).
    • Familiarity with LLM-specific monitoring tools and general ML monitoring systems.
    • Experience with distributed training and inference on multi-GPU environments.
    • Knowledge of model compression techniques like distillation and quantization.
    • Experience deploying models for high-throughput, low-latency production use.
    • Research background or strong awareness of the latest developments in LLMs.
    • Tools & Technologies We Use
    • Frameworks: PyTorch, TensorFlow, Hugging Face Transformers
    • Serving: vLLM, TensorRT-LLM, SGlang, OpenAI API
    • Infrastructure: Docker, Kubernetes, AWS, GCP
    • Databases: PostgreSQL, Redis, Vector Databases
We are proud to offer a competitive salary alongside a strong healthcare insurance and benefits package. We pride ourselves on the growth of our employees, offering extensive learning and development resources.