Senior Data Scientist

Pune
Engineering – Integrations /
Full-time /
On-site
AppZen is the leader in autonomous spend-to-pay software. Its patented artificial intelligence accurately and efficiently processes information from thousands of data sources so that organizations can better understand enterprise spend at scale to make smarter business decisions. It seamlessly integrates with existing accounts payable, expense, and card workflows to read, understand, and make real-time decisions based on your unique spend profile, leading to faster processing times and fewer instances of fraud or wasteful spend. Global enterprises, including one-third of the Fortune 500, use AppZen’s invoice, expense, and card transaction solutions to replace manual finance processes and accelerate the speed and agility of their businesses. To learn more, visit us at www.appzen.com.

About the Role

    • We are looking for a Senior Data Scientist to come and work on our growing AI
    • stack. You will be working with a team of highly skilled and motivated data scientists and machine learning engineers. If you are excited about natural
    • language understanding and machine translation, AppZen is the right place for you to apply and grow your skills.

Key Responsibilities:

    • Solid understanding of machine learning fundamentals, and familiar
    • with standard algorithms and techniques
    • Solid understanding of Python programming, including core data
    • structures (lists, dicts, sets, tuples), object-oriented design, and writing
    • clean, modular, and maintainable code.
    • Experience with code optimization techniques, including efficient
    • memory and runtime usage, vectorization (NumPy/Pandas), and
    • debugging performance bottlenecks.
    • Hands-on experience with LLMs such as GPT, LLaMA, or Claude,
    • including evaluation and integration into production pipelines.
    • Proficiency in using GenAI frameworks and tools (e.g., Hugging Face
    • Transformers, LangChain, OpenAI APIs, vLLM, or AutoGen) to build,
    • customize, and deploy language-based solutions.
    • Ability to analyze, evaluate, and optimize LLM outputs for accuracy,
    • safety, relevance, and bias mitigation in enterprise use cases.
    • Work with internal and external stakeholders on program management
    • Manage your own process: identify and execute on high impact projects,
    • triage external requests, and make sure you bring projects to conclusion
    • in time for the results to be useful
    • Excellent written and verbal technical communication skills;
    • communicate proposals and results in a clear manner backed by data
    • and coupled with actionable conclusions to drive business decisions
    • M.Tech/B.Tech. or equivalent experience in Computer Science,
    • Engineering, Statistics, or other relevant technical field
    • Must have 6+ years of industry experience
    • You are a team player

Nice-to-Have:

    • Track record of having developed novel algorithms, e.g. publications in
    • one or more of the following: KDD, WWW, NIPS, ISWC, NAACL, ACL,
    • SIGIR, EMNLP, ICML etc
    • Good Understanding of MLOps tools/processes like ElasticSearch, Jenkins,
    • Docker is plus.
    • Expertise in building and fine-tuning LLM models using
    • Transformers and RAG systems.