Senior Machine Learning Engineer (4+ years) | Nepal

Kathmandu, Nepal
Engineering – Machine Learning /
Full-time /
Hybrid
About Docsumo:
Docsumo is your go-to Document AI solution for streamlining business operations. We turn complex documents like bank statements, policies, and financial statements into valuable, actionable data. Our cutting-edge technology helps businesses make smarter decisions faster. We are backed by marquee investors such as Sequoia, Barclays, Fifth Wall, Common Ocean, and Techstars.

At Docsumo, we're on a mission to revolutionize how businesses handle data. We empower companies to:
Boost efficiency by 6-10 times
Make quick, accurate decisions from unstructured information
Scale operations effortlessly through innovative technology

The opportunity as Senior Machine Learning Engineer:

    • We are seeking senior professionals with over 4 years of experience in the field. This is a role for candidates with a proven track record in machine learning, deep learning, NLP, and computer vision. 
    • If you have led and managed a team of ML scientists and engineers and have a strong foundation in deploying end-to-end ML solutions, this opportunity is for you.
    • This role offers the chance to rapidly advance into leadership positions, such as Lead ML Engineer, where you'll spearhead critical projects with creative autonomy.
    • You'll work closely with our CTO, Data Science, and Engineering teams, shaping the future of intelligent document processing for our expanding client base in the US.
    • This is a full-time role with flexible options, including hybrid work in Kathmandu or remote work from India.
    • Working hours are 10:00 am to 7:00 pm IST, with a 1-hour lunch break.
    • You will report directly to the Data Science Lead or the CTO, collaborating with a talented team to deliver innovative ML solutions that drive customer success.

Key Responsibilities

    • Collaborate with cross-functional teams of scientists and engineers to design, develop, and implement advanced machine learning systems that transform innovative ideas from conceptual stages into operational APIs.
    • Conduct cutting-edge research in machine learning, focusing on the application and fine-tuning of Large Language Models (LLMs) to develop robust, scalable solutions for intelligent document processing.
    • Lead a team of data scientists and machine learning engineers, providing mentorship and fostering a culture of collaboration and continuous learning to achieve high performance and innovative outcomes.
    • Plan, manage, and oversee the full lifecycle of ML projects, ensuring alignment with business goals and timely delivery of high-quality solutions. Develop and apply sophisticated machine learning algorithms to address complex business challenges, particularly those involving the processing and analysis of unstructured data.
    • Engage in Agile development processes including regular standups, sprint planning, and retrospectives to facilitate iterative progress and maintain high standards of output.
    • Ensure the documentation of machine learning methodologies, model development processes, and maintain rigorous testing standards to ensure reliability and efficiency of models in production.
    • Drive the integration and optimization of LLMs and other advanced models to enhance performance and operational efficiency, continuously seeking improvements.

Need to Have:

    • At least 4 years of industry experience in machine learning, deep learning, NLP, and computer vision, ideally within tech companies, product startups, or R&D environments.
    • 1-2 years of experience leading teams of 4-5+ ML scientists and engineers, demonstrating effective leadership and project management skills. Proficiency in PyTorch and TensorFlow, with experience in training deep neural networks, implementing transfer learning, and optimizing models.
    • Strong skills in classification and regression techniques, with hands-on experience in Scikit-learn, Numpy, Pandas, and Scipy. Practical experience with Transformers, such as BERT and GPT, for tasks like text classification, entity recognition, and sentiment analysis.
    • Proficiency in Python and strong understanding of Object-Oriented Programming (OOP) principles.
    • Experience with version control using Git, cloud platforms like AWS and Google Cloud, and containerization technologies such as Docker and Kubernetes.
    • Ability to work effectively in a team, demonstrating motivation, resourcefulness, and a growth mindset.
    • Bachelor’s degree in Computer Science, Statistics, Machine Learning, Physics, Mathematics, or a related field.

Nice to Have:

    • Experience with Large Language Models, including techniques like PEFT, prompt engineering, few-shot learning, and RLHF.
    • Knowledge of computer vision applications like OCR, OpenCV, CNNs, and multimodal AI.
    • Experience with advanced data visualization libraries.
    • Familiarity with Agile development practices and sprint management.
    • A track record of innovative research contributions or publications in the machine learning domain.