Senior Data Scientist-Large Language Models
Asia / UAE, Dubai / Taiwan, Taipei / Australia, Melbourne / Australia, Brisbane / Australia, Sydney / Austria, Vienna / United Kingdom, London / Canada
Engineering – Data Science/AI /
Full-time Onsite or Remote /
Hybrid
Large Language Models (LLMs) represent a cutting-edge advancement in artificial intelligence, capable of understanding, generating, and processing human language with remarkable accuracy. Leveraging vast amounts of data, these models can perform a wide range of tasks, from text generation to complex problem-solving, making them invaluable across various industries. This scope extends beyond LLMs to include multi-modal LLMs that integrate video, image, audio, text, and code, as well as exploring the frontier of AGI (Artificial General Intelligence) technologies.
As a company operating within specialized domains such as finance, web3, and cryptocurrency, we require LLM Scientists to help us tailor these powerful models to our unique needs. Our goal is not only to fine-tune existing LLMs but also to potentially build custom models that can address the specific challenges and opportunities within these fields. We aim to leverage multi-modal LLMs and cutting-edge AGI technologies to enhance our capabilities further. In the role of an LLM Scientist, you will be at the forefront of AI innovation, applying your expertise to develop and refine models that possess deep domain knowledge and enhanced analytical capabilities. Your work will enable us to tackle more complex applications, such as:
Crypto Market Analysis: Developing models that can analyze and predict market trends, providing insights and forecasts that are crucial for making informed decisions in the fast-paced world of cryptocurrency.
Smart Investment Advisors: Creating intelligent systems that offer personalized investment advice based on real-time data and sophisticated risk assessment, helping clients navigate the financial landscape with confidence.
Advanced Data Interpretation: Utilizing your skills to enhance the model’s ability to interpret complex financial data, regulatory changes, and market signals, ensuring our clients stay ahead of the curve.
Responsibilities:
- Drive core technology development for LLM, continuously optimizing comprehension, reasoning, and generation capabilities.
- Collaborate with cross-functional teams, to integrate advanced LLM solutions into existing systems, ensuring seamless operation and maximum impact.
- Work closely with prompt engineers to refine and optimize prompt design, enabling more accurate and contextually relevant outputs from LLMs.
- Conduct cutting-edge research to stay ahead of the latest developments in LLM and AGI technologies, applying these advancements to solve complex business challenges.
- Develop scalable and robust LLM frameworks that can be adapted to various domains, driving innovation and maintaining a competitive edge in the market.
Requirements:
- PhD degree required, with top artificial intelligence conference papers (NeurIPS, ICML, ICLR, CVPR, ACL, OSDI, NSDI, SC and SigMOD, etc.) in machine learning (ML), computer vision (CV), natural language processing (NLP) and other fields.
- Programming skills, data structure and algorithm skills, proficient in C/C++ or Python programming language, candidates with awards in ACM/ICPC, NOI/IOI, Top Coder, Kaggle and other competitions are preferred.
- Research experience in the field of machine learning, especially in large language models (LLMs) and generative artificial intelligence.
- Excellent problem analysis and solving skills, and passionate about solving challenging problems.
- Passionate about technology, good communication skills and teamwork spirit.