AI Safety Analyst | Full Time | Singapore or Palo Alto

Singapore / Palo Alto
General & Administrative /
Hybrid /
Hybrid
About TrustLab

Online misinformation, hate speech, child endangerment, and extreme violence are some of the world's most critical and complex problems. TrustLab is a fast-growing, VC-backed startup, founded by ex-Google, TikTok and Reddit executives determined to use software engineering, ML, and data science to tackle these challenges and make the internet healthier and safer for everyone. If you’re interested in working with the world’s largest social media companies and online platforms, and building technologies to mitigate these issues, you’ve come to the right place. 

About the Role

As an AI Safety analyst, you will be engaging on the full spectrum of policy issues on AI Safety and play an integral role in building deep expertise within the team. You will work directly on solving real world complex trust & safety and fraud issues. Your work will be critical in the design & development of our AI safety product & service offerings.

Day-to-day work may encompass anything from risk helping to shape strategic initiatives, technical/policy research, risk evaluations and investigations. You will also get to work on adversarial and red-teaming opportunities to protect real users and improve AI security.

This role can be performed remotely from anywhere in Singapore or Palo Alto.

Responsibilities

    • Develop deep subject matter expertise in role of AI safety in cyber security risks 
    • Discover and exploit Responsible AI vulnerabilities end-to-end in order to assess the safety of systems by developing responsible AI red teaming methodologies
    • Develop a framework for testing and benchmarking the safety of AI Models
    • Play a role in building & improving Gen AI fraud & risk detection capabilities
    • Monitor the policy landscape to identify relevant questions and emerging policy areas to build our expertise in the subject
    • Keep up to date with new and existing AI policy norms and standards, particularly those related to cyber security, and use these to inform our decision-making on policy areas

Minimum qualifications

    • Bachelor's degree or equivalent practical experience
    • 3+ years track record in trust & safety, risk evaluations, fraud investigations, technical/data analysis 
    • Experience and familiarity with AI or a demonstrated interest in AI policy issues
    • Experience in data analysis or data science - identifying trends and drawing actionable insights
    • Have a deep practical familiarity with understanding of how AI technology contributes to online risks & threats
    • Worked on topics around: AI risk assessment, model safety, prompting
    • Stay up-to-date and informed by taking an active interest in emerging research and industry
    • Passion for using AI to create safe and beneficial products

Preferred skills

    • Experience and familiarity with AI or a demonstrated interest in AI policy issues and research
    • Strong familiarity with existing GenAI / LLM / ML standards - prior experience exploring, testing and evaluation of  language model behavior.
    • Experience in benchmarking Generative AI issues and quantify improvements
    • Experience with SQL and a programming language (e.g., Python or R)

Opportunities and perks

    • Competitive compensation at a rapidly growing Series A, VC-backed startup 
    • Remote-first, with the ability to work from home or co-locate with our Singapore or Palo Alto teams
    • Influence new product direction from idea to commercialization
    • Help develop critical tech to solve one of the 21st century’s trickiest societal problems
EOE

We are an equal-opportunity employer that celebrates diversity and believes that people of varied backgrounds working together lead to the best outcomes across all levels of a company. This is your chance to help build world-class technology in a space that greatly matters to society, working with a team that is invested in your professional growth! Our team members come from all different parts of the world, perspectives, ages, and experiences.