Researcher

Berkeley

Open Positions: ARC – Research /

Employee /

On-site

What does ARC do?

The Alignment Research Center (ARC) is a non-profit whose mission is to align future machine learning systems with human interests. (ARC is not to be confused with METR, which was formerly known as "ARC Evals" but has since been spun out.)

ARC's high-level agenda is described by the report on Eliciting Latent Knowledge (ELK): roughly speaking, we’re trying to design ML training objectives that incentivize systems to honestly report their internal beliefs.

For the last couple of years, we’ve mostly been focused on an approach to ELK based on formalizing a kind of heuristic reasoning that could be used to analyze neural network behavior. You can read more about this approach in our recent blog post, A bird’s eye view of ARC’s research, and see more details in some of our other blog posts, such as:

- Low Probability Estimation in Language Models (ICLR 2025 spotlight)

- Backdoors as an analogy for deceptive alignment (ITCS 2025)

- Formal verification, heuristic explanations and surprise accounting

- Estimating tail risk in neural networks

- Formalizing the presumption of independence

Our research has reached a stage where we’re coming up against concrete problems in mathematics, theoretical computer science and machine learning, and so we’re particularly excited about hiring researchers with relevant background, regardless of whether they have worked on AI alignment before.

Who is ARC looking to hire?

Most successful candidates have a strong theoretical background (in math, physics or computer science, for example). Empirical machine learning background is a plus, but is not required for a strong application.

We also remain open to anyone who is excited about getting involved in AI alignment, even if they do not have an existing research record.

Ultimately, we are excited to hire people who could contribute to our research agenda. One way to figure out whether you might be able to contribute would be to take a look at some of our recent research, as described on our blog.

What is working at ARC like?

ARC currently has five permanent team members (see our team here), alongside a varying number of temporary team members (recently, anywhere from 0–4).

Most team members work on research problems independently, but with significant collaboration. (A typical researcher spends about 25% of their time collaborating with others.) This work is often somewhat similar to academic research in pure math or theoretical computer science, or machine learning.

In addition to this, we also allocate a significant portion of our time to higher-level questions surrounding research prioritization, which we often discuss at our weekly group meeting. Since the team is still small, we are keen for new team members to help with this process of shaping and defining our research.

ARC shares an office with several other groups working on AI safety such as METR and Redwood Research, so even though our team is small, the office is lively with lots of AI-related discussion.

Hiring process

Our current interview process involves:

- 3-hour take-home test involving math and computer science puzzles

- 30-minute non-technical phone call

- 1-day onsite interview

We will compensate candidates for their time when this is logistically possible.

Employment details

ARC is based in Berkeley, California, and we would prefer people who can work full-time from our office, but we are open to discussing remote or part-time arrangements in some circumstances. We can sponsor visas and are H-1B cap-exempt.

We are accepting applications for both visiting researcher (1–3 months) and full-time positions. The intention of the visiting researcher position is to assess potential fit for a full-time role, and we expect to invite around one half of visiting researchers to join full-time. We are also able to offer straight-to-full-time positions, but we anticipate that we will only be able to do this for people with a legible research track-record. We are currently only interested in applicants who can start by Dec 31, 2026.

Salaries are in the $150k–400k range for most people depending on experience.

Further information

If you have any questions about anything in this posting, please email hiring@alignment.org.

Apply for this job