A remote, non-profit research group focused on mechanistic interpretability of deep learning models, developing causal abstraction frameworks, open-source course materials, and mentorship programs for the AI safety community.
A remote, non-profit research group focused on mechanistic interpretability of deep learning models, developing causal abstraction frameworks, open-source course materials, and mentorship programs for the AI safety community.
People
Updated 05/18/26principal researcher
principal investigator
research scholar
researcher
Funding Details
Updated 05/18/26- Annual Budget
- $368,500
- Current Runway
- -
- Funding Goal
- -
- Funding Raised to Date
- $737,000
Org Details
Updated 05/18/26The Practical AI Alignment and Interpretability Research Group, also known as the Pr(Ai)²R Group, is a remote, non-profit research organization focused on mechanistic interpretability of AI systems. The group was founded by Atticus Geiger in late 2023, building directly on his doctoral dissertation at Stanford University, which examined the problem of uncovering and inducing interpretable causal structure in deep learning models. The group's central technical focus is causal abstraction, a theoretical framework that provides rigorous foundations for mechanistic interpretability. Mechanistic interpretability aims to explain the computations performed by neural networks in terms of intelligible algorithms that faithfully capture the behavior of otherwise opaque models. The group applies and extends this framework to language models and other deep learning systems. Beyond original research, the group's mandate includes creating open-source course materials on mechanistic interpretability and running mentorship programs to grow the pipeline of researchers working on AI interpretability. This educational and community-building component reflects a theory that the field needs more trained researchers to make progress on AI safety. In October 2024, Open Philanthropy announced a grant of $737,000 over two years to support the group's work under Geiger's leadership. The group's website was hosted at prair.group. Known members include Atticus Geiger (PI), Amir Zur, and Jiuding Sun. Atticus Geiger subsequently joined Goodfire, a commercial AI interpretability company, in September 2025.
Theory of Change
Updated 05/18/26By developing rigorous mathematical frameworks (particularly causal abstraction) for understanding how neural networks compute, and by training more researchers in mechanistic interpretability methods, the group aims to make it possible to verify AI system behavior, detect misalignment, and build safer AI systems. The open-source course materials and mentorship programs multiply impact by expanding the community of researchers able to perform interpretability work on frontier models.
Grants Received
Updated 05/18/26Projects– no linked projects
Updated 05/18/26Discussion
No comments yet. Be the first to share your thoughts.