Center for AI Risk Management & Alignment (CARMA)

Berkeley, CA, US (virtual-first)

5 peopleFounded 2022

CARMA is a research and policy think tank working to lower the risks to humanity and the biosphere from transformative AI through integrated risk management, policy research, and technical safety work.

People

Updated 04/02/26

Abra Ganz

Senior Researcher & Geostrategic Dynamics Lead

Richard Mallah

Executive Director & Founder

Sterlin Waters

Senior Researcher & U.S. Public Policy Lead

Anna Katariina Wisakanto

Senior Risk Assessment Associate

Funding Details

Updated 04/02/26

Annual Budget: -
Current Runway: -
Funding Goal: -
Funding Raised to Date: -

Org Details

Updated 04/02/26

The Center for AI Risk Management & Alignment (CARMA) is a research and policy think tank dedicated to helping society navigate the complex and potentially catastrophic risks arising from increasingly powerful AI systems. Its mission is to lower the risks to humanity and the biosphere from transformative AI by grounding risk management in rigorous analysis, developing policy frameworks that squarely address AGI, advancing technical safety approaches, and fostering global perspectives on durable safety. CARMA was founded in November 2022 by Richard Mallah, who serves as Executive Director. Mallah brings over two decades of machine learning and AI experience, has focused on advanced AI safety since 2010, and previously served as Director of AI Projects at the Future of Life Institute from 2014 to 2022. He continues to serve as Principal AI Safety Strategist at FLI on a part-time basis. The organization is virtual-first, with staff clusters in Berkeley, Cambridge (UK), Washington DC, and London. CARMA operates five distinct research programs. The Comprehensive Risk Assessment program develops frameworks and tools for top-down threat surface scanning and risk propagation modeling across society, including the Probabilistic Risk Assessment (PRA) for AI framework published in 2024. The Public Security Policy program conducts targeted advocacy, policy development, and stakeholder engagement to address AI-related threats to critical infrastructure and national preparedness. The Geostrategic Dynamics Mechanisms program uses game theory and mechanism design to propose frameworks for multiscale international cooperation on AI safety. The Offense/Defense Dynamics program investigates how AI capabilities affect the balance between harmful and protective uses. The Alignment of Dynamical Cognitive Systems program explores architectural strategies to make powerful AI systems safer while respecting stakeholder needs and contextual tradeoffs. Notable outputs include the PRA for AI framework and accompanying workbook (released November 2024), a comprehensive technical paper on probabilistic risk assessment methodology (June 2025), an interim policy brief on national preparedness in the age of AI (July 2025), and a paper on platform-mediated coordination for AI agents (November 2025). CARMA is a fiscally sponsored project of Social & Environmental Entrepreneurs, Inc. (SEE), a California 501(c)(3) nonprofit public benefit corporation.

Theory of Change

Updated 04/02/26

CARMA believes that reducing catastrophic AI risk requires a multi-layered, proactive approach that addresses both technical and governance dimensions before harms materialize. By developing rigorous risk assessment frameworks (such as PRA for AI), CARMA aims to give researchers, developers, regulators, and policymakers structured tools to identify and quantify AI-related hazards early. Simultaneously, the organization works to shape policy frameworks that account for AGI-level capabilities, improve national and international preparedness for AI-related crises, and design cooperation mechanisms that prevent destabilizing arms races or concentration of power. CARMA's theory of change holds that technical safety research, governance innovation, and strategic policy advocacy must be pursued in parallel and integrated into a systems perspective to meaningfully reduce existential and catastrophic risk from advanced AI.

Grants Received– no grants recorded

Updated 04/02/26

Projects

Updated 04/02/26

Alignment of Dynamical Systems

Program exploring architectures and platforms to align powerful dynamical AI and multi‑agent systems, translating alignment theory into experimental systems and tools for safer advanced AI.

active

Comprehensive Risk Assessment

Program developing frameworks, methodologies, and tools to help assessors systematically scan AI threat surfaces and model how resulting risks propagate through society.

active

Geostrategic Dynamics Mechanisms

Program using game theory, mechanism design, and complex-systems modelling to analyse AI-driven geopolitical competition and design mechanisms for multiscale international cooperation.

active

Discussion

No comments yet. Be the first to share your thoughts.