Center for AI Risk Management & Alignment (CARMA)
CARMA is a research and policy think tank working to lower the risks to humanity and the biosphere from transformative AI through integrated risk management, policy research, and technical safety work.
CARMA is a research and policy think tank working to lower the risks to humanity and the biosphere from transformative AI through integrated risk management, policy research, and technical safety work.
People
Updated 04/02/26Senior Researcher & Geostrategic Dynamics Lead
Executive Director & Founder
Senior Researcher & U.S. Public Policy Lead
Senior Risk Assessment Associate
Funding Details
Updated 04/02/26- Annual Budget
- -
- Current Runway
- -
- Funding Goal
- -
- Funding Raised to Date
- -
Org Details
Updated 04/02/26The Center for AI Risk Management & Alignment (CARMA) is a research and policy think tank dedicated to helping society navigate the complex and potentially catastrophic risks arising from increasingly powerful AI systems. Its mission is to lower the risks to humanity and the biosphere from transformative AI by grounding risk management in rigorous analysis, developing policy frameworks that squarely address AGI, advancing technical safety approaches, and fostering global perspectives on durable safety. CARMA was founded in November 2022 by Richard Mallah, who serves as Executive Director. Mallah brings over two decades of machine learning and AI experience, has focused on advanced AI safety since 2010, and previously served as Director of AI Projects at the Future of Life Institute from 2014 to 2022. He continues to serve as Principal AI Safety Strategist at FLI on a part-time basis. The organization is virtual-first, with staff clusters in Berkeley, Cambridge (UK), Washington DC, and London. CARMA operates five distinct research programs. The Comprehensive Risk Assessment program develops frameworks and tools for top-down threat surface scanning and risk propagation modeling across society, including the Probabilistic Risk Assessment (PRA) for AI framework published in 2024. The Public Security Policy program conducts targeted advocacy, policy development, and stakeholder engagement to address AI-related threats to critical infrastructure and national preparedness. The Geostrategic Dynamics Mechanisms program uses game theory and mechanism design to propose frameworks for multiscale international cooperation on AI safety. The Offense/Defense Dynamics program investigates how AI capabilities affect the balance between harmful and protective uses. The Alignment of Dynamical Cognitive Systems program explores architectural strategies to make powerful AI systems safer while respecting stakeholder needs and contextual tradeoffs. Notable outputs include the PRA for AI framework and accompanying workbook (released November 2024), a comprehensive technical paper on probabilistic risk assessment methodology (June 2025), an interim policy brief on national preparedness in the age of AI (July 2025), and a paper on platform-mediated coordination for AI agents (November 2025). CARMA is a fiscally sponsored project of Social & Environmental Entrepreneurs, Inc. (SEE), a California 501(c)(3) nonprofit public benefit corporation.
Theory of Change
Updated 04/02/26CARMA believes that reducing catastrophic AI risk requires a multi-layered, proactive approach that addresses both technical and governance dimensions before harms materialize. By developing rigorous risk assessment frameworks (such as PRA for AI), CARMA aims to give researchers, developers, regulators, and policymakers structured tools to identify and quantify AI-related hazards early. Simultaneously, the organization works to shape policy frameworks that account for AGI-level capabilities, improve national and international preparedness for AI-related crises, and design cooperation mechanisms that prevent destabilizing arms races or concentration of power. CARMA's theory of change holds that technical safety research, governance innovation, and strategic policy advocacy must be pursued in parallel and integrated into a systems perspective to meaningfully reduce existential and catastrophic risk from advanced AI.
Grants Received– no grants recorded
Updated 04/02/26Projects
Updated 04/02/26Program exploring architectures and platforms to align powerful dynamical AI and multi‑agent systems, translating alignment theory into experimental systems and tools for safer advanced AI.
Program developing frameworks, methodologies, and tools to help assessors systematically scan AI threat surfaces and model how resulting risks propagate through society.
Program using game theory, mechanism design, and complex-systems modelling to analyse AI-driven geopolitical competition and design mechanisms for multiscale international cooperation.
Discussion
No comments yet. Be the first to share your thoughts.