George Mason University is a large public research university in Fairfax, Virginia, notable in the AI safety and governance space for housing the Mercatus Center and for faculty research on AI scenarios and policy.
- Team
- 8900
- Led by
Loading results...
Showing 151-200 of 471 results
Clear filtersGeorge Mason University is a large public research university in Fairfax, Virginia, notable in the AI safety and governance space for housing the Mercatus Center and for faculty research on AI scenarios and policy.
MentaLeap is an Israel-based AI safety research group focused on mechanistic interpretability, applying neuroscience and cybersecurity expertise to reverse-engineer neural networks and reduce risks from advanced AI systems.
A nonprofit that helps university students choose high-impact thesis topics and launch research careers focused on the world's most pressing problems, including AI safety, biosecurity, animal welfare, and global health.
GovAI is an independent nonprofit research organization dedicated to helping decision-makers navigate the transition to a world with advanced AI, by producing rigorous research on AI governance and fostering talent in the field.
UC San Diego is a major public research university conducting AI safety-relevant research including LLM persuasion evaluation, trustworthy machine learning, and safe autonomous systems.
Coefficient Giving (formerly Open Philanthropy) is a major philanthropic grantmaker that directs funding toward high-impact causes including AI safety, global health, biosecurity, and farm animal welfare. It is the primary grantmaking vehicle for Dustin Moskovitz and Cari Tuna's philanthropy through Good Ventures.
Major private research university in Los Angeles that received SFF flexHEGs funding for hardware-enabled AI governance research, and hosts multiple labs and centers working on AI safety, alignment, and responsible AI development.
Dioptra is a volunteer AI safety research community founded by Joshua Clymer that builds evaluations for advanced AI systems.
India's national AI safety institute under the IndiaAI Mission, established to ensure the ethical, safe, and responsible development and deployment of AI systems in India.
Astral Codex Ten is Scott Alexander's Substack blog covering reasoning, science, AI, medicine, ethics, and effective altruism, and the home of the ACX Grants program that funds high-impact projects.
Formation Research is a UK-based not-for-profit that researches lock-in risk — the danger that negative features of the world, such as authoritarian power structures or AI-enabled totalitarianism, become permanently entrenched — and develops interventions to minimize it.
Cambridge Effective Altruism is a community group at the University of Cambridge that helps students and local residents explore how to have the most positive impact through their careers and charitable giving. It runs fellowships, discussion groups, and career support programs, and was the seedbed for BlueDot Impact.
Panoplia Laboratories (now operating as Active Site) is a nonprofit that evaluates the risks and capabilities of AI-driven biology through wet lab research, and develops broad-spectrum antivirals for pandemic preparedness.
Modulo Research is a UK-based AI safety research organization that conducts empirical evaluations of large language models and develops datasets to advance scalable oversight research.
The KIRA Center (Center for AI Risks & Impacts) is a Berlin-based independent think tank working to ensure the transition to advanced AI is safe and beneficial. It conducts policy research and engages governments, particularly in Germany and the EU, on AI governance and safety.
The 501(c)(4) advocacy arm of the Center for AI Safety, dedicated to advancing bipartisan public policies that maintain U.S. leadership in AI and protect against AI-related national security threats.
Gray Swan AI is an AI safety and security company that builds tools to assess vulnerabilities in AI deployments and develop more robust, attack-resistant AI models. It was founded in 2024 by Carnegie Mellon University researchers who pioneered automated jailbreaking research.
Arizona State University is a major public research university and one of the largest in the United States, with significant programs in AI governance, responsible innovation, and governance of emerging technologies.
A 501(c)(3) nonpartisan think tank that bridges technology and national security policy, with major programs addressing ransomware, frontier AI security, and the catastrophic risks posed by emerging technologies to nuclear stability.
A nonprofit R&D lab that develops collective intelligence tools and governance models to steer transformative AI development toward better outcomes through democratic public input.
Princeton University is a leading Ivy League research institution that conducts significant AI safety and AI governance research through several interdisciplinary centers and initiatives.
Pivotal Research runs a 9-week in-person research fellowship in London for early-career researchers working on AI safety, AI governance, and biosecurity. Fellows work alongside mentors from leading organizations to produce impactful research and launch careers in reducing global catastrophic risks.
generative.ink is the personal research and creative platform of Janus (also known as "moire" and "@repligate"), a pseudonymous AI safety researcher known for the Simulators framework and the Loom human-AI collaboration tool.
An AI safety research infrastructure nonprofit that builds open-source tools and platforms to accelerate mechanistic interpretability research, including Neuronpedia and SAELens.
The Navigation Fund is a major philanthropic funder that grants over $60 million annually to high-impact organizations working on climate change, farm animal welfare, criminal justice reform, open science, and AI safety.
An academic research group at New York University doing empirical work with language models to address longer-term safety concerns about highly capable AI systems.
SeedAI is a Washington, D.C. nonprofit working at the intersection of AI policy and practical application, helping policymakers and communities across the U.S. understand, adopt, and shape AI responsibly.
A Substack newsletter by Gergő Gáspár covering fieldbuilding strategy, careers, and marketing for the AI Safety and Effective Altruism communities.
SHfHS is a small philanthropic foundation that identifies and funds researchers and organizations working on existential risk reduction. It acts as a funding intermediary rather than conducting direct research.
Michigan State University's Department of Computer Science and Engineering (CSE) conducts AI safety research, notably through the OPTML group's work on trustworthy machine learning and LLM unlearning.
A research funding program run by Schmidt Sciences that supports foundational technical research on understanding, predicting, and controlling risks from frontier AI systems. The program funds academic and nonprofit researchers working on AI safety science, evaluation methodology, and oversight of advanced AI.
ICLR is one of the world's premier annual academic conferences dedicated to deep learning and representation learning research. It was founded in 2013 by Yann LeCun and Yoshua Bengio.
A Canadian registered charity that increases public and scientific awareness of AI's catastrophic risks through education and research.
Northeastern University is a private R1 research university in Boston, Massachusetts, home to notable AI safety and mechanistic interpretability research through its Khoury College of Computer Sciences and Institute for Experiential AI.
Aether is an independent research lab focused on LLM agent safety, conducting technical research on the alignment, control, and evaluation of large language model agents.
London-based for-profit AI safety company working on Cognitive Emulation, an approach to building controllable, bounded AI systems that reason transparently.
A US nonprofit founded by Max Tegmark and Meia Chita-Tegmark to place AI safety on a solid quantitative foundation. BAIF funds research, fellowships, and university partnerships aimed at ensuring advanced AI systems remain safe and beneficial.
No summary available yet.
A leading private research university on Chicago's South Side that hosts several AI safety and existential risk research programs, including the Existential Risk Laboratory (XLab), the Chicago Human+AI Lab, and the Harris School's Technology and Society Initiative.
Earendil is a hardware security startup that builds tamper response systems for AI compute infrastructure, including GPU clusters, to support hardware-enabled governance and compliance verification for AI development.
AISI is a student-led community at Georgia Tech working to ensure AI is developed safely, running fellowships, research projects, and policy programs across technical and governance tracks.
Jeff Clune's AI safety and alignment research lab at UBC's Department of Computer Science, focused on deep learning, AI interpretability, and open-ended AI systems.
Horizon is a non-partisan nonprofit that addresses the US government's critical shortage of emerging technology expertise by recruiting, training, and placing technical talent in federal agencies, congressional offices, and think tanks.
CNAS is a Washington, DC-based bipartisan think tank that develops national security and defense policy, with a dedicated Technology & National Security program focused on AI, compute governance, and great power competition.
A private research university in Nashville, Tennessee, that received SFF Fairness Track funding for research related to AI fairness, algorithmic equity, and the societal implications of AI systems.
Dr Waku is a pseudonymous AI safety educator who creates YouTube videos, a Substack newsletter, and other content explaining AI alignment risks and AI security to general audiences.
BlueDot Impact is a nonprofit talent accelerator that runs free cohort-based courses to train professionals in AI safety, AI governance, and biosecurity. It is the leading pipeline for building the workforce needed to safely navigate transformative AI.
A 501(c)(3) research laboratory in Santa Monica, CA that uses neuroimaging, neuromodulation, VR/AR, and altered states to study consciousness, with an AI safety research program on preventing antisocial AI through artificial empathy.
EleutherAI is a nonprofit AI research institute focused on interpretability, alignment, and open-source foundation model research. It is best known for creating GPT-NeoX, the Pythia model suite, and The Pile dataset.
Poseidon Research is an independent AI safety laboratory conducting deep technical research in interpretability, control, and secure monitoring to make advanced AI systems transparent, trustworthy, and governable.