AI-Plans
AI-Plans is a platform for discovering, critiquing, and advancing AI alignment strategies, hosting a contributable compendium of alignment plans and running community research events.
Loading results...
Showing 2051-2100 of 4520 results
AI-Plans is a platform for discovering, critiquing, and advancing AI alignment strategies, hosting a contributable compendium of alignment plans and running community research events.
No summary available yet.
Scott Viteri is a CS PhD candidate at Stanford University's Center for Automated Reasoning, admitted in Autumn 2019 and advised by Prof. Clark Barrett. He holds a B.S. in Computer Science and Electrical Engineering from MIT (2018), and before starting his PhD he worked on interactive theorem proving at CMU with Simon DeDeo, publishing research on abduction in mathematics in the journal Cognition. His research focus has evolved from formal verification and programming languages to AI alignment, driven by his view that advanced AI poses a substantial existential risk. His core work involves training language models to produce causally grounded chain-of-thought reasoning via reinforcement learning, as demonstrated in his 2024 paper "Markovian Transformers for Informative Language Modeling" (arXiv 2404.18988), which achieved large gains on QA benchmarks. He has also received a grant from the Long-Term Future Fund to research a novel method for training prosociality into large language models, and Open Philanthropy recommended a grant of $153,820 to Stanford University to support his and Barrett's AI alignment research.
Workshop Labs is a public benefit corporation building billions of personalized, privacy-preserving AI models with a mission to keep humans empowered as AI advances.
No summary available yet.
6 months salary. Turn intuitions, like goals, wanting, abilities, into concepts applicable to computational systems
OpenAI is an AI research and deployment company working to ensure that artificial general intelligence benefits all of humanity. It is the creator of ChatGPT, GPT-4, and a wide range of frontier AI models.
Jed McCaleb is the founder of the Astera Institute and serves as Co‑Founder and CEO of its Neuro & AGI program, where he is directing a large, long‑term philanthropic commitment to neuroscience‑informed AGI research. A software engineer and serial entrepreneur, he previously co‑founded Ripple and the Stellar Development Foundation, created the eDonkey network and the Mt. Gox bitcoin exchange, and later founded the space company Vast, where he is founder and board chair.
No summary available yet.
Laurence D. (Larry) Fink is Co‑Chair of the Board of Trustees of the World Economic Forum and Chairman and Chief Executive Officer of BlackRock, the global investment and technology solutions firm he co‑founded in 1988.
No summary available yet.
A UK-based research and advocacy think tank that combines complexity modelling, expert elicitation, and democratic deliberation to improve policymaking around existential and catastrophic risks.
A model-agnostic benchmark for detecting deceptive reasoning in LLMs through behavioral fingerprints — no weight access required.
No summary available yet.
A global non-profit building AI safety governance capacity across Asia through policy research, training, and multi-stakeholder dialogue, starting in Southeast Asia.
No summary available yet.
Aya Abdelsalam Ismail is co-founder and chief science officer of Guide Labs. Previously she was a senior machine learning scientist at Prescient Design in Genentech, and her research focuses on making neural networks more interpretable. She earned a PhD in computer science from the University of Maryland and has published over a dozen papers at top machine learning conferences such as NeurIPS and ICLR.
Coordinates and supports rationality-focused community meetup groups worldwide, serving as a hub for ACX (Astral Codex Ten), LessWrong, and broader rationality community organizers.
No summary available yet.
No summary available yet.
Seth Lazar is a professor in the Johns Hopkins University School of Government and Policy and a leading scholar in the moral and political philosophy of artificial intelligence. He leads the Machine Intelligence and Normative Theory (MINT) Lab, which works on AI safety, governance, and resilience, and previously served as a professor of philosophy at the Australian National University. He holds a D.Phil., M.Phil., and B.A. (Hons) from the University of Oxford, and his research is supported by funders including the Templeton World Charity Foundation, the Centre for Security and Emerging Technology, the Survival and Flourishing Fund, AI2050, Google, OpenAI, and the Australian Research Council.
LawZero is a nonprofit AI safety research organization founded by Yoshua Bengio to develop safe-by-design AI systems that cannot act autonomously or pursue hidden goals.
No summary available yet.
Gaia Marcus is Director of the Ada Lovelace Institute. She previously held senior roles across the UK Civil Service, including Deputy Director (Advanced Analytics and Local Capabilities) in the Spatial Data Unit at the Department for Levelling Up, Housing and Communities, Deputy Director for the Integrated Data Service at the Office for National Statistics, Head of Engagement for Civil Service Reform at the Cabinet Office and Head of National Data Strategy at the Department for Digital, Culture, Media and Sport. In the non-profit sector she has led data strategy and participatory approaches to research and innovation at organisations such as Parkinson’s UK, Centrepoint and the RSA, and has served as a trustee of Samaritans.
No summary available yet.
No summary available yet.
Starting funds and moving costs for a DPhil project in AI that addresses safety concerns in ML algorithms and positions
UC Berkeley's Center for Long-Term Cybersecurity (CLTC) is a research and collaboration hub advancing future-oriented cybersecurity research, policy, and education, with a growing focus on AI safety governance and risk management for frontier AI systems.
No summary available yet.
Simon Skade is an independent AI alignment researcher based in Germany. He studied computer science at the Technical University of Munich and began self-studying machine learning and AI safety through the rationalist and effective altruism communities. He conducted mostly non-prosaic alignment research from February 2022 through August 2025, during which time he won $10,000 in the Eliciting Latent Knowledge (ELK) contest and participated in MLAB (ML Alignment Bootcamp) and SERI MATS cohorts 3.0 and 3.1. His research focused on ontology identification and an interdisciplinary approach to understanding minds — drawing on linguistics, psychology, and neuroscience — with the goal of creating more understandable and better-targeted AI systems. He received funding from the Long-Term Future Fund for independent study to deepen his understanding of the alignment problem. More recently, he has turned his attention toward advocacy for international coordination to more safely navigate the AI transition.
No summary available yet.
No summary available yet.
No summary available yet.
Meridian Cambridge is an independent research and incubation hub in Cambridge, UK focused on AI safety, biosecurity, frontier-risk policy, and institutional design. Formerly Effective Altruism Cambridge CIC, it hosts the Cambridge AI Safety Hub, biosecurity and governance hubs, research labs, and fellowships.
Identifying and auditing reasoning circuits in LLMs within Algoverse 2026 using Sparse Autoencoders (SAEs).
An international advocacy organization devoted to reducing global catastrophic risk from all threats and hazards, working with governments worldwide to enact policies that address existential and catastrophic risks.
AI Safety Argentina (AISAR) is a 6-month research scholarship program based at the University of Buenos Aires that connects Argentine students with mentors to conduct AI safety research.
No summary available yet.
This grant will support Naoya Okamoto upskill in AI Safety research. Naoya will take the Mathematics of Machine Learning course offered by the University of Illinois at Urbana-Champaign.
Top-up funding for a 3-month new hire trial to help me connect, expand and enable the AGI gov/safety community in Canada
No summary available yet.
Charlotte Monico is Chief Executive Officer of Founders Pledge. A long-time member of the organization, she previously served for around six years as Chief Operating Officer and has worked in close partnership with founder David Goldberg since 2019, bringing strong strategic and operational leadership as the organization scales.
Pranav Pant is a software and quantitative developer at Graviton Research Capital and an IndiaAI Fellow of the Government of India, with a B.Tech in Computer Science and Engineering from IIT Jodhpur and research experience in deep learning and multimodal AI.
Founder of CEEALAR (formerly the EA Hotel). He has a background in astrophysics and Earth system modelling and previously ran a 3D-printing/open-source hardware business, which he pursued with an eye toward supporting effective altruism.
No summary available yet.
No summary available yet.
No summary available yet.