Funding for having written AI safety distillation posts on the topic of membranes/boundaries
Database
Loading results...
Loading results...
Showing 1801-1850 of 3953 results
Clear filtersFunding for having written AI safety distillation posts on the topic of membranes/boundaries
Showing 1801-1850 of 3953 results
Active filters: Type: Project, Individual
Clear filters to view everything →I am an AI safety Researcher and artist both working on Interpretability as well as Outreach Programs
No summary available yet.
No summary available yet.
H. Akın Ünver is an associate professor of international relations at Özyeğin University and a fellow in the Carnegie Endowment’s Digital Democracy Network, where his research explores how emerging technologies, disinformation, and computational methods shape conflict, diplomacy, and democratic politics.
Philip L is the creator of the AI Explained YouTube channel, and he also runs AI Insiders, a community of more than 1,000 professionals working in generative AI across 30 industries, while authoring the Signal to Noise newsletter on high-signal AI developments.
External Advisor to the Transformative Futures Institute whose research spans several areas relevant to longtermism and is currently focused primarily on AI governance; previously a Senior Research Scholar at the Future of Humanity Institute and holder of a PhD in Materials Science and Engineering from UCLA.
Abram Demski (legal first name Daniel) is an independent AI alignment researcher specializing in agent foundations. He joined MIRI (Machine Intelligence Research Institute) full-time in 2017 as part of the Agent Foundations team, a position he held until summer 2024 when MIRI pivoted toward governance, policy, and outreach. He is best known for co-authoring the "Embedded Agency" sequence with Scott Garrabrant and for his foundational contributions to the development of Logical Induction. His research focuses on deconfusion work around core concepts in AI safety including agency, optimization, trust, embedded world models, and computational uncertainty. He received a $30,000 grant from the Long-Term Future Fund in November 2019 for independent research on agent foundations, building on work developed during MIRI's Summer Fellows Program in 2017 and 2018. Since leaving MIRI he has continued independent research, currently supported through Patreon and serving as a mentor in the MATS (Machine Learning Alignment Theory Scholars) program.
Independent researcher working on internal risk–stability laws for LLMs. Creator of ZTGI-Pro (Tek-Taht), a real-time hazard and collapse-detection framework for safer AI systems.
No summary available yet.
X-Risk Activist, Trader, Coder, Astronomer, Grassroots Lobbyist & Author
Funding towards a 2 year postdoctoral stint to work on Safety in AI, with a focus on developing value aligned systems
No summary available yet.
No summary available yet.

Akbir Khan is a Member of Technical Staff at Anthropic, where he works on the Horizons team focused on building safe superintelligence. He completed his PhD at the UCL DARK Lab under the supervision of Tim Rocktäschel and Edward Grefenstette, with prior academic training in Mathematics and Physics at UCL and Computer Science at Cambridge. His research centers on Scalable Oversight techniques — particularly the use of multi-agent debate to elicit truthfulness from AI systems — as well as AI control protocols and alignment auditing. His work on LLM debate, exploring whether weaker models can assess the correctness of stronger models, received a Best Paper Award at ICML 2024 for the paper "Debating with More Persuasive LLMs Leads to More Truthful Answers." Before his PhD, he co-founded Spherical Defence Labs, an AI-powered API security startup, and also worked as a Research Analyst at Cooperative AI and a Senior Researcher at Tractable.
No summary available yet.
No summary available yet.
No summary available yet.
Kelly Anthis co-founded Sentience Institute and served as its executive director for the organizations first three years, helping to establish it through grantwriting, conference presentations, website development, and hiring and managing staff alongside co-founder Jacy Reese Anthis. Her background includes serving as Director of Communications at Sentience Politics, co-organizing a grassroots animal advocacy group, working as a front-end software engineer, and volunteering her design skills and time to multiple animal advocacy organizations.
Research Editor at ILINA, responsible for reviewing research outputs, preparing them for publication, and supporting the program’s editorial work; she is also a researcher in animal law and ethics at the International Centre for Animal Rights and Ethics.
No summary available yet.
No summary available yet.
Trying to get people to listen to the experts.
Independent researcher and sole proprietor of FSME Logic.
No summary available yet.
Daniel Colson is the co-founder and executive director of the AI Policy Institute (AIPI), a think tank that researches and advocates for government policies to mitigate extreme risks from frontier artificial intelligence technologies. His work focuses on how autonomous weapons systems and other advanced AI capabilities could affect military strategy and global political stability. Before founding AIPI, he co-founded the fintech company Reserve, which provides financial services in high-inflation currency regions, and later founded the executive assistant recruiting firm CampusPA.
Hannah Erlebach is an AI safety researcher based in the UK, currently pursuing an MSc in Machine Learning at University College London (started 2024). She graduated from the University of Cambridge with a degree in mathematics (2018-2021) and subsequently founded and ran the Cambridge AI Safety Hub as its full-time organizer until summer 2023. She was a Summer Research Fellow at the Center on Long-Term Risk in 2023, working on cooperative AI. Her technical research focuses on reinforcement learning, goal misgeneralization, and cooperative AI: she co-authored "Welfare Diplomacy: Benchmarking Language Model Cooperation" (2023), which introduced a general-sum variant of Diplomacy to benchmark cooperative capabilities of language models, and co-authored "Mitigating Goal Misgeneralization via Minimax Regret" (RLC 2025), which demonstrates that minimax regret objectives are more robust to goal misgeneralization than maximum expected value objectives. She has received multiple grants from the Long-Term Future Fund to support her independent AI safety research, including funding to complete a goal misgeneralization project for an ICLR submission.
No summary available yet.
Jessica Cooper (also known professionally as Jessica Rumbelow) is the founder and CEO of Leap Laboratories, an AI interpretability startup based in London. She holds a PhD in model-agnostic AI interpretability, an MSc in Advanced Computer Science, and a BA in Fine Art, all from the University of St Andrews, where she also worked as a Research Scientist applying deep learning to digital pathology. She participated in the MATS (ML Alignment & Theory Scholars) Autumn 2022 cohort, during which she co-authored the widely-cited "SolidGoldMagikarp" paper with Matthew Watkins, discovering anomalous tokens that cause failure modes in GPT-2 and GPT-3 models. She subsequently worked at Aligned AI and the Stanford Existential Risks Initiative before founding Leap Laboratories in 2023, which received seed funding from the AI Risk Mitigation Fund to develop a model-agnostic interpretability engine. She serves on the Advisory Board of the London Initiative for Safe AI (LISA) and is an Affiliated Lecturer at the University of Cambridge, where she co-taught a course on Explainable AI. In March 2022, she received LTFF funding to trial a new London organisation aimed at significantly increasing the number of AI safety researchers.
Lawyer based in Spain specialising in technology regulation and AI governance, with an LL.M. in the field from the University of Edinburgh and experience in Brussels at the Center for Democracy and Technology Europe and other organisations; joined AI Standards Lab to contribute legal and policy expertise to CEN-CENELEC JTC21 standards supporting the EU AI Act.
Support for Marius Hobbhahn for piloting a program that approaches and nudges promising people to get into AI safety faster
No summary available yet.
AISTOF, the AI Safety Tactical Opportunities Fund
No summary available yet.
No summary available yet.
Sumaya Nur Adan is a DPhil candidate at the University of Oxford researching decentralized AI security infrastructure and its role in enabling trustworthy and beneficial AI deployment globally. She previously served as an AI Risk Analyst at the UK Department for Science, Innovation and Technology working on AI risk assessment, is a research affiliate at the AI Governance Initiative leading work on the Global AI majority, and has contributed to international initiatives including the ITU AI for Good Summit 2025 and the African Commission’s work on AI and human rights.
No summary available yet.
Early exploration, agenda-setting, technical infrastructure, and early community building
I finished my contract with CERN last December and, since then, I have been self funding my AI safety studies. Now is time to have an impact.
Founder and Executive Director of CARMA with over two decades of experience in machine learning and AI, focused on advanced AI safety since 2010 and also serving part‑time as Principal AI Safety Strategist at the Future of Life Institute.
Co-founder and research lead at Simplex, with extensive experience in experimental and computational neuroscience. He earned his PhD from Caltech and has over a decade of work investigating the neural basis of intelligent behavior, most recently as a researcher at Stanford, and now focuses on developing principled methods for controlling and aligning increasingly advanced AI systems.
No summary available yet.
No summary available yet.
No summary available yet.
Jason Hoelscher-Obermaier is the Director of Research and former co-director at Apart Research, where he focuses on accelerating AI safety progress through research sprints, fellowships, and safety evaluations of large language models and other advanced AI systems.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.