Compute costs for experiments to evaluate different scalable oversight protocols
Database
Loading results...
Loading results...
Showing 1951-2000 of 3951 results
Clear filtersCompute costs for experiments to evaluate different scalable oversight protocols
Showing 1951-2000 of 3951 results
Active filters: Type: Individual, Project
Clear filters to view everything →Jack Ryan is an AI alignment researcher who received a grant from the Long-Term Future Fund to support work on evaluating alignment theory agendas. Due to the commonality of the name, limited public information could be confirmed about this individual's specific background, affiliations, or other professional details.
No summary available yet.
4-month stipend to continue work on AI Control as a MATS extension
Stipend and expenses to run the second Athena mentorship program for gender-minority researchers in technical AI alignment
French/Australian filmmaker and video producer raising awareness on the risks of AI
Rafal Rohozinski is a CIGI senior fellow and principal of the SecDev Group, where he leads its geopolitical digital risk practice, drawing on extensive experience advising the United Nations and other institutions on cybersecurity and the governance of cyberspace.

Nick Hollman is a legal and AI governance researcher who worked as a Research Assistant at the Legal Priorities Project (later the Institute for Law & AI), where he focused on the long-term challenges of artificial intelligence in judicial systems. He received a $24,000 grant from the Long-Term Future Fund in November 2020 to research and advise legal practitioners on AI in the judiciary, including collaboration with advisors to the Indian Supreme Court. He co-authored "Value Alignment for Advanced Artificial Judicial Intelligence" with Christoph Winter and David Manheim, published in the American Philosophical Quarterly in 2023, which applied AI safety and alignment frameworks to the governance of advanced judicial AI systems. He also contributed to "Legal Priorities Research: A Research Agenda" (2021), a foundational paper for the Legal Priorities Project. Hollman holds a B.A. in Cognitive Science from the University of Michigan (class of 2020) and subsequently moved into a Development Coordinator role at the Reporters Committee for Freedom of the Press.

Suzy Shepherd is a British film editor and director based in Oxford, UK, specialising in documentary short-form and factual content. She holds a BA in Classics from Balliol College, Oxford (2016) and is pursuing an MFA in Film Editing at the National Film and Television School (NFTS). She has created films for the University of Oxford, the WHO, NHS England, and the BBC, and co-directed a documentary that premiered at BFI Flare Festival 2022. Her fiction short 'Mischief Managed' has accumulated over 2 million YouTube views. She received a grant from the Long-Term Future Fund to develop a short fiction film on AI x-risk at a top film school, resulting in 'Writing Doom' (2024), which won the $20,000 Grand Prize at the Future of Life Institute's Superintelligence Imagined contest and has reached over 500,000 views. Her work is noted within the effective altruism community for communicating AI safety concerns through creative storytelling.
Author of "Interplay" (2020), explores the development of generational innovation management programs tailored to address evolving societal cultures across different age groups. With a focus on building capabilities and resources for transformative movements, Laurel brings extensive experience in the venture building sphere. She dissects the valuation of technological breakthroughs and draws inspiration to map their impact on the global human trajectory. Notably, Laurel leads pioneering advancements in substrate intelligence, impacting both robot-intelligence and water-soil relations.
6-month salary for me to continue the SERI MATS project on expanding the "Discovering Latent Knowledge" paper
No summary available yet.
No summary available yet.
Sarah Schwettmann is Chief Science Officer at Transluce and a research scientist at MIT CSAIL with the MIT-IBM Watson AI Lab. Her work focuses on developing tools for understanding artificial neural networks and she holds a PhD in Brain and Cognitive Sciences from MIT, where she was an NSF fellow.
No summary available yet.
No summary available yet.
6-months salary to accelerate my plans of upskilling in order to work on the issue of AI safety
No summary available yet.
Felecia Webb is Chief Strategy Officer, Philanthropy and Partnerships at Partnership on AI, where she is responsible for sustainable growth, partner engagement, and amplifying the organization’s global impact on responsible AI. A social impact strategist and advocate for equity with more than 20 years of experience across the nonprofit and private sectors, she has led data-driven initiatives in arts and culture, social services, workforce, and youth development.
No summary available yet.
Michael Klein is Senior Director for Preparedness and Response at the Institute for Security and Technology, where he focuses on improving the resilience of “target rich, cyber poor” critical infrastructure sectors, drawing on nearly 20 years of experience in K‑12 education and federal cyber policy.
The self-study section of AISafety.com curates courses, textbooks, and reading lists for independent learning in AI safety, covering both technical alignment and AI governance.
Aza Raskin is a technologist and interface designer who co‑founded the Center for Humane Technology and the Earth Species Project. Trained as a mathematician and physicist, he has founded multiple companies, helped shape the Emmy‑winning documentary The Social Dilemma, and co‑hosts the podcast Your Undivided Attention about the societal impacts of information technology.
Lily Ottinger is ChinaTalk’s managing editor and a researcher. She holds a degree in mathematics, learned Mandarin to fluency while teaching policy debate in Taiwan, previously worked as an assistant researcher to Professor J. Andrés Gannon, and is an Emergent Ventures grant recipient with research interests including Sino-Russian relations, Chinese influence in Central Asia, and the diplomacy of unrecognized states.
Year-long stipend to work as the primary maintainer of TransformerLens, and implement large changes to the code base.
Jackson Dean is an engineer at TamperSec based in San Francisco, with prior experience at companies including Cricut, Kairos Autonomi and the Lassonde Entrepreneur Institute.
James (Jim) W. Hinton is a CIGI senior fellow and intellectual property lawyer, founder of Own Innovation and co‑founder of the Innovation Asset Collective, who focuses on patents, trademarks and innovation policy for Canadian technology companies.
Luis Cosio is a Mexico City–based technologist and entrepreneur with about 15 years of experience at the intersection of cloud computing, cybersecurity, and artificial intelligence, who has architected and launched national‑scale e‑government systems and now serves as technical staff on IST’s Security Level 5 Task Force.
No summary available yet.
No summary available yet.
Devina Jain is an AI safety researcher whose work includes leading the "Red-teaming with Mech-Interpretability" project at Apart Research, developing mechanistic-interpretability-based tools to improve red-teaming efficiency and evaluation of large language models.
Dave Cortright is a UP Coach at Upgradable who helps clients clarify and live their true lives. He is a certified professional coach who draws on multiple coaching approaches to craft personalized programs for each client.
Lee Sharkey is a mechanistic interpretability researcher and Principal Investigator at Goodfire AI, based in London. He co-founded Apollo Research, where he served as Chief Strategy Officer, and previously worked as a Research Engineer at Conjecture. His academic background spans preclinical medicine and neuroscience at the University of Cambridge, an MSc in Data Analytics from the University of Glasgow, and an MSc in Neural Systems and Computation from the University of Zurich and ETH Zurich; he also worked in international public health before transitioning to AI research. He is best known for early foundational work on sparse autoencoders (SAEs) as a solution to representational superposition in neural networks, and more recently has developed Attribution-based Parameter Decomposition (APD) and Stochastic Parameter Decomposition (SPD) as improved approaches to reverse-engineering neural network mechanisms. He is the lead author of the 2025 paper "Open Problems in Mechanistic Interpretability," a comprehensive review published in TMLR co-authored with approximately 30 researchers. He also mentors scholars in the MATS program and has contributed key alignment research including work on goal misgeneralization in deep reinforcement learning.
No summary available yet.
Program Officer at Coefficient Giving focused on AI governance and policy, and former Director of FAR Labs, with prior experience at Google and other technology and forecasting-focused organizations; he also serves on the board of the Quantified Uncertainty Research Institute.
No summary available yet.
No summary available yet.
Funding For Humanity: An AI Risk Podcast
6-month salary to dedicate full-time to upskilling/AI alignment research tentatively focused on agent foundations
Request for Retroactive Funding
No summary available yet.
Julius Adebayo is the co-founder and CEO of Guide Labs, a San Francisco-based startup building interpretable and auditable AI systems. He holds a PhD in Electrical Engineering and Computer Science from MIT and is known for research on machine learning interpretability and fairness, including work showing that many saliency-based explanation methods are unreliable.
CEO of Consultants for Impact and social entrepreneur who helped found Accenture’s EA workplace group (the Intentional Impact Collaborative) and Sulis, a venture developing solar-powered water treatment technology for communities in India.
No summary available yet.
6-month stipend on evaluating robustness of AI agents safety guardrails and for running an AI spear-phishing study
No summary available yet.
No summary available yet.
No summary available yet.