No summary available yet.
Database
Loading results...
Loading results...
Showing 151-200 of 308 results
Clear filtersNo summary available yet.
pushing bounds of organized digital systems
General Support for an AI Safety evals for-profit
Empirical measurement infrastructure proving whether AI systems hold their constraints under real operating conditions.
4M+ views on AI safety: Help us replicate and scale this success with more creators
AI-Plans is a platform for discovering, critiquing, and advancing AI alignment strategies, hosting a contributable compendium of alignment plans and running community research events.
No summary available yet.
A model-agnostic benchmark for detecting deceptive reasoning in LLMs through behavioral fingerprints — no weight access required.
LawZero is a nonprofit AI safety research organization founded by Yoshua Bengio to develop safe-by-design AI systems that cannot act autonomously or pursue hidden goals.
UC Berkeley's Center for Long-Term Cybersecurity (CLTC) is a research and collaboration hub advancing future-oriented cybersecurity research, policy, and education, with a growing focus on AI safety governance and risk management for frontier AI systems.
No summary available yet.
Identifying and auditing reasoning circuits in LLMs within Algoverse 2026 using Sparse Autoencoders (SAEs).
No summary available yet.
Early exploration, agenda-setting, technical infrastructure, and early community building
181,448 evaluations proving no production AI model reliably maintains corrections. Expanding coverage and pursuing multi pass validation.
Creating a contest for Robust, Detailed Proposals and Redteaming of AI Safety Plans: Fast Action for Safe Transformative AI
I plan to investigate what realistic RL training conditions might lead to LLMs developing steganographic capabilities.
No summary available yet.
No summary available yet.
No summary available yet.
Horizon Events is a Canadian non-profit that advances AI safety R&D by organizing high-impact events, including the AI Safety Unconference series and monthly Guaranteed Safe AI Seminars.
No summary available yet.
Funding For Humanity: An AI Risk Podcast
Does harmful fine-tuning data cause broad misalignment only when the model already recognises the target behaviour as a norm violation?
One year of bootstrapped development, four patent filings, seeking support to continue.
Train LLMs to accurately & honestly report on their internal decision-making processes through real-time introspection
Open-Source Runtime Governance Architecture for Structural Alignment Drift in Long-Running AI Agents
Educating the general public about AI and risks in most efficient ways and leveraging this to achieve good policy outcomes
Funding the open-source launch of a working claim-state system and the local firewall bridge that carries verification before voice into governed agent action.
Support my postgraduate law studies and research in AI Governance
12 months funding for 3 people to work full-time on projects supporting AI safety efforts
No summary available yet.
Funding ends June 2025: Urgent support for proven AI safety pipeline converting technical talent from 26+ countries into published contributors
Help fund our student’s trip to NeurIPS to present his main conference paper on interpretable features in text-to-image diffusion models.
Designing a Project Funding Proposal
A tech-infused immersive musical. Experience the future of storytelling where artificial intelligence meets the depths of human emotion.
The US fundraising arm of the ETH Zurich Foundation, enabling American donors to make tax-deductible gifts that support research, teaching, and talent at ETH Zurich in Switzerland.
No summary available yet.
A scalable, non-infohazardous way to quickly upskill via digestible, repeatable exercises from papers and workshops.

I want to work in AI Safety full time - help me transition my career!
Translating in-person convening to measurable outcomes
Putting explainability at the forefront of AI text detection
AI Safety lab focusing on technical alignment and governance of AI in Africa and the Global South more broadly. We are a grassroots community-led research lab
Help us fund 2-3 new employees to support our team
An early-stage AI safety research group based in Sydney, Australia
Mox is San Francisco's largest AI safety coworking and community space, providing workspace, events, and fellowships for researchers and organizations working on high-impact problems.
Creating a fund exclusively focused on supporting AI Safety Research
Collective Action for Existential Safety (CAES) catalyzes coordinated action to reduce existential risks from AI, nuclear weapons, and engineered pandemics. It is an initiative of the Center for Existential Safety, a newly-formed U.S. nonprofit.
Athena is a hybrid mentorship program for women in technical AI alignment research, combining remote mentorship with an in-person retreat to build skills, networks, and representation in the field.
Combining "kickstarter" style functionality with transitional anonymity to decrease risk and raise expected value of participating in collective action.