The Society Library
A nonprofit that archives humanity's ideas, ideologies, and world-views through structured debate mapping, with a focus on AI safety, alignment, and democratic governance of AI.
Loading results...
Showing 251-300 of 4527 results
A nonprofit that archives humanity's ideas, ideologies, and world-views through structured debate mapping, with a focus on AI safety, alignment, and democratic governance of AI.
No summary available yet.
Kelsey Piper is an American journalist and staff writer at The Argument who previously wrote for Vox’s Future Perfect column, covering global challenges and catastrophic risks from an effective altruism perspective.
Malcolm Murray is Research Lead at SaferAI, where he heads work on quantitative risk assessment for large language models and advanced AI risks such as cybersecurity and biosecurity. He is an AI risk management expert with over two decades of experience in risk and strategy, previously serving as Chief of Research for Risk and Audit and Managing Vice President at Gartner and advising CEOs, prime ministers, and chief risk officers across the US, Europe, and Asia. He holds an MBA from INSEAD, a M.Sc. in Business and Economics from the Stockholm School of Economics, and a MIM from HEC, is a Good Judgment Project Superforecaster, and has been a CFA charterholder.
AFFINE (Agent Foundations FIeld NEtwork) runs intensive superintelligence alignment seminars and fellowships to upskill promising newcomers in agent foundations and AI alignment research.
No summary available yet.
No summary available yet.

Garrett Baker is an independent AI alignment researcher based in Berkeley, CA. He works on using singular learning theory (SLT), neuroscience, and reinforcement learning to build mathematically grounded theories for how values develop during training in ML systems. He has participated in the MATS program twice — as a MATS 3.0 scholar working on mechanistic interpretability of maze-solving agents under Alex Turner, and in the MATS 5.0/5.1 developmental interpretability stream — and has received funding via Manifund for both a MATS stipend and a full-time research salary. His research investigates epoch-wise critical periods in neural networks through an SLT lens, explores connections between ML inductive biases and neuroscience, and aims to create training stories that could produce inner-aligned AI. He is an active contributor to LessWrong and the AI Alignment Forum under the handle d0themath, with over 77 posts and 6,600 karma.
Building model organisms of CoT and Python packages for intervention in reasoning traces
No summary available yet.
A sovereign, encrypted, sharable, persistent memory protocol for AI agents.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.

Project manager at 1Day Sooner. Focused on biosecurity and policy.
No summary available yet.
No summary available yet.
Building a transparent, symbolic AGI that runs millions of tokens/sec on CPUs, making safe, explainable AI accessible to everyone.
Yaya Shi is the Lab Manager at the Institute for Advanced Consciousness Studies and a mental health clinician in training. Her interests span clinical and social neuroscience, particularly the neural and psychological mechanisms underlying attachment, grief, emotional resilience, and transformative emotional states, and she is motivated to translate consciousness research into accessible clinical tools and public engagement.
Cameron King is Operations Lead at Animal Advocacy Africa, having moved from running an e-commerce business to charity entrepreneurship and remaining active in the effective altruism community for over a decade.
Founder of CIRIS, open-source accountability infrastructure for autonomous AI: cryptographic attestation, runtime conscience, the Coherence Ratchet. Live in 14 languages, AGPL, mission-locked. Formerly IBM Associate Partner and AWS Professional Services.
Research on how much language models can infer about their current user, and interpretability work on such inferences
No summary available yet.
No summary available yet.
Mateusz Bagiński is a Polish AI safety researcher currently based in Tallinn, Estonia. He holds a BSc and MSc in cognitive science, and previously worked as a programmer at a startup developing software for enhancing collective sense-making. He transitioned into technical AI safety research after completing his dissertation, receiving a Long-Term Future Fund grant to skill up and gain experience working on AI safety full-time. In 2024, he was a PIBBSS Fellow mentored by Tsvi Benson-Tilsen (ex-MIRI), where he conducted a conceptual investigation of the core drivers of goal-achieving mental activity using the hermeneutic net method, presenting preliminary results at the PIBBSS Symposium '24 under the title "Fixing our concepts to understand minds and agency." His research focus is on theoretical and agent foundations work. He is active on LessWrong and the EA Forum, and has co-authored posts on AI safety policy including arguments for why safety-concerned researchers at capabilities labs should speak out publicly. He is the organizer of the AFFINE Superintelligence Alignment Seminar, a five-weekend intensive program in Hostačov, Czech Republic bringing together approximately 35 participants with leading mentors in the field.
6-month salary for an AI alignment research project on the manipulation of humans by AI
Canada's national AI safety institute, established by the federal government in November 2024 to advance the science of AI safety and ensure governments can understand and act on risks from advanced AI systems.
Spend 3 months (part time) assessing plausible pathways to slowing AI
No summary available yet.
Jeyashree Krishnan (JK) is a researcher at Apart Research, works on generative AI products in Siemens Corporate IT, and is a researcher at RWTH Aachen’s Center for Computational Life Sciences, with expertise spanning interpretability, AI safety and risk, time series modelling, and computational biology.
PhD student at the University of Vermont and Software Engineer. I'm interesting in Programming Languages, Formal Methods, and AI Safety.
Clare Diane Harris is a Research Associate at Macroscopic Ventures, where she researches societal long-term risks; she is a medical doctor who now primarily conducts non-clinical research for organizations aiming for positive social impact.
Suren Pahlevan is a PhD student in the Faculty of Music at the University of Cambridge and a Student Fellow at the Leverhulme Centre for the Future of Intelligence. His ethnomusicological doctoral research, funded jointly by the Arts and Humanities Research Council and the Isaac Newton Trust, examines how British producers in genres such as pop, hip‑hop, R&B and EDM are incorporating AI tools into digital audio workstation production and what this implies for the design of ethical music‑AI systems.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
Fabian Schimpf is an independent AI alignment researcher based in Stuttgart, Germany, supported by a grant from the Long-Term Future Fund. He received the grant to upskill into AI alignment research and conduct independent research on the limits of predictability, with mentorship from Andrea Iannelli at the University of Stuttgart. His research focus is on improving robustness in deep learning and using insights from that field to advance interpretability as a path toward ensuring AI robustly benefits humanity. He has a background in aerospace engineering from the University of Stuttgart, where he worked on autonomous soaring and asteroid exploration at the Flight Mechanics and Controls lab and completed an internship at NASA. He has contributed to approximately ten publications spanning aerospace and machine learning topics. He is active on LessWrong and the AI Alignment Forum under the handle 'fasc', where he has written on robustness in AI alignment and co-authored work on negative side effect minimization as part of an AI Safety Camp project.
independent
Mathematician Researching AI Safety
No summary available yet.
Ryan Khurana is a senior fellow at the Foundation for American Innovation and an AI practitioner based in Toronto. He has helped launch and lead AI products at companies including WOMBO and Maple Leaf Sports & Entertainment and now leads agentic applications at TwelveLabs, alongside prior research and policy roles with organizations such as the Macdonald-Laurier Institute and the Consumer Choice Center.
No summary available yet.
Naoya Okamoto is an early-career researcher exploring AI safety and alignment, based in the United States. They graduated from Fordham University in 2023 and have a background in proof-based mathematics. Inspired by Brian Christian's book The Alignment Problem, Okamoto pursued upskilling in machine learning through the University of Illinois Urbana-Champaign's Mathematics of Machine Learning course in summer 2023, funded by a Long-Term Future Fund grant. After exploring theoretical alignment research, they shifted focus toward empirical alignment research, working through the MLAB curriculum in 2024. They have also interned at the U.S.-Japan Council and volunteered with the Human Restoration Project, a progressive education nonprofit. Outside of AI safety, they are interested in AI policy advocacy and biosecurity.
No summary available yet.
Aleena Khan is Senior Outreach & Program Manager at TechCongress, where she leads fellowship recruitment and selections. Previously she served as Deputy Director for Content at TEDxFoggyBottom and as a Research Assistant at the Institute for International Economic Policy, supporting research on data and ethics. Aleena holds a B.A. in Political Science with a focus in Public Policy from The George Washington University and is pursuing a Master of Public Administration at American University.
Dr Keegan McBride is Director of Science & Technology at the Tony Blair Institute for Global Change, leading work on AI, digital government and technology policy, including how states can harness emerging technologies to improve competitiveness and public services. Previously he was a departmental research lecturer in AI, Government and Policy at the Oxford Internet Institute, where his research examined digital government, AI in the public sector and the future of the state in the digital age.
AI safety organiser and writer based in Sydney who co-founded AI Safety Australia and New Zealand, has been involved in the AI safety space for more than half a decade, leads local movement-building in Australia and New Zealand, and has experience including a summer fellowship with the Stanford Existential Risk Initiative, facilitation for BlueDot Impact and the Center for AI Safety, and organising the Sydney AI Safety Fellowship.