AGI Inherent Non-Safety
A research project developing non-maximizing, aspiration-based designs for AI agents that avoid objective function maximization, arguing that such optimization is inherently unsafe in sufficiently capable AGI systems.
Loading results...
Showing 2101-2150 of 3786 results
Clear filtersA research project developing non-maximizing, aspiration-based designs for AI agents that avoid objective function maximization, arguing that such optimization is inherently unsafe in sufficiently capable AGI systems.
Showing 2101-2150 of 3786 results
Active filters: Type: Org, Individual
Clear filters to view everything →AI/AI safety Researcher.
No summary available yet.
Director of the UC Berkeley Cybersecurity Clinic and lecturer at the School of Information, Elijah Baucom is a cybersecurity and privacy technologist, engineer, and activist focused on the intersection of technology, humanity, liberation, and political education; through the clinic he teaches and trains students to consult with and support social‑sector organizations that are particularly susceptible to ideologically motivated cyber attacks, and he is also the founder of Everyday Security, which provides cybersecurity and IT support to human rights and movement organizations.
Founder of PauseAI, CEO of Ontola.io
Technical Program Manager at the Technical Alignment Research Accelerator (TARA) and Master of AI student at UNSW, with a background including a Bachelor of Science in Mathematical Physics and a Diploma in Music from the University of Melbourne, completion of TARA v1 with a capstone project on causal interventions on a chess model, acceptance into the SPAR program, leadership roles in Effective Altruism University of Melbourne, and work on AI-focused education seminars.
Alex Flint is an independent AI alignment researcher and monastic. He completed his PhD with the Active Vision Lab at the University of Oxford (2008-2012) in robotics and computer vision on a Clarendon Fund scholarship, following undergraduate studies in computer science at the University of Adelaide. After his PhD he led computer vision engineering at Flyby Media in New York City, then co-founded Zippy.ai, a robotics startup focused on last-mile delivery, which was acquired by Cruise (GM's autonomous vehicle division) in 2018. He subsequently left industry to pursue independent AI alignment research and is affiliated with the Monastic Academy for the Preservation of Life on Earth in Vermont. His alignment research focuses on foundational theories of optimization, knowledge, and agency, including a widely cited post "The ground of optimization" selected as Best of LessWrong 2020. He has received funding from the Long-Term Future Fund for independent research and is active on LessWrong and the AI Alignment Forum.
Ben Hoffman has worked as a researcher for GiveWell and the Open Philanthropy Project and in financial services managing a risk analytics team, and he holds an MS in Mathematics and Statistics from Georgetown University and a BA from St John’s College.
Buck Shlegeris is the CEO of Redwood Research, a nonprofit AI safety and security lab, where he leads work on AI control and related empirical safety research; he previously worked on AI alignment and outreach at the Machine Intelligence Research Institute (MIRI).
No summary available yet.
No summary available yet.
Lorenzo is a full‑stack software engineer at Giving What We Can, supporting product development and maintaining the organisation’s technical infrastructure. Originally from Italy, he previously did small‑scale earning to give as a software developer in the Netherlands and, after winning a donor lottery in 2022, moved back to Italy to reflect on how to do more good.
No summary available yet.
Researcher at the Machine Intelligence Research Institute working on the foundations of reasoning and agency, including logical uncertainty and decision theory; studied pure mathematics at the University of Waterloo and previously worked at Google on automatic construction of deep learning models.
Trajectory Labs is a nonprofit coworking and events space in downtown Toronto dedicated to AI safety research and community building. It provides free workspace, weekly events, and a peer network to grow Toronto's AI safety ecosystem.
test bio
No summary available yet.
Assistant Professor in Natural Language Processing at the University of Vienna, working in natural language processing and related areas of computational linguistics.
Alex Turner is a research scientist at Google DeepMind on the scalable alignment team, where he works on training invariants into model behavior and related alignment problems. He previously formulated and proved the power-seeking theorems, co-formulated the shard theory of human value formation, and proposed the Attainable Utility Preservation (AUP) approach to penalizing negative side effects, and he now mentors researchers through the MATS program’s Team Shard stream.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
Executive Director of Probably Good, whose widely read writing on applied psychology and philosophy has been linked from outlets such as The New York Times and TechCrunch; holds a PhD from the University of Cambridge, where he is an affiliated lecturer teaching quantitative and computational research methods.
Helen Edwards is co-founder of the Artificiality Institute, a nonprofit that explores how AI is transforming how people think, decide and live. She has led large-scale technology transformations as CIO of New Zealand’s national grid and held executive roles at organizations including Pacific Gas & Electric, Fonterra, Meridian Energy, Quartz and Transpower. She also serves as a commissioner on Oregon’s Higher Education Coordinating Commission and is a visiting researcher at UC Berkeley’s Center for Human-Compatible AI.
A leading Israeli research university home to the Governance of AI Lab (GOAL), which conducts cross-disciplinary research on AI governance, legal alignment, and the safe development of advanced AI systems.
Josh Rosenberg is the Chief Executive Officer and a co-founder of the Forecasting Research Institute, where he oversees research, strategy, fundraising, and operations with a focus on making forecasting useful to decision-makers. Previously, he was a Senior Advisor and Senior Research Manager at GiveWell, working on management, research, grantmaking, and hiring, and he holds a B.A. in Economics and Philosophy from Pomona College.
One of the world's oldest and most prestigious research universities, Oxford has been a central hub for AI safety and existential risk research through institutions like the Future of Humanity Institute (FHI) and the Oxford Martin AI Governance Initiative (AIGI).
5050 is a free 12-14 week company-builder program run by Fifty Years that helps scientists, researchers, and engineers become deep-tech startup founders, with a dedicated AI safety track.
No summary available yet.
President and Partner at Flow Research Collective and Venture Partner at multiple early-stage funds including Lionheart Ventures, Pioneer Fund, Essence Venture Capital and 1024 Ventures, as well as Limited Partner Advisor at GTMfund.
No summary available yet.
No summary available yet.
No summary available yet.

Paul Colognese is an AI safety researcher based in the United Kingdom, focused on understanding how AI systems represent and self-reflect on their goals and beliefs. He holds a PhD in Mathematics (Geometry, Topology, and Dynamical Systems) from the University of Warwick, where he researched translation surfaces under the supervision of Professor Mark Pollicott. His AI safety work includes building safety evaluations for Anthropic and the UK AI Security Institute, including control and sabotage evaluations that measure whether deployed AI agents could undermine safety systems. He has conducted AI threat modeling on catastrophic risk scenarios and carried out interpretability research in which he demonstrated the ability to detect an AI system's objectives through technical analysis. He participated in the MATS research program mentored by Evan Hubinger at Anthropic. He founded the London Initiative for Safe AI, a UK-based research center, and serves as AI Alignment Lead at the Center for the Study of Apparent Selves. He is active on LessWrong and the Alignment Forum, and participated in AI Safety Camp (AISC9) working on detecting agentic AI objectives via interpretability methods.
No summary available yet.
No summary available yet.
Director of Safe AI Germany
No summary available yet.
No summary available yet.
Sam F. Brown is an independent AI alignment researcher based in Oxford, UK. He has a background in physics and programming, and previously worked at a climate technology startup before pivoting to full-time alignment research. He has received two grants from the EA Long-Term Future Fund: an initial six-month grant (approximately £40,000) for research on goal-inference and choice-maximisation, and a subsequent twelve-month grant ($82,298) to research technical approaches to value lock-in and minimal paternalism. His work explores empowerment-based alignment — the idea of maximising humans' capacity to reach diverse future outcomes rather than inferring and locking in specific human values. He has published research essays on LessWrong and the EA Forum, including "The Empowerment of Others" and "Questions about Value Lock-in, Paternalism, and Empowerment". He is connected to the Oxford rationalist and EA community and works from spaces including Trajan House, the Centre for Effective Altruism's Oxford building.
Co-director of the Cambridge AI Safety Hub, working on AI safety fieldbuilding in Cambridge, UK. Previously a political science PhD candidate at SUNY Stony Brook, with experience in research and teaching before moving full-time into AI safety.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
Director of Operations at the AI Safety Awareness Project. Trained as a mechanical engineer, he previously worked as a design engineer at Boeing across commercial and defense programs, focusing on structural design, destructive testing, and product development. He holds a B.S.E. and an M.S. in Mechanical Engineering from Arizona State University and, beginning in 2024, transitioned from aerospace engineering into operational work and AI safety outreach.