No summary available yet.
- Team
- Individual
- Endorsed by
- No endorsements yet
Loading results...
Showing 601-650 of 2714 results
Clear filtersNo summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
Thomas H. Costello is an Assistant Professor in Social and Decision Sciences at Carnegie Mellon University, where he directs the Viewpoints Lab. He also holds affiliated faculty appointments at CMU's Human-Computer Interaction Institute and serves as a Research Affiliate at MIT's Sloan School of Management. He earned his PhD from Emory University under Scott Lilienfeld and completed a postdoctoral fellowship at MIT Sloan with David Rand and Gordon Pennycook. His research integrates psychology, political science, and human-computer interaction to study belief formation, attitude change, and the societal impacts of artificial intelligence on persuasion and misinformation. His landmark 2024 Science paper demonstrated that personalized AI dialogues (using GPT-4 Turbo) reduced conspiracy beliefs by approximately 20% in a large sample, with effects persisting for two months, earning the 2026 AAAS Newcomb Cleveland Prize. A subsequent 2025 Nature paper examined AI's persuasive power in political contexts, and his work on frontier LLMs' ability to persuade humans on extreme and hazardous topics directly addresses AI safety concerns around manipulative AI systems. He is a Research Affiliate at FAR.AI and was named an APS Rising Star in 2025.
Eli Lifland is a forecaster focused on AI alignment who writes the Foxy Scout blog and is ranked first all‑time on CSET‑Foretell/INFER, with strong results in Metaculus tournaments such as the Economist and Salk forecasting challenges.
No summary available yet.
No summary available yet.
Philipp Bongartz is a German machine learning engineer and independent AI alignment researcher who received a Long-Term Future Fund grant of approximately $25,000 for prosaic alignment research using chess as a model domain. He has developed a research agenda centered on building a multi-modal chess-language model with an encoder-decoder architecture, using chess as a testbed for investigating symbol grounding and truthfulness in AI models. His GitHub projects include Chess2Vec, ChessTransformer, and other chess-related ML tools, as well as bioinformatics work suggesting a computational background spanning machine learning and life sciences. He holds a PhD (defended in early 2020) from the Heidelberg Institute for Theoretical Studies (HITS). He has been an active contributor to the LessWrong and Alignment Forum communities under the handle p.b. since December 2020, with over 26 posts and 295 comments on topics ranging from chess AI to model scaling and alignment methodology. He has worked as a Senior Consultant and Data Scientist at Exxeta AG in Germany and is a rated chess player with a FIDE standard rating of approximately 2155.
No summary available yet.
Cole Wyeth is a PhD student in computer science at the University of Waterloo, supervised by Professor Ming Li and advised by Professor Marcus Hutter. He holds an M.S. in mathematics from the University of Minnesota, Twin Cities, and previously worked as a machine learning intern at Dexai Robotics with a publication at ICRA. His research focuses on algorithmic information theory, sequential decision theory, and the AIXI reinforcement learning framework, with a particular interest in theoretical AI safety and agent foundations. He has published work on reflective AIXI, embeddedness failures in universal AI, and the grain of truth problem for multi-agent reasoning. He organizes the AIXI research community at uaiasi.com and serves as an advisor to the AI Safety Research Fund. He received a grant from the Long-Term Future Fund to study extensions of the AIXI model to reflective agents in order to understand the behavior of self-modifying AGI.
Physicist, sustainable energy engineer, entrepreneur, and founder-director of the Existential Risk Observatory, focusing on reducing existential risks, especially from advanced AI, by informing public debate.
No summary available yet.
Co‑Director of Kairos, an AI safety organization focused on supporting early‑career people entering the field, and a generalist software engineer with interests in computer science, cybersecurity, and using technology for social impact.
Joseph Bloom is a mechanistic interpretability researcher currently leading the White Box Control Team at the UK AI Security Institute (AISI), where his team focuses on estimating and addressing risks associated with deceptive alignment in frontier AI systems. He studied computational biology and statistics at the University of Melbourne, Australia, and previously worked as a data scientist at Mass Dynamics, a proteomics startup. He became a MATS 5.0 scholar under Neel Nanda and completed the ARENA 1.0 program, during which he developed expertise in sparse autoencoders (SAEs) and mechanistic interpretability of transformer models. He created SAELens, an open-source library for training and analyzing sparse autoencoders on language models (over 1,300 GitHub stars), and served as a maintainer of the TransformerLens library (over 3,200 stars). He also co-founded Decode Research, an AI safety research infrastructure nonprofit, and helped build Neuronpedia, an open platform for hosting and analyzing sparse autoencoder features. Earlier independent work on decision transformer interpretability was funded by the Long-Term Future Fund, Manifund, and Lightspeed Grants.
No summary available yet.
No summary available yet.
Rob Sherman is Vice President and Deputy Chief Privacy Officer for Policy at Meta, where he leads work on privacy, security, and online trust across the company’s products and technologies. Before joining Meta (then Facebook) in 2012, he was an attorney at Covington & Burling advising technology and digital media companies on regulatory and public policy issues.
No summary available yet.
No summary available yet.
Co-founder and Chief Scientist at Goodfire; previously co-founded the interpretability team at DeepMind before moving from London to South Park Commons in San Francisco to build the company’s interpretability research program.
Dr. Jess Whittlestone is Head of AI Policy at the Centre for Long-Term Resilience (CLTR), a UK-based think tank working to reduce extreme risks. In this role she leads CLTR's work developing and advocating for policy recommendations to reduce extreme risks from AI, including frontier AI regulation and model evaluation for catastrophic risks. She was previously a Senior Research Fellow and Deputy Director of the AI: Futures and Responsibility Programme at the Leverhulme Centre for the Future of Intelligence and the Centre for the Study of Existential Risk (CSER) at the University of Cambridge. She holds a PhD in Behavioural Science from the University of Warwick and a first-class degree combining Mathematics and Philosophy from the University of Oxford. She is also an affiliate of the Centre for the Governance of AI (GovAI) and has received funding from the Long-Term Future Fund to research the links between short- and long-term AI policy while building technical ML knowledge. She was named one of TIME's 100 Most Influential People in AI in 2023.
No summary available yet.
No summary available yet.
No summary available yet.
Imran Khan is a strategist and writer working at the intersection of science, technology, and society, with a focus on AI safety, psychedelics, and how emerging technologies reshape human experience. He publishes essays and resources on these topics at his personal site and collaborates with the Center for Humane Technology on AI‑related explainers and analysis.
No summary available yet.
No summary available yet.
Gregory Kollmer is the Chief Technology Officer and co‑founder at Lucid Computing and holds a Bachelor of Science degree from Georgetown University.
No summary available yet.
AI safety engineer at Gray Swan AI and Member of Technical Staff at Cohere, focusing on high-quality, diverse post-training data for large language models and on keeping AI systems safe and secure.
Elav Horwitz is Chief Innovation Officer at WPP, where she leads applied innovation and helps global brands harness emerging technologies such as generative AI, XR, and Web3 to drive growth. She has over two decades of experience in technology-driven marketing and previously led innovation and partnerships for McCann Worldgroup.
Conrad Helminger is a Senior Associate at the Effective Institutions Project, supporting research, monitoring, and prioritization across institutions and issue areas. He holds an MA in International Political Economy and is a PhD candidate at the University of Amsterdam, where he studies global shifts in power, norms, and institutions.
Sara Price is a Member of Technical Staff at Anthropic working on AI safety, based in the Bay Area. She has worked in machine learning since 2016 and transitioned into AI alignment research in early 2024 through the MATS 5.0 (Spring 2024) program, after which she received a 6-month stipend to continue independent research on situational awareness and deception in AI systems. Her research focuses on adversarial robustness of multimodal LLMs, scheming and deception, control, and model organisms of misalignment. She was a co-author on Petri, an open-source AI safety auditing tool developed at Anthropic that automates evaluation of concerning model behaviors such as deception, sycophancy, and self-preservation. She now serves as a mentor for the MATS Summer 2026 program in the Anthropic and OpenAI Megastream.
Ihor Kendiukhov is a bioinformatician and machine learning and quant finance professional whose research spans AI for biology, AI safety, econophysics, and longevity; he has authored work on single-cell foundation model interpretability and other ML applications in biology, and holds investment and deep-tech roles such as Deputy General Partner at Deep Knowledge Group.
Zaha Hassan is a human rights lawyer and senior fellow in the Middle East Program at the Carnegie Endowment for International Peace; her work focuses on Palestinian‑Israeli peace efforts, the use of international legal mechanisms by political movements, and U.S. foreign policy in the region.
No summary available yet.
No summary available yet.
No summary available yet.
Hafsa Ahmed is a Grants Associate on Coefficient Giving's Grants Management team. She joined in January 2025 after working in a variety of UK and US non-profit operations roles and leading a youth mental health startup in the UAE. She holds an MSc in International Social and Public Policy (Development) from the London School of Economics.
No summary available yet.
Daniel Privitera is the founder and Executive Director of the KIRA Center, an independent AI policy non-profit based in Berlin. He serves as Lead Writer of the International AI Safety Report, which is backed by dozens of governments and international organisations, and holds a master’s degree in economics from the University of Oxford with work at the intersection of economic research and machine learning.
Casey is an Operations Associate at Giving What We Can, helping to ensure smooth operations across the organisation. She brings broad experience from work with peak bodies, universities, research institutes and international organisations across Australia and Europe, and holds a Master of Public Health from the University of New South Wales plus bachelor’s degrees in Psychology and Anthropology from the University of Sydney.
AI policy lead at Observatorio de Riesgos Catastróficos Globales, working on EU AI regulation, international governance, and large-scale AI risk assessments.
Roger Grosse is a Director of The AI Safety Foundation and an Associate Professor of Computer Science at the University of Toronto, where he holds the Schwartz Reisman Chair in Technology and Society and a Canada CIFAR AI Chair. His research focuses on deep learning, neural network training dynamics and AI alignment, and he also works on alignment and training-data attribution as a member of Anthropic’s Alignment Science team.
No summary available yet.
No summary available yet.