Matt Souza
Humanitarian interested in tech
Loading results...
Showing 2101-2150 of 3281 results
Clear filtersHumanitarian interested in tech
Jason Gross is a computer scientist and entrepreneur, co‑founder of Theorem (Theorem Labs), an AI and programming languages research lab focused on program verification; he holds a PhD in Electrical Engineering and Computer Science from MIT, where his research improved proof assistants and contributed verified cryptography now securing large volumes of HTTPS traffic.
Ajeya Cotra works at METR on threat modeling and risk assessment for loss-of-control risks from advanced AI. She previously led the technical AI safety program at Open Philanthropy (now Coefficient Giving), where she developed the influential “biological anchors” framework for forecasting when transformative AI might arrive.
Aaquib Syed is a CS and Mathematics undergraduate student at the University of Maryland, College Park, where he is a Banneker-Key Scholar. He was a fellow in the MATS 5.0 program under Neel Nanda's supervision, conducting mechanistic interpretability research on how refusal is implemented in large language models. His most notable work, "Refusal in Language Models Is Mediated by a Single Direction" (NeurIPS 2024), co-authored with Andy Arditi and others, showed that refusal behavior across 13 open-source chat models is controlled by a single direction in the residual stream. He also co-authored "Attribution Patching Outperforms Automated Circuit Discovery" (NeurIPS 2024 ATTRIB workshop) and work on mechanistic unlearning and model pruning. He is currently a Student Researcher at Google DeepMind on the Frontier Safety team, where he focuses on evaluating and forecasting dangerous AI capabilities.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
Robert Miles (also known as Rob Miles) is a British science communicator and educator specializing in AI safety and alignment. He studied Computer Science at the University of Nottingham and dropped out of his PhD program around 2011 to focus on AI safety communication full time. He began creating content for the popular Computerphile YouTube channel around 2015 before launching his own channel, Robert Miles AI Safety, which has accumulated over 169,000 subscribers and more than 7.5 million views covering topics such as the orthogonality thesis, instrumental convergence, and inner misalignment. He is the founder of AISafety.info (also known as Stampy), a community-written interactive FAQ about AI existential risk, and has run the Distillation Fellowship, a paid program funding writers to distill AI safety research into accessible content for the site. He has co-produced the Alignment Newsletter Podcast with Rohin Shah and has received funding from the Long-Term Future Fund to support his educational work. He collaborates with organizations including MIRI and the Future of Humanity Institute to help communicate their research to broader audiences.
No summary available yet.
German computer scientist and leading machine learning researcher, Scientific Director of the ELLIS Institute Tübingen and Director of the Empirical Inference Department at the Max Planck Institute for Intelligent Systems, whose work focuses on machine learning and causal inference with applications ranging from astronomy and computational photography to robotics.
carer of life
No summary available yet.
Ankit Panda is the Stanton Senior Fellow in the Nuclear Policy Program at the Carnegie Endowment for International Peace, specializing in nuclear strategy, escalation dynamics, missiles and missile defense, space security, and U.S. alliances.
No summary available yet.
No summary available yet.
No summary available yet.
Storyboard artist and 2D animator known for work on the independent series HEATHENS, The Amazing Digital Circus, and Rational Animations’ YouTube videos.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
AI safety researcher & pioneer of distributed systems for finance
Building an AI Safety & Responsible AI Ecosystem | New Zealand
No summary available yet.
President and CEO of Foresight Institute, directing programs in AI, longevity biotechnology, molecular nanotechnology, neurotechnology, and space; founder of ExistentialHope.com, co-editor of Superintelligence: Coordination & Strategy, and co-author of Gaming the Future.
three personalities in one
Experienced management consultant and serial entrepreneur. Seeks to maximise personal and professional impact.
No summary available yet.
No summary available yet.
Sam Clarke is a researcher working at the intersection of AI safety and AI governance. He studied computer science and philosophy at Oxford University, including a master's thesis applying Deep Bayesian Active Learning to the reward modeling approach to AI alignment. In late 2020 he relocated from New Zealand to Cambridge to work as a research assistant at the Leverhulme Centre for the Future of Intelligence, supporting Jess Whittlestone's work on mid-term AI impacts, which led to their co-authored paper 'A Survey of the Potential Long-term Impacts of AI' (AIES 2022). He subsequently held a researcher role at the Centre for the Study of Existential Risk (CSER) at the University of Cambridge, and later became Strategy Manager at the Centre for the Governance of AI (GovAI) in Oxford, where he researches actionable questions related to AI governance field-building strategy. He has also co-authored a chapter on the history of AI existential safety in 'The Era of Global Risk' (2023) and has written on the longtermist AI governance landscape, talent needs, and AI risk scenarios on the EA Forum and LessWrong.
No summary available yet.
No summary available yet.
Associate Professor of Statistics and EECS at UC Berkeley, affiliated with BAIR and CLIMB, and Founder & CEO of the nonprofit research lab Transluce; his research focuses on ensuring that machine-learning systems are understood by and aligned with humans.
Robert F. Trager is Co-Director of the Oxford Martin AI Governance Initiative, International Governance Lead at the Centre for the Governance of AI, and Senior Research Fellow at the Blavatnik School of Government at the University of Oxford. He is a recognised expert in the international governance of emerging technologies, diplomatic practice, institutional design and technology regulation, and has authored books and numerous articles in leading social science journals.
Spencer Case is a Research Fellow at Sentience Institute with a PhD in philosophy from the University of Colorado Boulder and experience as an international research fellow at Wuhan University. His work focuses on debates about moral realism and other topics in moral and political philosophy, he is coauthor of the book Is Morality Real? A Debate and is working on a second book, Why Its OK to Be Patriotic, in addition to publishing essays in outlets such as National Review and Quillette and hosting the Micro-Digressions philosophy podcast.
No summary available yet.
No summary available yet.
Noam Kolt is an Assistant Professor at the Hebrew University of Jerusalem’s Faculty of Law and School of Computer Science and Engineering, where he leads the Governance of AI Lab (GOAL). His research focuses on the governance of AI, automation in the legal system, and institutional design for advanced AI, including the governance of autonomous agents and empirical evaluations of legal alignment. Kolt completed his doctorate at the University of Toronto, where he served as a research advisor to Google DeepMind and was a member of OpenAI’s GPT-4 red team, and he now holds affiliate roles with institutes such as the Schwartz Reisman Institute for Technology and Society and the Institute for Law & AI.
Satvik Golechha is a Research Scientist at the AI Security Institute (AISI), a directorate of the UK Department for Science, Innovation and Technology, where he works on frontier alignment, mechanistic interpretability, and reinforcement learning. He completed a B.E. in Computer Science from BITS Pilani, India, and previously worked as a researcher at Microsoft Research and as an Associate Research Scientist at Wadhwani AI on applied machine learning for healthcare. He participated in the MATS Summer 2024 program under mentor Nandi Schoots, working on neural network modularity in the interpretability stream, and also conducted independent research at the Center for Human-Compatible AI (CHAI) at UC Berkeley. His published work includes "Challenges in Mechanistically Interpreting Model Representations" (arXiv 2024), "Training Neural Networks for Modularity aids Interpretability" (arXiv 2024), and "Intricacies of Feature Geometry in Large Language Models" (ICLR 2025), as well as collaborative research with Anthropic on auditing language models for hidden objectives. He received a Long-Term Future Fund grant to work on safe and robust reasoning via mechanistic interpretation of model representations.
Dr. Rand Waltzman is an adjunct senior information scientist at the RAND Corporation whose work focuses on artificial intelligence and cognitive security in the information environment. He has decades of experience managing AI research applied to domains such as social media and influence operations, and previously served as acting chief technology officer at Carnegie Mellon University’s Software Engineering Institute and as a program manager at DARPA responsible for the Social Media in Strategic Communications program.
Liz Specht is an Emerging Technology Fellow with Horizon’s Executive Branch track, currently placed at the Defense Advanced Research Projects Agency. She previously served as an Executive Branch fellow at the Emerging Technologies Institute and as Senior Vice President of Science and Technology at the Good Food Institute. She holds a B.S. in chemical and biomolecular engineering from Johns Hopkins University and a Ph.D. in biological sciences from the University of California San Diego.

Brian Tan is a co-founder and Operations Director of WhiteBox Research, an AI safety nonprofit based in Manila, Philippines, that trains early-career researchers in mechanistic interpretability. He co-founded WhiteBox in August 2023 alongside Clark Urzo and Kriz Tahimic, and leads the organization's operations, marketing, and project management. He also serves as an Operations Associate at Arcadia Impact, a UK nonprofit focused on AI safety and governance. Brian co-founded Effective Altruism Philippines in 2018 and worked full-time to grow the chapter in 2021, before supporting EA communities globally as a Group Support Contractor at the Centre for Effective Altruism from 2022 to 2023. He holds a degree in IT Entrepreneurship from Ateneo de Manila University, where he graduated as a Merit Scholar in 2019. WhiteBox Research's flagship AI Interpretability Fellowship runs for five months in person in Manila, with the goal of building AI safety research capacity in Southeast Asia.
Patrick Leblond is a CIGI senior fellow and CN–Paul M. Tellier Chair in Business and Public Policy at the University of Ottawa, specializing in international economic governance, including trade, financial and monetary integration, banking regulation and regional economic agreements.
No summary available yet.
Federico Speziali is co-founder of High Impact Professionals and works as a monitoring and evaluation manager at Ambitious Impact. His work focuses on effective giving, incubation programmes, and helping professionals and donors channel resources to high-impact charities.