Ahmed Mhiri
No summary available yet.
Loading results...
Showing 1201-1250 of 3787 results
Clear filtersNo summary available yet.
Showing 1201-1250 of 3787 results
Active filters: Type: Org, Individual
Clear filters to view everything →No summary available yet.
No summary available yet.
Friendly Ambitious Nerd - https://mattbrooks.xyz/
No summary available yet.

Alana Xiang (Chinese name: Zhengbo Xiang) is an AI safety researcher and entrepreneur who previously attended Stanford University before leaving to work in AI. She worked part-time at ARC Evals (now METR) in 2022-2023, contributing to evaluations of frontier AI models. In summer 2023, she was a Research Fellow at the Center on Long-Term Risk, focusing on reducing risks from advanced AI systems. She received funding from the Long-Term Future Fund for six months of independent AI alignment research and upskilling. She has described herself as a "smol alignment researcher" and explored research directions in AI alignment, evaluations, and adjacent AI safety topics. As of recent years, she has been associated with Calaveras AI, an AI data company.

Sarah Hastings-Woodhouse is a UK-based AI safety communicator and writer currently serving as Digital Communications Officer at the UK AI Security Institute (AISI). She holds a BA in English Literature from the University of Exeter (2017-2020) and previously worked as a Content Writer at FindAUniversity creating resources for postgraduate students before transitioning into AI safety work. She has contributed writing and research to the Future of Life Institute, Clearer Thinking, AI Frontiers, and participated in the Pivotal Research Fellowship. She hosts the "Consistently Candid" podcast, which focuses on making AI safety and existential risk accessible to non-technical audiences through in-depth interviews with researchers and advocates in the field. She received Long-Term Future Fund grants supporting her career transition into AI safety communications and for producing her podcast. She also maintains a Substack (longerramblings.substack.com) where she writes about AI safety topics including analyses of AI lab safety plans.
Tim Shavers leads AI safety and governance grantmaking at Astralis Foundation and advises the impact-first VC fund Halcyon Ventures. A graduate of Harvard College and Yale Law School, he spent nearly 20 years at McKinsey and more than a decade in venture capital and now also serves as a Senior Advisor with cFactual.
A16Z
No summary available yet.
No summary available yet.
Britanee TeBrake is a talent leader and consultant who specializes in fractional leadership for early- and mid-stage startups, advising companies on talent strategy, recruitment, and building high-performing teams drawing on extensive experience in talent acquisition and executive search.
No summary available yet.
Celia Ford is an AI reporter at Transformer, covering both technical and policy developments. She was previously a Future Perfect fellow at Vox and a AAAS Mass Media Fellow at Wired, and she holds a PhD in neuroscience from the University of California, Berkeley.

Cyborgism is an AI safety research agenda and community proposing that human-AI collaboration systems — where humans are cognitively augmented by LLMs rather than replaced by autonomous AI agents — can accelerate alignment research while preserving human control.
SAIHM — Sovereign AI Horizontal Memory. A sovereign, encrypted, sharable, persistent memory protocol for AI agents.
No summary available yet.
No summary available yet.
alientologist
No summary available yet.
No summary available yet.
UK government research organization that tests frontier AI systems, advances AI safety science, and informs policymakers about the risks and capabilities of advanced AI.
No summary available yet.
CAISI is the U.S. government's primary point of contact for AI testing and research within NIST, focused on developing voluntary AI standards and conducting evaluations of frontier AI systems. It was renamed from the U.S. AI Safety Institute in June 2025.
Caleb Rak is an operations professional in the AI safety and effective altruism community. He grew up in Norwich, Connecticut and attended Norwich Free Academy, where he was a 2017 National Merit Scholarship Finalist. He went on to Harvard University, graduating with a degree in Comparative Literature and Linguistics; his senior thesis was titled "Translating Zhou Zuoren: The Vernacular Essay and the Individual." After graduating, he became involved in EA community-building through roles at SPARC (a rationality and EA summer program for high school students) and Canopy Retreats (an organization supporting logistics for EA community events). He now works in Operations at Iliad (iliad.ac), an applied mathematics research organization focused on AI alignment that runs the Agent Foundations conference series. In that capacity, he received a $20,700 Long-Term Future Fund grant to organize the Agent Foundations 2025 workshop at Carnegie Mellon University, a five-day gathering of approximately 30 researchers working on mathematical foundations of AI alignment.
Board Member at the Transformative Futures Institute and Harbert Eminent Scholar in Business Analytics at Auburn University’s Raymond J. Harbert College of Business, whose research has focused on information systems support for managerial problem formulation.
No summary available yet.
Dr Jon Truby is a Visiting Research Associate Professor at the Centre for International Law at the National University of Singapore, where his research focuses on the intersection of artificial intelligence, sustainability and digital technology. He has been appointed as a participating expert to the OECD.AI Expert Group on Compute and Climate, a group supported by the GPAI Secretariat that examines the environmental implications of AI’s computing demands.
Head of Advising at 80,000 Hours, having joined after working at Oxford’s Global Priorities Institute, where she now focuses on providing one-on-one career advice to help people pursue high-impact paths.
No summary available yet.
H. Andrew Schwartz (1968–2025) was chief communications officer at the Center for Strategic and International Studies, where for roughly 20 years he directed CSIS’s media relations, digital strategy, events, publications, website, and other external engagement. A former Fox News producer and print journalist, he cohosted several CSIS podcasts—including The Truth of the Matter, The Trade Guys, The Impossible State, and The AI Policy Podcast—and coauthored Overload: Finding the Truth in Today’s Deluge of News with CSIS trustee Bob Schieffer.
Aysja Johnson is an AI safety researcher and policy analyst focused on AI lab scaling policies and responsible scaling frameworks. She holds a background in cognitive science, having completed undergraduate studies in Mathematics at UC Berkeley and graduate work in NYU's Computation and Cognition Lab under Todd Gureckis, where she studied human sense-making, open-ended reasoning, and human-machine intelligence. She was hired as a Research Analyst at AI Impacts in 2022, selected from over 250 applicants, contributing research on comparative cognition and technology adoption patterns relevant to AI risk. In 2023 she was a PIBBSS Summer Fellow, working on a project titled 'Towards a Science of Abstraction' exploring why natural abstractions are favored by agents and what this implies for AI alignment. She received a Long-Term Future Fund stipend for 1.5 years to conduct a thorough investigation and analysis of AI lab scaling policies, and has published critical analyses on LessWrong arguing that current responsible scaling policies lack rigor, fail to specify measurable evidence thresholds, and that behavioral evaluations alone are insufficient for safety assurance. She is active on LessWrong under the handle 'aysja' and has co-authored posts on AI lab governance topics including OpenAI's non-disparagement practices.
No summary available yet.
No summary available yet.
No summary available yet.
Pauline Charazac is Head of Policy Engagement at CeSIA, where she leads international efforts to promote responsible and inclusive AI governance. Conference bios describe her as a senior public policy adviser with experience at institutions such as the OECD and the Bank of Mauritius, working at the intersection of AI ethics, global governance, and financial inclusion.
Yuxiao Li is an AI safety and mechanistic interpretability researcher currently based in Bilbao, Spain, where she is a postdoctoral researcher at the Basque Center for Applied Mathematics (BCAM). She holds a PhD in Computer and Information Sciences from Tsinghua University (2018-2023). Her research focuses on understanding the internal representations of large language models through techniques such as sparse autoencoders, variational inference, and geometric analysis of feature spaces. She was previously affiliated with MIT's Tegmark group and the Beneficial AI Foundation, where she was first author on "The Geometry of Concepts: Sparse Autoencoder Feature Structure" (arXiv 2024), a study of how concepts are geometrically organized in LLM activations. She has also participated in the ML Alignment & Theory Scholars (MATS) program and the Supervised Program for Alignment Research (SPAR), contributing multi-part research on structured priors and block-diagonal geometry in language model activations. She currently serves as a mentor in the Algoverse AI Safety Fellowship and has received independent research funding for inference-based AI interpretability work.
Catherine Régis is a full professor of law at Université de Montréal whose work spans health law, artificial intelligence, and digital innovation. She holds a Canada CIFAR AI Chair, is an associate academic member at Mila, serves as Director of Social Innovation and International Policy at IVADO, and is Co-Director of the Canadian AI Safety Institute Research Program at CIFAR.
No summary available yet.
No summary available yet.
Sviatoslav (Slava) Chalnev is an AI researcher based in Sydney, Australia, with a background in mechanistic interpretability and AI safety. He studied at The Australian National University and subsequently pursued independent interpretability research funded by two Long-Term Future Fund stipends totaling $75,000, focused on mechanistic interpretability methods and open-source tooling. He participated in the MATS 6.0 program under Arthur Conmy, resulting in the paper "Improving Steering Vectors by Targeting Sparse Autoencoder Features" (arXiv:2411.02193, 2024), which introduced SAE-Targeted Steering (SAE-TS), a method for constructing steering vectors that target specific sparse autoencoder features while minimizing unintended side effects. He also co-authored "A Single Direction of Truth" (arXiv:2507.23221, 2025), demonstrating that a linear probe on an observer model's residual stream can detect and causally steer contextual hallucinations in language models. More recently, Chalnev co-founded Integuide, an AI startup building tools to capture and disseminate expert technician knowledge, which was part of the Startmate Winter 2025 accelerator cohort.
Tomislav Kurtovic (Tomislav Kurtović) is a researcher and Computer Vision PhD candidate at the Faculty of Electrical Engineering and Computing (FER), University of Zagreb, Croatia. He holds a university master's degree in computer engineering (univ. mag. ing. comp.) and works in the Department of Electronic Systems and Information Processing. At FER, he teaches laboratory exercises for Information Processing and Statistical Data Analysis at the undergraduate level, and Deep Learning 2 at the graduate level. In Q4 2022, he received a grant from the Long-Term Future Fund (LTFF) to skill up in machine learning and AI alignment, with the goal of developing a streamlined course in mathematics and AI for an alignment-focused audience.
Jenna Peters is Chief of Staff for the Career Services Team at 80,000 Hours. Before joining 80,000 Hours she worked as a project manager at the Centre for Effective Altruism and as a Post‑Baccalaureate Fellow at the Center for Global Women’s Health Technologies at Duke University. Jenna graduated summa cum laude from Duke University with a BS in neuroscience.
Oliver Patel is the Enterprise AI Governance Lead at AstraZeneca, where he leads the global framework of policies, standards and processes to ensure the company can realise the benefits of AI while managing associated risks. He writes the "Enterprise AI Governance" Substack and is a frequent speaker on practical frameworks for scaling AI governance in large organisations.
No summary available yet.
No summary available yet.
Org director studying how social change happens | Climate, animal welfare, AI safety movements
Itay Yona is an AI security researcher and mechanistic interpretability specialist who founded MentaLeap and serves as its founder and principal investigator while also working as an AI security researcher at Google DeepMind.
Chair of the UK’s AI Security Institute, overseeing its work to evaluate and mitigate serious risks from advanced AI systems.
Hoagy Cunningham is an AI safety researcher currently working at Anthropic, where he has contributed to both interpretability and safeguards research. He holds a 2:1 in Politics, Philosophy and Economics from The Queen's College, Oxford, and earlier in his career worked as a researcher at Full Fact, the UK fact-checking charity, and as an economist. He became a SERI MATS scholar under Lee Sharkey and is the lead author of "Sparse Autoencoders Find Highly Interpretable Features in Language Models" (ICLR 2024), a foundational paper demonstrating that sparse autoencoders can recover monosemantic, interpretable features from language model activations. This work was independently developed in parallel with similar research published by Anthropic and generated significant excitement in the mechanistic interpretability community. He received Long-Term Future Fund grants supporting his sparse coding research and work on preventing steganography in interpretable representations. At Anthropic, he has contributed to research on scaling monosemanticity, constitutional classifiers for jailbreak defense, and auditing language models for hidden objectives.