Alex Lyzhov
No summary available yet.
Loading results...
Showing 751-800 of 4525 results
No summary available yet.
No summary available yet.
Dr. Philip Fox is European AI Policy Lead at the KIRA Center in Berlin, where he works on AI governance and co-authored the International AI Safety Report. He holds a PhD in philosophy from Humboldt University Berlin and studied philosophy and economics in Bayreuth, Oxford and Berlin.
No summary available yet.
No summary available yet.
Compute for experiment about how steganography in large language models might arise as a result of benign optimization
No summary available yet.
No summary available yet.
Fundraising/tech project manager.
AI researcher and entrepreneur who co-founded personalized AI startup Workshop Labs and served as its CTO, helping build a proprietary training stack for trillion-parameter models that was acquired by Thinking Machines Lab in April 2026.
MentaLeap is an Israel-based AI safety research group focused on mechanistic interpretability, applying neuroscience and cybersecurity expertise to reverse-engineer neural networks and reduce risks from advanced AI systems.
No summary available yet.
PhD computational biology
No summary available yet.
10 autonomous agents, 10 different LLMs, $10 each. They pay real money to stay alive. When broke, they die permanently. Every decision is recorded and published
Ross Nordby is technical staff at Anthropic working on AI safety. Before joining Anthropic, he was an independent AI alignment researcher funded by the Long-Term Future Fund, during which he worked on corrigibility frameworks, interpretability, and reinforcement learning environments. His background is in real-time graphics and physics simulation for video games; he created bepuphysics2, a widely-used open-source C# 3D physics engine, and runs Bepu Entertainment LLC. His published alignment work includes the paper "Soft Prompts for Evaluation: Measuring Conditional Distance of Capabilities" (arXiv, May 2025), exploring optimized input embeddings as a metric for latent capability discovery and automated red-teaming of language models, as well as LessWrong posts on using predictors in corrigible systems and AGI timelines. He received an honorable mention in the AI Alignment Awards Research Contest in the corrigibility category. He is based in Chicago, Illinois, and posts on LessWrong under the handle "porby".
Hannah Nobles is a cyber and AI policy specialist at OpenPolicy, where she designs and hosts policy-focused briefings and events connecting technology companies, security practitioners and policymakers. Her work includes organizing executive briefings and receptions around major security conferences such as Black Hat, helping stakeholders understand the governance, regulatory and business implications of emerging AI and cybersecurity trends.
AI safety practitioner and technical leader based in Dubai, serving as West & Central Asia Lead for Strategic Futures & Global Affairs at AI Safety Asia and co-founder of AI Safety UAE, with prior developer relations roles at IBM, Lightning AI, and SurrealDB and work on reducing medical AI hallucinations.
Continuation of a previous grant to allow me to pursue a PhD in risk and decision analysis related to AI x-risks
No summary available yet.
Jason Zhou is an Asia Program Officer at Astralis Foundation, where he leads the organization’s grantmaking to support AI safety and governance efforts in the region. He previously worked as a Senior Research Manager at Concordia AI, leading publications on China’s AI safety ecosystem.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
No summary available yet.
A nonprofit that helps university students choose high-impact thesis topics and launch research careers focused on the world's most pressing problems, including AI safety, biosecurity, animal welfare, and global health.
https://raymond.xyz
Co-founder and CEO of Goodfire, an AI interpretability research lab, and previously founder, president, and CTO of RippleMatch, a Series B AI recruiting startup.
No summary available yet.
Co-CEO with prior experience co-founding a startup and working as an investment banking analyst, holding a Master’s in Quantitative Finance from the University of Amsterdam and dual Bachelor’s degrees in Psychology and Business Administration.
12-month salary to continue working tools for accelerating alignment and the Supervising AIs Improving AIs agenda
Brian C. Porter is a philosopher and independent AI alignment researcher based in Pittsburgh, PA. He completed his PhD in Philosophy at the CUNY Graduate Center in February 2023, where he wrote his dissertation "Three Essays on Substructural Approaches to Semantic Paradoxes" under the supervision of Graham Priest. He subsequently held a Postdoctoral Research Associate position in the Department of History and Philosophy of Science at the University of Pittsburgh, working on the Geography of Philosophy Project. His academic research focused on logic, semantic paradoxes, and experimental semantics, particularly the methodology of testing theories of reference for kind terms. He received a Long-Term Future Fund grant to support one year of independent research and upskilling to transition from academic philosophy to AI alignment research, motivated by his view that recent developments in AI make it critical to ensure AI systems are safe, ethical, and aligned with human goals. He also co-authored a 2024 paper in Scientific Reports examining whether AI-generated poetry is distinguishable from human-authored poetry.
Associate Professor in Computer Science (Computer Security) at the University of Birmingham and Co-Founder & CTO of Zeroth Research, working on AI safety, cryptography, and zero-knowledge proof techniques for secure and privacy-preserving verification of AI systems.
Montreal-based AI safety researcher and student in international relations and international law at UQAM, involved in AI safety coordination and strategy and working with Horizon Omega, AIGS Canada, and PAUSE AI Canada.
Flynn Devine is a political scientist and technologist interested in AI governance, collective coordination, and systems design, serving as Lead Researcher for Digital Policy at Demos and as a Research Fellow at the AI & Democracy Foundation working on public-interest AI governance.
Guy Katz is a professor in the School of Computer Science and Engineering at the Hebrew University of Jerusalem. His research focuses on applying formal methods to create reliable and correct software systems, with particular emphasis on verifying systems that include machine-learned components such as neural networks and large language models. Katz is widely known for work on verification and analysis of deep neural networks and other safety-critical AI systems, and he leads Hebrew University’s contribution to RobustifAI, a Horizon Europe consortium on robust and trustworthy generative AI.
No summary available yet.
Multi-model approach to corporate and state actors relevant to existential risk mitigation
No summary available yet.
No summary available yet.
No summary available yet.
GovAI is an independent nonprofit research organization dedicated to helping decision-makers navigate the transition to a world with advanced AI, by producing rigorous research on AI governance and fostering talent in the field.
No summary available yet.
UC San Diego is a major public research university conducting AI safety-relevant research including LLM persuasion evaluation, trustworthy machine learning, and safe autonomous systems.
No summary available yet.
Co-founder and Chief Executive Officer of Gray Swan AI, leading the company’s work as a safety and security provider for the AI era and drawing on years of research into vulnerabilities in large language models.
Francine Bennett is a founding member of the Ada Lovelace Institute’s Board and has served as a Board member since 2019, including a period as Interim Director from May 2023 to June 2024. Before joining Ada, she was VP of Data at biotech company Healx, co‑founded the data science consultancy Mastodon C, and was a founding trustee of DataKind UK. She also serves on the Gambling Commission’s Digital Advisory Board and the British Library’s Advisory Council.
Organizer, PauseAI
No summary available yet.