One year part time spent on AI safety upskilling and concrete research projects
Database
Loading results...
Loading results...
Showing 4351-4400 of 4525 results
One year part time spent on AI safety upskilling and concrete research projects
A cross-institutional Australian research collective focused on multi-objective reinforcement learning approaches to AI safety and alignment, with researchers at Federation University, Deakin University, and UNSW.
Chief simulacrum officer, chronicler of hipster museums, and Potemkin geek
No summary available yet.
No summary available yet.
No summary available yet.

Tushant Jha (known as TJ) is a researcher working at the intersection of AI safety, macrostrategy, and computational economics. He holds a dual degree (MS) in Computer Science and Engineering from the International Institute of Information Technology, Hyderabad (IIIT-H), where his research was conducted at the Centre for Security, Theory, and Algorithms (C-STAR). His academic work includes publications on game-theoretic solution concepts and learning frameworks, with a paper published in ACM Transactions on Economics and Computation. He has been affiliated with the Future of Humanity Institute at Oxford, the AI Objectives Institute, and was a 2022 Foresight Institute Fellow. In April 2020, he received a $40,000 grant from the Long-Term Future Fund to work on long-term macrostrategy and AI alignment research, and to support career transition toward that goal. His work addresses the ethical and evolutionary dimensions of AI progress and the value alignment problem.

Max Kaufmann is a PhD student at the University of Toronto supervised by Roger Grosse, and a Visiting Student Researcher at Google DeepMind where he works on chain-of-thought monitoring for AI safety. He completed his undergraduate degree in computer science at the University of Cambridge. Before his PhD, he was part of the founding team at the UK AI Safety Institute, where he helped scale the organisation and established their LLM agents team. He has also worked as a research intern at UC Berkeley's Human-Compatible AI group, collaborating with Owain Evans on LLM generalization research, including co-authoring papers on the Reversal Curse and situational awareness in LLMs. His research spans adversarial robustness, LLM generalization, LLM evaluation, training data attribution, and chain-of-thought monitoring. He has received funding from the Long-Term Future Fund to support his early-career alignment research.
No summary available yet.
General support of flexHEG prototyping
No summary available yet.
No summary available yet.
Harvard University is a leading private research university with several prominent programs advancing AI safety, AI governance, and AI interpretability research, including the Kempner Institute, Berkman Klein Center, and Harvard AI Safety Team.
Effective Altruism Israel is a Tel Aviv-based nonprofit that builds and supports the Israeli effective altruism community, helping people maximize their social impact through career guidance, education, and effective giving programs.
PhD student in the Algorithmic Alignment Group at MIT whose work uses interpretability and causal methods to explain model and human behaviours, including how people interact with AI-generated content and audits of black-box social media algorithms.

Johannes Heidecke is the Head of Safety Systems at OpenAI, where he leads work on model safety behavior and alignment. He completed a Master's degree in Artificial Intelligence in Barcelona and participated in the MATS Summer 2022 Cohort under the mentorship of Evan Hubinger. Early in his career he organized the second AI Safety Camp in Prague, a retreat for nearly 30 aspiring AI alignment researchers, and received funding to support this field-building work. At OpenAI he has co-authored influential safety research including "Deliberative Alignment: Reasoning Enables Safer Language Models" (2024) and "Improving Model Safety Behavior with Rule-Based Rewards," both of which have shaped how OpenAI's o-series models handle safety-critical decisions. He has spoken at the 2025 Singapore Conference on AI and has been quoted on OpenAI's preparedness framework and the risks posed by advanced reasoning models.
Touro is a large private Jewish university system headquartered in New York City, operating over 38 schools across the US and internationally. It received an Open Philanthropy grant to support Professor Gabriel Weil's legal research on using tort liability to mitigate catastrophic AI risks.
Justin Rockefeller is a philanthropist and impact investing leader who co-founded and chairs The ImPact, a global community of families committed to aligning their assets with their values. He serves on the board and the investment and governance committees of the Rockefeller Brothers Fund and is a board member of Longview Philanthropy; previously he worked at Addepar and in venture capital.
A technical advisory body within Australia's Department of Industry, Science and Resources that monitors, tests, and shares information on emerging AI risks and harms. It was announced in November 2025 and became operational in early 2026.
No summary available yet.
No summary available yet.
No summary available yet.
Shelby Summerfield is an operations and leadership professional with a diverse background in political campaigns, social impact, and startup operations; she serves as Head of Operations at Halcyon Futures (2023–present), is Chief of Staff and Senior Operations Manager at Verbal+Visual, and previously was Chief Operating Officer at Humanity Forward and held senior scheduling and campaign roles for Andrew Yang’s presidential and New York City mayoral campaigns.
Dr. Roxy Mathew Koll is a climate scientist at the Indian Institute of Tropical Meteorology whose research focuses on Indian Ocean warming, monsoon dynamics, and extreme weather in the Indo-Pacific, and who has contributed to IPCC reports and received major honors including India’s 2024 Rashtriya Vigyan Puraskar and the AGU Devendra Lal Medal.
No summary available yet.
No summary available yet.
Talent Partner at the UK AI Security Institute, bringing recruiting experience from previous roles at Chapter 2, Sainsbury’s, John Lewis Partnership and The Crown Estate.
No summary available yet.
Fred Heiding is a postdoctoral research fellow at the Harvard Kennedy School’s Belfer Center for Science and International Affairs. His work focuses on computer security at the intersection of technical capabilities, business implications, and policy remediations. Fred is a member of the World Economic Forum's Cybercrime Center and is on the organizing committee of the Technology and National Security Conference.
Accepted ICML 2026 workshop paper on cross-constitution drift in LLMs; seeking $2,050 travel support to present in Seoul and gather research feedback.
Seeking funding to develop and evaluate a new benchmark for systematically assesing safety of LLM-based agents
ARIA is a UK government research funding agency that backs high-risk, high-reward R&D in underexplored areas, including a major £59 million programme on formal mathematical safety guarantees for AI systems.

Harrison Gietz is an AI Program Associate at Longview Philanthropy, where he conducts grant investigations focused on artificial intelligence. He previously served as Co-Director of the Cambridge AI Safety Hub (CAISH) and Program Director of the ERA Fellowship, collectively supporting over 40 research projects in technical AI safety and AI governance. His background includes published research in machine learning robustness and experimental physics, and he has participated in research programs at MATS and the Center for AI Safety. He holds a Bachelor of Science in Mathematics (summa cum laude) from Louisiana State University and also studied computer science at Georgia Tech. Earlier in his career, he received a Long-Term Future Fund grant for full-time AI safety technical and governance research. He is based in Cambridge, UK, and writes on AI safety topics at his personal Substack.
Joshua Reiners is an AI safety researcher focused on mechanistic interpretability. He received a grant from the Long-Term Future Fund to spend four months investigating the most interpretable directions in GPT-2-small's early residual stream, a project aimed at improving our understanding of how language models represent and process information in their early layers. His work sits within the broader mechanistic interpretability research agenda, which seeks to reverse-engineer neural network computations into human-understandable algorithms.
Co-founder and director at Zeroth Research who develops technical AI-based solutions for customers, with a PhD in mechanical engineering focused on computational mechanics and numerical simulation algorithms.
No summary available yet.
Cindy Wu is an AI safety researcher who participated in the MATS (ML Alignment & Theory Scholars) Winter 2023-24 program (MATS 5.0) and subsequently received funding for an extension period of technical alignment research. She has co-authored two papers in AI safety: "Using Degeneracy in the Loss Landscape for Mechanistic Interpretability" (arXiv:2405.10927, May 2024), which introduces the Interaction Basis technique to address parameter degeneracy that can obscure neural network internal structure, and "Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs" (arXiv:2407.15549, July 2024), where she led experiments on machine unlearning of harmful knowledge. Her research sits at the intersection of mechanistic interpretability and AI safety, with a focus on understanding and improving robustness of large language models. She was affiliated with the MATS program during her research work on both papers.
Develop an LLM-based coordinator and test against consumer spending with 200 people.
6 months of funding for MATS 5.0 extension, with projects on latent adversarial training and persona explainability
Dr Eleanor Drage is a Senior Research Fellow and Assistant Research Professor at the Leverhulme Centre for the Future of Intelligence, where she co‑directs the Narratives and Justice (AI Narratives and Justice) programme. Her research uses feminist and anti‑racist approaches to analyse how AI systems shape power, labour and inequality, and she is co‑investigator on the Desirable Digitalisation project and principal investigator for the HEAT High‑risk EU AI Act Toolkit. She co‑hosts The Good Robot podcast and is author or co‑editor of works including An Experience of the Impossible, Feminist AI and The Good Robot: Why Technology Needs Feminism.
Leonardo Christov-Moore, Ph.D., is a neuroscientist, artist, and complexity theorist who works as a Senior Research Scientist at the Institute for Advanced Consciousness Studies. His research examines trust, empathy, vulnerable AI, mind–body interactions, and the role of non-ordinary and aesthetic experiences in deep belief change, drawing on multiscale neuroscience and neuromodulation methods.
No summary available yet.
No summary available yet.
An interdisciplinary research centre at the University of Cambridge dedicated to studying and mitigating existential and global catastrophic risks, with major focus areas in AI safety, biological risks, and environmental risks.
Intern@Center for Human-Compatible AI, UC Berkeley. Emulating moral progress in AI, to prevent premature value lock-in.
A curated directory of 60+ funding sources for AI safety work, maintained by AISafety.com as part of its broader resource hub for the AI safety ecosystem.
UK-based GP, clinical academic, MPhil candidate at University of Cambridge. Founder of GiveHealth, Volunteer Global Health, and Bridge Health Research.
ETH Zürich (Swiss Federal Institute of Technology) is one of the world's leading technical universities, hosting several prominent AI safety and security research groups including the SPY Lab and SRI Lab.
No summary available yet.