A personal Substack newsletter by AI safety researcher Daniel Paleka covering recent AI safety research papers and technical developments.
A personal Substack newsletter by AI safety researcher Daniel Paleka covering recent AI safety research papers and technical developments.
People
Updated 05/18/26Author
Funding Details
Updated 05/18/26- Annual Budget
- -
- Current Runway
- -
- Funding Goal
- -
- Funding Raised to Date
- -
Org Details
Updated 05/18/26AI Safety Takes is a personal newsletter by Daniel Paleka, published on Substack at newsletter.danielpaleka.com. The newsletter focuses on AI safety research and making the future go well, covering topics such as constitutional AI, mechanistic interpretability, scalable oversight, reinforcement learning from human feedback, adversarial attacks, and model alignment challenges. Daniel Paleka is a PhD student at ETH Zurich, advised by Florian Tramèr, with research focused on security and failure modes of artificial intelligence. He began his PhD in September 2022 and launched the newsletter in November 2022. Paleka also holds notable competitive mathematics credentials, including silver medals at the International Mathematical Olympiad in 2014, 2015, and 2016. The newsletter has grown to over 1,000 subscribers, with a significant fraction working in AI. Paleka describes the newsletter as evolving from structured monthly paper summaries to more freeform posts published whenever he has something interesting to say. He also runs a separate Substack called Random Features for non-AI-safety content. The newsletter has no paid subscription model and is free to all readers.
Theory of Change
Updated 05/18/26By synthesizing and commenting on the latest AI safety research, the newsletter aims to help practitioners and researchers stay informed about important developments in alignment, interpretability, and AI robustness. Paleka's implicit theory of change is that better-informed researchers and decision-makers in AI will contribute to safer AI development outcomes.
Grants Received– no grants recorded
Updated 05/18/26Projects– no linked projects
Updated 05/18/26Discussion
No comments yet. Be the first to share your thoughts.