A Substack blog by PhD mathematician Robert Huben documenting his Open Philanthropy-funded year of AI safety research and writing, covering mechanistic interpretability, AI risk, and related topics.
A Substack blog by PhD mathematician Robert Huben documenting his Open Philanthropy-funded year of AI safety research and writing, covering mechanistic interpretability, AI risk, and related topics.
People
Updated 05/18/26Founder and author
Funding Details
Updated 05/18/26- Annual Budget
- -
- Current Runway
- -
- Funding Goal
- -
- Funding Raised to Date
- -
Org Details
Updated 05/18/26From AI to ZI is a Substack newsletter and blog authored by Robert Huben, a PhD mathematician who completed his doctorate at the University of Nebraska-Lincoln in 2021. After spending a year as a scientist at a defense contractor, Huben received a grant from Open Philanthropy running from October 2022 to September 2023 to pursue a self-directed AI safety research and education program. The blog was created to document this journey, with posts covering AI safety concepts, paper summaries, original research ideas, and Huben's personal views on the existential risks posed by unaligned AI. A signature feature is that each post is rated on a scale from "AI" to "ZI" reflecting how directly relevant the content is to AI safety. During his grant year, Huben co-authored two notable research papers. The first, "Sparse Autoencoders Find Highly Interpretable Features in Language Models" (co-authored with Hoagy Cunningham, Aidan Ewart, Logan Riggs Smith, and Lee Sharkey), was presented at a NeurIPS 2023 workshop. His work focused on mechanistic interpretability — using sparse autoencoders to identify monosemantic features in neural network activations. He also worked on attention-only transformers. After the grant period ended in September 2023, Huben indicated he was applying to work at AI safety organizations and research groups. The blog shifted to infrequent posting. As of early 2026, the blog is semi-dormant, with payments disabled on Substack, but it continues to serve as a public record of his AI safety research and writing. Huben is also active on LessWrong under the handle Robert_AIZI.
Theory of Change
Updated 05/18/26Huben's implicit theory of change is that increasing the number of technically skilled researchers working on AI safety — particularly from adjacent fields like mathematics — is important for reducing existential risk from advanced AI. By learning about and contributing to AI safety research (especially mechanistic interpretability), and by writing accessibly about AI safety topics to grow awareness and community knowledge, he aims to contribute to the field both as a researcher and as an educator. The blog also serves as a demonstration that people from non-CS backgrounds can make meaningful contributions to AI safety.
Grants Received– no grants recorded
Updated 05/18/26Projects– no linked projects
Updated 05/18/26Discussion
No comments yet. Be the first to share your thoughts.