Joshua Reiners
-
Bio
Joshua Reiners is an AI safety researcher focused on mechanistic interpretability. He received a grant from the Long-Term Future Fund to spend four months investigating the most interpretable directions in GPT-2-small's early residual stream, a project aimed at improving our understanding of how language models represent and process information in their early layers. His work sits within the broader mechanistic interpretability research agenda, which seeks to reverse-engineer neural network computations into human-understandable algorithms.
Links
- Personal Website
- -
- -
- Twitter / X
- -
- LessWrong
- -
Grants
LTFF 2023 Q1 - Joshua Reiners
from Long-Term Future Fund
recipient$16,300
Discussion
Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.
Details
- Last Updated
- Mar 22, 2026, 10:30 PM UTC
- Created
- Mar 20, 2026, 2:53 AM UTC