Rusheb Shah
Bio
Rusheb Shah is a Research Engineer at Apollo Research, an AI safety organization focused on evaluating and auditing high-risk failure modes in frontier AI systems. He holds a Master's degree in Materials Science from the University of Oxford and completed the Alignment Research Engineer Accelerator (ARENA) program to transition into technical AI safety work. Before joining Apollo Research in December 2023, he briefly worked at OpenAI and previously held software engineering roles at R3, Brainlabs, and Amazon Web Services. His research at Apollo Research focuses on LLM evaluations, including co-authoring work on evaluations-based safety cases for AI scheming and research on scalable black-box jailbreaks via persona modulation. He also contributed to the mechanistic interpretability library TransformerLens by adding BERT support and won first prize at the ARENA Interpretability Hackathon for work on circuit discovery algorithms.
Links
- Personal Website
- -
- Twitter / X
- -
- LessWrong
- -
Grants
from Long-Term Future Fund
Discussion
Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.
Details
- Last Updated
- Mar 23, 2026, 12:43 AM UTC
- Created
- Mar 20, 2026, 2:57 AM UTC