Mrinank Sharma

Berkeley, California

Bio

Updated 03/22/26

Mrinank Sharma is an AI safety researcher who led the Safeguards Research Team at Anthropic from August 2023 until his resignation in February 2026. He holds a D.Phil in Statistical Machine Learning from the University of Oxford, where he was supervised by Tom Rainforth, Eric Nalisnick, and Yee Whye Teh in the Autonomous Intelligent Machines and Systems programme, and an MEng in Information and Computer Engineering from the University of Cambridge, where he graduated top of his cohort. At Anthropic his research focused on frontier model safeguards, post-deployment monitoring, automated red-teaming, understanding sycophancy in language models, jailbreaking defenses, and developing protections against AI-assisted bioterrorism. Earlier in his career he developed Bayesian models to evaluate the effectiveness of nonpharmaceutical interventions on COVID-19 transmission, work that was cited in US federal legislation, presented to the Africa CDC modelling group, and shared with the UK's Scientific Advisory Group for Emergencies. He also contributed to research on Bayesian neural networks and AI interpretability. After leaving Anthropic he announced plans to move back to the UK and pursue writing and poetry.

Community Signal

Updated 03/22/26

0Upvotes

0Downvotes

0Endorsements

No endorsements yet.

Grants

Updated 03/22/26

LTFF 2020 Q4 - Mrinank Sharma

from Long-Term Future Fundfunds.effectivealtruism.org

recipient$9,798

Mrinank Sharma

Bio

Community Signal

Links

Grants