Sara Price

Bay Area, CA

Bio

Updated 03/23/26

Sara Price is a Member of Technical Staff at Anthropic working on AI safety, based in the Bay Area. She has worked in machine learning since 2016 and transitioned into AI alignment research in early 2024 through the MATS 5.0 (Spring 2024) program, after which she received a 6-month stipend to continue independent research on situational awareness and deception in AI systems. Her research focuses on adversarial robustness of multimodal LLMs, scheming and deception, control, and model organisms of misalignment. She was a co-author on Petri, an open-source AI safety auditing tool developed at Anthropic that automates evaluation of concerning model behaviors such as deception, sycophancy, and self-preservation. She now serves as a mentor for the MATS Summer 2026 program in the Anthropic and OpenAI Megastream.

Community Signal

Updated 03/23/26

0Upvotes

0Downvotes

0Endorsements

No endorsements yet.

Grants

Updated 03/23/26

LTFF 2024 Q1 - Sara Price

from Long-Term Future Fundfunds.effectivealtruism.org

recipient$55,000

Sara Price

Bio

Community Signal

Links

Grants