Joe Kwon

Washington, DC

Bio

Updated 03/22/26

Joe Kwon is an AI safety researcher and policy analyst based in Washington, DC. He holds a BS in Computer Science and Psychology from Yale University and has conducted research at MIT's Computational Cognitive Science Lab, where he studied moral and social cognition with Josh Tenenbaum and Sydney Levine. His technical background includes early RLHF work at OpenAI, empirical ML research at UC Berkeley with Jacob Steinhardt and Dan Hendrycks focused on evals and out-of-distribution detection, and a stint as a Research Engineer at LG AI Research working on multilingual large language models. He subsequently transitioned to AI governance work, completing a GovAI DC Fellowship focused on risks from internal AI deployment and automated R&D, and serving as a Technical Policy Analyst at the Center for AI Policy (CAIP). Most recently he has been an Astra Fellow working with Tom Davidson and Fabien Roger on threat modeling and ML experiments related to secretly loyal AI. He received a Long-Term Future Fund grant as a stipend to work on an ML safety project with the goal of joining an ML safety team full-time.

Community Signal

Updated 03/22/26

0Upvotes

0Downvotes

0Endorsements

No endorsements yet.

Grants

Updated 03/22/26

LTFF 2024 Q1 - Joe Kwon

from Long-Term Future Fundfunds.effectivealtruism.org

recipient$40,000

Joe Kwon

Bio

Community Signal

Links

Grants