Arjun Panickssery

San Francisco Bay Area, USA

Bio

Updated 03/22/26

Arjun Panickssery is an AI safety researcher and entrepreneur based in the San Francisco Bay Area. He studied at the University of Illinois at Urbana-Champaign and has been active in AI alignment research through multiple programs and institutions. He participated in MATS (Machine Learning Alignment Theory Scholars), including an extension phase in London, where his research on the safety implications of LLM self-recognition produced the widely-cited paper "LLM Evaluators Recognize and Favor Their Own Generations" (co-authored with Samuel R. Bowman and Shi Feng), which demonstrated that frontier models such as GPT-4 can recognize their own outputs and exhibit self-preference bias that could undermine safety techniques like reward modeling and constitutional AI. He subsequently worked on scalable oversight benchmarks as part of MATS Summer 2024 and previously held roles at METR Evals and an AI risks organization. He is also building Zembla, an AI-powered platform for accelerated, individualized learning, and writes frequently about AI tutoring and education research.

Community Signal

Updated 03/22/26

0Upvotes

0Downvotes

0Endorsements

No endorsements yet.

Grants

Updated 03/22/26

LTFF 2024 Q1 - Arjun Panickssery

from Long-Term Future Fundfunds.effectivealtruism.org

recipient$34,100

Arjun Panickssery

Bio

Community Signal

Links

Grants