Akbir Khan

Bio

Updated 03/22/26

Akbir Khan is a Member of Technical Staff at Anthropic, where he works on the Horizons team focused on building safe superintelligence. He completed his PhD at the UCL DARK Lab under the supervision of Tim Rocktäschel and Edward Grefenstette, with prior academic training in Mathematics and Physics at UCL and Computer Science at Cambridge. His research centers on Scalable Oversight techniques — particularly the use of multi-agent debate to elicit truthfulness from AI systems — as well as AI control protocols and alignment auditing. His work on LLM debate, exploring whether weaker models can assess the correctness of stronger models, received a Best Paper Award at ICML 2024 for the paper "Debating with More Persuasive LLMs Leads to More Truthful Answers." Before his PhD, he co-founded Spherical Defence Labs, an AI-powered API security startup, and also worked as a Research Analyst at Cooperative AI and a Senior Researcher at Tractable.

Community Signal

Updated 03/22/26

0Upvotes

0Downvotes

0Endorsements

No endorsements yet.

Grants

Updated 03/22/26

LTFF 2023 Q2 - Akbir Khan

from Long-Term Future Fundfunds.effectivealtruism.org

recipient$55,000

Akbir Khan

Bio

Community Signal

Links

Grants