Kai Sandbrink
Bio
Kai Sandbrink is a DPhil candidate in computational cognitive neuroscience at the University of Oxford's Department of Experimental Psychology, based at Lady Margaret Hall. He is co-supervised by Professor Christopher Summerfield at Oxford and Professor Wulfram Gerstner at EPFL, where he is also an invited guest researcher. Prior to Oxford, he completed an MS in Neural Systems and Computation at ETH Zurich and an MA in China Studies at Peking University. His research uses deep reinforcement learning as a task-driven model of human behavior, with a focus on learning dynamics, cognitive flexibility, and exploration-exploitation trade-offs. His AI-safety-relevant work includes improving deep learning's understanding of uncertainty and designing safer, more interpretable reward functions for reinforcement learning algorithms. He is an affiliate at Concordia AI and has an interest in East-West cooperation on AI safety and governance. He received a Long-Term Future Fund grant in 2021 for starting funds and moving costs related to his DPhil project.
Links
- Personal Website
- https://kjsandbrink.github.io/
- Twitter / X
- LessWrong
- -
Grants
from Long-Term Future Fund
Discussion
Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.
Details
- Last Updated
- Mar 22, 2026, 10:39 PM UTC
- Created
- Mar 20, 2026, 2:53 AM UTC