Arun Jose
Bio
Arun Jose (known online as Jozdien) is an independent AI alignment researcher based in Thiruvananthapuram, Kerala, India. He holds a B.Tech in Computer Science Engineering from the College of Engineering Trivandrum (2022) and has been conducting self-directed AI safety research since September 2022. He was a Research Fellow at the Center on Long-Term Risk from June to September 2025, where he worked on empirical research on model personas. His published research includes the paper 'Strategic Obfuscation of Deceptive Reasoning in Language Models,' presented at ICLR 2026, which studied how language models can hide deceptive reasoning from monitors. His research interests span high-level interpretability, deceptive alignment, and language model evaluation, and he has been active on the Alignment Forum and LessWrong with over 29 posts on AI safety topics. He has received funding from the Long-Term Future Fund for independent alignment research focused on high-level interpretability.
Links
- Personal Website
- https://www.jozdien.com/
- Twitter / X
- LessWrong
- jozdien
Grants
from Long-Term Future Fund
from Long-Term Future Fund
Discussion
Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.
Details
- Last Updated
- Mar 22, 2026, 2:28 PM UTC
- Created
- Mar 20, 2026, 2:48 AM UTC