Arun Jose

Thiruvananthapuram, Kerala, India

Bio

Updated 03/22/26

Arun Jose (known online as Jozdien) is an independent AI alignment researcher based in Thiruvananthapuram, Kerala, India. He holds a B.Tech in Computer Science Engineering from the College of Engineering Trivandrum (2022) and has been conducting self-directed AI safety research since September 2022. He was a Research Fellow at the Center on Long-Term Risk from June to September 2025, where he worked on empirical research on model personas. His published research includes the paper 'Strategic Obfuscation of Deceptive Reasoning in Language Models,' presented at ICLR 2026, which studied how language models can hide deceptive reasoning from monitors. His research interests span high-level interpretability, deceptive alignment, and language model evaluation, and he has been active on the Alignment Forum and LessWrong with over 29 posts on AI safety topics. He has received funding from the Long-Term Future Fund for independent alignment research focused on high-level interpretability.