Iván Arcuschin Moreno

London, UK

Bio

Updated 03/22/26

Iván Arcuschin Moreno is an AI safety researcher based in London, UK, currently serving as Lead Research Scientist at Poseidon Research, a US-based AI safety non-profit where he heads the London research team. He holds a Computer Science PhD from the University of Buenos Aires, Argentina (2018–2024), where his thesis focused on automated test generation for Android apps. He completed two terms at the ML Alignment & Theory Scholars (MATS) program: the first (Jan–Jul 2024) under Adrià Garriga-Alonso at FAR AI, producing InterpBench, a collection of 86 semi-synthetic transformers with known circuits for evaluating mechanistic interpretability techniques (NeurIPS 2024); and the second (Jan 2025–Feb 2026) under Arthur Conmy at Google DeepMind, focusing on chain-of-thought faithfulness. He is lead author on "Chain-of-Thought Reasoning In The Wild Is Not Always Faithful" (ICLR 2025 workshop, 140+ citations within a year) and a contributor to the MIB mechanistic interpretability benchmark (ICML 2025). He also co-authored a position paper with Yoshua Bengio and founded AI Safety Argentina (AISAR), a research scholarship program to grow the AI safety research community in Latin America, supported by a $77,000 grant from Coefficient Giving.

Community Signal

Updated 03/22/26

0Upvotes

0Downvotes

0Endorsements

No endorsements yet.

Grants

Updated 03/22/26

LTFF 2024 Q1 - Iván Arcuschin Moreno

from Long-Term Future Fundfunds.effectivealtruism.org

recipient$67,000

Iván Arcuschin Moreno

Bio

Community Signal

Links

Grants