David Lorell
Bio
Updated 03/22/26David Lorell is an independent AI alignment researcher who works closely with John Wentworth on the natural abstraction research agenda. He has co-authored multiple posts and a peer-reviewed paper with Wentworth, including "Natural Latents: Latent Variables Stable Across Ontologies" (arXiv:2509.03780, 2025), which develops a mathematical framework for latent variables that remain stable across different agent ontologies. His role in the collaboration involves serving as an active intellectual sounding board — asking for clarifications, requesting examples, and probing how theoretical ideas connect to broader alignment goals — a contribution John Wentworth has credited with multiplying his research productivity severalfold. Lorell is an active participant on LessWrong and the AI Alignment Forum, where he has been a member since 2022 and has contributed posts and comments on topics including natural latents, instrumental goals, coherence theorems, and corrigibility. He has also been acknowledged for discussion by EA and alignment researchers such as Joe Carlsmith. He has received general support funding for his independent alignment research work.
Community Signal
Updated 03/22/26No endorsements yet.
Links
Updated 03/22/26- Personal Website
- -
- Twitter / X
- LessWrong
- david-lorell
- EA Forum
- -