Ann-Kathrin Dombrowski
Bio
Updated 03/22/26Ann-Kathrin Dombrowski is a Member of Technical Staff and Research Engineer at FAR.AI, where she focuses on explainable AI, AI transparency, and mitigating the malicious use of AI models. She holds a PhD from Technische Universität Berlin, where her research examined a geometrical perspective on counterfactual explanations and attribution methods for deep neural networks. She participated in the ML Alignment and Theory Scholars (MATS) program as a scholar under Dan Hendrycks, contributing to research on representation engineering and knowledge removal, and subsequently received LTFF funding to extend that work on internal concept extraction. She also explored information processing in large language models as a PIBBSS affiliate. Her published work includes contributions to the WMDP benchmark for measuring hazardous knowledge in AI models, safety evaluation toolkits for open-source models, and research on the manipulability of neural network explanations.
Community Signal
Updated 03/22/26No endorsements yet.
Links
Updated 03/22/26- Personal Website
- -
- Twitter / X
- -
- LessWrong
- -
- EA Forum
- -