Ann-Kathrin Dombrowski

Berlin, Germany

Bio

Updated 03/22/26

Ann-Kathrin Dombrowski is a Member of Technical Staff and Research Engineer at FAR.AI, where she focuses on explainable AI, AI transparency, and mitigating the malicious use of AI models. She holds a PhD from Technische Universität Berlin, where her research examined a geometrical perspective on counterfactual explanations and attribution methods for deep neural networks. She participated in the ML Alignment and Theory Scholars (MATS) program as a scholar under Dan Hendrycks, contributing to research on representation engineering and knowledge removal, and subsequently received LTFF funding to extend that work on internal concept extraction. She also explored information processing in large language models as a PIBBSS affiliate. Her published work includes contributions to the WMDP benchmark for measuring hazardous knowledge in AI models, safety evaluation toolkits for open-source models, and research on the manipulability of neural network explanations.

Community Signal

Updated 03/22/26

0Upvotes

0Downvotes

0Endorsements

No endorsements yet.

Grants

Updated 03/22/26

LTFF 2023 Q3 - Ann-Kathrin Dombrowski

from Long-Term Future Fundfunds.effectivealtruism.org

recipient$27,260

Ann-Kathrin Dombrowski

Bio

Community Signal

Links

Grants