Conference publication of interpretability and LM-steering results | grantmaking.ai