WhiteBox Research is a nonprofit organization founded in August 2023 and based in Quezon City, Philippines. Its mission is to develop the next generation of AI safety and mechanistic interpretability researchers in Southeast Asia. The organization runs a free AI Interpretability Fellowship — a five-month part-time program combining curriculum from the ARENA upskilling program with hands-on hackathons — to identify and train talented individuals who can contribute meaningfully to open problems in AI interpretability. WhiteBox has been funded by the Long-Term Future Fund and Manifund, and its advisers include Callum McDougall (Google DeepMind, ARENA founder) and Lee Sharkey (Apollo Research co-founder).
Funding Details
- Annual Budget
- -
- Monthly Burn Rate
- -
- Current Runway
- -
- Funding Goal
- -
- Funding Raised to Date
- -
- Fiscal Sponsor
- -
Theory of Change
WhiteBox Research believes that one constraint on AI safety progress is the global supply of skilled researchers, and that Southeast Asia is an underutilized talent pool. By identifying motivated early-career individuals in the region and providing structured, intensive training in mechanistic interpretability — a technical discipline seen as particularly promising for understanding and controlling AI systems — WhiteBox aims to produce researchers who can contribute directly to solving open problems in AI safety. Mechanistic interpretability research helps make AI models more transparent and understandable, which is a prerequisite for reliably verifying that advanced AI systems are behaving safely. More researchers in this field, especially in geographically diverse locations, increases the probability of important breakthroughs.
Grants Received
No grants recorded.
Projects
No linked projects.
People
No linked people.
Discussion
Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.
Details
- Last Updated
- Apr 2, 2026, 10:00 PM UTC
- Created
- Mar 19, 2026, 10:32 PM UTC
