Guide Labs

San Francisco, CA

9 peopleFounded 2023

Guide Labs builds interpretable AI systems and foundation models that humans can reliably understand, audit, and steer. Their flagship model, Steerling-8B, is the first inherently interpretable large language model at scale.

People

Updated 05/18/26

Aya Abdelsalam Ismail

Co-Founder & CSO

Fulton Wang

Co-Founder

Julius Adebayo

Co-Founder & CEO

Mark Weber

Seed Investor

Funding Details

Updated 05/18/26

Annual Budget: -
Current Runway: -
Funding Goal: -
Funding Raised to Date: $9,820,000

Org Details

Updated 05/18/26

Guide Labs was founded in 2023 by Julius Adebayo and Aya Abdelsalam Ismail, both seasoned ML researchers with PhDs and industry experience at Google, Meta, and Genentech. Adebayo originated much of the underlying research during his doctoral work at MIT, where he was supported by Open Philanthropy; his 2018 paper demonstrated that many existing deep learning explanation methods were unreliable or actively misleading. Guide Labs graduated from Y Combinator's Winter 2024 batch and closed a $9.3 million seed round in December 2025, led by Initialized Capital with participation from Tectonic Ventures, Y Combinator, Lombardstreet Ventures, E14 Fund, and Pioneer Fund. The company's core thesis is that interpretability must be built into models from the ground up. They developed Atlas, an automated system for annotating trillion-token pretraining corpora with human-understandable concepts; Causal Diffusion Language Models (CDLMs), a new architecture using block causal attention; and PRISM, a family of smaller models (130M to 1.6B parameters) that reveal which training patterns influenced predictions. In February 2026, Guide Labs open-sourced Steerling-8B, an 8-billion-parameter model trained on 1.35 trillion tokens that routes all predictions through approximately 33,000 supervised concepts and 100,000 discovered concepts, enabling inference-time alignment via concept control, training data provenance for any generated token, and suppression or amplification of specific concepts without retraining. Guide Labs targets regulated industries — finance, life sciences, and other domains where model auditability is a legal or practical requirement — as well as AI safety researchers who need reliable mechanistic understanding of model internals. The team of nine has published more than 24 papers at top ML venues including ICLR and ICML. The company is headquartered in San Francisco and is actively hiring machine learning engineers, full-stack developers, and researchers.

Theory of Change

Updated 05/18/26

Guide Labs believes that the fundamental barrier to safe and aligned AI is the opacity of current models: because we cannot reliably understand how they reason, we cannot debug failures, verify alignment, or correct problematic behaviors. Their approach is to make interpretability a first-class design constraint rather than an afterthought. By building models whose every prediction can be traced to human-understandable concepts and source training data, they aim to give developers, auditors, and policymakers the tools to detect and correct unsafe behavior before deployment and in production. If interpretable models can match the performance of black-box ones — as they argue Steerling-8B demonstrates — the field will have a viable path to AI systems that are both capable and genuinely overseen by humans, reducing the risk of misaligned or uncontrollable AI.

Grants Received

Updated 05/18/26

Open Access Interpretability Project

from Open Philanthropycoefficientgiving.org

$750,000

Projects

Updated 05/18/26

Steerling-8B

Steerling-8B is an 8-billion-parameter causal diffusion language model developed by Guide Labs that is interpretable by construction, routing each prediction through human-understandable concepts that can be measured, audited, and controlled.

active

Discussion

No comments yet. Be the first to share your thoughts.