Canary: Evaluating Frontier AI

active

About

Updated 05/18/26

Canary is a multi‑year collaboration between METR and RAND, selected as part of The Audacious Project’s 2024 cohort, that aims to create a “canary in the coal mine” for dangerous AI capabilities. The project funds METR and RAND to design, validate, and operationalize evaluations that can detect when frontier AI models acquire high‑risk capabilities, such as enabling cyber or biological misuse or supporting runaway autonomy, so that developers and governments can respond before deployment. Canary’s work includes building science‑based tools and benchmarks, running evaluations on cutting‑edge models, and helping integrate these assessments into safety policies and regulatory frameworks for frontier AI.

Community Signal

Updated 05/18/26

0Upvotes

0Downvotes

1Endorsements

0Comments

Endorsements support Model Evaluation & Threat Research (METR).

Endorsed by+1

Discussion

No comments yet. Be the first to share your thoughts.

Details

Start Date: Oct 9, 2024
End Date: -
Expected Duration: -
Funding Raised to Date: -