BELLS benchmark

active

About

Updated 05/18/26

An open-source benchmark suite developed by CeSIA to evaluate and compare large language model supervision and safeguard systems, measuring how reliably they detect problematic or unsafe behaviour in other models.

Theory of Change

By providing standardised, open-source evaluations of how well different guardrail and monitoring systems detect harmful or non-compliant model behaviour, BELLS aims to raise the bar for AI supervision tools and inform regulators, labs, and safety institutes about which approaches best mitigate real-world risks from advanced language models.

Community Signal

Updated 05/18/26

0Upvotes

0Downvotes

1Endorsements

0Comments

Endorsements support CeSIA.

Endorsed by+1

Discussion

No comments yet. Be the first to share your thoughts.

Details

Start Date: -
End Date: -
Expected Duration: -
Funding Raised to Date: -