System Level Safety Evaluations
active
About
Updated 05/18/26Research agenda developing adversarial, multi-agent evaluations to stress-test societal defensive processes—such as democratic mechanisms, scientific consensus formation, and social dynamics—so as to measure system-level AI safety properties under coordinated attacks.
Discussion
Sign in to comment
No comments yet. Be the first to share your thoughts.
Details
- Start Date
- -
- End Date
- -
- Expected Duration
- -
- Funding Raised to Date
- -