AI Safety

active

About

Updated 05/18/26

Research program on making advanced AI systems reliably do what humans intend, using approaches such as provable behavioral guarantees in model-based reinforcement learning agents, zero-shot cooperation in RL systems, and interpretability of what models are learning.

Community Signal

Updated 05/18/26

0Upvotes

0Downvotes

0Endorsements

0Comments

Endorsements support Cavendish Labs.

No endorsements yet.

Discussion

No comments yet. Be the first to share your thoughts.

Details

Start Date: -
End Date: -
Expected Duration: -
Funding Raised to Date: -