AI Safety
active
About
Updated 05/18/26Research program on making advanced AI systems reliably do what humans intend, using approaches such as provable behavioral guarantees in model-based reinforcement learning agents, zero-shot cooperation in RL systems, and interpretability of what models are learning.
Discussion
Sign in to comment
No comments yet. Be the first to share your thoughts.
Details
- Start Date
- -
- End Date
- -
- Expected Duration
- -
- Funding Raised to Date
- -