Alignment Newsletter Podcast

Alignment Newsletter #106: Evaluating generalization ability of learned reward models

Listen on