Alignment Newsletter Podcast

Alignment Newsletter #100: What might go wrong if you learn a reward function while acting

Listen on