2023-10-29
I use outer misaligment interchangably with reward misspecification but this is not uncontroversial.