Attention Sink arxiv.orghttps://arxiv.org/pdf/2402.09221interpreting GPT: the logit lens — AI Alignment ForumThis post relates an observation I've made in my work with GPT-2, which I have not seen made elsewhere. …https://www.alignmentforum.org/posts/AcKRB8wDpdaN6v6ru/interpreting-gpt-the-logit-lens