Vocab size Although self-certainty comes naturally to LLMs, its use as a reward prevents them from cultivating true AI reasoning abilities over time. www.arxiv.orghttps://www.arxiv.org/pdf/2505.19590