The information content of a message is a function of how predictable it is. The information content (number of bits) needed to encode i is . So Next Token Prediction probability is containing information content itself.
Shannon Game
Creator
Creator
Seonglae ChoCreated
Created
2025 Mar 19 11:37Editor
Editor
Seonglae ChoEdited
Edited
2025 Mar 19 11:43Refs
Refs
