Randomness
Temperature is only applied to output logit softmax
- When T=0, it selects the token with highest probability (deterministic, greedy decoding)
- When T approaches infinity, probability distribution becomes uniform, increasing randomness
ll
Seonglae Cho
Seonglae Cho