Test-time Scaling

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Aug 1 23:31
Editor
Edited
Edited
2025 Aug 1 23:31
Refs
Refs
 
 
 
 
 
In Reasoning Models, increasing test-time computations (thinking tokens) doesn't always lead to improvement, and reverse scaling where accuracy actually decreases has been observed across multiple tasks
 
 

Recommendations