Texonom
/
Engineering
/
Data Engineering
/
Artificial Intelligence
/
Machine Learning
/
Reinforcement Learning
/
Reinforcement Learning Method
/
Reward model
/
Verifiable Reward
/
Accuracy reward
Search
Accuracy reward
Creator
Creator
Seonglae Cho
Created
Created
2025 Apr 16 13:5
Editor
Editor
Seonglae Cho
Edited
Edited
2025 Apr 16 13:7
Refs
Refs
Task reward
grade model outputs according to the correctness of their responses
Recommendations
Texonom
/
Engineering
/
Data Engineering
/
Artificial Intelligence
/
Machine Learning
/
Reinforcement Learning
/
Reinforcement Learning Method
/
Reward model
/
Verifiable Reward
/
Accuracy reward