LLM as a Judge

Creator
Creator
Seonglae Cho
Created
Created
2024 Nov 22 23:37
Editor
Edited
Edited
2025 Mar 26 23:55
Refs
Refs

LLM Judge

Cons

  • egocentric - prefer himself
 
 
 
 

Thinking-LLM-as-a-Judge (
CoT
judge)

EvalPlanner is specific implementation of Thinking-LLM-as-a-Judge which plan

Judge as a Judge (how to improve consistency)

Judge only model

Selene
 
 

Recommendations