- heuristic
- hype
- overstated
- ai generated
- frame
- statistical robustness
Vision Language Model
- Evaluate separately by error type list with prompts using the whole paper
- Automatic score calculation (Aggregator)
- Automatic review generation with overall score and detected issues

Seonglae Cho