Meta bias benchmark generation
Integrated Bias Benchmark Pipeline
- Generating
- Evaluating
- Micro-assessment
- Statistical Diagnosis
SAGED
SAGED: A Holistic Bias-Benchmarking Pipeline for Language Models...
The development of unbiased large language models is widely recognized as crucial, yet existing benchmarks fall short in detecting biases due to limited scope, contamination, and lack of a...
https://arxiv.org/abs/2409.11149


Seonglae Cho