RealToxicityPrompts: Evaluating Neural Toxic Degeneration in...
Pretrained neural language models (LMs) are prone to generating racist, sexist, or otherwise toxic language which hinders their safe deployment. We investigate the extent to which pretrained LMs...
https://arxiv.org/abs/2009.11462