RLHF dataset from Anthropic (harmful outputs not only jailbreak)
Anthropic/hh-rlhf · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/Anthropic/hh-rlhf/viewer/default/train

Seonglae Cho