Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Hacking/AI Redteaming/AI Jailbreak/AI Jailbreak Benchmark/
HH RLHF
Search

HH RLHF

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Jul 21 15:1
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2025 Jul 21 15:1
Refs
Refs
RLHF
dataset from Anthropic (harmful outputs not only jailbreak)
Anthropic/hh-rlhf · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Anthropic/hh-rlhf · Datasets at Hugging Face
https://huggingface.co/datasets/Anthropic/hh-rlhf/viewer/default/train
Anthropic/hh-rlhf · Datasets at Hugging Face
 
 
 
 
 
 
 
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Hacking/AI Redteaming/AI Jailbreak/AI Jailbreak Benchmark/
HH RLHF
Copyright Seonglae Cho