Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Risk/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/Activation Engineering/
RepE
Search

RepE

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Oct 9 23:35
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2025 Oct 13 9:38
Refs
Refs

Representation Engineering

 
 
 
 
 
 
 
Representation Engineering: A Top-Down Approach to AI Transparency
In this paper, we identify and characterize the emerging area of representation engineering (RepE), an approach to enhancing the transparency of AI systems that draws on insights from cognitive...
Representation Engineering: A Top-Down Approach to AI Transparency
https://arxiv.org/abs/2310.01405
Representation Engineering: A Top-Down Approach to AI Transparency
arxiv.org
https://arxiv.org/pdf/2502.19649
 
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Risk/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/Activation Engineering/
RepE
Copyright Seonglae Cho