There is significant empirical evidence suggesting that neural networks have interpretable linear directions in activation space.
Linear representation hypothesis
Created
Created
2024 May 24 4:19Editor
Editor
Seonglae ChoCreator
Creator
Seonglae ChoEdited
Edited
2024 Nov 18 20:32