Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/
Universality Hypothesis
Search

Universality Hypothesis

Created
Created
2024 Apr 6 13:13
Editor
Editor
Seonglae ChoSeonglae Cho
Creator
Creator
Seonglae ChoSeonglae Cho
Edited
Edited
2025 Jun 29 19:52
Refs
Refs
Natural Abstraction Hypothesis
Correlation
AI Feature
SAE Transferability

Different models learn similar features and circuits

https://transformer-circuits.pub/2023/monosemantic-features#phenomenology-universality
Universality Hypothesis Approches
Model Diffing
GCR
 
 
Universality Types
Embedding Universality
 
 
 

Convergent learning (2016)

arxiv.org
https://arxiv.org/pdf/1511.07543.pdf
Connectome
Computational Neuroscience
arxiv.org
https://arxiv.org/pdf/2211.12935
arxiv.org
https://arxiv.org/pdf/2210.06756
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/
Universality Hypothesis
Copyright Seonglae Cho