Natural Abstraction Hypothesis

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Oct 24 23:7
Editor
Edited
Edited
2025 Dec 13 18:27

NAH

The efficient abstractions learned by AI reflect the inherent characteristics of the environment itself
  • Abstractability - The physical world can be abstracted, and it can be summarized with information of a much lower dimension than the overall complexity of the system
  • Human-Compatibility - Low-dimensional abstraction aligns with the abstractions humans use
  • Convergence - Various cognitive structures are likely to use similar abstractions
Currently, the best world modeling approaches are
Noise Reduction
for visual processing and
Attention Mechanism
for language processing.
If intelligence is strongly dependent on nature as a posterior probability , this means its determination probability is very high. This suggests that in a universe that is an
Entropy
generation machine and information seeking, nature may have adjusted the
Fine-tuned universe
to design specific forms of intelligence.

Multimodal Neuron from OpenAI (2021,
Gabriel Goh
)

In 2005, a letter published in Nature described human neurons responding to specific people, such as Jennifer Aniston or Halle Berry. The exciting thing was that they did so regardless of whether they were shown photographs, drawings, or even images of the person’s name. The neurons were multimodal. You are looking at the far end of the transformation from metric, visual shapes to conceptual information.

World model Interpretability with
Internal Interface Theory

If the way AI interacts with various modules through internal interfaces is consistently formed, the possibility increases that humans can understand the format of these interfaces and interpret the entire world model at once.

key claims theorems and critiques

Proposal (Wentworth, 2021)

Emergent Computations in Artificial Neural Networks and Real Brains

Even the discovery of similar circuits in humans and AI supports this claim

Grandmother cell,

2005 Nature study showed that single neurons in the human medial temporal lobe (MTL) respond selectively to the same person/object across different photo angles, lighting, and contexts with invariant selective responses. These results suggest an invariant, sparse and explicit code, which might be important in the transformation of complex visual percepts into long-term and more abstract memories.
Trump always has dedicated neuron
Mechanistic Interpretability explained | Chris Olah and Lex Fridman
Lex Fridman Podcast full episode: https://www.youtube.com/watch?v=ugvHCXCOmm4 Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/cv8247-sb See below for guest bio, links, and to give feedback, submit questions, contact Lex, etc. *GUEST BIO:* Dario Amodei is the CEO of Anthropic, the company that created Claude. Amanda Askell is an AI researcher working on Claude's character and personality. Chris Olah is an AI researcher working on mechanistic interpretability. *CONTACT LEX:* *Feedback* - give feedback to Lex: https://lexfridman.com/survey *AMA* - submit questions, videos or call-in: https://lexfridman.com/ama *Hiring* - join our team: https://lexfridman.com/hiring *Other* - other ways to get in touch: https://lexfridman.com/contact *EPISODE LINKS:* Claude: https://claude.ai Anthropic's X: https://x.com/AnthropicAI Anthropic's Website: https://anthropic.com Dario's X: https://x.com/DarioAmodei Dario's Website: https://darioamodei.com Machines of Loving Grace (Essay): https://darioamodei.com/machines-of-loving-grace Chris's X: https://x.com/ch402 Chris's Blog: https://colah.github.io Amanda's X: https://x.com/AmandaAskell Amanda's Website: https://askell.io *SPONSORS:* To support this podcast, check out our sponsors & get discounts: *Encord:* AI tooling for annotation & data management. Go to https://lexfridman.com/s/encord-cv8247-sb *Notion:* Note-taking and team collaboration. Go to https://lexfridman.com/s/notion-cv8247-sb *Shopify:* Sell stuff online. Go to https://lexfridman.com/s/shopify-cv8247-sb *BetterHelp:* Online therapy and counseling. Go to https://lexfridman.com/s/betterhelp-cv8247-sb *LMNT:* Zero-sugar electrolyte drink mix. Go to https://lexfridman.com/s/lmnt-cv8247-sb *PODCAST LINKS:* - Podcast Website: https://lexfridman.com/podcast - Apple Podcasts: https://apple.co/2lwqZIr - Spotify: https://spoti.fi/2nEwCF8 - RSS: https://lexfridman.com/feed/podcast/ - Podcast Playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 - Clips Channel: https://www.youtube.com/lexclips *SOCIAL LINKS:* - X: https://x.com/lexfridman - Instagram: https://instagram.com/lexfridman - TikTok: https://tiktok.com/@lexfridman - LinkedIn: https://linkedin.com/in/lexfridman - Facebook: https://facebook.com/lexfridman - Patreon: https://patreon.com/lexfridman - Telegram: https://t.me/lexfridman - Reddit: https://reddit.com/r/lexfridman
Mechanistic Interpretability explained | Chris Olah and Lex Fridman
 
 
 

Recommendations