Pre Training

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2023 Mar 7 14:1
Editor
Edited
Edited
2024 Apr 16 2:43

The process of artificial neural networks extracting features from data and abstracting separation in each neuron.

  • 모든 파라미터 업데이트
  • 일반적인 이해력 획득
Pre Training Notion
 
 
 

How training process and loss value is related to neural network’s ability

notion image
Perhaps the most striking phenomenon the Anthropic have noticed is that the learning dynamics of toy models with large numbers of features appear to be dominated by "energy level jumps" where features jump between different feature dimensionalities.
notion image
 
 
 
 
 
 
 

Recommendations