Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/
Weight Initialization
Search

Weight Initialization

Creator
Creator
Seonglae Cho
Created
Created
2023 Jun 6 8:40
Editor
Editor
Seonglae Cho
Edited
Edited
2025 Mar 14 16:52
Refs
Refs

Parameter initialization

  • weight to small random numbers
  • bias (zero or small nonzero)
Weight Initialization Usages
He initialization
Xavier Initialization
Gaussian Initialization
Kaiming initialization
 
 
 
 
 
0025 Initialization - Deepest Documentation
0025 Initialization - Deepest Documentation
https://deepestdocs.readthedocs.io/en/latest/002_deep_learning_part_1/0025/
 
 

Backlinks

Mechanistic interpretabilityTransformer TrainingRandom TransformerMeta Learningtorch.manual_seed()Residual ConnectionAI Optimization

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/
Weight Initialization
Copyright Seonglae Cho