Texonom
Texonom
/
Science
Science
/Mathematics/Math Field/Statistics/Statistical Model/Model Generalization/Model Training/Model Training Tool/
Nanotron
Search

Nanotron

Creator
Creator
Seonglae Cho
Created
Created
2024 Mar 31 12:29
Editor
Editor
Seonglae Cho
Edited
Edited
2024 Mar 31 12:30
Refs
Refs
Apex
Deepspeed
Flash Attention
 
 
 
 
 
GitHub - huggingface/nanotron: Minimalistic large language model 3D-parallelism training
Minimalistic large language model 3D-parallelism training - huggingface/nanotron
GitHub - huggingface/nanotron: Minimalistic large language model 3D-parallelism training
https://github.com/huggingface/nanotron?tab=readme-ov-file
GitHub - huggingface/nanotron: Minimalistic large language model 3D-parallelism training
 
 
 

Recommendations

Texonom
Texonom
/
Science
Science
/Mathematics/Math Field/Statistics/Statistical Model/Model Generalization/Model Training/Model Training Tool/
Nanotron
Copyright Seonglae Cho