Texonom
Texonom
/
Science
Science
/Mathematics/Math Field/Statistics/Statistical Model/Model Generalization/Model Training/Model Training Tool/
Torchtitan
Search

Torchtitan

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Nov 29 21:33
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2024 Nov 29 21:33
Refs
Refs
torchao
 
 
 
 
 
 
Supercharging Training using float8 and FSDP2
IBM: Tuan Hoang Trong, Alexei Karve, Yan Koyfman, Linsong Chu, Divya Kumari, Shweta Salaria, Robert Walkup, Praneet Adusumilli, Nirmit Desai, Raghu Ganti, Seetharami Seelam Meta: Less Wright, Wei Feng, Vasiliy Kuznetsov, Driss Guesseous
Supercharging Training using float8 and FSDP2
https://pytorch.org/blog/training-using-float8-fsdp2/
Supercharging Training using float8 and FSDP2
 
 

Recommendations

Texonom
Texonom
/
Science
Science
/Mathematics/Math Field/Statistics/Statistical Model/Model Generalization/Model Training/Model Training Tool/
Torchtitan
Copyright Seonglae Cho