Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Development/Model Training Tool/
Nanotron
Search

Nanotron

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Mar 31 12:29
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2026 Jan 10 0:25
Refs
Refs
Apex
Deepspeed
Flash Attention
 
 
 
 
 
GitHub - huggingface/nanotron: Minimalistic large language model 3D-parallelism training
Minimalistic large language model 3D-parallelism training - huggingface/nanotron
GitHub - huggingface/nanotron: Minimalistic large language model 3D-parallelism training
https://github.com/huggingface/nanotron?tab=readme-ov-file
GitHub - huggingface/nanotron: Minimalistic large language model 3D-parallelism training
3
NVIDIA Debuts Nemotron 3 Family of Open Models
The Nemotron 3 family of open models — in Nano, Super and Ultra sizes — introduces the most efficient family of open models with leading accuracy for building agentic AI applications.
NVIDIA Debuts Nemotron 3 Family of Open Models
https://nvidianews.nvidia.com/news/nvidia-debuts-nemotron-3-family-of-open-models
NVIDIA Debuts Nemotron 3 Family of Open Models
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Development/Model Training Tool/
Nanotron
Copyright Seonglae Cho