Parallel Training

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2022 Mar 15 11:35
Editor
Edited
Edited
2024 Oct 5 22:29

Multi-GPU or multi-node Distributed Training, Federated learning

Data parallelism or model parallelism

  • In data parallelism, the data is split into multiple parts
  • in model parallelism, different parts of the model are processed by separate processors

These parallelism are states as 4D parallelism or 3D parallelism

Parallel Training Notion
 
 
 
 
https://xiandong79.github.io
 
 
 

Recommendations