DiLoCo

Creator

Creator

Created

Created

2024 Oct 13 3:12

Editor

Editor

Edited

Edited

2025 Feb 13 16:54

Refs

Refs

Distributed Low-Communication

Streaming DiLoCo by Deepmind

Synchronize only subsets of parameters in sequence, rather than all at once, which greatly reduces peak bandwidth

Allow workers to continue training while synchronizing, which decreases wall clock time

Quantize the data exchanged by workers, which further reduces bandwidth across workers

https://arxiv.org/pdf/2501.18512v1

DiLoCo: Distributed Low-Communication Training of Language Models

Large language models (LLM) have become a critical component in many applications of machine learning. However, standard approaches to training LLM require a large number of tightly interconnected...

DiLoCo: Distributed Low-Communication Training of Language Models

https://arxiv.org/abs/2311.08105

DiLoCo: Distributed Low-Communication Training of Language Models

OpenDiLoCo: An Open-Source Framework for Globally Distributed...

OpenDiLoCo is an open-source implementation and replication of the Distributed Low-Communication (DiLoCo) training method for large language models. We provide a reproducible implementation of the...

OpenDiLoCo: An Open-Source Framework for Globally Distributed...

https://arxiv.org/abs/2407.07852

OpenDiLoCo: An Open-Source Framework for Globally Distributed...

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Introducing OpenDiLoCo, an open-source implementation and scaling of DeepMind’s Distributed Low-Communication (DiLoCo) method, enabling globally distributed AI model training.

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

https://www.primeintellect.ai/blog/opendiloco

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Recommendations

//////////