EP
Expert Parallelism Tools
Field Notes on Scaling MoE Expert Parallelism with DeepEP
Documenting the journey of scaling expert parallelism to achieve high-throughput pretraining.
https://nousresearch.com/moe-scaling-field-notes/

Seonglae Cho
Seonglae Cho