Distributed Shuffling
How to shuffle a big dataset
At Jane Street, we often work with data that has a very lowsignal-to-noise ratio, but fortunately we also have a lot of data.Where practitioners in many fiel...
https://blog.janestreet.com/how-to-shuffle-a-big-dataset/


Seonglae Cho