SparseRT: Accelerating Unstructured Sparsity on GPUs for Deep...
In recent years, there has been a flurry of research in deep neural network pruning and compression. Early approaches prune weights individually. However, it is difficult to take advantage of the...
https://arxiv.org/abs/2008.11849