It is good to separate out hard zeros for interpretability. Thisis because sparsity causes a dirac delta in density at zero.But Is sparsity actually a good proxy for interpretability? is still open question regularization forestarxiv.orghttps://arxiv.org/pdf/1406.2035VAEarxiv.orghttps://arxiv.org/pdf/2009.12421