[2303.16529] Importance Sampling for Stochastic Gradient Descent in Deep Neural Networks