Nonlinear
A new preprint on nonlinear SGD under heavy-tailed gradient noise, including analyses on large deviations, mean-sqaure and almost sure convergence rates.
A new preprint on nonlinear SGD under heavy-tailed gradient noise, including analyses on large deviations, mean-sqaure and almost sure convergence rates.