Linearly convergent stochastic heavy ball method for minimizing generalization error
Type
PreprintAuthors
Loizou, NicolasRichtarik, Peter

KAUST Department
Computer Science ProgramComputer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
Date
2017-10-30Permanent link to this record
http://hdl.handle.net/10754/626510
Metadata
Show full item recordAbstract
In this work we establish the first linear convergence result for the stochastic heavy ball method. The method performs SGD steps with a fixed stepsize, amended by a heavy ball momentum term. In the analysis, we focus on minimizing the expected loss and not on finite-sum minimization, which is typically a much harder problem. While in the analysis we constrain ourselves to quadratic loss, the overall objective is not necessarily strongly convex.Publisher
arXivarXiv
1710.10737Additional Links
http://arxiv.org/abs/1710.10737v1http://arxiv.org/pdf/1710.10737v1