SEGA: Variance Reduction via Gradient Sketching
dc.contributor.author | Hanzely, Filip | |
dc.contributor.author | Mishchenko, Konstantin | |
dc.contributor.author | Richtarik, Peter | |
dc.date.accessioned | 2019-05-29T07:37:28Z | |
dc.date.available | 2019-05-29T07:37:28Z | |
dc.date.issued | 2018-09-09 | |
dc.identifier.uri | http://hdl.handle.net/10754/653114 | |
dc.description.abstract | We propose a randomized first order optimization method--SEGA (SkEtchedGrAdient method)-- which progressively throughout its iterations builds avariance-reduced estimate of the gradient from random linear measurements(sketches) of the gradient obtained from an oracle. In each iteration, SEGAupdates the current estimate of the gradient through a sketch-and-projectoperation using the information provided by the latest sketch, and this issubsequently used to compute an unbiased estimate of the true gradient througha random relaxation procedure. This unbiased estimate is then used to perform agradient step. Unlike standard subspace descent methods, such as coordinatedescent, SEGA can be used for optimization problems with a non-separableproximal term. We provide a general convergence analysis and prove linearconvergence for strongly convex objectives. In the special case of coordinatesketches, SEGA can be enhanced with various techniques such as importancesampling, minibatching and acceleration, and its rate is up to a small constantfactor identical to the best-known rate of coordinate descent. | |
dc.publisher | arXiv | |
dc.relation.url | https://arxiv.org/pdf/1809.03054 | |
dc.rights | Archived with thanks to arXiv | |
dc.title | SEGA: Variance Reduction via Gradient Sketching | |
dc.type | Preprint | |
dc.contributor.department | Applied Mathematics and Computational Science | |
dc.contributor.department | Applied Mathematics and Computational Science Program | |
dc.contributor.department | Computer Science | |
dc.contributor.department | Computer Science Program | |
dc.contributor.department | Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division | |
dc.eprint.version | Pre-print | |
dc.contributor.institution | School of Mathematics, University of Edinburgh, United Kingdom | |
dc.contributor.institution | Moscow Institute of Physics and Technology, Russia | |
dc.identifier.arxivid | 1809.03054 | |
kaust.person | Hanzely, Filip | |
kaust.person | Mishchenko, Konstantin | |
kaust.person | Richtarik, Peter | |
refterms.dateFOA | 2019-05-29T07:37:48Z |
Files in this item
This item appears in the following Collection(s)
-
Preprints
-
Applied Mathematics and Computational Science Program
For more information visit: https://cemse.kaust.edu.sa/amcs -
Computer Science Program
For more information visit: https://cemse.kaust.edu.sa/cs -
Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division
For more information visit: https://cemse.kaust.edu.sa/