On the equivalence of different adaptive batch size selection strategies for stochastic gradient descent methods
dc.contributor.author | Espath, Luis | |
dc.contributor.author | Krumscheid, Sebastian | |
dc.contributor.author | Tempone, Raul | |
dc.contributor.author | Vilanova, Pedro | |
dc.date.accessioned | 2021-10-04T13:12:43Z | |
dc.date.available | 2021-10-04T13:12:43Z | |
dc.date.issued | 2021-09-22 | |
dc.identifier.uri | http://hdl.handle.net/10754/672107 | |
dc.description.abstract | In this study, we demonstrate that the norm test and inner product/orthogonality test presented in [1] are equivalent in terms of the convergence rates associated with Stochastic Gradient Descent (SGD) methods if e2 = θ2 + ν2 with specific choices of θ and ν. Here, controls the relative statistical error of the norm of the gradient while θ and ν control the relative statistical error of the gradient in the direction of the gradient and in the direction orthogonal to the gradient, respectively. Furthermore, we demonstrate that the inner product/orthogonality test can be as inexpensive as the norm test in the best case scenario if θ and ν are optimally selected, but the inner product/orthogonality test will never be more computationally affordable than the norm test if e2 = θ2 + ν2. Finally, we present two stochastic optimization problems to illustrate our results. | |
dc.description.sponsorship | This work was partially supported by the KAUST Office of Sponsored Research (OSR) under Award numbers URF/1/2281 − 01 − 01, URF/1/2584 − 01 − 01 in the KAUST Competitive Research Grants Program Round 8, the Alexander von Humboldt Foundation. | |
dc.publisher | arXiv | |
dc.relation.url | https://arxiv.org/pdf/2109.10933.pdf | |
dc.rights | Archived with thanks to arXiv | |
dc.title | On the equivalence of different adaptive batch size selection strategies for stochastic gradient descent methods | |
dc.type | Preprint | |
dc.contributor.department | Applied Mathematics and Computational Science Program | |
dc.contributor.department | Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division | |
dc.contributor.department | Stochastic Numerics Research Group | |
dc.eprint.version | Pre-print | |
dc.contributor.institution | Department of Mathematics, RWTH Aachen University, Gebaude-1953 1.OG, Pontdriesch 14-16, 161, 52062 ¨Aachen, Germany. | |
dc.contributor.institution | Alexander von Humboldt Professor in Mathematics for Uncertainty Quantification, RWTH Aachen University, Germany. | |
dc.contributor.institution | Department of Mathematical Sciences, Stevens Institute of Technology, Hoboken, NJ 07030 USA. | |
dc.identifier.arxivid | 2109.10933 | |
kaust.person | Tempone, Raul | |
kaust.grant.number | URF/1/2281 − 01 − 01 | |
kaust.grant.number | URF/1/2584 − 01 − 01 | |
refterms.dateFOA | 2021-10-04T13:13:45Z | |
kaust.acknowledged.supportUnit | Competitive Research Grants | |
kaust.acknowledged.supportUnit | KAUST Office of Sponsored Research (OSR) |
Files in this item
This item appears in the following Collection(s)
-
Preprints
-
Applied Mathematics and Computational Science Program
For more information visit: https://cemse.kaust.edu.sa/amcs -
Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division
For more information visit: https://cemse.kaust.edu.sa/