Alternating maximization: unifying framework for 8 sparse PCA formulations and efficient parallel codes
dc.contributor.author | Richtarik, Peter | |
dc.contributor.author | Jahani, Majid | |
dc.contributor.author | Ahipaşaoğlu, Selin Damla | |
dc.contributor.author | Takáč, Martin | |
dc.date.accessioned | 2020-09-29T12:05:47Z | |
dc.date.available | 2020-09-29T12:05:47Z | |
dc.date.issued | 2020-09-22 | |
dc.date.submitted | 2019-10-03 | |
dc.identifier.citation | Richtárik, P., Jahani, M., Ahipaşaoğlu, S. D., & Takáč, M. (2020). Alternating maximization: unifying framework for 8 sparse PCA formulations and efficient parallel codes. Optimization and Engineering. doi:10.1007/s11081-020-09562-3 | |
dc.identifier.issn | 1573-2924 | |
dc.identifier.issn | 1389-4420 | |
dc.identifier.doi | 10.1007/s11081-020-09562-3 | |
dc.identifier.uri | http://hdl.handle.net/10754/665355 | |
dc.description.abstract | Given a multivariate data set, sparse principal component analysis (SPCA) aims to extract several linear combinations of the variables that together explain the variance in the data as much as possible, while controlling the number of nonzero loadings in these combinations. In this paper we consider 8 different optimization formulations for computing a single sparse loading vector: we employ two norms for measuring variance (L2, L1) and two sparsity-inducing norms (L0, L1), which are used in two ways (constraint, penalty). Three of our formulations, notably the one with L0 constraint and L1 variance, have not been considered in the literature. We give a unifying reformulation which we propose to solve via the alternating maximization (AM) method. We show that AM is equivalent to GPower for all formulations. Besides this, we provide 24 efficient parallel SPCA implementations: 3 codes (multi-core, GPU and cluster) for each of the 8 problems. Parallelism in the methods is aimed at (1) speeding up computations (our GPU code can be 100 times faster than an efficient serial code written in C++), (2) obtaining solutions explaining more variance and (3) dealing with big data problems (our cluster code can solve a 357 GB problem in a minute). | |
dc.publisher | Springer Nature | |
dc.relation.url | http://link.springer.com/10.1007/s11081-020-09562-3 | |
dc.relation.url | http://arxiv.org/pdf/1212.4137 | |
dc.rights | Archived with thanks to Optimization and Engineering | |
dc.rights | This file is an open access version redistributed from: http://arxiv.org/pdf/1212.4137 | |
dc.title | Alternating maximization: unifying framework for 8 sparse PCA formulations and efficient parallel codes | |
dc.type | Article | |
dc.contributor.department | Computer Science Program | |
dc.contributor.department | Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division | |
dc.identifier.journal | Optimization and Engineering | |
dc.rights.embargodate | 2021-09-22 | |
dc.eprint.version | Pre-print | |
dc.contributor.institution | Industrial and Systems Engineering, Lehigh University, 200 West Packer Avenue, Bethlehem, PA, 18015, USA | |
dc.contributor.institution | Mathematical Sciences, University of Southampton, University Road, Southampton, SO17 1BJ, UK | |
dc.identifier.arxivid | 1212.4137 | |
kaust.person | Richtarik, Peter | |
dc.date.accepted | 2020-09-07 | |
dc.identifier.eid | 2-s2.0-85091319469 | |
refterms.dateFOA | 2020-12-07T13:19:29Z | |
dc.date.published-online | 2020-09-22 | |
dc.date.published-print | 2021-09 |
Files in this item
This item appears in the following Collection(s)
-
Articles
-
Computer Science Program
For more information visit: https://cemse.kaust.edu.sa/cs -
Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division
For more information visit: https://cemse.kaust.edu.sa/