Hyperfast second-order local solvers for efficient statistically preconditioned distributed optimization

dc.bibliographicCitation.firstPage100045
dc.bibliographicCitation.journalTitleEURO journal on computational optimizationeng
dc.bibliographicCitation.volume10
dc.contributor.authorDvurechensky, Pavel
dc.contributor.authorKamzolov, Dmitry
dc.contributor.authorLukashevich, Aleksandr
dc.contributor.authorLee, Soomin
dc.contributor.authorOrdentlich, Erik
dc.contributor.authorUribe, César A.
dc.contributor.authorGasnikov, Alexander
dc.date.accessioned2023-03-01T09:28:13Z
dc.date.available2023-03-01T09:28:13Z
dc.date.issued2022
dc.description.abstractStatistical preconditioning enables fast methods for distributed large-scale empirical risk minimization problems. In this approach, multiple worker nodes compute gradients in parallel, which are then used by the central node to update the parameter by solving an auxiliary (preconditioned) smaller-scale optimization problem. The recently proposed Statistically Preconditioned Accelerated Gradient (SPAG) method [1] has complexity bounds superior to other such algorithms but requires an exact solution for computationally intensive auxiliary optimization problems at every iteration. In this paper, we propose an Inexact SPAG (InSPAG) and explicitly characterize the accuracy by which the corresponding auxiliary subproblem needs to be solved to guarantee the same convergence rate as the exact method. We build our results by first developing an inexact adaptive accelerated Bregman proximal gradient method for general optimization problems under relative smoothness and strong convexity assumptions, which may be of independent interest. Moreover, we explore the properties of the auxiliary problem in the InSPAG algorithm assuming Lipschitz third-order derivatives and strong convexity. For such problem class, we develop a linearly convergent Hyperfast second-order method and estimate the total complexity of the InSPAG method with hyperfast auxiliary problem solver. Finally, we illustrate the proposed method's practical efficiency by performing large-scale numerical experiments on logistic regression models. To the best of our knowledge, these are the first empirical results on implementing high-order methods on large-scale problems, as we work with data where the dimension is of the order of 3 million, and the number of samples is 700 million.eng
dc.description.versionpublishedVersioneng
dc.identifier.urihttps://oa.tib.eu/renate/handle/123456789/11626
dc.identifier.urihttp://dx.doi.org/10.34657/10659
dc.language.isoeng
dc.publisherAmsterdam : Elsevier
dc.relation.doihttps://doi.org/10.1016/j.ejco.2022.100045
dc.relation.essn2192-4414
dc.relation.issn2192-4406
dc.rights.licenseCC BY-NC-ND 4.0 Unported
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0
dc.subject.ddc510
dc.subject.otherDistributed optimizationeng
dc.subject.otherEmpirical risk minimizationeng
dc.subject.otherStatistical preconditioningeng
dc.subject.otherTensor optimization methodseng
dc.titleHyperfast second-order local solvers for efficient statistically preconditioned distributed optimizationeng
dc.typeArticleeng
dc.typeTexteng
tib.accessRightsopenAccess
wgl.contributorWIAS
wgl.subjectMathematikger
wgl.typeZeitschriftenartikelger
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1-s2-0-S2192440622000211-main.pdf
Size:
932.97 KB
Format:
Adobe Portable Document Format
Description:
Collections