Hyperfast second-order local solvers for efficient statistically preconditioned distributed optimization

Dvurechensky, Pavel; Kamzolov, Dmitry; Lukashevich, Aleksandr; Lee, Soomin; Ordentlich, Erik; Uribe, César A.; Gasnikov, Alexander

doi:http://dx.doi.org/10.34657/10659

Hyperfast second-order local solvers for efficient statistically preconditioned distributed optimization

dc.bibliographicCitation.firstPage	100045
dc.bibliographicCitation.journalTitle	EURO journal on computational optimization	eng
dc.bibliographicCitation.volume	10
dc.contributor.author	Dvurechensky, Pavel
dc.contributor.author	Kamzolov, Dmitry
dc.contributor.author	Lukashevich, Aleksandr
dc.contributor.author	Lee, Soomin
dc.contributor.author	Ordentlich, Erik
dc.contributor.author	Uribe, César A.
dc.contributor.author	Gasnikov, Alexander
dc.date.accessioned	2023-03-01T09:28:13Z
dc.date.available	2023-03-01T09:28:13Z
dc.date.issued	2022
dc.description.abstract	Statistical preconditioning enables fast methods for distributed large-scale empirical risk minimization problems. In this approach, multiple worker nodes compute gradients in parallel, which are then used by the central node to update the parameter by solving an auxiliary (preconditioned) smaller-scale optimization problem. The recently proposed Statistically Preconditioned Accelerated Gradient (SPAG) method [1] has complexity bounds superior to other such algorithms but requires an exact solution for computationally intensive auxiliary optimization problems at every iteration. In this paper, we propose an Inexact SPAG (InSPAG) and explicitly characterize the accuracy by which the corresponding auxiliary subproblem needs to be solved to guarantee the same convergence rate as the exact method. We build our results by first developing an inexact adaptive accelerated Bregman proximal gradient method for general optimization problems under relative smoothness and strong convexity assumptions, which may be of independent interest. Moreover, we explore the properties of the auxiliary problem in the InSPAG algorithm assuming Lipschitz third-order derivatives and strong convexity. For such problem class, we develop a linearly convergent Hyperfast second-order method and estimate the total complexity of the InSPAG method with hyperfast auxiliary problem solver. Finally, we illustrate the proposed method's practical efficiency by performing large-scale numerical experiments on logistic regression models. To the best of our knowledge, these are the first empirical results on implementing high-order methods on large-scale problems, as we work with data where the dimension is of the order of 3 million, and the number of samples is 700 million.	eng
dc.description.version	publishedVersion	eng
dc.identifier.uri	https://oa.tib.eu/renate/handle/123456789/11626
dc.identifier.uri	http://dx.doi.org/10.34657/10659
dc.language.iso	eng
dc.publisher	Amsterdam : Elsevier
dc.relation.doi	https://doi.org/10.1016/j.ejco.2022.100045
dc.relation.essn	2192-4414
dc.relation.issn	2192-4406
dc.rights.license	CC BY-NC-ND 4.0 Unported
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0
dc.subject.ddc	510
dc.subject.other	Distributed optimization	eng
dc.subject.other	Empirical risk minimization	eng
dc.subject.other	Statistical preconditioning	eng
dc.subject.other	Tensor optimization methods	eng
dc.title	Hyperfast second-order local solvers for efficient statistically preconditioned distributed optimization	eng
dc.type	Article	eng
dc.type	Text	eng
tib.accessRights	openAccess
wgl.contributor	WIAS
wgl.subject	Mathematik	ger
wgl.type	Zeitschriftenartikel	ger

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 1-s2-0-S2192440622000211-main.pdf
Size:: 932.97 KB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Mathematik