Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
158 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Randomized Block-Coordinate Primal-Dual Method for Large-scale Stochastic Saddle Point Problems (1907.03886v5)

Published 8 Jul 2019 in math.OC

Abstract: We consider (stochastic) convex-concave saddle point (SP) problems with high-dimensional decision variables, arising in various machine learning problems. To contend with the challenges in computing full gradients, we employ a randomized block-coordinate primal-dual scheme in which randomly selected primal and dual blocks of variables are updated. We consider both deterministic and stochastic settings, where deterministic partial gradients and their randomly sampled estimates are used, respectively, at each iteration. We investigate the convergence of the proposed method under different blocking strategies and provide the corresponding complexity results. While the best-known complexity result for deterministic primal-dual methods using full gradients is $\mathcal O(\max{M,N}2/\varepsilon)$ where $M$ and $N$ denote the number of primal and dual blocks, respectively, we show that our proposed randomized block-coordinate method can achieve an improved convergence rate of $\mathcal O(MN/\varepsilon)$. Moreover, for the stochastic setting where a mini-batch sample gradient is utilized, we show an optimal oracle complexity of $\tilde{\mathcal{O}}(M2N2/\varepsilon2)$ through acceleration. Finally, almost sure convergence of the iterate sequence to a saddle point is established.

Summary

We haven't generated a summary for this paper yet.