Biconvex Landscape In SDP-Related Learning

En-Liang Hu, Bo Wang

Many machine learning problems can be reduced to learning a low-rank positive semidefinite matrix (denoted as $Z$), which encounters semidefinite program (SDP). Existing SDP solvers are often expensive for large-scale learning. To avoid directly solving SDP, some works convert SDP into a nonconvex program by factorizing $Z$ as $XX^\top$. However, this would bring higher-order nonlinearity, resulting in scarcity of structure in subsequent optimization. In this paper, we propose a novel surrogate for SDP-related learning, in which the structure of subproblem is exploited. More specifically, we surrogate unconstrained SDP by a biconvex problem, through factorizing $Z$ as $XY^\top$ and using a Courant penalty to penalize the difference of $X$ and $Y$, in which the resultant subproblems are convex. Furthermore, we provide a theoretical bound for the associated penalty parameter under the assumption that the objective function is $L-$Lipschitz-smooth and $\sigma-$strongly convex, such that the proposed surrogate will solve the original SDP when the penalty parameter is larger than this bound (that is $\gamma>\frac{1}{4}\min\{L^X-\sigma^X, L^Y-\sigma^Y\}$). Experiments on two SDP-related machine learning applications demonstrate that the proposed algorithm is as accurate as the state-of-the-art, but is faster on large-scale learning.

Knowledge Graph

arrow_drop_up

Comments

Sign up or login to leave a comment