On the Discrepancy Principle for Stochastic Gradient Descent

Tim Jahn, Bangti Jin

Stochastic gradient descent (SGD) is a promising numerical method for solving large-scale inverse problems. However, its theoretical properties remain largely underexplored in the lens of classical regularization theory. In this note, we study the classical discrepancy principle, one of the most popular \textit{a posteriori} choice rules, as the stopping criterion for SGD, and prove the finite iteration termination property and the convergence of the iterate in probability as the noise level tends to zero. The theoretical results are complemented with extensive numerical experiments.

Knowledge Graph

arrow_drop_up

Comments

Sign up or login to leave a comment