Moreau envelope

The Moreau envelope (or the Moreau-Yosida regularization) M_f of a proper lower semi-continuous convex function f is a smoothed version of f. It was proposed by Jean-Jacques Moreau in 1965.{{Cite journal |last=Moreau |first=J. J. |date=1965 |title=Proximité et dualité dans un espace hilbertien |url=https://eudml.org/doc/87067 |journal=Bulletin de la Société Mathématique de France |volume=93 |pages=273–299 |doi=10.24033/bsmf.1625 |issn=0037-9484|doi-access=free }}

The Moreau envelope has important applications in mathematical optimization: minimizing over M_f and minimizing over f are equivalent problems in the sense that the sets of minimizers of f and M_f are the same. However, first-order optimization algorithms can be directly applied to M_f, since f may be non-differentiable while M_f is always continuously differentiable. Indeed, many proximal gradient methods can be interpreted as a gradient descent method over M_f.

Definition

The Moreau envelope of a proper lower semi-continuous convex function f from a Hilbert space \mathcal{X} to (-\infty,+\infty] is defined as{{cite journal |author=Neal Parikh and Stephen Boyd |year=2013 |title=Proximal Algorithms |url=https://web.stanford.edu/~boyd/papers/pdf/prox_algs.pdf |journal=Foundations and Trends in Optimization |volume=1 |issue=3 |pages=123–231 |access-date=2019-01-29}}

M_{\lambda f}(v) = \inf_{x\in\mathcal{X}} \left(f(x) + \frac{1}{2\lambda} \|x - v\|_2^2\right).

Given a parameter \lambda \in \mathbb{R}, the Moreau envelope of \lambda f is also called as the Moreau envelope of f with parameter \lambda.

Properties

  • The Moreau envelope can also be seen as the infimal convolution between f and (1/2)\| \cdot \|^2_2.
  • The proximal operator of a function is related to the gradient of the Moreau envelope by the following identity:

\nabla M_{\lambda f}(x) = \frac{1}{\lambda} (x - \mathrm{prox}_{\lambda f}(x)). By defining the sequence x_{k+1} = \mathrm{prox}_{\lambda f}(x_k) and using the above identity, we can interpret the proximal operator as a gradient descent algorithm over the Moreau envelope.

M_{\lambda f}(v) = \max_{p \in \mathcal X} \left( \langle p, v \rangle - \frac{\lambda}{2} \| p \|^2 - f^*(p)\right), where f^* denotes the convex conjugate of f.

Since the subdifferential of a proper, convex, lower semicontinuous function on a Hilbert space is inverse to the subdifferential of its convex conjugate, we can conclude that if p_0 \in \mathcal X is the maximizer of the above expression, then x_0 := v - \lambda p_0 is the minimizer in the primal formulation and vice versa.

  • By Hopf–Lax formula, the Moreau envelope is a viscosity solution to a Hamilton–Jacobi equation.{{Cite arXiv|last1=Heaton |first1=Howard |last2=Fung |first2=Samy Wu |last3=Osher |first3=Stanley |date=2022-10-09 |title=Global Solutions to Nonconvex Problems by Evolution of Hamilton–Jacobi PDEs |class=math.OC |eprint=2202.11014 }} Stanley Osher and co-authors used this property and Cole–Hopf transformation to derive an algorithm to compute approximations to the proximal operator of a function.{{Cite journal |last1=Osher |first1=Stanley |last2=Heaton |first2=Howard |last3=Fung |first3=Samy Wu |date=2023 |title=A Hamilton–Jacobi-based proximal operator |journal=Proceedings of the National Academy of Sciences |volume=120 |issue=14 |pages=e2220469120 |doi=10.1073/pnas.2220469120 |doi-access=free |pmid=36989305 |pmc=10083605 |arxiv=2211.12997 |bibcode=2023PNAS..12020469O }}

See also

References

{{Reflist}}