subderivative#The subgradient

{{Short description|Generalization of derivatives to real-valued functions}}

File:Subderivative illustration.png

In mathematics, subderivatives (or subgradient) generalizes the derivative to convex functions which are not necessarily differentiable. The set of subderivatives at a point is called the subdifferential at that point.Bubeck, S. (2014). Theory of Convex Optimization for Machine Learning. ArXiv, abs/1405.4980. Subderivatives arise in convex analysis, the study of convex functions, often in connection to convex optimization.

Let f:I \to \mathbb{R} be a real-valued convex function defined on an open interval of the real line. Such a function need not be differentiable at all points: For example, the absolute value function f(x)=|x| is non-differentiable when x=0. However, as seen in the graph on the right (where f(x) in blue has non-differentiable kinks similar to the absolute value function), for any x_0 in the domain of the function one can draw a line which goes through the point (x_0,f(x_0)) and which is everywhere either touching or below the graph of f. The slope of such a line is called a subderivative.

Definition

Rigorously, a subderivative of a convex function f:I \to \mathbb{R} at a point x_0 in the open interval I is a real number c such that

f(x)-f(x_0)\ge c(x-x_0)

for all x\in I. By the converse of the mean value theorem, the set of subderivatives at x_0 for a convex function is a nonempty closed interval [a,b], where a and b are the one-sided limits

a=\lim_{x\to x_0^-} \frac{f(x)-f(x_0)}{x-x_0},

b=\lim_{x\to x_0^+} \frac{f(x)-f(x_0)}{x-x_0}.

The interval [a,b] of all subderivatives is called the subdifferential of the function f at x_0, denoted by \partial f(x_0). If f is convex, then its subdifferential at any point is non-empty. Moreover, if its subdifferential at x_0 contains exactly one subderivative, then f is differentiable at x_0 and \partial f(x_0)=\{f'(x_0)\}.{{cite book |first=R. T. |last=Rockafellar |author-link=R. T. Rockafellar |title=Convex Analysis |publisher=Princeton University Press |year=1970 |isbn=0-691-08069-0 |page=242 [Theorem 25.1] }}

Example

Consider the function f(x)=|x| which is convex. Then, the subdifferential at the origin is the interval [-1,1]. The subdifferential at any point x_0<0 is the singleton set \{-1\}, while the subdifferential at any point x_0>0 is the singleton set \{1\}. This is similar to the sign function, but is not single-valued at 0, instead including all possible subderivatives.

Properties

  • A convex function f:I\to\mathbb{R} is differentiable at x_0 if and only if the subdifferential is a singleton set, which is \{f'(x_0)\}.
  • A point x_0 is a global minimum of a convex function f if and only if zero is contained in the subdifferential. For instance, in the figure above, one may draw a horizontal "subtangent line" to the graph of f at (x_0,f(x_0)). This last property is a generalization of the fact that the derivative of a function differentiable at a local minimum is zero.
  • If f and g are convex functions with subdifferentials \partial f(x) and \partial g(x) with x being the interior point of one of the functions, then the subdifferential of f + g is \partial(f + g)(x) = \partial f(x) + \partial g(x) (where the addition operator denotes the Minkowski sum). This reads as "the subdifferential of a sum is the sum of the subdifferentials."{{cite book|last1=Lemaréchal|first1=Claude|last2=Hiriart-Urruty|first2=Jean-Baptiste|title=Fundamentals of Convex Analysis|url=https://archive.org/details/fundamentalsconv00hiri|url-access=limited|date=2001|publisher=Springer-Verlag Berlin Heidelberg|isbn=978-3-642-56468-0|page=[https://archive.org/details/fundamentalsconv00hiri/page/n193 183]}}

The subgradient

The concepts of subderivative and subdifferential can be generalized to functions of several variables. If f:U\to\mathbb{R} is a real-valued convex function defined on a convex open set in the Euclidean space \mathbb{R}^n, a vector v in that space is called a subgradient at x_0\in U if for any x\in U one has that

:f(x)-f(x_0)\ge v\cdot (x-x_0),

where the dot denotes the dot product.

The set of all subgradients at x_0 is called the subdifferential at x_0 and is denoted \partial f(x_0). The subdifferential is always a nonempty convex compact set.

These concepts generalize further to convex functions f:U\to\mathbb{R} on a convex set in a locally convex space V. A functional v^* in the dual space V^* is called a subgradient at x_0 in U if for all x\in U,

:f(x)-f(x_0)\ge v^*(x-x_0).

The set of all subgradients at x_0 is called the subdifferential at x_0 and is again denoted \partial f(x_0). The subdifferential is always a convex closed set. It can be an empty set; consider for example an unbounded operator, which is convex, but has no subgradient. If f is continuous, the subdifferential is nonempty.

History

The subdifferential on convex functions was introduced by Jean Jacques Moreau and R. Tyrrell Rockafellar in the early 1960s. The generalized subdifferential for nonconvex functions was introduced by Francis H. Clarke and R. Tyrrell Rockafellar in the early 1980s.

{{cite book|last=Clarke|first=Frank H.|title=Optimization and nonsmooth analysis|url=https://archive.org/details/optimizationnons0000clar|url-access=registration|publisher=John Wiley & Sons|location=New York|year=1983|pages=xiii+308|isbn=0-471-87504-X|mr=0709590}}

See also

References

{{Reflist}}

  • {{cite book |first1=Jonathan |last1=Borwein |first2=Adrian S. |last2=Lewis |title=Convex Analysis and Nonlinear Optimization : Theory and Examples |location=New York |publisher=Springer |edition=2nd |year=2010 |isbn=978-0-387-31256-9 }}
  • {{cite book |first1=Jean-Baptiste |last1=Hiriart-Urruty |first2=Claude |last2=Lemaréchal |author-link2=Claude Lemaréchal |title=Fundamentals of Convex Analysis |publisher=Springer |year=2001 |isbn=3-540-42205-6 }}
  • {{cite book|last=Zălinescu|first=C.|title=Convex analysis in general vector spaces|publisher=World Scientific Publishing  Co., Inc|year=2002|pages=xx+367|isbn=981-238-067-1|mr=1921556}}