Functional derivative
{{Short description|Concept in calculus of variation}}
In the calculus of variations, a field of mathematical analysis, the functional derivative (or variational derivative){{harvp|Giaquinta|Hildebrandt|1996|p=18}} relates a change in a functional (a functional in this sense is a function that acts on functions) to a change in a function on which the functional depends.
In the calculus of variations, functionals are usually expressed in terms of an integral of functions, their arguments, and their derivatives. In an integrand {{math|L}} of a functional, if a function {{math|f}} is varied by adding to it another function {{math|δf}} that is arbitrarily small, and the resulting integrand is expanded in powers of {{math|δf}}, the coefficient of {{math|δf}} in the first order term is called the functional derivative.
For example, consider the functional
where {{math|f ′(x) ≡ df/dx}}. If {{math|f}} is varied by adding to it a function {{math|δf}}, and the resulting integrand {{math|L(x, f +δf, f ′+δf ′)}} is expanded in powers of {{math|δf}}, then the change in the value of {{math|J}} to first order in {{math|δf}} can be expressed as follows:According to {{Harvp|Giaquinta|Hildebrandt|1996|p=18}}, this notation is customary in physical literature.
\delta J &= \int_a^b \left( \frac{\partial L}{\partial f} \delta f(x) + \frac{\partial L}{\partial f'} \frac{d}{dx} \delta f(x) \right) \, dx \, \\[1ex]
&= \int_a^b \left( \frac{\partial L}{\partial f} - \frac{d}{dx} \frac{\partial L}{\partial f'} \right) \delta f(x) \, dx \, + \, \frac{\partial L}{\partial f'} (b) \delta f(b) \, - \, \frac{\partial L}{\partial f'} (a) \delta f(a)
\end{align}
where the variation in the derivative, {{math|δf ′}} was rewritten as the derivative of the variation {{math|(δf) ′}}, and integration by parts was used in these derivatives.
Definition
In this section, the functional differential (or variation or first variation) Called first variation in {{harv|Giaquinta|Hildebrandt|1996|p=3}}, variation or first variation in {{harv|Courant|Hilbert|1953|p=186}}, variation or differential in {{harv|Gelfand|Fomin|2000|loc= p. 11, § 3.2}} and differential in {{harv|Parr|Yang|1989|p=246}}. is defined. Then the functional derivative is defined in terms of the functional differential.
=Functional differential=
Suppose is a Banach space and is a functional defined on .
The differential of at a point is the linear functional on defined{{harvp|Gelfand|Fomin|2000|p=11}}. by the condition that, for all ,
F[\rho+\phi] - F[\rho]
=
\delta F [\rho; \phi] + \varepsilon \left\|\phi\right\|
where is a real number that depends on in such a way that as . This means that is the Fréchet derivative of at .
However, this notion of functional differential is so strong it may not exist,{{harvp|Giaquinta|Hildebrandt|1996|p=10}}. and in those cases a weaker notion, like the Gateaux derivative is preferred. In many practical cases, the functional differential is defined{{harvp|Giaquinta|Hildebrandt|1996|p=10}}. as the directional derivative
\begin{align}
\delta F[\rho,\phi]
&= \lim_{\varepsilon\to 0}\frac{F[\rho+\varepsilon \phi]-F[\rho]}{\varepsilon} \\[1ex]
&= \left [ \frac{d}{d\varepsilon}F[\rho+\varepsilon \phi]\right ]_{\varepsilon=0}.
\end{align}
Note that this notion of the functional differential can even be defined without a norm.
=Functional derivative=
In many applications, the domain of the functional is a space of differentiable functions defined on some space and is of the form
F[\rho]
=
\int_\Omega L(x,\rho(x),D\rho(x))\,dx
for some function that may depend on , the value and the derivative .
If this is the case and, moreover, can be written as the integral of times another function (denoted {{math|δF/δρ}})
then this function {{math|δF/δρ}} is called the functional derivative of {{math|F}} at {{math|ρ}}.{{harvp|Parr|Yang|1989|loc= p. 246, Eq. A.2}}.{{harvp|Greiner|Reinhardt|1996|p=36,37}}. If is restricted to only certain functions (for example, if there are some boundary conditions imposed) then is restricted to functions such that continues to satisfy these conditions.
Heuristically, is the change in , so we 'formally' have , and then this is similar in form to the total differential of a function ,
where are independent variables.
Comparing the last two equations, the functional derivative has a role similar to that of the partial derivative , where the variable of integration is like a continuous version of the summation index .{{harvp|Parr|Yang|1989|p=246}}. One thinks of {{math|δF/δρ}} as the gradient of {{math|F}} at the point {{math|ρ}}, so the value {{math|δF/δρ(x)}} measures how much the functional {{math|F}} will change if the function {{math|ρ}} is changed at the point {{math|x}}. Hence the formula
is regarded as the directional derivative at point in the direction of . This is analogous to vector calculus, where the inner product of a vector with the gradient gives the directional derivative in the direction of .
Properties
Like the derivative of a function, the functional derivative satisfies the following properties, where {{math|F[ρ]}} and {{math|G[ρ]}} are functionals:
Here the notation
is introduced.
- Linearity:{{harvp|Parr|Yang|1989|loc= p. 247, Eq. A.3}}. where {{math|λ, μ}} are constants.
- Product rule:{{harvp|Parr|Yang|1989|loc= p. 247, Eq. A.4}}.
- Chain rules:
- If {{math|F}} is a functional and {{math|G}} another functional, then{{harvp|Greiner|Reinhardt|1996|loc=p. 38, Eq. 6}}.
- If {{math|G}} is an ordinary differentiable function (local functional) {{math|g}}, then this reduces to{{harvp|Greiner|Reinhardt|1996|loc=p. 38, Eq. 7}}.
Determining functional derivatives
A formula to determine functional derivatives for a common class of functionals can be written as the integral of a function and its derivatives. This is a generalization of the Euler–Lagrange equation: indeed, the functional derivative was introduced in physics within the derivation of the Lagrange equation of the second kind from the principle of least action in Lagrangian mechanics (18th century). The first three examples below are taken from density functional theory (20th century), the fourth from statistical mechanics (19th century).
=Formula=
Given a functional
and a function that vanishes on the boundary of the region of integration, from a previous section Definition,
\int \frac{\delta F}{\delta\rho(\boldsymbol{r})} \, \phi(\boldsymbol{r}) \, d\boldsymbol{r}
& = \left [ \frac{d}{d\varepsilon} \int f( \boldsymbol{r}, \rho + \varepsilon \phi, \nabla\rho+\varepsilon\nabla\phi )\, d\boldsymbol{r} \right ]_{\varepsilon=0} \\
& = \int \left( \frac{\partial f}{\partial\rho} \, \phi + \frac{\partial f}{\partial\nabla\rho} \cdot \nabla\phi \right) d\boldsymbol{r} \\
& = \int \left[ \frac{\partial f}{\partial\rho} \, \phi + \nabla \cdot \left( \frac{\partial f}{\partial\nabla\rho} \, \phi \right) - \left( \nabla \cdot \frac{\partial f}{\partial\nabla\rho} \right) \phi \right] d\boldsymbol{r} \\
& = \int \left[ \frac{\partial f}{\partial\rho} \, \phi - \left( \nabla \cdot \frac{\partial f}{\partial\nabla\rho} \right) \phi \right] d\boldsymbol{r} \\
& = \int \left( \frac{\partial f}{\partial\rho} - \nabla \cdot \frac{\partial f}{\partial\nabla\rho} \right) \phi(\boldsymbol{r}) \ d\boldsymbol{r} \, .
\end{align}
The second line is obtained using the total derivative, where {{math|∂f /∂∇ρ}} is a derivative of a scalar with respect to a vector.For a three-dimensional Cartesian coordinate system,
where and , , are unit vectors along the x, y, z axes.
The third line was obtained by use of a product rule for divergence. The fourth line was obtained using the divergence theorem and the condition that on the boundary of the region of integration. Since is also an arbitrary function, applying the fundamental lemma of calculus of variations to the last line, the functional derivative is
where {{math|1=ρ = ρ(r)}} and {{math|1=f = f (r, ρ, ∇ρ)}}. This formula is for the case of the functional form given by {{math|F[ρ]}} at the beginning of this section. For other functional forms, the definition of the functional derivative can be used as the starting point for its determination. (See the example Coulomb potential energy functional.)
The above equation for the functional derivative can be generalized to the case that includes higher dimensions and higher order derivatives. The functional would be,
where the vector {{math|r ∈ Rn}}, and {{math|∇(i)}} is a tensor whose {{math|ni}} components are partial derivative operators of order {{math|i}},
where and can be .
An analogous application of the definition of the functional derivative yields
\frac{\delta F[\rho]}{\delta \rho} &{} = \frac{\partial f}{\partial\rho} - \nabla \cdot \frac{\partial f}{\partial(\nabla\rho)} + \nabla^{(2)} \cdot \frac{\partial f}{\partial\left(\nabla^{(2)}\rho\right)} + \dots + (-1)^N \nabla^{(N)} \cdot \frac{\partial f}{\partial\left(\nabla^{(N)}\rho\right)} \\
&{} = \frac{\partial f}{\partial\rho} + \sum_{i=1}^N (-1)^{i}\nabla^{(i)} \cdot \frac{\partial f}{\partial\left(\nabla^{(i)}\rho\right)} \ .
\end{align}
In the last two equations, the {{math|ni}} components of the tensor are partial derivatives of {{math|f}} with respect to partial derivatives of ρ,
where , and the tensor scalar product is,
For example, for the case {{math|1=n = 3}} and {{math|1=i = 2}}, the tensor scalar product is,
where .
=Examples=
==Thomas–Fermi kinetic energy functional==
The Thomas–Fermi model of 1927 used a kinetic energy functional for a noninteracting uniform electron gas in a first attempt of density-functional theory of electronic structure:
Since the integrand of {{math|TTF[ρ]}} does not involve derivatives of {{math|ρ(r)}}, the functional derivative of {{math|TTF[ρ]}} is,{{harvp|Parr|Yang|1989|loc=p. 247, Eq. A.6}}.
= C_\mathrm{F} \frac{\partial \rho^{5/3}(\mathbf{r})}{\partial \rho(\mathbf{r})}
= \frac{5}{3} C_\mathrm{F} \rho^{2/3}(\mathbf{r}) \, .
==Coulomb potential energy functional==
The electron-nucleus potential energy is
Applying the definition of functional derivative,
\int \frac{\delta V}{\delta \rho(\boldsymbol{r})} \ \phi(\boldsymbol{r}) \ d\boldsymbol{r}
& {} = \left [ \frac{d}{d\varepsilon} \int \frac{\rho(\boldsymbol{r}) + \varepsilon \phi(\boldsymbol{r})}
\boldsymbol{r} |
& {} = \int \frac {\phi(\boldsymbol{r})}
\boldsymbol{r} |
\end{align}
So,
The functional derivative of the classical part of the electron-electron interaction (often called Hartree energy) is
From the definition of the functional derivative,
\int \frac{\delta J}{\delta\rho(\boldsymbol{r})} \phi(\boldsymbol{r})d\boldsymbol{r}
& {} = \left [ \frac {d \ }{d\varepsilon} \, J[\rho + \varepsilon\phi] \right ]_{\varepsilon = 0} \\
& {} = \left [ \frac {d \ }{d\varepsilon} \, \left ( \frac{1}{2}\iint \frac {[\rho(\boldsymbol{r}) + \varepsilon \phi(\boldsymbol{r})] \, [\rho(\boldsymbol{r}') + \varepsilon \phi(\boldsymbol{r}')] }
\boldsymbol{r}-\boldsymbol{r}' |
& {} = \frac{1}{2}\iint \frac {\rho(\boldsymbol{r}') \phi(\boldsymbol{r}) }
\boldsymbol{r}-\boldsymbol{r}' |
\boldsymbol{r}-\boldsymbol{r}' |
\end{align}
The first and second terms on the right hand side of the last equation are equal, since {{math|r}} and {{math|r′}} in the second term can be interchanged without changing the value of the integral. Therefore,
and the functional derivative of the electron-electron Coulomb potential energy functional {{math|J}}[ρ] is,{{harvp|Parr|Yang|1989|loc=p. 248, Eq. A.11}}.
The second functional derivative is
==von Weizsäcker kinetic energy functional==
In 1935 von Weizsäcker proposed to add a gradient correction to the Thomas-Fermi kinetic energy functional to make it better suit a molecular electron cloud:
where
Using a previously derived formula for the functional derivative,
\frac{\delta T_\mathrm{W}}{\delta \rho}
& = \frac{\partial t_\mathrm{W}}{\partial \rho} - \nabla\cdot\frac{\partial t_\mathrm{W}}{\partial \nabla \rho} \\
& = -\frac{1}{8}\frac{\nabla\rho \cdot \nabla\rho}{\rho^2} - \left ( \frac {1}{4} \frac {\nabla^2\rho} {\rho} - \frac {1}{4} \frac {\nabla\rho \cdot \nabla\rho} {\rho^2} \right ) \qquad \text{where} \ \ \nabla^2 = \nabla \cdot \nabla \ ,
\end{align}
and the result is,{{harvp|Parr|Yang|1989|loc= p. 247, Eq. A.9}}.
==Entropy==
The entropy of a discrete random variable is a functional of the probability mass function.
Thus,
\sum_x \frac{\delta H}{\delta p(x)} \, \phi(x)
& {} = \left[ \frac{d}{d\varepsilon} H[p(x) + \varepsilon\phi(x)] \right]_{\varepsilon=0}\\
& {} = \left [- \, \frac{d}{d\varepsilon} \sum_x \, [p(x) + \varepsilon\phi(x)] \ \log [p(x) + \varepsilon\phi(x)] \right]_{\varepsilon=0} \\
& {} = -\sum_x \, [1+\log p(x)] \ \phi(x) \, .
\end{align}
Thus,
== Exponential ==
Let
Using the delta function as a test function,
\frac{\delta F[\varphi(x)]}{\delta \varphi(y)}
& {} = \lim_{\varepsilon\to 0}\frac{F[\varphi(x)+\varepsilon\delta(x-y)]-F[\varphi(x)]}{\varepsilon}\\
& {} = \lim_{\varepsilon\to 0}\frac{e^{\int (\varphi(x)+\varepsilon\delta(x-y)) g(x)dx}-e^{\int \varphi(x) g(x)dx}}{\varepsilon}\\
& {} = e^{\int \varphi(x) g(x)dx}\lim_{\varepsilon\to 0}\frac{e^{\varepsilon \int \delta(x-y) g(x)dx}-1}{\varepsilon}\\
& {} = e^{\int \varphi(x) g(x)dx}\lim_{\varepsilon\to 0}\frac{e^{\varepsilon g(y)}-1}{\varepsilon}\\
& {} = e^{\int \varphi(x) g(x)dx}g(y).
\end{align}
Thus,
This is particularly useful in calculating the correlation functions from the partition function in quantum field theory.
==Functional derivative of a function==
A function can be written in the form of an integral like a functional. For example,
Since the integrand does not depend on derivatives of ρ, the functional derivative of ρ{{math|(r)}} is,
= \frac{\partial \ \ }{\partial \rho(\boldsymbol{r}')} \, [\rho(\boldsymbol{r}') \delta(\boldsymbol{r}-\boldsymbol{r}')]
= \delta(\boldsymbol{r}-\boldsymbol{r}').
== Functional derivative of iterated function==
The functional derivative of the iterated function is given by:
and
In general:
Putting in {{math|1=N = 0}} gives:
Using the delta function as a test function
In physics, it is common to use the Dirac delta function in place of a generic test function , for yielding the functional derivative at the point (this is a point of the whole functional derivative as a partial derivative is a component of the gradient):{{harvp|Greiner|Reinhardt|1996|p=37}}
This works in cases when formally can be expanded as a series (or at least up to first order) in . The formula is however not mathematically rigorous, since is usually not even defined.
The definition given in a previous section is based on a relationship that holds for all test functions , so one might think that it should hold also when is chosen to be a specific function such as the delta function. However, the latter is not a valid test function (it is not even a proper function).
In the definition, the functional derivative describes how the functional changes as a result of a small change in the entire function . The particular form of the change in is not specified, but it should stretch over the whole interval on which is defined. Employing the particular form of the perturbation given by the delta function has the meaning that is varied only in the point . Except for this point, there is no variation in .
Notes
{{Reflist|group=Note}}
Footnotes
{{reflist|29em}}
References
- {{cite book | last1=Courant | first1=Richard | author-link1=Richard Courant | last2=Hilbert | first2=David | author-link2=David Hilbert | title = Methods of Mathematical Physics | volume = I | edition = First English | publisher = Interscience Publishers, Inc | year = 1953 | location = New York, New York | chapter = Chapter IV. The Calculus of Variations | pages = 164–274 | isbn = 978-0471504474| mr = 0065391 | zbl = 0001.00501}}.
- {{Citation
| last1 = Frigyik
| first1 = Béla A.
| last2 = Srivastava
| first2 = Santosh
| last3 = Gupta
| first3 = Maya R.
| title = Introduction to Functional Derivatives
| place = Seattle, WA
| publisher = Department of Electrical Engineering at the University of Washington
| series = UWEE Tech Report
| volume = UWEETR-2008-0001
| date = January 2008
| pages = 7
| url = https://www.ee.washington.edu/techsite/papers/documents/UWEETR-2008-0001.pdf
| access-date = 2013-10-23
| archive-url = https://web.archive.org/web/20170217025324/https://www2.ee.washington.edu/techsite/papers/documents/UWEETR-2008-0001.pdf
| archive-date = 2017-02-17
| url-status = dead
}}.
- {{Citation
| last1 = Gelfand
| first1 = I. M.
| author-link = Israel Gelfand
| last2 = Fomin
| first2 = S. V.
| author2-link = Sergei Fomin
| title = Calculus of variations
| place = Mineola, N.Y.
| publisher = Dover Publications
| series = translated and edited by Richard A. Silverman
| orig-year = 1963
| year = 2000
| edition = Revised English
| url = http://store.doverpublications.com/0486414485.html
| isbn = 978-0486414485
| mr = 0160139
| zbl = 0127.05402
}}.
- {{Citation
| last1 = Giaquinta
| first1 = Mariano
| author-link = Mariano Giaquinta
| last2 = Hildebrandt
| first2 = Stefan
| title = Calculus of Variations 1. The Lagrangian Formalism
| place = Berlin
| publisher = Springer-Verlag
| series = Grundlehren der Mathematischen Wissenschaften
| volume = 310
| year = 1996
| edition = 1st
| isbn = 3-540-50625-X
| mr = 1368401
| zbl = 0853.49001
}}.
- {{Citation
| last1 = Greiner
| first1 = Walter
| author-link1 = Walter Greiner
| last2 = Reinhardt
| first2 = Joachim
| title = Field quantization
| place = Berlin–Heidelberg–New York
| publisher = Springer-Verlag
| series = With a foreword by D. A. Bromley
| year = 1996
| chapter = Section 2.3 – Functional derivatives
| pages = [https://archive.org/details/fieldquantizatio0000grei/page/36 36–38]
| chapter-url = https://archive.org/details/fieldquantizatio0000grei/page/36
| isbn = 3-540-59179-6
| mr = 1383589
| zbl = 0844.00006
}}.
- {{cite book |first1=R. G.|last1=Parr|first2=W.|last2=Yang| title = Density-Functional Theory of Atoms and Molecules | chapter = Appendix A, Functionals | pages = 246–254 | publisher = Oxford University Press | year = 1989 |location=New York| url = https://books.google.com/books?id=mGOpScSIwU4C&q=Density-Functional+Theory+of+Atoms+and+Molecules | isbn = 978-0195042795}}
External links
- {{springer|title=Functional derivative|id=p/f042040}}
{{Functional analysis}}
{{Analysis in topological vector spaces}}
Category:Calculus of variations
Category:Differential calculus
Category:Differential operators