directional derivative
{{Short description|Instantaneous rate of change of the function}}
{{refimprove section|date=October 2012|talk=Verifiability of definition}}
{{Calculus |Vector}}
In multivariable calculus, the directional derivative measures the rate at which a function changes in a particular direction at a given point.{{cn|date=November 2023}}
The directional derivative of a multivariable differentiable (scalar) function along a given vector v at a given point x intuitively represents the instantaneous rate of change of the function, moving through x with a direction specified by v.
The directional derivative of a scalar function f with respect to a vector v at a point (e.g., position) x may be denoted by any of the following:
\begin{aligned}
\nabla_{\mathbf{v}}{f}(\mathbf{x})
&=f'_\mathbf{v}(\mathbf{x})\\
&=D_\mathbf{v}f(\mathbf{x})\\
&=Df(\mathbf{x})(\mathbf{v})\\
&=\partial_\mathbf{v}f(\mathbf{x})\\
&=\mathbf{v}\cdot{\nabla f(\mathbf{x})}\\
&=\mathbf{v}\cdot \frac{\partial f(\mathbf{x})}{\partial\mathbf{x}}.\\
\end{aligned}
It therefore generalizes the notion of a partial derivative, in which the rate of change is taken along one of the curvilinear coordinate curves, all other coordinates being constant.
The directional derivative is a special case of the Gateaux derivative.
Definition
File:Directional derivative contour plot.svg of , showing the gradient vector in black, and the unit vector scaled by the directional derivative in the direction of in orange. The gradient vector is longer because the gradient points in the direction of greatest rate of increase of a function.]]
The directional derivative of a scalar function
along a vector
is the function defined by the limit{{cite book |author1=R. Wrede |author2=M.R. Spiegel | title=Advanced Calculus|edition=3rd| publisher=Schaum's Outline Series| year=2010 | isbn=978-0-07-162366-7}}
This definition is valid in a broad range of contexts, for example where the norm of a vector (and hence a unit vector) is undefined.The applicability extends to functions over spaces without a metric and to differentiable manifolds, such as in general relativity.
= For differentiable functions =
If the function f is differentiable at x, then the directional derivative exists along any unit vector v at x, and one has
where the on the right denotes the gradient, is the dot product and v is a unit vector.If the dot product is undefined, the gradient is also undefined; however, for differentiable f, the directional derivative is still defined, and a similar relation exists with the exterior derivative. This follows from defining a path and using the definition of the derivative as a limit which can be calculated along this path to get:
0
&=\lim_{t \to 0}\frac {f(x+tv)-f(x)-tDf(x)(v)} t \\
&=\lim_{t \to 0}\frac {f(x+tv)-f(x)} t - Df(x)(v) \\
&=\nabla_v f(x)-Df(x)(v).
\end{align}
Intuitively, the directional derivative of f at a point x represents the rate of change of f, in the direction of v.
= Using only direction of vector =
image:Geometrical interpretation of a directional derivative.svg
In a Euclidean space, some authorsThomas, George B. Jr.; and Finney, Ross L. (1979) Calculus and Analytic Geometry, Addison-Wesley Publ. Co., fifth edition, p. 593. define the directional derivative to be with respect to an arbitrary nonzero vector v after normalization, thus being independent of its magnitude and depending only on its direction.This typically assumes a Euclidean space – for example, a function of several variables typically has no definition of the magnitude of a vector, and hence of a unit vector.
This definition gives the rate of increase of {{math|f}} per unit of distance moved in the direction given by {{math|v}}. In this case, one has
or in case f is differentiable at x,
= Restriction to a unit vector =
In the context of a function on a Euclidean space, some texts restrict the vector v to being a unit vector. With this restriction, both the above definitions are equivalent.{{Cite book| title=Calculus : Single and multivariable.|last1=Hughes Hallett|first1=Deborah|author1-link=Deborah Hughes Hallett|last2=McCallum|first2=William G.| author2-link=William G. McCallum|last3=Gleason|first3=Andrew M.|author3-link=Andrew M. Gleason| date=2012-01-01| publisher=John wiley|isbn=9780470888612|pages=780|oclc=828768012}}
Properties
Many of the familiar properties of the ordinary derivative hold for the directional derivative. These include, for any functions f and g defined in a neighborhood of, and differentiable at, p:
- sum rule:
- constant factor rule: For any constant c,
- product rule (or Leibniz's rule):
- chain rule: If g is differentiable at p and h is differentiable at g(p), then
In differential geometry
{{see also|Tangent space#Tangent vectors as directional derivatives}}
Let {{math|M}} be a differentiable manifold and {{math|p}} a point of {{math|M}}. Suppose that {{math|f}} is a function defined in a neighborhood of {{math|p}}, and differentiable at {{math|p}}. If {{math|v}} is a tangent vector to {{math|M}} at {{math|p}}, then the directional derivative of {{math|f}} along {{math|v}}, denoted variously as {{math|df(v)}} (see Exterior derivative), (see Covariant derivative), (see Lie derivative), or (see {{section link|Tangent space|Definition via derivations}}), can be defined as follows. Let {{math|γ : [−1, 1] → M}} be a differentiable curve with {{math|1=γ(0) = p}} and {{math|1=γ′(0) = v}}. Then the directional derivative is defined by
This definition can be proven independent of the choice of {{math|γ}}, provided {{math|γ}} is selected in the prescribed manner so that {{math|1=γ(0) = p}} and {{math|1=γ′(0) = v}}.
=The Lie derivative=
The Lie derivative of a vector field along a vector field is given by the difference of two directional derivatives (with vanishing torsion):
In particular, for a scalar field , the Lie derivative reduces to the standard directional derivative:
=The Riemann tensor=
Directional derivatives are often used in introductory derivations of the Riemann curvature tensor. Consider a curved rectangle with an infinitesimal vector along one edge and along the other. We translate a covector along then and then subtract the translation along and then . Instead of building the directional derivative using partial derivatives, we use the covariant derivative. The translation operator for is thus
and for ,
The difference between the two paths is then
It can be argued{{cite book|last1=Zee|first1=A.|title=Einstein gravity in a nutshell|date=2013|publisher=Princeton University Press|location=Princeton|isbn=9780691145587|page=341}} that the noncommutativity of the covariant derivatives measures the curvature of the manifold:
where is the Riemann curvature tensor and the sign depends on the sign convention of the author.
In group theory
=Translations=
In the Poincaré algebra, we can define an infinitesimal translation operator P as
(the i ensures that P is a self-adjoint operator) For a finite displacement λ, the unitary Hilbert space representation for translations is{{cite book| last1=Weinberg|first1=Steven|title=The quantum theory of fields|date=1999|publisher=Cambridge Univ. Press| location=Cambridge [u.a.]| isbn=9780521550017|edition=Reprinted (with corr.).|url-access=registration| url=https://archive.org/details/quantumtheoryoff00stev}}
By using the above definition of the infinitesimal translation operator, we see that the finite translation operator is an exponentiated directional derivative:
This is a translation operator in the sense that it acts on multivariable functions f(x) as
{{math proof|title=Proof of the last equation
|proof=
In standard single-variable calculus, the derivative of a smooth function f(x) is defined by (for small ε)
This can be rearranged to find f(x+ε):
It follows that is a translation operator. This is instantly generalized{{cite book |last1=Zee|first1=A.|title=Einstein gravity in a nutshell|date=2013|publisher=Princeton University Press| location=Princeton| isbn=9780691145587}} to multivariable functions f(x)
Here is the directional derivative along the infinitesimal displacement ε. We have found the infinitesimal version of the translation operator:
It is evident that the group multiplication law{{cite book | last1=Cahill |first1=Kevin Cahill | title=Physical mathematics | date=2013 | publisher=Cambridge University Press | location=Cambridge | isbn=978-1107005211 | edition=Repr.}} U(g)U(f)=U(gf) takes the form
So suppose that we take the finite displacement λ and divide it into N parts (N→∞ is implied everywhere), so that λ/N=ε. In other words,
Then by applying U(ε) N times, we can construct U(λ):
We can now plug in our above expression for U(ε):
we have
And since {{math|1=U(ε)f(x) = f(x+ε)}} we have
Q.E.D.
As a technical note, this procedure is only possible because the translation group forms an Abelian subgroup (Cartan subalgebra) in the Poincaré algebra. In particular, the group multiplication law U(a)U(b) = U(a+b) should not be taken for granted. We also note that Poincaré is a connected Lie group. It is a group of transformations T(ξ) that are described by a continuous set of real parameters . The group multiplication law takes the form
Taking as the coordinates of the identity, we must have
The actual operators on the Hilbert space are represented by unitary operators U(T(ξ)). In the above notation we suppressed the T; we now write U(λ) as U(P(λ)). For a small neighborhood around the identity, the power series representation
is quite good. Suppose that U(T(ξ)) form a non-projective representation, i.e.,
The expansion of f to second power is
After expanding the representation multiplication equation and equating coefficients, we have the nontrivial condition
Since is by definition symmetric in its indices, we have the standard Lie algebra commutator:
with C the structure constant. The generators for translations are partial derivative operators, which commute:
This implies that the structure constants vanish and thus the quadratic coefficients in the f expansion vanish as well. This means that f is simply additive:
and thus for abelian groups,
Q.E.D.
}}
=Rotations=
The rotation operator also contains a directional derivative. The rotation operator for an angle θ, i.e. by an amount θ = |θ| about an axis parallel to is
Here L is the vector operator that generates SO(3):
0& 0 & 0\\
0& 0 & 1\\
0& -1 & 0
\end{pmatrix}\mathbf{i}+\begin{pmatrix}
0 &0 & -1\\
0& 0 &0 \\
1 & 0 & 0
\end{pmatrix}\mathbf{j}+\begin{pmatrix}
0&1 &0 \\
-1&0 &0 \\
0 & 0 & 0
\end{pmatrix}\mathbf{k}.
It may be shown geometrically that an infinitesimal right-handed rotation changes the position vector x by
So we would expect under infinitesimal rotation:
It follows that
Following the same exponentiation procedure as above, we arrive at the rotation operator in the position basis, which is an exponentiated directional derivative:{{cite book|last1=Shankar|first1=R. | title=Principles of quantum mechanics | date=1994|publisher=Kluwer Academic / Plenum|location=New York|isbn=9780306447907|page=318|edition=2nd}}
Normal derivative
A normal derivative is a directional derivative taken in the direction normal (that is, orthogonal) to some surface in space, or more generally along a normal vector field orthogonal to some hypersurface. See for example Neumann boundary condition. If the normal direction is denoted by , then the normal derivative of a function f is sometimes denoted as . In other notations,
In the continuum mechanics of solids
Several important results in continuum mechanics require the derivatives of vectors with respect to vectors and of tensors with respect to vectors and tensors.J. E. Marsden and T. J. R. Hughes, 2000, Mathematical Foundations of Elasticity, Dover. The directional directive provides a systematic way of finding these derivatives.
{{excerpt|Tensor derivative (continuum mechanics)|Derivatives with respect to vectors and second-order tensors|subsections=y}}
See also
- {{annotated link|Del in cylindrical and spherical coordinates}}
- {{annotated link|Differential form}}
- {{annotated link|Ehresmann connection}}
- {{annotated link|Fréchet derivative}}
- {{annotated link|Gateaux derivative}}
- {{annotated link|Generalizations of the derivative}}
- {{annotated link | Semi-differentiability}}
- {{annotated link|Hadamard derivative}}
- {{annotated link|Lie derivative}}
- {{annotated link|Material derivative}}
- {{annotated link|Structure tensor}}
- {{annotated link|Tensor derivative (continuum mechanics)}}
- {{annotated link|Total derivative}}
Notes
{{reflist|2}}
References
- {{cite book | first=F. B. | last=Hildebrand | title=Advanced Calculus for Applications| publisher=Prentice Hall | year=1976 | isbn=0-13-011189-9 }}
- {{cite book |author1=K.F. Riley |author2=M.P. Hobson |author3=S.J. Bence | title=Mathematical methods for physics and engineering|url=https://archive.org/details/mathematicalmeth00rile |url-access=registration | publisher=Cambridge University Press| year=2010 | isbn=978-0-521-86153-3}}
- {{cite journal |first=A. |last=Shapiro |title=On concepts of directional differentiability |journal=Journal of Optimization Theory and Applications |volume=66 |issue= 3|pages=477–487 |year=1990 |doi=10.1007/BF00940933 |s2cid=120253580 }}
External links
{{Commons category inline|Directional derivative}}
- [http://mathworld.wolfram.com/DirectionalDerivative.html Directional derivatives] at MathWorld.
- [http://planetmath.org/directionalderivative Directional derivative] at PlanetMath.
{{Calculus topics}}
Category:Differential calculus
Category:Differential geometry
Category:Generalizations of the derivative