Lagrange inversion theorem#Lambert W function

{{short description|Formula for the Taylor series expansion of the inverse function of an analytic function}}

In mathematical analysis, the Lagrange inversion theorem, also known as the Lagrange–Bürmann formula, gives the Taylor series expansion of the inverse function of an analytic function. Lagrange inversion is a special case of the inverse function theorem.

Statement

Suppose {{mvar|z}} is defined as a function of {{mvar|w}} by an equation of the form

:z = f(w)

where {{mvar|f}} is analytic at a point {{mvar|a}} and f'(a)\neq 0. Then it is possible to invert or solve the equation for {{mvar|w}}, expressing it in the form w=g(z) given by a power series{{cite book |editor=M. Abramowitz |editor2=I. A. Stegun |title=Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables |chapter=3.6.6. Lagrange's Expansion |place=New York |publisher=Dover |page=14 |year=1972 |url=http://people.math.sfu.ca/~cbm/aands/page_14.htm}}

: g(z) = a + \sum_{n=1}^{\infty} g_n \frac{(z - f(a))^n}{n!},

where

: g_n = \lim_{w \to a} \frac{d^{n-1}}{dw^{n-1}} \left[\left( \frac{w-a}{f(w) - f(a)} \right)^n \right].

The theorem further states that this series has a non-zero radius of convergence, i.e., g(z) represents an analytic function of {{mvar|z}} in a neighbourhood of z= f(a). This is also called reversion of series.

If the assertions about analyticity are omitted, the formula is also valid for formal power series and can be generalized in various ways: It can be formulated for functions of several variables; it can be extended to provide a ready formula for {{math|F(g(z))}} for any analytic function {{mvar|F}}; and it can be generalized to the case f'(a)=0, where the inverse {{mvar|g}} is a multivalued function.

The theorem was proved by Lagrange{{cite journal |author=Lagrange, Joseph-Louis |year=1770 |title=Nouvelle méthode pour résoudre les équations littérales par le moyen des séries |journal=Histoire de l'Académie Royale des Sciences et Belles-Lettres de Berlin |pages=251–326 |url=http://bibliothek.bbaw.de/bbaw/bibliothek-digital/digitalequellen/schriften/anzeige/index_html?band=02-hist/1768&seite:int=257}} https://archive.org/details/uvresdelagrange18natigoog/page/n13 (Note: Although Lagrange submitted this article in 1768, it was not published until 1770.) and generalized by Hans Heinrich Bürmann,Bürmann, Hans Heinrich, "Essai de calcul fonctionnaire aux constantes ad-libitum," submitted in 1796 to the Institut National de France. For a summary of this article, see: {{cite book |editor=Hindenburg, Carl Friedrich |title=Archiv der reinen und angewandten Mathematik |trans-title=Archive of pure and applied mathematics |location=Leipzig, Germany |publisher=Schäferischen Buchhandlung |year=1798 |volume=2 |chapter=Versuch einer vereinfachten Analysis; ein Auszug eines Auszuges von Herrn Bürmann |trans-chapter=Attempt at a simplified analysis; an extract of an abridgement by Mr. Bürmann |pages=495–499 |chapter-url=https://books.google.com/books?id=jj4DAAAAQAAJ&pg=495}}Bürmann, Hans Heinrich, "Formules du développement, de retour et d'integration," submitted to the Institut National de France. Bürmann's manuscript survives in the archives of the École Nationale des Ponts et Chaussées [National School of Bridges and Roads] in Paris. (See ms. 1715.)A report on Bürmann's theorem by Joseph-Louis Lagrange and Adrien-Marie Legendre appears in: [http://gallica.bnf.fr/ark:/12148/bpt6k3217h.image.f22.langFR.pagination "Rapport sur deux mémoires d'analyse du professeur Burmann,"] Mémoires de l'Institut National des Sciences et Arts: Sciences Mathématiques et Physiques, vol. 2, pages 13–17 (1799). both in the late 18th century. There is a straightforward derivation using complex analysis and contour integration;E. T. Whittaker and G. N. Watson. A Course of Modern Analysis. Cambridge University Press; 4th edition (January 2, 1927), pp. 129–130 the complex formal power series version is a consequence of knowing the formula for polynomials, so the theory of analytic functions may be applied. Actually, the machinery from analytic function theory enters only in a formal way in this proof, in that what is really needed is some property of the formal residue, and a more direct formal proof is available. In fact, the Lagrange inversion theorem has a number of additional rather different proofs, including ones using tree-counting arguments or induction.{{cite book | last1=Richard | first1=Stanley | title=Enumerative combinatorics. Volume 1. | series =Cambridge Stud. Adv. Math. | volume=49 | location=Cambridge | publisher=Cambridge University Press | year=2012 | isbn=978-1-107-60262-5 | mr=2868112 }}{{Citation |last1=Ira|first1=Gessel |date=2016 |title=Lagrange inversion |journal=Journal of Combinatorial Theory, Series A |volume=144 |language=en |pages=212–249 |doi=10.1016/j.jcta.2016.06.018 |arxiv=1609.05988|mr=3534068}}{{Citation |last1=Surya|first1=Erlang |last2=Warnke |first2=Lutz |date=2023 |title=Lagrange Inversion Formula by Induction |journal=The American Mathematical Monthly |volume=130 |issue=10 |language=en |pages=944–948 |doi=10.1080/00029890.2023.2251344 |arxiv=2305.17576|mr=4669236}}

If {{mvar|f}} is a formal power series, then the above formula does not give the coefficients of the compositional inverse series {{mvar|g}} directly in terms for the coefficients of the series {{mvar|f}}. If one can express the functions {{mvar|f}} and {{mvar|g}} in formal power series as

:f(w) = \sum_{k=0}^\infty f_k \frac{w^k}{k!} \qquad \text{and} \qquad g(z) = \sum_{k=0}^\infty g_k \frac{z^k}{k!}

with {{math|1=f0 = 0}} and {{math|f1 ≠ 0}}, then an explicit form of inverse coefficients can be given in term of Bell polynomials:Eqn (11.43), p. 437, C.A. Charalambides, Enumerative Combinatorics, Chapman & Hall / CRC, 2002

: g_n = \frac{1}{f_1^n} \sum_{k=1}^{n-1} (-1)^k n^\overline{k} B_{n-1,k}(\hat{f}_1,\hat{f}_2,\ldots,\hat{f}_{n-k}), \quad n \geq 2,

where

:\begin{align}

\hat{f}_k &= \frac{f_{k+1}}{(k+1)f_{1}}, \\

g_1 &= \frac{1}{f_{1}}, \text{ and} \\

n^{\overline{k}} &= n(n+1)\cdots (n+k-1)

\end{align}

is the rising factorial.

When {{math|1=f1 = 1}}, the last formula can be interpreted in terms of the faces of associahedra {{cite arXiv|eprint=1709.07504|class=math.CO|title=Hopf monoids and generalized permutahedra|last1=Aguiar|first1=Marcelo|last2=Ardila|first2=Federico|year=2017}}

: g_n = \sum_{F \text{ face of } K_n} (-1)^{n-\dim F} f_F , \quad n \geq 2,

where f_{F} = f_{i_{1}} \cdots f_{i_{m}} for each face F = K_{i_1} \times \cdots \times K_{i_m} of the associahedron K_n .

Example

For instance, the algebraic equation of degree {{mvar|p}}

: x^p - x + z= 0

can be solved for {{mvar|x}} by means of the Lagrange inversion formula for the function {{math|1=f(x) = xxp}}, resulting in a formal series solution

: x = \sum_{k=0}^\infty \binom{pk}{k} \frac{z^{(p-1)k+1} }{(p-1)k+1} .

By convergence tests, this series is in fact convergent for |z| \leq (p-1)p^{-p/(p-1)}, which is also the largest disk in which a local inverse to {{mvar|f}} can be defined.

Applications

=Lagrange–Bürmann formula=

There is a special case of Lagrange inversion theorem that is used in combinatorics and applies when f(w)=w/\phi(w) for some analytic \phi(w) with \phi(0)\ne 0. Take a=0 to obtain f(a)=f(0)=0. Then for the inverse g(z) (satisfying f(g(z))\equiv z), we have

:\begin{align}

g(z) &= \sum_{n=1}^{\infty} \left[ \lim_{w \to 0} \frac {d^{n-1}}{dw^{n-1}} \left(\left( \frac{w}{w/\phi(w)} \right)^n \right)\right] \frac{z^n}{n!} \\

{} &= \sum_{n=1}^{\infty} \frac{1}{n} \left[\frac{1}{(n-1)!} \lim_{w \to 0} \frac{d^{n-1}}{dw^{n-1}} (\phi(w)^n) \right] z^n,

\end{align}

which can be written alternatively as

:[z^n] g(z) = \frac{1}{n} [w^{n-1}] \phi(w)^n,

where [w^r] is an operator which extracts the coefficient of w^r in the Taylor series of a function of {{mvar|w}}.

A generalization of the formula is known as the Lagrange–Bürmann formula:

:[z^n] H (g(z)) = \frac{1}{n} [w^{n-1}] (H' (w) \phi(w)^n)

where {{math|H}} is an arbitrary analytic function.

Sometimes, the derivative {{math|{{prime|H}}(w)}} can be quite complicated. A simpler version of the formula replaces {{math|{{prime|H}}(w)}} with {{math|H(w)(1 − {{prime|φ}}(w)/φ(w))}} to get

: [z^n] H (g(z)) = [w^n] H(w) \phi(w)^{n-1} (\phi(w) - w \phi'(w)),

which involves {{math|{{prime|φ}}(w)}} instead of {{math|{{prime|H}}(w)}}.

=Lambert ''W'' function=

{{main|Lambert W function}}

The Lambert {{mvar|W}} function is the function W(z) that is implicitly defined by the equation

: W(z) e^{W(z)} = z.

We may use the theorem to compute the Taylor series of W(z) at z=0. We take f(w) = we^w and a = 0. Recognizing that

:\frac{d^n}{dx^n} e^{\alpha x} = \alpha^n e^{\alpha x},

this gives

:\begin{align}

W(z) &= \sum_{n=1}^{\infty} \left[\lim_{w \to 0} \frac{d^{n-1}}{dw^{n-1}} e^{-nw} \right] \frac{z^n}{n!} \\

{} &= \sum_{n=1}^{\infty} (-n)^{n-1} \frac{z^n}{n!} \\

{} &= z-z^2+\frac{3}{2}z^3-\frac{8}{3}z^4+O(z^5).

\end{align}

The radius of convergence of this series is e^{-1} (giving the principal branch of the Lambert function).

A series that converges for |\ln(z)-1|<\sqrt{{4+\pi^2}} (approximately 0.0655 < z < 112.63) can also be derived by series inversion. The function f(z) = W(e^z) - 1 satisfies the equation

:1 + f(z) + \ln (1 + f(z)) = z.

Then z + \ln (1 + z) can be expanded into a power series and inverted.{{cite conference |url=https://dl.acm.org/doi/pdf/10.1145/258726.258783 |title=A sequence of series for the Lambert W function |last1=Corless |first1=Robert M. |last2=Jeffrey |first2= David J.|author-link2=|last3=Knuth|first3=Donald E.|author-link3=Donald E. Knuth|date=July 1997 |book-title=Proceedings of the 1997 international symposium on Symbolic and algebraic computation |pages=197–204|doi=10.1145/258726.258783 }} This gives a series for f(z+1) = W(e^{z+1})-1\text{:}

:W(e^{1+z}) = 1 + \frac{z}{2} + \frac{z^2}{16} - \frac{z^3}{192} - \frac{z^4}{3072} + \frac{13 z^5}{61440} - O(z^6).

W(x) can be computed by substituting \ln x - 1 for {{mvar|z}} in the above series. For example, substituting {{math|−1}} for {{mvar|z}} gives the value of W(1) \approx 0.567143.

=Binary trees=

Consider{{cite book |last1=Harris|first1= John |last2=Hirst |first2= Jeffry L.| last3= Mossinghoff| first3= Michael |date=2008 |title=Combinatorics and Graph Theory |publisher= Springer |pages=185–189 |isbn=978-0387797113}} the set \mathcal{B} of unlabelled binary trees. An element of \mathcal{B} is either a leaf of size zero, or a root node with two subtrees. Denote by B_n the number of binary trees on n nodes.

Removing the root splits a binary tree into two trees of smaller size. This yields the functional equation on the generating function \textstyle B(z) = \sum_{n=0}^\infty B_n z^n\text{:}

:B(z) = 1 + z B(z)^2.

Letting C(z) = B(z) - 1, one has thus C(z) = z (C(z)+1)^2. Applying the theorem with \phi(w) = (w+1)^2 yields

: B_n = [z^n] C(z) = \frac{1}{n} [w^{n-1}] (w+1)^{2n} = \frac{1}{n} \binom{2n}{n-1} = \frac{1}{n+1} \binom{2n}{n}.

This shows that B_n is the {{mvar|n}}th Catalan number.

= Asymptotic approximation of integrals=

In the Laplace–Erdelyi theorem that gives the asymptotic approximation for Laplace-type integrals, the function inversion is taken as a crucial step.

See also

References

{{reflist|colwidth=30em}}