Brillouin and Langevin functions

{{short description|Mathematical function, used to describe magnetization}}

The Brillouin and Langevin functions are a pair of special functions that appear when studying an idealized paramagnetic material in statistical mechanics. These functions are named after French physicists Paul Langevin and Léon Brillouin who contributed to the microscopic understanding of magnetic properties of matter.

The Langevin function is derived using statistical mechanics, and describes how magnetic dipoles are alignment by an applied field.{{cite book

| author = B. D. Cullity & C. D. Graham

| year=2009

| publisher= Wiley (IEEE press)

| title = Introduction to magnetic materials (2nd ed.)

| pages = 91–99

| isbn = 978-0471-47741-9}} The Brillouin function was developed later to give an explanation that considers quantum physics.{{cite book

| author = J. R. Hook & H. F. Hall

| year=2010

| publisher= Wiley

| title = Solid State Physics (2nd ed.)

| pages = 200–206

| isbn = 978-0471-92805-8}} The Langevin function could then be a seen as a special case of the more general Brillouin function if the quantum number J would be infinite (J \rightarrow \infty ).{{cite book

| author = B. D. Cullity & C. D. Graham

| year=2009

| publisher= Wiley (IEEE press)

| title = Introduction to magnetic materials (2nd ed.)

| pages = 99–113

| isbn = 978-0471-47741-9}}

Brillouin function for paramagnetism {{anchor|Brillouin Function}}

The Brillouin functionC. Kittel, Introduction to Solid State Physics (8th ed.), pages 303-4 {{ISBN|978-0-471-41526-8}}{{Cite journal

| last = Darby

| first = M.I.

| author-link =

| title = Tables of the Brillouin function and of the related function for the spontaneous magnetization

| journal = Br. J. Appl. Phys.

| volume = 18

| issue = 10

| pages = 1415–1417

| year = 1967

| doi =10.1088/0508-3443/18/10/307

|bibcode = 1967BJAP...18.1415D }}{{Cite web

|title=General Paramagnetism

|url=https://farside.ph.utexas.edu/teaching/sm1/Thermalhtml/node81.html

|access-date=2024-12-18

}} arises when studying magnetization of an ideal paramagnet. In particular, it describes the dependency of the magnetization M on the applied magnetic field B, defined by the following equation:

:B_J(x) = \frac{2J + 1}{2J} \coth \left ( \frac{2J + 1}{2J} x \right )

- \frac{1}{2J} \coth \left ( \frac{1}{2J} x \right )

The function B_J is usually applied in the context where x is a real variable and a function of the applied field B. In this case, the function varies from -1 to 1, approaching +1 as x \to +\infty and -1 as x \to -\infty.

The total angular momentum quantum number J is a positive integer or half-integer. Considering the microscopic magnetic moments of the material. The magnetization is given by:

:M = N g \mu_{\rm B} J B_J(x)

where

  • N is the number of atoms per unit volume,
  • g the g-factor,
  • \mu_{\rm B} the Bohr magneton,
  • x is the ratio of the Zeeman energy of the magnetic moment in the external field to the thermal energy k_{\rm B} T:

::x = J\frac{g\mu_{\rm B} B}{k_{\rm B} T}

Note that in the SI system of units B given in Tesla stands for the magnetic field, B=\mu_0 H, where H is the auxiliary magnetic field given in A/m and \mu_0 is the permeability of vacuum.

:

class="toccolours collapsible collapsed" width="80%" style="text-align:left"

!Click "show" to see a derivation of this law:

A derivation of this law describing the magnetization of an ideal paramagnet is as follows. Let z be the direction of the magnetic field. The z-component of the angular momentum of each magnetic moment (a.k.a. the azimuthal quantum number) can take on one of the 2J+1 possible values -J,-J+1,...,+J. Each of these has a different energy, due to the external field B: The energy associated with quantum number m is

:E_m = -mg \mu_{\rm B} B = -k_{\rm B}Txm/J

(where g is the g-factor, μB is the Bohr magneton, and x is as defined in the text above). The relative probability of each of these is given by the Boltzmann factor:

:P(m)=e^{-E_m/(k_{\rm B}T)}/Z=e^{xm/J}/Z

where Z (the partition function) is a normalization constant such that the probabilities sum to unity. Calculating Z, the result is:

:P(m) = e^{xm/J}/\left(\sum_{m'=-J}^J e^{xm'/J}\right).

All told, the expectation value of the azimuthal quantum number m is

:\langle m \rangle = (-J)\times P(-J) + \cdots + J\times P(J) = \left(\sum_{m=-J}^J m e^{xm/J}\right)/ \left(\sum_{m=-J}^J e^{xm/J}\right).

The denominator is a geometric series and the numerator is a type of arithmetico–geometric series, so the series can be explicitly summed. After some algebra, the result turns out to be

:\langle m \rangle = J B_J(x)

With N magnetic moments per unit volume, the magnetization density is

:M = Ng\mu_{\rm B}\langle m \rangle = NgJ\mu_{\rm B} B_J(x).

=High-field saturation limit {{anchor|High Field Limit}} =

When x\to\infty, the Brillouin function goes to 1. The magnetization saturates with the magnetic moments completely aligned with the applied field:

:M = N g \mu_{\rm B} J

=Low field/high-temperature limit {{anchor|High Temperature Limit}} =

{{Main|Curie's law}}

For low fields the curve appears almost linear, and could be replaced by a linear slope as in Curie's law of paramagnetism. When x \ll 1 (i.e. when x = \mu_{\rm B} B / k_{\rm B} T is small) the expression of the magnetization can be approximated by:

:M = C \cdot \frac{B}{T}

and equivalent to Curie's law with the constant given by

:C = \frac{N g^2 J(J+1) \mu_{\rm B}^2}{3k_{\rm B}} = \frac{N \mu_{\text{eff}}^2 }{3k_{\rm B}}

Using \mu_{\text{eff}}=g\sqrt{J(J+1)}\mu_{\rm B} as the effective number of Bohr magnetons.

Note that this is only valid for low fields in paramagnetism.{{Cite web

|url=https://www.tf.uni-kiel.de/matwis/amat/elmat_en/kap_4/backbone/r4_2_2.html

|title=4.2.2 Paramagnetism

|access-date=2024-12-18

}} Ferromagnetic materials still has a spontaneous magnetization at low fields (below the Curie-temperature), and the susceptibility must then instead be explained by Curie–Weiss law.

=Two-state case (spin-1/2) =

The most simple case of the Brillouin function would be the case of J=1/2, when the function simplifies to the shape of a tanh-function.{{cite book

| author = B. D. Cullity & C. D. Graham

| year=2009

| publisher= Wiley (IEEE press)

| title = Introduction to magnetic materials (2nd ed.)

| pages = 99–113 and 122

| isbn = 978-0471-47741-9}}{{Cite web

| title= Advanced Solid State Physics, Paramagnetism

|url=https://lampz.tugraz.at/~hadley/ss2/magnetism/para.php

|access-date=2024-12-18

}}{{cite web

| author = R. Feynman

| title = The Feynman Lectures on Physics, II-35 Paramagnetism and Magnetic Resonance

| publisher = Caltech

|url=https://www.feynmanlectures.caltech.edu/II_35.html }} Then written as

: M = N g \mu_B J \tanh\frac{g J \mu_B B}{k_{\rm B}T},

This could be linked to Ising's model, for a case with two possible spins: either up or down.{{Cite web |title= Introduction to Statistical Mechanics 6. Phase Transitions

| url= https://web.stanford.edu/~peastman/statmech/phasetransitions.html |access-date=2024-12-18 }} Directed in parallel or antiparallel to the applied field.

This is then equivalent to a 2-state particle: it may either align its magnetic moment with the magnetic field or against it. So the only possible values of magnetic moment are then \mu_B and -\mu_B. If so, then such a particle has only two possible energies, -\mu_B B when it is aligned with the field and +\mu_B B when it is oriented opposite to the field.

Langevin function for classical paramagnetism {{anchor|Langevin Function}}

[[File:Mplwp Langevin-function tanhx3.svg|300px|thumb|right|Langevin function (blue line), compared with

\tanh(x/3) (magenta line).]]

The Langevin function (L(x)) was named after Paul Langevin who published two papers with this function in 1905 to describe paramagnetism by statistical mechanics. Written as:

:L(x) = \coth(x) - \frac{1}{x}

It could be derived by describing how magnetic moments are aligned by a magnetic field, considering the statistical thermodynamics.{{Cite web

|title=Classical Langevin theory of paramagnetism

|url=https://www.arsdcollege.ac.in/wp-content/uploads/2020/04/lec12-Classical-Langevin-theory-of-paramagnetism.pdf

|access-date=2024-12-18

}}{{Cite web

|title= PHYSICS 4750 Physics of Modern Materials Chapter 8: Magnetic Materials

|url=http://faculty.chas.uni.edu/~shand/Physics_Modern_Materials_Lecture_Notes/PMM_Chap8_Magnetic_Notes.pdf

|access-date=2024-12-18

}}{{Cite web

|title= Chapter 6 Part 3 - Classical (Langevin) theory of paramagnetism

|url=http://www.phy.cuhk.edu.hk/~pmhui/course/phys4031/download/1/Chapter%206%20Part%203%20-%20Classical%20(Langevin)%20theory%20of%20paramagnetism%20(ignore%20quantization%20of%20z-component%20of%20magnetic%20moments).pdf

|access-date=2024-12-18

}} One derivation could be seen here:

:

class="toccolours collapsible collapsed" width="80%" style="text-align:left"

!Click "show" to see a derivation of this function:

The energy for each magnetic moment will be

:E = - \mu H\cos\theta,

where \theta is the angle between the magnetic moment and

the magnetic field (which we take to be pointing in the z

coordinate.) The corresponding partition function is

:Z = \int_0^{2\pi} d\phi \int_0^{\pi}d\theta \sin\theta e^{(\mu H \cos\theta /k_B T)}.

Here we could simplify to replace x=\mu H/k_B T

We see there is no dependence on the \phi angle, and also we can

change variables to a=\cos\theta to obtain

:Z = 2\pi \int_{-1}^ 1 da \cdot e^{xa} =

2\pi{e^x-e^{-x} \over x }=

{4\pi\sinh( x ) \over x .}

Now, the expected value of the z component of the magnetization (the other two are seen to be null (due to integration over \phi), as they should) will be given by

:\left\langle\mu_z \right\rangle = {1 \over Z} \int_0^{2\pi} d\phi \int_0^{\pi}d\theta \sin\theta \exp( x \cos\theta) \left[\mu\cos\theta\right] .

To simplify the calculation, we see this can be written as a differentiation of Z:

:\left\langle\mu_z\right\rangle = {k_B T \over Z } \frac{\partial Z}{\partial B} = k_B T \frac{\partial \ln Z}{\partial B}

Carrying out the derivation we find

:M=N\left\langle\mu_z\right\rangle = N\mu \cdot L(\mu H /k_B T),

where L is the Langevin function:

: L(x)= \coth x -{1 \over x}.

The Langevin function can also be derived as the classical limit of the Brillouin function, if the magnetic moments can be continuously aligned in the field and the quantum number J would be able to assume all values (J \to \infty). The Brillouin function is then simplified into the langevin function.

= Classical or quantum approach? =

{{See also|Bohr–Van Leeuwen theorem}}

Langevin function is often seen as the classical theory of paramagnetism, while the Brillouin function is the quantum theory of paramagnetism. When Langevin published the theory paramagnetism in 1905{{Cite journal

| title = Sur la théorie du magnétisme.

| author = Paul Langevin

| year= 1905

| journal = J. Phys. Theor. Appl.

| pages = 678–693

| volume = 4

| issue = 1

| doi = 10.1051/jphystap:019050040067800 }}{{Cite journal

| title = Magnétisme et théorie des électrons

| author = Paul Langevin

| year= 1905

| journal = Annales de chimie et de physique

| pages = 68–125

| volume = 8

| issue = 5

| url=https://fr.wikisource.org/wiki/Magn%C3%A9tisme_et_th%C3%A9orie_des_%C3%A9lectrons }} it was before the adoption of quantum physics. Meaning that Langevin only used concepts of classical physics.{{Cite journal

| title = The story of magnetism: from Heisenberg, Slater, and Stoner to Van Vleck, and the issues of exchange and correlation

| author = Navinder Singh

| year= 2018

| journal = arXiv

| pages = 1807.11291

| arxiv = 1807.11291 }}

Niels Bohr showed in his thesis that classical statistical mechanics can not be used to explain paramagnetism, and that quantum theory has to be used. This would later be known as the Bohr–Van Leeuwen theorem. The magnetic moment would later be explained in quantum theory by the Bohr magneton (\mu_B), which is used in the Brillouin function.

It could be noted that there is a difference in the approaches of Langevin and Bohr, since Langevin assumes a magnetic polarization \mu as the basis for the derivation, while Bohr start the derivation from motions of electrons and a model of an atom. Langevin is still assuming a fix magnetic dipole. This could be expressed as by J. H. Van Vleck:

"When Langevin assumed that the magnetic moment of the atom or molecule had a fixed value \mu, he was quantizing the system without realizing it ". This makes the Langevin function to be in the borderland between classical statistical mechanics and quantum theory (as either semi-classical or semi-quantum).

Langevin function for electric polarization

The Langevin function could also be used to describe electric polarization, in the specific case when the polarization is explained by orientation of (electrically polarized) dipoles.{{Cite journal

| title = Nonlinear dielectric effects in liquids: a guided tour

| journal = Journal of Physics: Condensed Matter

| author = Richert, Ranko

| date= 2017-08-09

| publisher = IOP Publishing

| pages = 363001

| issue = 36

| volume = 29

| doi = 10.1088/1361-648X/aa7cc4 | pmid = 28665294

| bibcode = 2017JPCM...29J3001R

}}{{Cite web

|url=https://www.tf.uni-kiel.de/matwis/amat/elmat_en/kap_3/backbone/r3_2_4.html

|title=3.2.4 Orientation Polarization

|access-date=2024-12-18

}} So that the electric polarization is given by:{{cite book

| author = J. R. Hook & H. F. Hall

| year=2010

| publisher= Wiley

| title = Solid State Physics (2nd ed.)

| pages = 260–262

| isbn = 978-0471-92805-8 }}

: P = P_s \cdot L(x)

but here for an electric dipole moment p and an electric field E_L (insead of the magntic equivalents), that is

: x = \frac{pE_L}{k_BT}

Simplified functions

For small values of {{math|x}}, the Langevin function can be approximated by a truncation of its Taylor series:

:

L(x) = \tfrac{1}{3} x - \tfrac{1}{45} x^3 + \tfrac{2}{945} x^5 - \tfrac{1}{4725} x^7 + \dots

The first term of this series expansion is equivalent to Curie's law, when writing it as

: L(x) \approx \frac{x}{3}

An alternative, better behaved approximation can be derived from the

Lambert's continued fraction expansion of {{math|tanh(x)}}:

:

L(x) = \frac{x}{3+\tfrac{x^2}{5+\tfrac{x^2}{7+\tfrac{x^2}{9+\ldots}}}}

For small enough {{math|x}}, both approximations are numerically better than a direct evaluation of the actual analytical expression, since the latter suffers from catastrophic cancellation for x \approx 0 where \coth(x) \approx 1/x.

Inverse Langevin function

The inverse Langevin function ({{math|L−1(x)}}) is without an explicit analytical form, but there exist several approximations.{{Cite journal

|author = Howard, R.M.

|title = Analytical approximations for the inverse Langevin function via linearization, error approximation, and iteration.

|journal = Rheol. Acta

|volume = 59

|pages = 521–544

|year = 2020

|issue = 8

|publisher = Springer

|doi = 10.1007/s00397-020-01195-8 |bibcode = 2020AcRhe..59..521H

|doi-access = free

}}

The inverse Langevin function {{math|L−1(x)}} is defined on the open interval (−1, 1). For small values of {{mvar|x}}, it can be approximated by a truncation of its Taylor series{{cite journal |title=Energy functions for rubber from microscopic potentials |last1=Johal |first1=A. S. |last2=Dunstan |first2=D. J. |journal=Journal of Applied Physics |volume=101 |issue=8 |year=2007 |pages=084917–084917–5 |doi=10.1063/1.2723870 |bibcode = 2007JAP...101h4917J |url=https://zenodo.org/record/1136026 }}

:

L^{-1}(x) = 3 x + \tfrac{9}{5} x^3 + \tfrac{297}{175} x^5 + \tfrac{1539}{875} x^7 + \dots

and by the Padé approximant

:

L^{-1}(x) = 3x \frac{35-12x^2}{35-33x^2} + O(x^7).

File:Cohen and Jedynak approximations.gif

Since this function has no closed form, it is useful to have approximations valid for arbitrary values of {{mvar|x}}. One popular approximation, valid on the whole range (−1, 1), has been published by A. Cohen:{{cite journal |title=A Padé approximant to the inverse Langevin function |last=Cohen |first=A. |journal=Rheologica Acta |volume=30 |issue=3 |pages=270–273 |year=1991 |doi=10.1007/BF00366640 |bibcode=1991AcRhe..30..270C |s2cid=95818330 }}

:

L^{-1}(x) \approx x \frac{3-x^2}{1-x^2}.

This has a maximum relative error of 4.9% at the vicinity of {{math|x {{=}} ±0.8}}. Greater accuracy can be achieved by using the formula given by R. Jedynak:{{cite journal |title=Approximation of the inverse Langevin function revisited |last=Jedynak|first=R. |journal=Rheologica Acta |volume=54|issue=1 |pages=29–39 |year=2015|doi=10.1007/s00397-014-0802-2|doi-access=free |bibcode=2015AcRhe..54...29J }}

:

L^{-1}(x) \approx x \frac{3.0-2.6x+0.7x^2}{(1-x)(1+0.1x)},

valid for {{math|x ≥ 0}}. The maximal relative error for this approximation is 1.5% at the vicinity of x = 0.85. Even greater accuracy can be achieved by using the formula given by M. Kröger:{{cite journal |title=Simple, admissible, and accurate approximants of the inverse Langevin and Brillouin functions, relevant for strong polymer deformations and flows |last=Kröger|first=M. |journal=J Non-Newton Fluid Mech |volume=223 |pages=77–87|year=2015|doi=10.1016/j.jnnfm.2015.05.007|doi-access=free|bibcode=2015JNNFM.223...77K |hdl=20.500.11850/102747|hdl-access=free}}

:

L^{-1}(x) \approx \frac{3x-x(6x^{2}+x^{4}-2x^{6})/5}{1-x^{2}}

The maximal relative error for this approximation is less than 0.28%. More accurate approximation was reported by R. Petrosyan:{{cite journal|last=Petrosyan|first=R.|title=Improved approximations for some polymer extension models|journal=Rheologica Acta|year=2016|volume=56|pages=21–26|doi = 10.1007/s00397-016-0977-9 |arxiv=1606.02519|s2cid=100350117}}

:

L^{-1}(x) \approx 3x+\frac{x^{2}}{5}\sin\left(\frac{7x}{2}\right)+\frac{x^{3}}{1-x},

valid for {{math|x ≥ 0}}. The maximal relative error for the above formula is less than 0.18%.

New approximation given by R. Jedynak,{{cite journal|last=Jedynak|first=R.|title=New facts concerning the approximation of the inverse Langevin function|journal=Journal of Non-Newtonian Fluid Mechanics|volume=249 |pages=8–25|year=2017|doi = 10.1016/j.jnnfm.2017.09.003 |bibcode=2017JNNFM.249....8J }} is the best reported approximant at complexity 11:

L^{-1}(x) \approx \frac{x (3 -1.00651x^2 -0.962251x^4+ 1.47353x^6-0.48953 x^8)}

{(1 - x) (1 + 1.01524 x)},

valid for {{math|x ≥ 0}}. Its maximum relative error is less than 0.076%.

Current state-of-the-art diagram of the approximants to the inverse Langevin function

presents the figure below. It is valid for the rational/Padé approximants,

File:Approximants to the inverse Langevin function.png

A recently published paper by R. Jedynak,{{cite journal|last=Jedynak|first=R.|title= A comprehensive study of the mathematical methods used to approximate the inverse Langevin function|journal=Mathematics and Mechanics of Solids|pages=1–25|year=2018|volume=24|issue=7|doi = 10.1177/1081286518811395|s2cid=125370646}} provides a series of the optimal approximants to the inverse Langevin function. The table below reports the results with correct asymptotic behaviors,.

Comparison of relative errors for the different optimal rational approximations, which were computed with constraints (Appendix 8 Table 1)

class="wikitable" style="text-align:center"
ComplexityOptimal approximationMaximum relative error [%]
3R_{2,1}(y)=\frac{-2 y^2+3 y}{1-y} 13
4R_{3,1}(y)=\frac{0.88 y^3-2.88 y^2+3 y}{1-y}0.95
5R_{3,2}(y)=\frac{1.1571 y^3-3.3533 y^2+3 y}{(1-y) (1-0.1962 y)} 0.56
6R_{5,1}(y)=\frac{0.756 y^5-1.383 y^4+1.5733 y^3-2.9463 y^2+3 y}{1-y} 0.16
7R_{3,4}(y)=\frac{2.14234 y^3-4.22785 y^2+3 y}{(1-y) \left(0.71716 y^3-0.41103 y^2-0.39165 y+1\right)} 0.082

Also recently, an efficient near-machine precision approximant, based on spline interpolations, has been proposed by Benítez and Montáns,{{cite journal|last1=Benítez|first1=J.M.|last2=Montáns|first2=F.J.|title= A simple and efficient numerical procedure to compute the inverse Langevin function with high accuracy|journal=Journal of Non-Newtonian Fluid Mechanics|pages=153–163|year=2018|volume=261|doi = 10.1016/j.jnnfm.2018.08.011|arxiv=1806.08068|bibcode=2018JNNFM.261..153B |s2cid=119029096}} where Matlab code is also given to generate the spline-based approximant and to compare many of the previously proposed approximants in all the function domain.

= Inverse Brillouin function =

Approximations could also be used to express the inverse Brillouin function (B_J(x)^{-1}) . Takacs{{cite journal|last1=Takacs|first1=Jeno|title=Approximations for Brillouin and its reverse function|journal=COMPEL - the International Journal for Computation and Mathematics in Electrical and Electronic Engineering|date=2016|volume=35|issue= 6|page=2095|doi=10.1108/COMPEL-06-2016-0278}} proposed the following approximation to the inverse of the Brillouin function:

:B_J(x)^{-1} = \frac{axJ^2}{1-bx^2}

where the constants a and b are defined to be

:a=\frac{0.5(1+2J)(1-0.055)}{(J-0.27)2J}+\frac{0.1}{J^2}

:b=0.8

See also

References