Richardson extrapolation

{{Short description|Sequence acceleration method in numerical analysis}}

File:Richardson extra 2d.gif

In numerical analysis, Richardson extrapolation is a sequence acceleration method used to improve the rate of convergence of a sequence of estimates of some value A^\ast = \lim_{h\to 0} A(h). In essence, given the value of A(h) for several values of h, we can estimate A^\ast by extrapolating the estimates to h=0. It is named after Lewis Fry Richardson, who introduced the technique in the early 20th century,{{cite journal

| last=Richardson | first=L. F. | author-link=Lewis Fry Richardson

| title=The approximate arithmetical solution by finite differences of physical problems including differential equations, with an application to the stresses in a masonry dam

| journal=Philosophical Transactions of the Royal Society A

| year=1911 | volume=210

| issue=459–470 | pages=307–357

| doi=10.1098/rsta.1911.0009

| doi-access=free}}{{cite journal

| last1=Richardson | first1=L. F. | author-link=Lewis Fry Richardson

| title=The deferred approach to the limit

| journal=Philosophical Transactions of the Royal Society A

| year=1927 | volume=226

| issue=636–646 | pages=299–349

| doi=10.1098/rsta.1927.0008

| last2=Gaunt

| first2=J. A.

| doi-access=free

}} though the idea was already known to Christiaan Huygens in his calculation of \pi.{{Citation|last=Brezinski|first=Claude|title=Some pioneers of extrapolation methods|date=2009-11-01|url=https://www.worldscientific.com/doi/10.1142/9789812836267_0001|work=The Birth of Numerical Analysis|pages=1–22|publisher=WORLD SCIENTIFIC|doi=10.1142/9789812836267_0001|isbn=978-981-283-625-0}} In the words of Birkhoff and Rota, "its usefulness for practical computations can hardly be overestimated."Page 126 of {{cite book | last=Birkhoff | first=Garrett | author-link=Garrett Birkhoff |author2=Gian-Carlo Rota |author2-link=Gian-Carlo Rota | title=Ordinary differential equations | publisher=John Wiley and sons | year=1978 | edition=3rd | isbn=0-471-07411-X | oclc= 4379402}}

Practical applications of Richardson extrapolation include Romberg integration, which applies Richardson extrapolation to the trapezoid rule, and the Bulirsch–Stoer algorithm for solving ordinary differential equations.

General formula

= Notation =

Let A_0(h) be an approximation of A^*(exact value) that depends on a step size {{mvar|h}} (where 0 < h < 1) with an error formula of the form

A^* = A_0(h)+a_0h^{k_0} + a_1h^{k_1} + a_2h^{k_2} + \cdots

where the a_i are unknown constants and the k_i are known constants such that h^{k_i} > h^{k_{i+1}}. Furthermore, O(h^{k_i}) represents the truncation error of the A_i(h) approximation such that A^* = A_i(h)+O(h^{k_i}). Similarly, in A^*=A_i(h)+O(h^{k_i}), the approximation A_i(h) is said to be an O(h^{k_i}) approximation.

Note that by simplifying with Big O notation, the following formulae are equivalent:

\begin{align}

A^* &= A_0(h) + a_0h^{k_0} + a_1h^{k_1} + a_2h^{k_2} + \cdots \\

A^* &= A_0(h)+ a_0h^{k_0} + O(h^{k_1}) \\

A^* &= A_0(h)+O(h^{k_0})

\end{align}

= Purpose =

Richardson extrapolation is a process that finds a better approximation of A^* by changing the error formula from A^*=A_0(h)+O(h^{k_0}) to A^* = A_1(h) + O(h^{k_1}). Therefore, by replacing A_0(h) with A_1(h) the truncation error has reduced from O(h^{k_0}) to O(h^{k_1}) for the same step size h. The general pattern occurs in which A_i(h) is a more accurate estimate than A_j(h) when i>j. By this process, we have achieved a better approximation of A^* by subtracting the largest term in the error which was O(h^{k_0}) . This process can be repeated to remove more error terms to get even better approximations.

= Process =

Using the step sizes h and h / t for some constant t, the two formulas for A^* are:

{{NumBlk||A^* = A_0(h)+ a_0h^{k_0} + a_1h^{k_1} + a_2h^{k_2} + O(h^{k_3}) |{{EquationRef|1}}}}

{{NumBlk||A^* = A_0\!\left(\frac{h}{t}\right) + a_0\left(\frac{h}{t}\right)^{k_0} + a_1\left(\frac{h}{t}\right)^{k_1} + a_2\left(\frac{h}{t}\right)^{k_2} + O(h^{k_3}) |{{EquationRef|2}}}}

To improve our approximation from O(h^{k_0}) to O(h^{k_1}) by removing the first error term, we multiply {{EquationNote|2|equation 2}} by t^{k_0} and subtract {{EquationNote|1|equation 1}} to give us

(t^{k_0}-1)A^* = \bigg[t^{k_0}A_0\left(\frac{h}{t}\right) - A_0(h)\bigg] + \bigg(t^{k_0}a_1\bigg(\frac{h}{t}\bigg)^{k_1}-a_1h^{k_1}\bigg)+ \bigg(t^{k_0}a_2\bigg(\frac{h}{t}\bigg)^{k_2}-a_2h^{k_2}\bigg) + O(h^{k_3}).

This multiplication and subtraction was performed because \big[t^{k_0}A_0\left(\frac{h}{t}\right) - A_0(h)\big] is an O(h^{k_1}) approximation of (t^{k_0}-1)A^*. We can solve our current formula for A^* to give

A^* = \frac{\bigg[t^{k_0}A_0\left(\frac{h}{t}\right) - A_0(h)\bigg]}{t^{k_0}-1}

+ \frac{\bigg(t^{k_0}a_1\bigg(\frac{h}{t}\bigg)^{k_1}-a_1h^{k_1}\bigg)}{t^{k_0}-1}

+ \frac{\bigg(t^{k_0}a_2\bigg(\frac{h}{t}\bigg)^{k_2}-a_2h^{k_2}\bigg)}{t^{k_0}-1}

+O(h^{k_3})

which can be written as A^* = A_1(h)+O(h^{k_1}) by setting

A_1(h) = \frac{t^{k_0}A_0\left(\frac{h}{t}\right) - A_0(h)}{t^{k_0}-1} .

= Recurrence relation =

A general recurrence relation can be defined for the approximations by

A_{i+1}(h) = \frac{t^{k_i}A_i\left(\frac{h}{t}\right) - A_i(h)}{t^{k_i}-1}

where k_{i+1} satisfies

A^* = A_{i+1}(h) + O(h^{k_{i+1}}) .

= Properties =

The Richardson extrapolation can be considered as a linear sequence transformation.

Additionally, the general formula can be used to estimate k_0 (leading order step size behavior of Truncation error) when neither its value nor A^* is known a priori. Such a technique can be useful for quantifying an unknown rate of convergence. Given approximations of A^* from three distinct step sizes h, h / t, and h / s, the exact relationshipA^*=\frac{t^{k_0}A_i\left(\frac{h}{t}\right) - A_i(h)}{t^{k_0}-1} + O(h^{k_1}) = \frac{s^{k_0}A_i\left(\frac{h}{s}\right) - A_i(h)}{s^{k_0}-1} + O(h^{k_1})yields an approximate relationship (please note that the notation here may cause a bit of confusion, the two O appearing in the equation above only indicates the leading order step size behavior but their explicit forms are different and hence cancelling out of the two {{math|O}} terms is only approximately valid)

A_i\left(\frac{h}{t}\right) + \frac{A_i\left(\frac{h}{t}\right) - A_i(h)}{t^{k_0}-1} \approx A_i\left(\frac{h}{s}\right) +\frac{A_i\left(\frac{h}{s}\right) - A_i(h)}{s^{k_0}-1}

which can be solved numerically to estimate k_0 for some arbitrary valid choices of h, s, and t.

As t \neq 1, if t>0 and s is chosen so that s = t^2, this approximate relation reduces to a quadratic equation in t^{k_0}, which is readily solved for k_0 in terms of h and t.

Example of Richardson extrapolation

Suppose that we wish to approximate A^*, and we have a method A(h) that depends on a small parameter h in such a way that

A(h) = A^\ast + C h^n + O(h^{n+1}).

Let us define a new function R(h,t) := \frac{ t^n A(h/t) - A(h)}{t^n-1} where h and \frac{h}{t} are two distinct step sizes.

Then

R(h, t) = \frac{ t^n ( A^* + C \left(\frac{h}{t}\right)^n + O(h^{n+1}) ) - ( A^* + C h^n + O(h^{n+1}) ) }{ t^n - 1} = A^* + O(h^{n+1}).

R(h,t) is called the Richardson extrapolation of A(h), and has a higher-order error estimate O(h^{n+1}) compared to A(h) .

Very often, it is much easier to obtain a given precision by using R(h) rather than A(h′) with a much smaller h′. Where A(h′) can cause problems due to limited precision (rounding errors) and/or due to the increasing number of calculations needed (see examples below).

Example pseudocode for Richardson extrapolation

The following pseudocode in MATLAB style demonstrates Richardson extrapolation to help solve the ODE y'(t) = -y^2, y(0) = 1 with the Trapezoidal method. In this example we halve the step size h each iteration and so in the discussion above we'd have that t = 2. The error of the Trapezoidal method can be expressed in terms of odd powers so that the error over multiple steps can be expressed in even powers; this leads us to raise t to the second power and to take powers of 4 = 2^2 = t^2 in the pseudocode. We want to find the value of y(5), which has the exact solution of \frac{1}{5 + 1} = \frac{1}{6} = 0.1666... since the exact solution of the ODE is y(t) = \frac{1}{1 + t}. This pseudocode assumes that a function called Trapezoidal(f, tStart, tEnd, h, y0) exists which attempts to compute y(tEnd) by performing the trapezoidal method on the function f, with starting point y0 and tStart and step size h.

Note that starting with too small an initial step size can potentially introduce error into the final solution. Although there are methods designed to help pick the best initial step size, one option is to start with a large step size and then to allow the Richardson extrapolation to reduce the step size each iteration until the error reaches the desired tolerance.

tStart = 0 % Starting time

tEnd = 5 % Ending time

f = -y^2 % The derivative of y, so y' = f(t, y(t)) = -y^2

% The solution to this ODE is y = 1/(1 + t)

y0 = 1 % The initial position (i.e. y0 = y(tStart) = y(0) = 1)

tolerance = 10^-11 % 10 digit accuracy is desired

% Don't allow the iteration to continue indefinitely

maxRows = 20

% Pick an initial step size

initialH = tStart - tEnd

% Were we able to find the solution to within the desired tolerance? not yet.

haveWeFoundSolution = false

h = initialH

% Create a 2D matrix of size maxRows by maxRows to hold the Richardson extrapolates

% Note that this will be a lower triangular matrix and that at most two rows are actually

% needed at any time in the computation.

A = zeroMatrix(maxRows, maxRows)

% Compute the top left element of the matrix.

% The first row of this (lower triangular) matrix has now been filled.

A(1, 1) = Trapezoidal(f, tStart, tEnd, h, y0)

% Each row of the matrix requires one call to Trapezoidal

% This loops starts by filling the second row of the matrix,

% since the first row was computed above

for i = 1 : maxRows - 1 % Starting at i = 1, iterate at most maxRows - 1 times

% Halve the previous value of h since this is the start of a new row.

h = h/2

% Starting filling row i+1 from the left by calling

% the Trapezoidal function with this new smaller step size

A(i + 1, 1) = Trapezoidal(f, tStart, tEnd, h, y0)

% Go across this current (i+1)-th row until the diagonal is reached

for j = 1 : i

% To compute A(i + 1, j + 1), which is the next Richardson extrapolate,

% use the most recently computed value (i.e. A(i + 1, j))

% and the value from the row above it (i.e. A(i, j)).

A(i + 1, j + 1) = ((4^j).*A(i + 1, j) - A(i, j))/(4^j - 1);

end

% After leaving the above inner loop, the diagonal element of row i + 1 has been computed

% This diagonal element is the latest Richardson extrapolate to be computed.

% The difference between this extrapolate and the last extrapolate of row i is a good

% indication of the error.

if (absoluteValue(A(i + 1, i + 1) - A(i, i)) < tolerance) % If the result is within tolerance

% Display the result of the Richardson extrapolation

print("y = ", A(i + 1, i + 1))

haveWeFoundSolution = true

% Done, so leave the loop

break

end

end

% If we were not able to find a solution to within the desired tolerance

if (not haveWeFoundSolution)

print("Warning: Not able to find solution to within the desired tolerance of ", tolerance);

print("The last computed extrapolate was ", A(maxRows, maxRows))

end

See also

References

{{Reflist}}

{{refbegin}}

  • Extrapolation Methods. Theory and Practice by C. Brezinski and M. Redivo Zaglia, North-Holland, 1991.
  • Ivan Dimov, Zahari Zlatev, Istvan Farago, Agnes Havasi: Richardson Extrapolation: Practical Aspects and Applications, Walter de Gruyter GmbH & Co KG, {{ISBN|9783110533002}} (2017).

{{refend}}