Aberth method

The Aberth method, or AberthEhrlich method, named after Oliver Aberth[1] and Louis W. Ehrlich,[2] is a root-finding algorithm developed in 1967 for simultaneous approximation of all the roots of a univariate polynomial.

The fundamental theorem of algebra states that for each polynomial with complex coefficients there are as many roots as the degree of the polynomial. This method converges cubically, an improvement over the Weierstrass(DurandKerner) method, another numerical algorithm that approximates all roots at once, which converges quadratically.[1][2] (However, both algorithms converge linearly at multiple zeros.[3])

Description

Let   p(x)=p_nx^n+p_{n-1}x^{n-1}+\cdots+p_1x+p_0  be a univariate polynomial of degree n with real or complex coefficients. Then there exist complex numbers z^*_1,\,z^*_2,\dots,z^*_n, the roots of p(x), that give the factorisation:

p(x)=p_n\cdot(x-z^*_1)\cdot(x-z^*_2)\cdots(x-z^*_n).

Although those numbers are unknown, upper and lower bounds for their absolute values are computable from the coefficients of the polynomial. Now one can pick n distinct numbers in the complex plane—randomly or evenly distributed—such that their absolute values are within the same bounds. (Also, if the zeros are symmetrical, the starting points must not be exactly symmetrical along the same axis, as this can prevent convergence.)[1] A set of such numbers is called an initial approximation of the set of roots of p(x). This approximation can be iteratively improved using the following procedure.

Let z_1,\dots,z_n\in\mathbb C be the current approximations of the zeros of p(x). Then offset numbers w_1,\dots,w_n\in\mathbb C are computed as

w_k=-\frac{\frac{p(z_k)}{p'(z_k)}}{1-\frac{p(z_k)}{p'(z_k)}\cdot \sum_{j\ne k}\frac1{z_k-z_j}},

where p'(z) is the polynomial derivative of p evaluated in the point z.

The next set of approximations of roots of p(x) is then   z_1+w_1,\dots,z_n+w_n  . One can measure the quality of the current approximation by the values of the polynomial or by the size of the offsets.

Conceptually, this method uses an electrostatic analogy, modeling the approximated zeros as movable negative point charges, which converge toward the true zeros, represented by fixed positive point charges.[1] A direct application of Newton's method to each approximated zero will often cause multiple starting points to incorrectly converge to the same root. The Aberth method avoids this by also modeling the repulsive effect the movable charges have on each other. In this way, when a movable charge has converged on a zero, their charges will cancel out, so that other movable charges are no longer attracted to that location, encouraging them to converge to other "unoccupied" zeros. (Stieltjes also modeled the positions of zeros of polynomials as solutions to electrostatic problems.)

Inside the formula of the Aberth method one can find elements of Newton's method and the Weierstrass(DurandKerner) method. Details for an efficient implementation, esp. on the choice of good initial approximations, can be found in Bini (1996).[3]

The updates of the roots may be executed as a simultaneous Jacobi-like iteration where first all new approximations are computed from the old approximations or as a sequential Gauss–Seidel-like iteration that uses each new approximation from the time it is computed.

A very similar method is the Newton-Maehly method. It computes the zeros one after another, but instead of an explicit deflation it divides by the already acquired linear factors on the fly. The Aberth method is like the Newton-Maehly method for computing the last root while pretending you have already found the other ones.[4]

Derivation from Newton's method

The iteration formula is the univariate Newton iteration for the function

F(x)=\frac{p(x)}{\prod_{j=0;\,j\ne k}^n(x-z_j)}

If the values z_1,\dots,z_n are already close to the roots of p(x), then the rational function F(x) is almost linear with a dominant root close to z_k and poles at z_1,\dots,z_{k-1},z_{k+1},\dots,z_n that direct the Newton iteration away from the roots of p(x) that are close to them. That is, the corresponding basins of attraction get rather small, while the root close to z_k has a wide region of attraction.

The Newton step \tfrac{F(x)}{F'(x)} in the univariate case is the reciprocal value to the logarithmic derivative

\begin{align}
       \frac{F'(x)}{F(x)}
    &= \frac{d}{dx}\ln|F(x)|\\
    &= \frac{d}{dx}\big(\ln|p(x)|-\sum_{j=0;\,j\ne k}^n\ln|x-z_j|\big)\\
    &= \frac{p'(x)}{p(x)}-\sum_{j=0;\,j\ne k}^n\frac1{x-z_j}
\end{align}

Thus, the new approximation is computed as

z_k'=z_k-\frac{F(z_k)}{F'(z_k)}=z_k-\frac1{\frac{p'(z_k)}{p(z_k)}-\sum_{j=0;\,j\ne k}^n\frac1{z_k-z_j}}\,,

which is the update formula of the AberthEhrlich method.

Literature

  1. 1 2 3 4 Aberth, Oliver (1973). "Iteration methods for finding all zeros of a polynomial simultaneously". Math. Comp. (Mathematics of Computation, Vol. 27, No. 122) 27 (122): 339–344. doi:10.2307/2005621. JSTOR 2005621. Because of the obvious analogy from electrostatics, this field may be called the field of a unit plus charge ... To avoid this, we assign a unit minus charge at each sampling point. The idea here is that when a sampling point z, is near a simple zero, the field from the minus charge at z, should counteract that from the plus charge at the zero, preventing a second sampling point from converging to this zero.
  2. 1 2 Ehrlich, Louis W. (1967). "A modified Newton method for polynomials". Comm. ACM 10 (2): 107–108. doi:10.1145/363067.363115.
  3. 1 2 Bini, Dario Andrea (1996). "Numerical computation of polynomial zeros by means of Aberth's method". Numerical Algorithms 13 (2): 179–200. doi:10.1007/BF02207694.
  4. Bauer, F.L.; Stoer, J. (1962). "Algorithm 105: Newton Maehly". Comm. ACM 5 (7): 387–388. doi:10.1145/368273.368423.

See also

This article is issued from Wikipedia - version of the Thursday, April 21, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.