Broyden's method

In numerical analysis, Broyden's method is a quasi-Newton method for finding roots in $k$ variables. It was originally described by C. G. Broyden in 1965.^[1]

Newton's method for solving $f (x) = 0$ uses the Jacobian matrix, $J$ , at every iteration. However, computing this Jacobian is a difficult and expensive operation. The idea behind Broyden's method is to compute the whole Jacobian only at the first iteration, and to do a rank-one update at the other iterations.

In 1979 Gay proved that when Broyden's method is applied to a linear system of size $n \times n$ , it terminates in $2 n$ steps,^[2] although like all quasi-Newton methods, it may not converge for nonlinear systems.

Description of the method

Solving single variable equation

In the secant method, we replace the first derivative $f'$ at $x n$ with the finite difference approximation:

f'(x_n) \simeq \frac{f(x_n) - f(x_{n - 1})}{x_n - x_{n - 1}} ,

and proceed similar to Newton's Method:

x_{n + 1} = x_n - \frac{1}{f'(x_n)} f(x_n)

where $n$ is the iteration index.

Solving a system of nonlinear equations

To solve a system of $k$ nonlinear equations

\mathbf f(\mathbf x) = \mathbf 0 ,

where $f$ is a vector-valued function of vector $x$ :

\mathbf x = (x_1, x_2, x_3, \dotsc, x_k)

\mathbf f(\mathbf x) = (f_1(x_1, x_2, \dotsc, x_k), f_2(x_1, x_2, \dotsc, x_k), \dotsc, f_k(x_1, x_2, \dotsc, x_k))

For such problems, Broyden gives a generalization of the one-dimensional Newton's method, replacing the derivative with the Jacobian $J$ . The Jacobian matrix is determined iteratively based on the secant equation in the finite difference approximation:

\mathbf J_n (\mathbf x_n - \mathbf x_{n - 1}) \simeq \mathbf f(\mathbf x_n) - \mathbf f(\mathbf x_{n - 1}) ,

where $n$ is the iteration index. For clarity, let us define:

\mathbf f_n = \mathbf f(\mathbf x_n) ,

\Delta \mathbf x_n = \mathbf x_n - \mathbf x_{n - 1} ,

\Delta \mathbf f_n = \mathbf f_n - \mathbf f_{n - 1} ,

so the above may be rewritten as:

\mathbf J_n \Delta \mathbf x_n \simeq \Delta \mathbf f_n .

The above equation is underdetermined when $k$ is greater than one. Broyden suggests using the current estimate of the Jacobian matrix $J n - 1$ and improving upon it by taking the solution to the secant equation that is a minimal modification to $J n - 1$ :

\mathbf J_n = \mathbf J_{n - 1} + \frac{\Delta \mathbf f_n - \mathbf J_{n - 1} \Delta \mathbf x_n}{\|\Delta \mathbf x_n\|^2} \Delta \mathbf x_n^{\mathrm T}

This minimizes the following Frobenius norm:

\|\mathbf J_n - \mathbf J_{n - 1}\|_{\mathrm f} .

We may then proceed in the Newton direction:

\mathbf x_{n + 1} = \mathbf x_n - \mathbf J_n^{-1} \mathbf f(\mathbf x_n) .

Broyden also suggested using the Sherman-Morrison formula to update directly the inverse of the Jacobian matrix:

\mathbf J_n^{-1} = \mathbf J_{n - 1}^{-1} + \frac{\Delta \mathbf x_n - \mathbf J^{-1}_{n - 1} \Delta \mathbf f_n}{\Delta \mathbf x_n^{\mathrm T} \mathbf J^{-1}_{n - 1} \Delta \mathbf f_n} \Delta \mathbf x_n^{\mathrm T} \mathbf J^{-1}_{n - 1}

This first method is commonly known as the "good Broyden's method".

A similar technique can be derived by using a slightly different modification to $J n - 1$ . This yields a second method, the so-called "bad Broyden's method" (but see^[3]):

\mathbf J_n^{-1} = \mathbf J_{n - 1}^{-1} + \frac{\Delta \mathbf x_n - \mathbf J^{-1}_{n - 1} \Delta \mathbf f_n}{\|\Delta \mathbf f_n\|^2} \Delta \mathbf f_n^{\mathrm T}

This minimizes a different Frobenius norm:

\|\mathbf J_n^{-1} - \mathbf J_{n - 1}^{-1}\|_{\mathrm f} .

Many other quasi-Newton schemes have been suggested in optimization, where one seeks a maximum or minimum by finding the root of the first derivative (gradient in multi dimensions). The Jacobian of the gradient is called Hessian and is symmetric, adding further constraints to its update.

Other members of the Broyden Class

Broyden has not only defined two methods, but a whole class of methods. Other members of this class have been added by other authors.

The Davidon–Fletcher–Powell update is the only member of this class being published before the two members defined by Broyden.^[4]
Schuberts or sparse Broyden algorithm – a modification for sparse Jacobian matrices.^[5]
Klement (2014) – uses less iterations to solve many equation systems.^[6]^[7]

References

↑ Broyden, C. G. (October 1965). "A Class of Methods for Solving Nonlinear Simultaneous Equations". Mathematics of Computation (American Mathematical Society) 19 (92): 577–593. doi:10.1090/S0025-5718-1965-0198670-6. JSTOR 2003941.
↑ Gay, D.M. (August 1979). "Some convergence properties of Broyden's method". SIAM Journal of Numerical Analysis (SIAM) 16 (4): 623–630. doi:10.1137/0716047.
↑ Kvaalen, Eric (November 1991). "A faster Broyden method". BIT Numerical Mathematics (SIAM) 31 (2): 369–372. doi:10.1007/BF01931297.
↑ Broyden, C. G. (October 1965). "A Class of Methods for Solving Nonlinear Simultaneous Equations". Mathematics of Computation (American Mathematical Society) 19 (92): 577–593. doi:10.1090/S0025-5718-1965-0198670-6. JSTOR 2003941.
↑ Schubert, L. K. (1970-01-01). "Modification of a quasi-Newton method for nonlinear equations with a sparse Jacobian". Mathematics of Computation 24 (109): 27–30. doi:10.1090/S0025-5718-1970-0258276-9. ISSN 0025-5718.
↑ Klement, Jan (2014-11-23). "On Using Quasi-Newton Algorithms of the Broyden Class for Model-to-Test Correlation". Journal of Aerospace Technology and Management 6 (4): 407–414. doi:10.5028/jatm.v6i4.373. ISSN 2175-9146.
↑ "Broyden class methods – File Exchange – MATLAB Central". www.mathworks.com. Retrieved 2016-02-04.

External links

Simple basic explanation: The story of the blind archer

Optimization: Algorithms, methods, and heuristics

Unconstrained nonlinear: Methods calling …

… functions

… and gradients

Convergence	Trust region Wolfe conditions

Quasi–Newton	BFGS and L-BFGS DFP Symmetric rank-one (SR1)

Other methods	Gauss–Newton Gradient Levenberg–Marquardt Conjugate gradient Truncated Newton

… and Hessians

Newton's method

The graph of a strictly concave quadratic function is shown in blue, with its unique maximum shown as a red dot. Below the graph appears the contours of the function: The level sets are nested ellipses.

Constrained nonlinear

General	Barrier methods Penalty methods

Differentiable	Augmented Lagrangian methods Sequential quadratic programming Successive linear programming

Convex optimization

Convex
minimization

Linear and
quadratic

Interior point	Affine scaling Ellipsoid algorithm of Khachiyan Projective algorithm of Karmarkar

Basis-Exchange	Simplex algorithm of Dantzig Revised simplex algorithm Criss-cross algorithm Principal pivoting algorithm of Lemke

Combinatorial

Paradigms

Graph
algorithms

Minimum spanning tree	Bellman–Ford Borůvka Dijkstra Floyd–Warshall Johnson Kruskal

Network flows

Metaheuristics

Evolutionary algorithm Hill climbing Local search Simulated annealing Tabu search

Categories
- Algorithms and methods
- Heuristics
Software

This article is issued from Wikipedia - version of the Friday, March 18, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.