Theory and Proof

for

Broyden's Method

Return to Numerical Methods - Numerical Analysis

 

Background

    Newton's method can be used to solve systems of nonlinear equations.

Theorem (Newton-Raphson Method for 2-dimensional Systems).  To solve the non-linear system  [Graphics:Images/BroydenMethodProof_gr_1.gif],  given one initial approximation  [Graphics:Images/BroydenMethodProof_gr_2.gif],  and generating a sequence  [Graphics:Images/BroydenMethodProof_gr_3.gif]  which converges to the solution  [Graphics:Images/BroydenMethodProof_gr_4.gif],  i.e.  [Graphics:Images/BroydenMethodProof_gr_5.gif].   Suppose that  [Graphics:Images/BroydenMethodProof_gr_6.gif]  has been obtained, use the following steps to obtain  [Graphics:Images/BroydenMethodProof_gr_7.gif].  

Step 1.     Evaluate the function   [Graphics:Images/BroydenMethodProof_gr_8.gif].

Step 2.  Evaluate the Jacobian   [Graphics:Images/BroydenMethodProof_gr_9.gif].  

Step 3.  Solve the linear system   [Graphics:Images/BroydenMethodProof_gr_10.gif]   for   [Graphics:Images/BroydenMethodProof_gr_11.gif].  

Step 4.  Compute the next approximation   [Graphics:Images/BroydenMethodProof_gr_12.gif].  

Proof  Nonlinear Systems  

 

More Background

    A drawback of Newton's method is the requirement that the Jacobian matrix be computed, which requires the calculation of  [Graphics:Images/BroydenMethodProof_gr_13.gif]  partial derivatives per iteration.  One obvious improvement is to use a finite difference approximation for the Jacobian matrix.  For two dimensions this is

        [Graphics:Images/BroydenMethodProof_gr_14.gif]
where h is small.  Notice that this will require  [Graphics:Images/BroydenMethodProof_gr_15.gif]  function evaluations.  

Exploration

 

Broyden's Method

    Calculation of the Jacobian matrix requires  
[Graphics:Images/BroydenMethodProof_gr_37.gif]  partial derivative evaluations and the approximate Jacobian matrix requires [Graphics:Images/BroydenMethodProof_gr_38.gif] function evaluations.  Hence, a more computationally efficient method is desired.  Broyden's method is a least change secant update method or quasi-Newton method.  Recall the one-dimensional case of Newton's method for solving  [Graphics:Images/BroydenMethodProof_gr_39.gif]. The secant method replaces the derivative  [Graphics:Images/BroydenMethodProof_gr_40.gif]  with the difference quotient  

        
[Graphics:Images/BroydenMethodProof_gr_41.gif]  

        [Graphics:Images/BroydenMethodProof_gr_42.gif]  

The extension to higher dimensions is made by using the basic property of the Jacobian  [Graphics:Images/BroydenMethodProof_gr_43.gif]  to write the equation

        [Graphics:Images/BroydenMethodProof_gr_44.gif].  

Broyden's method starts initially with the Jacobian matrix  
[Graphics:Images/BroydenMethodProof_gr_45.gif].  Then in successive iterations uses an update the approximate Jacobian with the matrix  [Graphics:Images/BroydenMethodProof_gr_46.gif]:  

        
[Graphics:Images/BroydenMethodProof_gr_47.gif]  for  [Graphics:Images/BroydenMethodProof_gr_48.gif].  

This equation is known as the quasi-Newton condition or secant condition.  Recall that two initial two values  [Graphics:Images/BroydenMethodProof_gr_49.gif]  are necessary to start the secant method.  In practice we are given one starting value [Graphics:Images/BroydenMethodProof_gr_50.gif] and compute the Jacobian  [Graphics:Images/BroydenMethodProof_gr_51.gif]  and use Newton's method to get the first approximation  [Graphics:Images/BroydenMethodProof_gr_52.gif].  The following result gives the details of Broyden's method.

 

Theorem (Broyden's Method for 2-dimensional Systems).  To solve the non-linear system  [Graphics:Images/BroydenMethodProof_gr_53.gif],  given one initial approximation  [Graphics:Images/BroydenMethodProof_gr_54.gif],  and generating a sequence  [Graphics:Images/BroydenMethodProof_gr_55.gif]  which converges to the solution  [Graphics:Images/BroydenMethodProof_gr_56.gif],  i.e.  [Graphics:Images/BroydenMethodProof_gr_57.gif].  

Step 0.     Compute the initial Jacobian matrix  

        
[Graphics:Images/BroydenMethodProof_gr_58.gif].  

Use it to compute the first approximation using Newton's method  

        [Graphics:Images/BroydenMethodProof_gr_59.gif].  

For  [Graphics:Images/BroydenMethodProof_gr_60.gif].  Suppose that  [Graphics:Images/BroydenMethodProof_gr_61.gif]  has been obtained, use the following steps to obtain  [Graphics:Images/BroydenMethodProof_gr_62.gif].  

Step 1.     Evaluate the function   [Graphics:Images/BroydenMethodProof_gr_63.gif].

Step 2.  Update the approximate Jacobian using   [Graphics:Images/BroydenMethodProof_gr_64.gif]  and   [Graphics:Images/BroydenMethodProof_gr_65.gif]

        [Graphics:Images/BroydenMethodProof_gr_66.gif]  

Step 3.  Solve the linear system   [Graphics:Images/BroydenMethodProof_gr_67.gif]   for   [Graphics:Images/BroydenMethodProof_gr_68.gif].  

Step 4.  Compute the next approximation   [Graphics:Images/BroydenMethodProof_gr_69.gif].  

Proof .

 

Improving Broyden's Method

    Matrix inversion requires  [Graphics:Images/BroydenMethodProof_gr_139.gif]  calculations.  Thus we seek a way to avoid this time consuming step.  A matrix inversion formula of Sherman and Morrison can facilitate in making a more efficient algorithm.  Except for [Graphics:Images/BroydenMethodProof_gr_140.gif], no matrix inverse or solution of a linear system computation is performed in the general step.  This is efficient when n is large.  

 

Theorem (Sherman-Morrison Formula).  If  [Graphics:Images/BroydenMethodProof_gr_141.gif]  is a nonsingular matrix and  [Graphics:Images/BroydenMethodProof_gr_142.gif]  are vectors such that  [Graphics:Images/BroydenMethodProof_gr_143.gif],  then  

        [Graphics:Images/BroydenMethodProof_gr_144.gif]  
    or
        [Graphics:Images/BroydenMethodProof_gr_145.gif].  

 

Theorem (Broyden's Method for n-dimensional Systems).  To solve the non-linear system  [Graphics:Images/BroydenMethodProof_gr_146.gif],  given one initial approximation  [Graphics:Images/BroydenMethodProof_gr_147.gif],  and generating a sequence  [Graphics:Images/BroydenMethodProof_gr_148.gif]  which converges to the solution  [Graphics:Images/BroydenMethodProof_gr_149.gif],  i.e.  [Graphics:Images/BroydenMethodProof_gr_150.gif].  Compute the initial Jacobian matrix  

        
[Graphics:Images/BroydenMethodProof_gr_151.gif].  

Use it to compute the first approximation using Newton's method  

        [Graphics:Images/BroydenMethodProof_gr_152.gif].  

For  [Graphics:Images/BroydenMethodProof_gr_153.gif].  Suppose that  [Graphics:Images/BroydenMethodProof_gr_154.gif]  has been obtained, use the following steps to obtain  [Graphics:Images/BroydenMethodProof_gr_155.gif].  

Step 1.     Evaluate the function  [Graphics:Images/BroydenMethodProof_gr_156.gif].  

Step 2.  Update the approximate Jacobian using  [Graphics:Images/BroydenMethodProof_gr_157.gif],  and  [Graphics:Images/BroydenMethodProof_gr_158.gif]

        [Graphics:Images/BroydenMethodProof_gr_159.gif].   

Step 3.  Compute  [Graphics:Images/BroydenMethodProof_gr_160.gif]  using the Sherman-Morrison formula

        [Graphics:Images/BroydenMethodProof_gr_161.gif]  

Step 4.  Compute the next approximation   

        [Graphics:Images/BroydenMethodProof_gr_162.gif].  

Remark.

 

References

    The details regarding the proof are given in the articles below.

A Class of Methods for Solving Nonlinear Simultaneous Equations  
C. G. Broyden  
Mathematics of Computation, Vol. 19, No. 92. (Oct., 1965), pp. 577-593, Jstor.  

Software Libraries for Linear Algebra Computations on High Performance Computers  
Jack J. Dongarra; David W. Walker  
SIAM Review, Vol. 37, No. 2. (Jun., 1995), pp. 151-180, Jstor.  

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(c) John H. Mathews 2005