Theorem (Broyden's Method for 2-dimensional Systems).  To solve the non-linear system  [Graphics:Images/BroydenMethodProof_gr_53.gif],  given one initial approximation  [Graphics:Images/BroydenMethodProof_gr_54.gif],  and generating a sequence  [Graphics:Images/BroydenMethodProof_gr_55.gif]  which converges to the solution  [Graphics:Images/BroydenMethodProof_gr_56.gif],  i.e.  [Graphics:Images/BroydenMethodProof_gr_57.gif].  

Step 0.     Compute the initial Jacobian matrix  

        
[Graphics:Images/BroydenMethodProof_gr_58.gif].  

Use it to compute the first approximation using Newton's method  

        [Graphics:Images/BroydenMethodProof_gr_59.gif].  

For  [Graphics:Images/BroydenMethodProof_gr_60.gif].  Suppose that  [Graphics:Images/BroydenMethodProof_gr_61.gif]  has been obtained, use the following steps to obtain  [Graphics:Images/BroydenMethodProof_gr_62.gif].  

Step 1.     Evaluate the function   [Graphics:Images/BroydenMethodProof_gr_63.gif].

Step 2.  Update the approximate Jacobian using   [Graphics:Images/BroydenMethodProof_gr_64.gif]  and   [Graphics:Images/BroydenMethodProof_gr_65.gif]

        [Graphics:Images/BroydenMethodProof_gr_66.gif]  

Step 3.  Solve the linear system   [Graphics:Images/BroydenMethodProof_gr_67.gif]   for   [Graphics:Images/BroydenMethodProof_gr_68.gif].  

Step 4.  Compute the next approximation   [Graphics:Images/BroydenMethodProof_gr_69.gif].  

Proof .

Start Broyden's method with the initial approximation

        [Graphics:../Images/BroydenMethodProof_gr_70.gif]  

Step 0.     Use Newton's method to construct the next approximation

        [Graphics:../Images/BroydenMethodProof_gr_71.gif],  

        [Graphics:../Images/BroydenMethodProof_gr_72.gif].  

For  [Graphics:../Images/BroydenMethodProof_gr_73.gif].  Suppose that  [Graphics:../Images/BroydenMethodProof_gr_74.gif]  has been obtained, use the following steps to obtain  [Graphics:../Images/BroydenMethodProof_gr_75.gif].  

Step 1.     Evaluate the function   [Graphics:../Images/BroydenMethodProof_gr_76.gif].

Step 2.     This is the first Broyden iteration.  It uses the basic property of the Jacobian  [Graphics:../Images/BroydenMethodProof_gr_77.gif]  to write the equation

        [Graphics:../Images/BroydenMethodProof_gr_78.gif].  
        
But, instead of using the Jacobian matrix  [Graphics:../Images/BroydenMethodProof_gr_79.gif],  a matrix  [Graphics:../Images/BroydenMethodProof_gr_80.gif]  is used in the quasi-Newton condition or secant equation  

        [Graphics:../Images/BroydenMethodProof_gr_81.gif].  

    Unfortunately the secant equation does not define  [Graphics:../Images/BroydenMethodProof_gr_82.gif]  uniquely, it only shows that when a displacement in the direction of the vector  [Graphics:../Images/BroydenMethodProof_gr_83.gif]  is entered on the left of the equation, it is translated in space into a displacement direction that approximates [Graphics:../Images/BroydenMethodProof_gr_84.gif]. We have no other information to explain how displacements in other directions orthogonal to  [Graphics:../Images/BroydenMethodProof_gr_85.gif] are effected.  

    To simplify the situation, Broyden adds a requirement for the update of  [Graphics:../Images/BroydenMethodProof_gr_86.gif]  from  [Graphics:../Images/BroydenMethodProof_gr_87.gif]  for vectors  [Graphics:../Images/BroydenMethodProof_gr_88.gif]  in the orthogonal complement.  The requirement is

        [Graphics:../Images/BroydenMethodProof_gr_89.gif]

for all vectors  [Graphics:../Images/BroydenMethodProof_gr_90.gif]  that are orthogonal to  [Graphics:../Images/BroydenMethodProof_gr_91.gif] (i.e. all vectors satisfying  [Graphics:../Images/BroydenMethodProof_gr_92.gif] ).  This will have the effect of updating  [Graphics:../Images/BroydenMethodProof_gr_93.gif]  to  [Graphics:../Images/BroydenMethodProof_gr_94.gif] only in the direction  [Graphics:../Images/BroydenMethodProof_gr_95.gif].  The resulting formula is  
        
        [Graphics:../Images/BroydenMethodProof_gr_96.gif].   

Where  [Graphics:../Images/BroydenMethodProof_gr_97.gif]  is the Euclidean norm, and we could have used the calculation  [Graphics:../Images/BroydenMethodProof_gr_98.gif].

 

Justification

    The least change modification of  [Graphics:../Images/BroydenMethodProof_gr_120.gif]  to construct  [Graphics:../Images/BroydenMethodProof_gr_121.gif] is usually written as follows:

        [Graphics:../Images/BroydenMethodProof_gr_122.gif],

        [Graphics:../Images/BroydenMethodProof_gr_123.gif],  
    
        [Graphics:../Images/BroydenMethodProof_gr_124.gif].   

The conclusion of the second step is to compute [Graphics:../Images/BroydenMethodProof_gr_125.gif]  

        [Graphics:../Images/BroydenMethodProof_gr_126.gif].

The general step is:  

        [Graphics:../Images/BroydenMethodProof_gr_127.gif]

        [Graphics:../Images/BroydenMethodProof_gr_128.gif]  

        [Graphics:../Images/BroydenMethodProof_gr_129.gif]  

        [Graphics:../Images/BroydenMethodProof_gr_130.gif]

    In order to avoid finding an inverse matrix [Graphics:../Images/BroydenMethodProof_gr_131.gif], one can rewrites the last equation  as follows  

        [Graphics:../Images/BroydenMethodProof_gr_132.gif]  

Use the substitution  [Graphics:../Images/BroydenMethodProof_gr_133.gif]  and get

        [Graphics:../Images/BroydenMethodProof_gr_134.gif]  

Rewrite as a linear system to solve for  [Graphics:../Images/BroydenMethodProof_gr_135.gif]  

Step 3.  Solve the linear system   

        [Graphics:../Images/BroydenMethodProof_gr_136.gif]    for    [Graphics:../Images/BroydenMethodProof_gr_137.gif].  

Step 4.  Compute the next iteration using the formula   

        [Graphics:../Images/BroydenMethodProof_gr_138.gif].  

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(c) John H. Mathews 2005