To solve equation
, we first consider the Taylor series
expansion of
at any point
:
data:image/s3,"s3://crabby-images/84a7c/84a7c5a2cd94da25bb22f69129ee734f2bc3bd44" alt="$\displaystyle f(x)=f(x_0)+f'(x_0)(x-x_0)+\frac{1}{2!}f''(x)(x-x_0)^2+\cdots
+\frac{1}{n!} f^{(n)}(x_0)(x-x_0)^n +\cdots$" |
(66) |
If
is linear, i.e., its slope
is a constant for any
, then the second and higher order terms are all zero, and the
equation
becomes
data:image/s3,"s3://crabby-images/be598/be598aab0aeda64dba5a59a9b84aa1d2df73a603" alt="$\displaystyle f(x)=f(x_0)+f'(x)(x-x_0)=0$" |
(67) |
Solving this equation we get the root
at which
:
data:image/s3,"s3://crabby-images/b79c4/b79c4f2e120afcb5830012185aa9ed67618b88a7" alt="$\displaystyle x^*=x=x_0-\frac{f(x_0)}{f'(x_0)}=x_0+\Delta x_0$" |
(68) |
where
is the step we need to take to
go from the initial point
to the root
:
data:image/s3,"s3://crabby-images/37808/37808edac0d7e30fa4fdf58cd4ef1c4509025aa6" alt="$\displaystyle \Delta x_0=-\frac{f(x_0)}{f'(x_0)}\;\left\{\begin{array}{ll}
<0 &...
... & \mbox{if $f(x_0)$\ and $f'(x_0)$\ are of different signs}
\end{array}\right.$" |
(69) |
If
is nonlinear, the sum of the first two terms of the Taylor
expansion is only an approximation of
, and the resulting
found above can be treated as an approximation of the root, which can
be improved iteratively to move from the initial point
towards
the root, as illustrated in the figure below:
data:image/s3,"s3://crabby-images/37420/374209316c64644d8f0b997bcefa2467f0ff1d81" alt="$\displaystyle x_{n+1}=x_n+\Delta x_n =x_n-\frac{f(x_n)}{f'(x_n)}=g(x_n),\;\;\;\;\;\;
n=0,\,1,\,2,\cdots$" |
(70) |
The Newton-Raphson method can be considered as the fixed point
iteration
based on
data:image/s3,"s3://crabby-images/cfd73/cfd73b39a067869c823ca953c2485f92cd4b2a73" alt="$\displaystyle g(x)=x-f(x)/f'(x)$" |
(71) |
The root
at which
is also the fixed point of
,
i.e.,
. For the iteration to converge,
needs to be
a contraction with
. Consider
data:image/s3,"s3://crabby-images/98a84/98a8486ed24679a64f7721eab4a3c9062071836c" alt="$\displaystyle g'(x)=\left(x-\frac{f(x)}{f'(x)}\right)'
=1-\frac{(f'(x))^2-f(x)f''(x)}{(f'(x))^2}
=\frac{f(x)f''(x)}{(f'(x))^2}$" |
(72) |
We make the following comments:
The order of convergence of the Newton-Raphson iteration can be found
based on the Taylor expansion of
at the neighborhood of the root
:
data:image/s3,"s3://crabby-images/6ecc5/6ecc59e7909684823e382c805c2cc51fc01cbda1" alt="$\displaystyle 0=f(x^*)=f(x_n)+f'(x_n)e_n+\frac{f''(x_n)}{2}e_n^2+O(e_n^3)$" |
(74) |
where
is the error at the nth step.
Substituting the Newton-Raphson's iteration
i.e.data:image/s3,"s3://crabby-images/bb321/bb32105d5c74e35a7f334f4f72aa8face9d1c6d5" alt="$\displaystyle \;\;\;\;
f(x_n)=f'(x_n)(x_n-x_{n+1})$" |
(75) |
into the equation above, we get
i.e.
data:image/s3,"s3://crabby-images/15997/15997f1e4d002d5048bf8aaaab39a50c0f2f87ed" alt="$\displaystyle e_{n+1}=-\frac{f''(x_n)}{2f'(x_n)}e_n^2+O(e_n^3)$" |
(77) |
When
all the higher order terms disappear, and the
above can be written as
data:image/s3,"s3://crabby-images/a8a44/a8a4455b74127c2d9b5a964953764bfdad24666b" alt="$\displaystyle \lim\limits_{n\rightarrow\infty}\frac{\vert e_{n+1}\vert}{\vert e_n\vert^2}
\le \frac{\vert f''(x^*)\vert}{2\vert f'(x^*)\vert}=\mu$" |
(78) |
Alternatively, we can get the Taylor expansion in terms of
:
data:image/s3,"s3://crabby-images/22cdb/22cdbf82967e9b440ef69ffa594733c028edd0f5" alt="$\displaystyle x_{n+1}=x^*-e_{n+1}=g(x_n)=g(x^*-e_n)
=g(x^*)-g'(x^*)e_n+\frac{g''(x^*)}{2}e_n^2+O(e_n^3)$" |
(79) |
Subtracting
from both sides we get:
data:image/s3,"s3://crabby-images/d8a9e/d8a9e025e3662edb357bb9282504970d673bdd29" alt="$\displaystyle e_{n+1}=g'(x^*)e_n-\frac{g''(x^*)}{2}e_n^2+O(e_n^3)$" |
(80) |
Now we find
and
:
data:image/s3,"s3://crabby-images/98a84/98a8486ed24679a64f7721eab4a3c9062071836c" alt="$\displaystyle g'(x)=\left(x-\frac{f(x)}{f'(x)}\right)'
=1-\frac{(f'(x))^2-f(x)f''(x)}{(f'(x))^2}
=\frac{f(x)f''(x)}{(f'(x))^2}$" |
(81) |
and
Evaluating these at
at which
, and substituting them
back into the expression for
above, the linear term is zero as
, i.e., the convergence is quadratic, and we get the same
result:
data:image/s3,"s3://crabby-images/9c5ae/9c5ae031cc02833b5ccee90d2aa050c5ecc8ea66" alt="$\displaystyle \lim\limits_{n\rightarrow\infty}\frac{\vert e_{n+1}\vert}{\vert e...
...e \frac{\vert f''(x^*)\vert}{2\vert f'(x^*)\vert}=\frac{\vert g''(x^*)\vert}{2}$" |
(83) |
We see that, if
, then the order of convergence of
the Newton-Raphson method is
, and the rate of convergence is
. However, if
, the convergence
is linear rather than quadratic, as shown in the example below.
Example: Consider solving the equation
data:image/s3,"s3://crabby-images/c31c4/c31c46d8948f9fde0aa7a1e0af597c9ed15c70d7" alt="$\displaystyle f(x)=x^3-4x^2+5x-2=(x-1)^2(x-2)=0$" |
(84) |
which has a repeated root
as well as a single root
. We
have
data:image/s3,"s3://crabby-images/8e303/8e3035bfb202dd3e21898bd578b07a28fa08ee01" alt="$\displaystyle f'(x)=3x^2-8x+5=(3x-5)(x-1),\;\;\;\;\;\;\;f''(x)=6x-8$" |
(85) |
Note that at the root
we have
. We further
find:
data:image/s3,"s3://crabby-images/1ad5f/1ad5f33d524dc24782beb326d1741abd4bbdd968" alt="$\displaystyle g(x)=x-\frac{f(x)}{f'(x)}=x-\frac{(x-1)^2(x-2)}{(3x-5)(x-1)}
=x-\frac{(x-1)(x-2)}{3x-5}$" |
(86) |
and
We therefore have
data:image/s3,"s3://crabby-images/19811/19811e007e385e08833579c91ef8163a3fdf60c1" alt="$\displaystyle e_{n+1}=g'(x)e_n-\frac{1}{2}g''(x)e_n^2+O(e_n^3)
\stackrel{n\righ...
...}
\left\{\begin{array}{ll}e_n/2 & x=1 \\ -g''(x)e_n^2/2 & x=2\end{array}\right.$" |
(87) |
We see the iteration converges quadratically to the single root
,
but only linearly to the repeated root
.
We consider in general a function with a repeated root at
of
multiplicity
:
data:image/s3,"s3://crabby-images/bc93b/bc93bb007b60c9e3d92829ab8f3e37433387386a" alt="$\displaystyle f(x)=(x-a)^kh(x)$" |
(88) |
its derivative is
![$\displaystyle f'(x)=k(x-a)^{k-1}h(x)+(x-a)^kh'(x)=(x-a)^{k-1}[kh(x)+(x-a)h'(x)]$](img399.svg) |
(89) |
As
, the convergence of the Newton-Raphson method to
is
linear, rather than quadratic.
In such case, we can accelerate the iteration by using a step size
:
data:image/s3,"s3://crabby-images/82a23/82a23ba2f60108938c28865778ddc026d2cfe085" alt="$\displaystyle g(x)=x-k\;\frac{f(x)}{f'(x)}$" |
(90) |
and
data:image/s3,"s3://crabby-images/8b5f5/8b5f50a0196422ff7002443b28d6276357c02eaf" alt="$\displaystyle g'(x)=1-k\;\frac{(f'(x))^2-f(x)f''(x)}{(f'(x))^2}
=1-k+k\,\frac{f(x)f''(x)}{(f'(x))^2}$" |
(91) |
Now we show that this
is zero at the repeated root
,
therefore the convergence to this root is still quadratic.
We substitute
,
, and
into the expression for
above to get (after some algebra):
At
, we get
data:image/s3,"s3://crabby-images/eaf49/eaf49a4b57bfa45a49a79e1c4dba2239cf2b45d9" alt="$\displaystyle g'(x)\bigg\vert _{x=a}=1-k+k\frac{(k-1)kh^2}{(kh)^2}=0$" |
(94) |
i.e., the convergence to the repeated root at
is no longer
linear but quadratic.
The difficulty, however, is that the multiplicity
of a root is
unknown ahead of time. If
is used blindly some root may be
skipped, and the iteration may oscillate around the real root.
Example: Consider solving
,
with a double root
and a single root
. In the following,
we compare the performance of both
and
.
- First use an initial guess
.
- When
is used, the convergence is linear around the
double root
. It takes 16 iterations to get
with
:
- When
is used, the convergence is quadratic around
the double root
. It takes only 3 iterations to get
with
:
- Next use a different initial guess
.
- If
is used, it takes 7 iterations to get
with
, the convergence is quadratic.
- If
is used, oscillation happens as shown in the
figure below. However, if a better initial guess
is used instead of
, it takes only one step to get
with
, the convergence is significantly
accelerated.