Chapter 9 Stationary Points

9.1 Definition of Stationary Points

Recall that for a univariate function y=f(x), a stationary point is a value x0 for x at which f(x0)=0. Graphically this is a point on the curve at which the tangent line is horizontal.


Now consider a function of two variables z=f(x,y).

A point (a,b) at which fx(a,b)=fy(a,b)=0 is a stationary point of f(x,y).

Calculate the stationary points of the function f(x,y)=x2+y2.

Calculating the first order partial derivatives one obtains

fx=2x,fy=2y.

So

fx=0, and fy=02x=0, and 2y=0x=0, and y=0.

Therefore f has a unique stationary point at (0,0).


Calculate the stationary points of the function f(x,y)=6x2y3x3+2y3150y.

Calculating the first order partial derivatives one obtains fx=12xy9x2,fy=6x2+6y2150. So fx=0, and fy=012xy9x2=0, and 6x2+6y2150=03x(4y3x)=0, and x2+y2=25. The equation 3x(4y3x)=0 implies either x=0 or y=34x. If x=0, the equation x2+y2=25 becomes y2=25, which has solutions y=5 and y=5. Therefore there are stationary points at (0,5) and (0,5).
Alternatively if y=34x, the equation x2+y2=25 becomes 2516x2=25, which has solutions x=4 and x=4. At x=4, one has y=34(4)=3, and at x=4, one has y=3. Therefore there are stationary points at (4,3) and (4,3).

In total there are four stationary points (0,5), (0,5), (4,3) and (4,3).


Suppose (a,b) is a stationary point of a function f(x,y). Graphically one can take two cross-sections of the surface z=f(x,y) through the planes x=a and y=b respectively. This will describes two curves, one given in the (y,z)-plane by the univariate function z=f(a,y), and the other given in the (x,z)-plane by the univariate function z=f(x,b).These curves will have stationary points, in the context of univariate functions, at y=b and x=a respectively.


Describing the surface of f using an implicit function, namely F(x,y,z)=zf(x,y)=0, allows us to calculate a normal vector as per Section 7.1. Specifically a normal is given by F=(Fx,Fy,Fz)=(fx,fy,1)=(0,0,1).

It follows that the tangent plane to z=f(x,y) is horizontal at a stationary point.


Definition 9.1.1 generalises to any multivariate function.

A stationary point of f(x1,x2,,xn) is a point such that fxi=0, for all i=1,2,n.

There are three types of stationary points:

  • Local maximum;

  • Local minimum;

  • Saddle point.

We will study each of these in turn in the following sections.

9.2 Local Maxima and Minima

Let (a,b) be a stationary point of a function of two variables f(x,y).

The value f(a,b) of f at (a,b) is a local maximum if

f(a,b)f(x,y),for all (x,y) in D,

where D is some open disc with center (a,b).


The value f(a,b) of f at (a,b) is a local minimum if

f(a,b)f(x,y),for all (x,y) in D,

where D is some open disc with center (a,b).


Any size of the disc D in Definition 8.2.1 and Definition 8.2.2 will do, no matter how small. It is also worth noting the terminology local: there may be other points at which the function is smaller than a local minimum or greater than a local maximum, but these will be a distance away from the stationary point.

Consider the function f(x,y)=x2+y2 from Example 9.1.2. We know there is a unique stationary point at (0,0). Calculate that

f(0,0)=0,f(x,y)=x2+y2>0,for (x,y)(0,0).

Therefore (0,0) is a minimum stationary point.

9.3 Saddle Points

Let (a,b) be a stationary point of a function of two variables f(x,y).

The point (a,b) is a saddle point of f if for every disc D with center (a,b) there exists

  • a point (x1,y1)D such that f(x1,y1)>f(a,b);

  • a point (x2,y2)D such that f(x2,y2)<f(a,b).



Equivalently a saddle point is a stationary point that is neither a local maximum or a local minimum.

9.4 Classification of Stationary Points

Suppose f(x,y) has a stationary point at (a,b). How can one tell if this stationary point is a local maximum, a local minimum or a saddle point?

Let f be a function with continuous second order partial derivatives. The Hessian of f is H(x,y)=fxxfyyf2xy


Let f be a function with a stationary point at (a,b), and continuous second order partial derivatives in a disc centered at (a,b). Then

  • If H(a,b)<0, then f has a saddle point at (a,b);

  • If H(a,b)>0 and fxx(a,b)>0, then f has a local minimum at (a,b);

  • If H(a,b)>0 and fxx(a,b)<0, then f has a local maximum at (a,b);

  • If H(a,b)=0, then no conclusion can be made about the classification of (a,b).

Let h,k be suitably small variables so that (a+h,b+k) is contained in the disc centered at (a,b) in which f has continuous second order partial derivatives


By Taylor’s theorem with n=1, one has f(a+h,b+k)=f(a,b)+hfx(a,b)+kfy(a,b)+R1, where R1=12(h2fxx(a+th,b+tk)+2hkfxy(a+th,b+tk)+k2fyy(a+th,b+tk)), for some t[0,1]. Substituting in that fx(a,b)=fy(a,b)=0 and rearranging, one obtains f(a+h,b+k)f(a,b)=R1. Hence the sign of R1 is key to the classification of the stationary point (a,b) since it determines whether f increases or decreases away from f. Specifically for a fixed value of h,k:

  • R1(hk)>0,f(a+h,b+k)>f(a,b);

  • R1(hk)<0,f(a+h,b+k)<f(a,b).

Rearranging:

R1=12(h2fxx+2hkfxy+k2fyy)=k22(fxx(hk)2+2fxyhk+fyy).

So R1 can be thought of a quadratic function in the single variable hk. Since we are only interested in the sign of R1, and k22 is always positive, it is enough to consider the quadratic function ~R1(hk)=fxx(hk)2+2fxyhk+fyy Quadratic functions are well-understood. In particular for a general quadratic Q(x)=ax2+bx+c, the roots of Q are governed by the discriminant ΔQ=b24ac. With this in mind, define: Δ=Δ~R1(hk)=(2fxy)24fxxfyy=4(f2xyfxxfyy).

Suppose that Δ>0. Then ~R1 has two real roots.


Specifically there are two distinct values for hk for which R1=0. At both of these roots R1 changes sign, and (a,b) is therefore a saddle point.


Alternatively suppose Δ<0, that is that ~R1 has no real roots. Then R1 will always have the same sign, be that positive or negative. Therefore (a,b) is either a local minimum or a local maximum.



Since in this case the sign of R1 does not change, it can be found by looking at a single point. Specifically when k=0, we have R1=h22fxx. Hence

  • fxx>0R1>0(a,b) is a local minimum;

  • fxx<0R1<0(a,b) is a local maximum.

Noting that Δ=4H gives the desired result.

Find and classify the stationary points of f(x,y)=6x22x3+3y2+6xy.
Equating the two first order partial derivatives of f to zero, one obtains fx=12x6x2+6y=0,fy=6y+6x=0.

Substituting () into () gives

12x6x26x=0,x(1x)=0,x=0 or x=1.

When x=0, equation () dictates y=0, and when x=1, equation () dictates y=1. Therefore f has two stationary points: (0,0) and (1,1).

Calculating the second order partial derivatives of f, one obtains:

fxx=1212x,fxy=6,fyy=6.

The Hessian of f is then given by

H(x,y)=fxxfyyf2xy=6(1212x)36=3672x.

By Theorem 9.4.2, the stationary points are classified as follows:

  • H(0,0)=36>0 and fxx=12>0, the stationary point (0,0) is a local minimum;

  • H(1,1)=36<0, the stationary point (1,1) is a saddle point.