Chapter 9 Stationary Points
9.1 Definition of Stationary Points
Recall that for a univariate function y=f(x), a stationary point is a value x0 for x at which f′(x0)=0. Graphically this is a point on the curve at which the tangent line is horizontal.

Now consider a function of two variables z=f(x,y).
A point (a,b) at which fx(a,b)=fy(a,b)=0 is a stationary point of f(x,y).
Calculate the stationary points of the function f(x,y)=x2+y2.
Calculating the first order partial derivatives one obtains
So
Therefore f has a unique stationary point at (0,0).
Calculate the stationary points of the function f(x,y)=6x2y−3x3+2y3−150y.
Calculating the first order partial derivatives one obtains
fx=12xy−9x2,fy=6x2+6y2−150.
So
fx=0, and fy=0⟺12xy−9x2=0, and 6x2+6y2−150=0⟺3x(4y−3x)=0, and x2+y2=25.
The equation 3x(4y−3x)=0 implies either x=0 or y=34x. If x=0, the equation x2+y2=25 becomes y2=25, which has solutions y=5 and y=−5. Therefore there are stationary points at (0,5) and (0,−5).
Alternatively if y=34x, the equation x2+y2=25 becomes 2516x2=25, which has solutions x=4 and x=−4. At x=4, one has y=34(4)=3, and at x=−4, one has y=−3. Therefore there are stationary points at (4,3) and (−4,−3).
In total there are four stationary points (0,5), (0,−5), (4,3) and (−4,−3).
Suppose (a,b) is a stationary point of a function f(x,y). Graphically one can take two cross-sections of the surface z=f(x,y) through the planes x=a and y=b respectively. This will describes two curves, one given in the (y,z)-plane by the univariate function z=f(a,y), and the other given in the (x,z)-plane by the univariate function z=f(x,b).These curves will have stationary points, in the context of univariate functions, at y=b and x=a respectively.

Describing the surface of f using an implicit function, namely F(x,y,z)=z−f(x,y)=0, allows us to calculate a normal vector as per Section 7.1. Specifically a normal is given by ∇F=(∂F∂x,∂F∂y,∂F∂z)=(−fx,−fy,1)=(0,0,1).
It follows that the tangent plane to z=f(x,y) is horizontal at a stationary point.

Definition 9.1.1 generalises to any multivariate function.
A stationary point of f(x1,x2,…,xn) is a point such that fxi=0, for all i=1,2,…n.
There are three types of stationary points:
Local maximum;
Local minimum;
Saddle point.
We will study each of these in turn in the following sections.
9.2 Local Maxima and Minima
Let (a,b) be a stationary point of a function of two variables f(x,y).
The value f(a,b) of f at (a,b) is a local maximum if
where D is some open disc with center (a,b).

The value f(a,b) of f at (a,b) is a local minimum if
where D is some open disc with center (a,b).

Any size of the disc D in Definition 8.2.1 and Definition 8.2.2 will do, no matter how small. It is also worth noting the terminology local: there may be other points at which the function is smaller than a local minimum or greater than a local maximum, but these will be a distance away from the stationary point.
Consider the function f(x,y)=x2+y2 from Example 9.1.2. We know there is a unique stationary point at (0,0). Calculate that
Therefore (0,0) is a minimum stationary point.
9.3 Saddle Points
Let (a,b) be a stationary point of a function of two variables f(x,y).
The point (a,b) is a saddle point of f if for every disc D with center (a,b) there exists
a point (x1,y1)∈D such that f(x1,y1)>f(a,b);
a point (x2,y2)∈D such that f(x2,y2)<f(a,b).


Equivalently a saddle point is a stationary point that is neither a local maximum or a local minimum.
9.4 Classification of Stationary Points
Suppose f(x,y) has a stationary point at (a,b). How can one tell if this stationary point is a local maximum, a local minimum or a saddle point?
Let f be a function with continuous second order partial derivatives. The Hessian of f is H(x,y)=fxxfyy−f2xy
Let f be a function with a stationary point at (a,b), and continuous second order partial derivatives in a disc centered at (a,b). Then
If H(a,b)<0, then f has a saddle point at (a,b);
If H(a,b)>0 and fxx(a,b)>0, then f has a local minimum at (a,b);
If H(a,b)>0 and fxx(a,b)<0, then f has a local maximum at (a,b);
If H(a,b)=0, then no conclusion can be made about the classification of (a,b).
Let h,k be suitably small variables so that (a+h,b+k) is contained in the disc centered at (a,b) in which f has continuous second order partial derivatives

By Taylor’s theorem with n=1, one has f(a+h,b+k)=f(a,b)+hfx(a,b)+kfy(a,b)+R1, where R1=12(h2fxx(a+th,b+tk)+2hkfxy(a+th,b+tk)+k2fyy(a+th,b+tk)), for some t∈[0,1]. Substituting in that fx(a,b)=fy(a,b)=0 and rearranging, one obtains f(a+h,b+k)−f(a,b)=R1. Hence the sign of R1 is key to the classification of the stationary point (a,b) since it determines whether f increases or decreases away from f. Specifically for a fixed value of h,k:
R1(hk)>0,⟹f(a+h,b+k)>f(a,b);
R1(hk)<0,⟹f(a+h,b+k)<f(a,b).
Rearranging:
So R1 can be thought of a quadratic function in the single variable hk. Since we are only interested in the sign of R1, and k22 is always positive, it is enough to consider the quadratic function ~R1(hk)=fxx(hk)2+2fxyhk+fyy Quadratic functions are well-understood. In particular for a general quadratic Q(x)=ax2+bx+c, the roots of Q are governed by the discriminant ΔQ=b2−4ac. With this in mind, define: Δ=Δ~R1(hk)=(2fxy)2−4fxxfyy=4(f2xy−fxxfyy).
Suppose that Δ>0. Then ~R1 has two real roots.

Specifically there are two distinct values for hk for which R1=0. At both of these roots R1 changes sign, and (a,b) is therefore a saddle point.

Alternatively suppose Δ<0, that is that ~R1 has no real roots. Then R1 will always have the same sign, be that positive or negative. Therefore (a,b) is either a local minimum or a local maximum.


Since in this case the sign of R1 does not change, it can be found by looking at a single point. Specifically when k=0, we have R1=h22fxx. Hence
fxx>0⟹R1>0⟹(a,b) is a local minimum;
fxx<0⟹R1<0⟹(a,b) is a local maximum.
Noting that Δ=−4H gives the desired result.
Find and classify the stationary points of
f(x,y)=6x2−2x3+3y2+6xy.
Equating the two first order partial derivatives of f to zero, one obtains
fx=12x−6x2+6y=0,fy=6y+6x=0.
Substituting (⋆⋆) into (⋆) gives
When x=0, equation (⋆⋆) dictates y=0, and when x=1, equation (⋆) dictates y=−1. Therefore f has two stationary points: (0,0) and (1,−1).
Calculating the second order partial derivatives of f, one obtains:
The Hessian of f is then given by
By Theorem 9.4.2, the stationary points are classified as follows:
H(0,0)=36>0 and fxx=12>0,⟹ the stationary point (0,0) is a local minimum;
H(1,−1)=−36<0,⟹ the stationary point (1,−1) is a saddle point.
