3.3 The structure of the set of solutions

In this section we will study the general structure of the set of solutions to a system of linear equations, when it has solutions at all. In the next section we will then look at methods to actually solve a system of linear equations.

Definition 3.27: (S(A,b))
Let A\in M_{m,n}(\mathbb{R}) and b\in \mathbb{R}^m, then we set S(A,b):=\{x\in \mathbb{R}^n: Ax=b\} . This is a subset of \mathbb{R}^n and consists of all the solutions to the system of linear equations Ax=b. If there are no solutions then S(A,b)=\emptyset.

One often distinguishes between two types of systems of linear equations based on their constant terms.

Definition 3.28: (Homogeneous and inhomogenous)
The system of linear equations Ax=b is called homogeneous if b=\mathbf{0}, i.e., if it is of the form Ax=\mathbf{0}. If b\ne \mathbf{0} the system is called inhomogeneous.

If the system is inhomogeneous, then it doesn’t necessarily have a solution. But for the ones which have a solution we can determine the structure of the set of solutions. The key observation is that if we have one solution, say x_0\in \mathbb{R}^n which satisfies Ax_0=b, then we can create further solutions by adding solutions of the corresponding homogeneous system, Ax=\mathbf{0}, since if Ax=\mathbf{0} A(x_0+x)=Ax_0+Ax=b+\mathbf{0}=b, and so x_0+x is another solution to the inhomogeneous system.

Theorem 3.29:

Let A\in M_{m,n}(\mathbb{R}) and b\in \mathbb{R}^m and assume there exists x_0\in \mathbb{R}^n with Ax_0=b. Then S(A,b)=\{x_0\}+S(A,\mathbf{0}):=\{x_0+x: x\in S(A,\mathbf{0})\}

Proof.

As we noticed above, if x\in S(A,\mathbf{0}), then A(x_0+x)=b, hence \{x_0\}+S(A,\mathbf{0})\subseteq S(A,b).

On the other hand, if y\in S(A,b) then A(y-x_0)=Ay-Ax_0=b-b=\mathbf{0}, and so y-x_0\in S(A,\mathbf{0}). Therefore S(A,b)\subseteq \{x_0\}+S(A,\mathbf{0}) and so S(A,b)=\{x_0\}+S(A,\mathbf{0}).

Remarks:

  • The structure of the set of solutions is often described as follows: The general solution of the inhomogeneous system Ax=b is given by a special solution x_0 to the inhomogeneous system plus a general solution to the corresponding homogeneous system Ax=\mathbf{0}.

  • The case that there is unique solution to Ax=b corresponds to S(A,\mathbf{0})=\{\mathbf{0}\}, in which case S(A,b)=\{x_0\}.

At first sight the definition of the set \{x_0\}+ S(A,\mathbf{0}) seems to depend on the choice of the particular solution x_0 to Ax_0=b. But this is not so; another choice y_0 just corresponds to a different labelling of the elements of the set.

Example 3.30:

Let us look at an example of three equations with three unknowns: \begin{aligned} 3 x +z&= 0,\\ y-z& =1,\\ 3x+y&=1. \end{aligned} This set of equations corresponds to A=\begin{pmatrix}3 & 0 & 1\\ 0 & 1 &-1\\ 3 & 1 &0\end{pmatrix}\quad\text{and}\quad b=\begin{pmatrix}0\\1\\1\end{pmatrix}. To solve this set of equations we try to simplify it: if we subtract the first equation from the third the third equation becomes y-z=1 which is identical to the second equation. Hence the initial system of three equations is equivalent to the following system of two equations: 3 x +z= 0 ,\quad y-z=1 . In the first one we can solve for x as a function of z and in the second for y as a function of z, hence \begin{equation} x=-\frac{1}{3}z ,\quad y=1+z. \tag{3.7}\end{equation} So z is arbitrary, but once z is chosen, x and y are fixed, and the set of solutions is given by S(A,b)=\{(-z/3,1+z,z): z\in \mathbb{R}\} . A similar computation for the corresponding homogeneous system of equations \begin{aligned} 3 x +z&= 0,\\ y-z& =0,\\ 3x+y&=0 \end{aligned} gives us the solutions x=-z/3, y=z, and z\in \mathbb{R} arbitrary, hence S(A,\mathbf{0})=\{(-z/3,z,z): z\in \mathbb{R}\} .

A particular solution to the inhomogeneous system is given by choosing z=0 in (3.7), i.e., x_0=(0,1,0), and then the relation S(A,b)=\{x_0\}+S(A,\mathbf{0}) can be seen directly, since for x=(-z/3,z,z)\in S(A,\mathbf{0}) we have x_0+x=(0,1,0)+(-z/3,z,z)=(-z/3,1+z,z) which was the general form of an element in S(A,b). But what happens if we choose another element of S(A,b)? Let \lambda\in \mathbb{R}, then x_{\lambda}:=(-\lambda/3,1+\lambda,\lambda) is in S(A,b) and we again have S(A,b)=\{x_{\lambda}\}+S(A,\mathbf{0}) , since x_{\lambda}+x=(-\lambda/3,1+\lambda ,\lambda)+(-z/3,z,z)=(-(\lambda+z)/3,1+(\lambda+z),(\lambda+z)). Then if z runs through \mathbb{R} we again obtain the whole set S(A,b), independent of which \lambda we chose initially. The choice of \lambda only determines the way in which we label the elements in S(A,b).

Finally we should notice that the set S(A,\mathbf{0}) is spanned by one vector, namely we have (-z/3,z,z)=z(-1/3,1,1) and hence with v=(-1/3,1,1) we have S(A,\mathbf{0})=\operatorname{span}\{v\} and S(A,b)=\{x_{\lambda}\}+\operatorname{span}\{v\} .

In the next section we will develop systematic methods to solve large systems of linear equations.