Linear maps

Solution 4.13.1

(See Exercise 4.1.) By Definition 4.9, Subsection 4.2.1, for \(A = \left ( \begin{array}{cc} a_{11} & a_{12} \\ a_{21} & a_{22} \end{array} \right )\) and \(v = \left ( \begin{array}{c} x \\ y \end{array} \right )\), we have

\[ Av = \left ( \begin{array}{c} a_{11}x+a_{12}y \\ a_{21}x+a_{22}y \end{array} \right ). \]

So we determine \(A\) by matching the target coordinates.

  1. Reflection along the \(y\)-axis sends \((x,y) \mapsto (-x,y)\), hence
\[ A = \left ( \begin{array}{cc} -1 & 0 \\ 0 & 1 \end{array} \right ). \]
  1. “Same point” means \((x,y) \mapsto (x,y)\), so
\[ A = {\mathrm{id}}_2 = \left ( \begin{array}{cc} 1 & 0 \\ 0 & 1 \end{array} \right ). \]
  1. “Always the origin” means \((x,y) \mapsto (0,0)\), so
\[ A = \left ( \begin{array}{cc} 0 & 0 \\ 0 & 0 \end{array} \right ). \]
  1. Reflection along the line \(\{(x,x)\}\) swaps coordinates: \((x,y) \mapsto (y,x)\), hence
\[ A = \left ( \begin{array}{cc} 0 & 1 \\ 1 & 0 \end{array} \right ). \]
  1. By Example 4.18, a counterclockwise rotation by angle \(r\) is given by
\[ \left ( \begin{array}{cc} \cos r & -\sin r \\ \sin r & \cos r \end{array} \right ). \]
For $r=60^\circ=\pi/3$ this gives
\[ A_{\mathrm{ccw}} = \left ( \begin{array}{cc} \frac12 & -\frac{\sqrt 3}{2} \\ \frac{\sqrt 3}{2} & \frac12 \end{array} \right ). \]
Clockwise by $60^\circ$ means $r=-\pi/3$, so
\[ A_{\mathrm{cw}} = \left ( \begin{array}{cc} \frac12 & \frac{\sqrt 3}{2} \\ -\frac{\sqrt 3}{2} & \frac12 \end{array} \right ). \]

For comparison with the \(90^\circ\) case, see also Example 4.17.

Solution 4.13.2

(See Exercise 4.2.) Using Definition 4.9, Subsection 4.2.1, for

\[ A = \left ( \begin{array}{cc} a_{11} & a_{12} \\ a_{21} & a_{22} \end{array} \right ) \]

and \(v=\left ( \begin{array}{c} x \\ y \end{array} \right )\) we have

\[ Av = \left ( \begin{array}{c} a_{11}x+a_{12}y \\ a_{21}x+a_{22}y \end{array} \right ). \]

We want

\[ Av = \left ( \begin{array}{c} -y \\ x \end{array} \right ), \]

so comparing coefficients gives \(a_{11}=0\), \(a_{12}=-1\), \(a_{21}=1\), and \(a_{22}=0.\) Hence

\[ A = \left ( \begin{array}{cc} 0 & -1 \\ 1 & 0 \end{array} \right ). \]

Geometrically, this is a counterclockwise rotation by \(90^\circ\) (equivalently, clockwise by \(270^\circ\)), compare Example 4.17.

Solution 4.13.3

(See Exercise 4.3.) By Proposition 4.43 (applied to the standard basis, cf. Example 3.59), the columns of the matrix \(A\) are exactly the vectors \(f(e_1),\dots,f(e_4)\). Hence

\[ A = \left ( \begin{array}{cccc} 1 & 0 & 0 & 13 \\ 2 & 0 & 0 & 0 \\ 3 & 7 & 0 & -1 \end{array} \right ). \]

By Definition 4.22, to determine \(\ker f\), we need to solve \(Ax=0\). Since there are so many zeros in the matrix, we do this by hand; alternatively one could also use Gaussian elimination (cf. Method 2.31). With \(x=(x_1,x_2,x_3,x_4)^T\), this system reads

\[ \begin{align*} x_1 + 13x_4 & = 0 \\ 2x_1 & = 0 \\ 3x_1 + 7x_2 - x_4 & = 0. \end{align*} \]

From \(2x_1=0\) we get \(x_1=0\), then \(x_4=0\), then \(x_2=0\); \(x_3\) is free. So

\[ \ker f = L\left(\left ( \begin{array}{c} 0 \\ 0 \\ 1 \\ 0 \end{array} \right )\right), \qquad \dim \ker f = 1. \]

To compute the image, we use Proposition 4.32:

\[ \operatorname{im} f = L\left(\left ( \begin{array}{c} 1 \\ 2 \\ 3 \end{array} \right ), \left ( \begin{array}{c} 0 \\ 0 \\ 7 \end{array} \right ), \left ( \begin{array}{c} 0 \\ 0 \\ 0 \end{array} \right ), \left ( \begin{array}{c} 13 \\ 0 \\ -1 \end{array} \right )\right). \]

Removing the zero vector, consider

\[ u_1=\left ( \begin{array}{c} 1 \\ 2 \\ 3 \end{array} \right ),\quad u_2=\left ( \begin{array}{c} 0 \\ 0 \\ 7 \end{array} \right ),\quad u_3=\left ( \begin{array}{c} 13 \\ 0 \\ -1 \end{array} \right ). \]

If \(a_1u_1+a_2u_2+a_3u_3=0\), then from the second coordinate \(2a_1=0\), hence \(a_1=0\); from the first coordinate \(13a_3=0\), hence \(a_3=0\); from the third coordinate \(7a_2=0\), hence \(a_2=0\). So \(u_1,u_2,u_3\) are linearly independent and therefore form a basis of \(\operatorname{im} f\). Instead of the above approach, one may also bring \(A\) to row-echelon form and observe that the resulting matrix has leading ones in the first, second, and fourth columns, so we again obtain that these three columns of \(A\) form a basis of \(\operatorname{im} f\) (Proposition 4.32). In particular,

\[ \dim \operatorname{im} f = 3. \]

As a check, Theorem 4.26 gives

\[ 4 = \dim {\bf R}^4 = \dim \ker f + \dim \operatorname{im} f = 1+3. \]

Solution 4.13.4

(See Exercise 4.4.) To compute the rank, we apply Method 2.31, which requires us to bring the matrix into row-echelon form using Gaussian elimination (Method 2.29). We perform the following elementary row operations (Definition 2.28).

\[ \begin{align*} A = \left ( \begin{array}{cccc} 0 & 1 & 2 & 1 \\ 1 & 1 & 1 & 0 \\ 0 & -1 & 1 & 1 \\ 1 & 1 & 4 & 2 \end{array} \right ) &\leadsto \left ( \begin{array}{cccc} 1 & 1 & 1 & 0 \\ 0 & 1 & 2 & 1 \\ 0 & -1 & 1 & 1 \\ 1 & 1 & 4 & 2 \end{array} \right ) \leadsto \left ( \begin{array}{cccc} 1 & 1 & 1 & 0 \\ 0 & 1 & 2 & 1 \\ 0 & 0 & 3 & 2 \\ 0 & 0 & 3 & 2 \end{array} \right ) \\ &\leadsto \left ( \begin{array}{cccc} 1 & 1 & 1 & 0 \\ 0 & 1 & 2 & 1 \\ 0 & 0 & 3 & 2 \\ 0 & 0 & 0 & 0 \end{array} \right ). \end{align*} \]

This row-echelon matrix has exactly 3 leading ones, so by Proposition 4.32 we get \(\operatorname{rk}(A) = 3.\)

Solution 4.13.5

(See Exercise 4.5.) From Example 4.42 we have

\[ f(e_1)=f(v_1)=(2,-1,0), \qquad f(e_2)=f(v_2)=(1,-1,1), \]

and, since \(e_3=v_2-v_3\) there,

\[ f(e_3)=f(v_2-v_3)=f(v_2)-f(v_3)=(1,-3,-1). \]

By Proposition 4.19, the matrix of \(f\) with respect to the standard basis in source and target has columns \(f(e_1),f(e_2),f(e_3)\). Hence

\[ A = \left ( \begin{array}{ccc} 2 & 1 & 1 \\ -1 & -1 & -3 \\ 0 & 1 & -1 \end{array} \right ). \]

This is exactly the matrix already identified in Example 4.42.

Solution 4.13.6

(See Exercise 4.6.) Let \(v_1=(1,1+\lambda,-1)\), \(v_2=(2,\lambda-2,\lambda+2)\). By definition, \(W_\lambda=L(v_1,v_2).\) So a basis of \(W_\lambda\) is either \(\{v_1,v_2\}\) (if they are linearly independent) or a single nonzero vector among them (if they are dependent), cf. Definition 3.46, def:basis.

To find when they are dependent, check whether \(v_2=t v_1\) for some \(t\in{\bf R}\). From the first coordinate we get \(2=t\cdot 1\), hence \(t=2\). Then the second coordinate gives \(\lambda-2=2(1+\lambda)\) which holds precisely if \(\lambda=-4\). For \(\lambda=-4\) the third coordinates also agree. Therefore:

  • If \(\lambda\neq -4\), the vectors \(v_1,v_2\) are linearly independent and therefore a basis of \(W_\lambda\). Thus \(\dim W_\lambda=2\) by Definition 3.63.

  • If \(\lambda=-4\), then \(v_2=2v_1\), so \(v_1=(1,-3,-1)\) is a basis vector of \(W_{-4}\) (or any other non-zero multiple of it, such as \(v_2)\). We have \(\dim W_{-4}=1\).

Solution 4.13.7

(See Exercise 4.7.) Let \(A = \begin{pmatrix} \alpha & 0 & 0 \\ 0 & \alpha & 1+\alpha \\ \alpha & 1 & 2 \end{pmatrix}\) with \(\alpha \in \mathbf{R}\). We determine \(\operatorname{rank}(A)\) (for all \(\alpha\)) by bringing \(A\) to row-echelon form and applying Proposition 4.32.

\[ A \leadsto \begin{pmatrix} \alpha & 0 & 0 \\ 0 & \alpha & 1+\alpha \\ 0 & 1 & 2 \end{pmatrix} \]

In order to proceed we need to distinguish whether or not \(\alpha = 0\). If \(\alpha \neq 0\), we can divide the first row by \(\alpha\) (Definition 2.28)

\[ \begin{pmatrix} 1 & 0 & 0 \\ 0 & \alpha & 1+\alpha \\ 0 & 1 & 2 \end{pmatrix} \]

Now, subtract \(\frac{1}{\alpha}\) times the second row from the third row (still for \(\alpha \neq 0\)):

\[ \begin{pmatrix} 1 & 0 & 0 \\ 0 & \alpha & 1+\alpha \\ 0 & 0 & 2 - \frac{1+\alpha}{\alpha} \end{pmatrix} \]

Given that \(\alpha \ne 0\), the rank of this matrix is 2 if \(2-\frac{1+\alpha}\alpha = 0\), i.e., if \(\alpha = 1\), and it is 3 otherwise. This finishes the analysis of the case \(\alpha \ne 0\).

If \(\alpha = 0\), the matrix reads

\[ \begin{pmatrix} 1 & 0 & 0 \\ 0 & 1 & 2 \\ 0 & 1 & 2 \end{pmatrix} \]

Since the 3rd row equals the 2nd one, one reads off \(\operatorname{rank}(A) = 2\) in this case.

In total we get the following result:

  • If \(\alpha = 1\) or \(\alpha = 0\), \(\operatorname{rank}(A) = 2\).

  • For all other \(\alpha \neq 0,1\), \(\operatorname{rank}(A) = 3\).

Solution 4.13.8

(See Exercise 4.9.) Let \(f : \mathbf{R}^4 \to \mathbf{R}^3\) be defined by

\[ f\left(\begin{pmatrix} x_1 \\ x_2 \\ x_3 \\ x_4 \end{pmatrix}\right) = \begin{pmatrix} 2 & -1 & 1 & 1 \\ 0 & 5 & -3 & -5 \\ 3 & -4 & 3 & 4 \end{pmatrix} \begin{pmatrix} x_1 \\ x_2 \\ x_3 \\ x_4 \end{pmatrix} = \begin{pmatrix} 2x_1 - x_2 + x_3 + x_4 \\ 5x_2 - 3x_3 - 5x_4 \\ 3x_1 - 4x_2 + 3x_3 + 4x_4 \end{pmatrix}. \]
  1. According to Definition 4.22, we need to compute the solution set of the system
\[ \begin{align*} 2x_1 - x_2 + x_3 + x_4 &= 0 \\ 5x_2 - 3x_3 - 5x_4 &= 0 \\ 3x_1 - 4x_2 + 3x_3 + 4x_4 &= 0. \end{align*} \]
We do this using Gaussian elimination (<a href="../systems-gaussian-elimination/#met-gaussian-elimination-solve" data-reference-type="ref+Label" data-reference="met:gaussian-elimination-solve">Method 2.31</a>): we bring the coefficient matrix to row echelon form:
\[ \begin{pmatrix} 2 & -1 & 1 & 1 \\ 0 & 5 & -3 & -5 \\ 3 & -4 & 3 & 4 \end{pmatrix} \xrightarrow{R_3\leftarrow 2R_3-3R_1} \begin{pmatrix} 2 & -1 & 1 & 1 \\ 0 & 5 & -3 & -5 \\ 0 & -5 & 3 & 5 \end{pmatrix} \xrightarrow{R_3\leftarrow R_3+R_2} \begin{pmatrix} 2 & -1 & 1 & 1 \\ 0 & 5 & -3 & -5 \\ 0 & 0 & 0 & 0 \end{pmatrix} \]
Divide row 2 by $5$, then eliminate the $-1$ in row 1, column 2:
\[ \begin{pmatrix} 2 & -1 & 1 & 1 \\ 0 & 1 & -\frac35 & -1 \\ 0 & 0 & 0 & 0 \end{pmatrix} \xrightarrow{R_1\leftarrow R_1+R_2} \begin{pmatrix} 2 & 0 & \frac25 & 0 \\ 0 & 1 & -\frac35 & -1 \\ 0 & 0 & 0 & 0 \end{pmatrix} \xrightarrow{R_1\leftarrow \frac12R_1} \begin{pmatrix} 1 & 0 & \frac15 & 0 \\ 0 & 1 & -\frac35 & -1 \\ 0 & 0 & 0 & 0 \end{pmatrix}. \]
Hence $x_3,x_4$ are free, and
\[ x_1=-\frac15x_3,\qquad x_2=\frac35x_3+x_4. \]
Writing $x_3=s$, $x_4=t$, we get
\[ \ker f= \left\{ \begin{pmatrix} -\frac15 s\\[2pt] \frac35 s+t\\[2pt] s\\[2pt] t \end{pmatrix} : s,t\in\mathbf R \right\} = L\!\left( \begin{pmatrix}-1\\3\\5\\0\end{pmatrix}, \begin{pmatrix}0\\1\\0\\1\end{pmatrix} \right). \]
  1. In order to solve \(f(v) = \begin{pmatrix} 1 \\ -3 \\ -3 \end{pmatrix}\), we again use Gaussian elimination:
\[ \begin{align*} 2x_1 - x_2 + x_3 + x_4 &= 1 \\ 5x_2 - 3x_3 - 5x_4 &= -3 \\ 3x_1 - 4x_2 + 3x_3 + 4x_4 &= -3 \end{align*} \]
The augmented matrix is
\[ \begin{align*} \left ( \begin{array}{cccc|c} 2 & -1 & 1 & 1 & 1 \\ 0 & 5 & -3 & -5 & -3 \\ 3 & -4 & 3 & 4 & -3 \end{array} \right ) & \xrightarrow{R_3\leftarrow 2R_3-3R_1} \left ( \begin{array}{cccc|c} 2 & -1 & 1 & 1 & 1 \\ 0 & 5 & -3 & -5 & -3 \\ 0 & -5 & 3 & 5 & -9 \end{array} \right ) \\ & \xrightarrow{R_3\leftarrow R_3+R_2} \left ( \begin{array}{cccc|c} 2 & -1 & 1 & 1 & 1 \\ 0 & 5 & -3 & -5 & -3 \\ 0 & 0 & 0 & 0 & -12 \end{array} \right ). \end{align*} \]
By <a href="../systems-gaussian-elimination/#met-gaussian-elimination-solve" data-reference-type="ref+Label" data-reference="met:gaussian-elimination-solve">Method 2.31</a>, the last row implies that there is no solution: this last row would imply $0=-12$, which is false. Therefore the system has no solution, so the preimage is the empty set:
\[ f^{-1}\!\left(\begin{pmatrix}1\\-3\\-3\end{pmatrix}\right)=\emptyset. \]

Solution 4.13.9

(See Exercise 4.8.) By Definition 4.22 \(\ker f = \{ \left ( \begin{array}{c} x_1 \\ x_2 \end{array} \right ) \in {\bf R}^2 \ | \ f(\left ( \begin{array}{c} x_1 \\ x_2 \end{array} \right )) = \left ( \begin{array}{c} 0 \\ 0 \\ 0 \end{array} \right ) \}\). I.e., the kernel consists of the solutions of the homongeneous system

\[ \begin{align*} x_1+2x_2 & = 0\\ x_2 & = 0 \\ 3x_1+5x_2 & = 0. \end{align*} \]

Solving this system gives \(x_2 = 0\), then \(x_1 = 0\). Thus, \(\ker f = \{ \left ( \begin{array}{c} 0 \\ 0 \end{array} \right ) \}\).

For the second part note that by Definition 4.20, \({\operatorname{im}\ } f\) consists precisely of those vectors \(\left ( \begin{array}{c} a \\ b \\ c \end{array} \right )\) that are of the form \(\left ( \begin{array}{c} a \\ b \\ c \end{array} \right ) = f (\left ( \begin{array}{c} x_1 \\ x_2 \end{array} \right ))\) for some \(x_1, x_2 \in {\bf R}\). Thus, the question amounts to this: do there exist \(x_1, x_2 \in {\bf R}\) with

\[ f (\left ( \begin{array}{c} x_1 \\ x_2 \end{array} \right )) = \left ( \begin{array}{c} x_1+2x_2 \\ x_2 \\ 3x_1+5x_2 \end{array} \right ) = \left ( \begin{array}{c} 1 \\ 0 \\ 3 \end{array} \right )? \]

Equivalently, this is a (inhomogeneous) linear system:

\[ \begin{align*} x_1+2x_2 & = 1\\ x_2 & = 0 \\ 3x_1+5x_2 & = 3. \end{align*} \]

One finds the solution \(x_1 = 1\), \(x_2 = 0\), i.e., \(\left ( \begin{array}{c} 1 \\ 0 \\ 3 \end{array} \right ) = f (\left ( \begin{array}{c} 1 \\ 0 \end{array} \right ))\). So the vector lies in the image.

Solution 4.13.10

(See Exercise 4.9.) By Definition 4.22, Definition 4.20, both questions are about solving linear systems. We use Gaussian elimination as in Method 2.31.

  1. To compute \(\ker f\), solve the homogeneous system
\[ \begin{align*} 2x_1-x_2+x_3+x_4 & = 0 \\ 5x_2-3x_3-5x_4 & = 0\\ 3x_1-4x_2+3x_3+4x_4 & = 0. \end{align*} \]
Its augmented matrix is
\[ \begin{align*} \left ( \begin{array}{cccc|c} 2 & -1 & 1 & 1 & 0\\ 0 & 5 & -3 & -5 & 0\\ 3 & -4 & 3 & 4 & 0 \end{array} \right ) & \xrightarrow{R_3\leftarrow 2R_3-3R_1} \left ( \begin{array}{cccc|c} 2 & -1 & 1 & 1 & 0\\ 0 & 5 & -3 & -5 & 0\\ 0 & -5 & 3 & 5 & 0 \end{array} \right ) \\ & \xrightarrow{R_3\leftarrow R_3+R_2} \left ( \begin{array}{cccc|c} 2 & -1 & 1 & 1 & 0\\ 0 & 5 & -3 & -5 & 0\\ 0 & 0 & 0 & 0 & 0 \end{array} \right ) \\ & \xrightarrow{R_2\leftarrow \frac15R_2, \;R_1\leftarrow R_1+R_2, \;R_1\leftarrow \frac12R_1} \left ( \begin{array}{cccc|c} 1 & 0 & \frac15 & 0 & 0\\ 0 & 1 & -\frac35 & -1 & 0\\ 0 & 0 & 0 & 0 & 0 \end{array} \right ). \end{align*} \]
Hence $x_3,x_4$ are free variables and $x_1=-\frac15x_3$, $x_2=\frac35x_3+x_4.$ Putting $x_3=s$, $x_4=t$, this leads to the following basis of the kernel:
\[ \ker f= \left\{ \begin{pmatrix} -\frac15 s\\[2pt] \frac35 s+t\\[2pt] s\\[2pt] t \end{pmatrix} : s,t\in\mathbf R \right\} = L\!\left( \begin{pmatrix}-1\\3\\5\\0\end{pmatrix}, \begin{pmatrix}0\\1\\0\\1\end{pmatrix} \right). \]
  1. We compute the preimage of \(\left(\begin{smallmatrix}1\\-3\\3\end{smallmatrix}\right)\) again using Gaussian elimination, applied to the augmented matrix describing the inhomogeneous system:
\[ \begin{align*} \left ( \begin{array}{cccc|c} 2 & -1 & 1 & 1 & 1\\ 0 & 5 & -3 & -5 & -3\\ 3 & -4 & 3 & 4 & 3 \end{array} \right ) & \xrightarrow{R_3\leftarrow 2R_3-3R_1} \left ( \begin{array}{cccc|c} 2 & -1 & 1 & 1 & 1\\ 0 & 5 & -3 & -5 & -3\\ 0 & -5 & 3 & 5 & 3 \end{array} \right ) \\ & \xrightarrow{R_3\leftarrow R_3+R_2} \left ( \begin{array}{cccc|c} 2 & -1 & 1 & 1 & 1\\ 0 & 5 & -3 & -5 & -3\\ 0 & 0 & 0 & 0 & 0 \end{array} \right ) \\ & \xrightarrow{R_2\leftarrow \frac15R_2, \;R_1\leftarrow R_1+R_2, \;R_1\leftarrow \frac12R_1} \left ( \begin{array}{cccc|c} 1 & 0 & \frac15 & 0 & \frac15\\ 0 & 1 & -\frac35 & -1 & -\frac35\\ 0 & 0 & 0 & 0 & 0 \end{array} \right ). \end{align*} \]
As before, $x_3,x_4$ are free variables
\[ x_1=\frac15-\frac15x_3,\qquad x_2=-\frac35+\frac35x_3+x_4. \]
With $x_3=s$, $x_4=t$:
\[ f^{-1}\!\left(\begin{pmatrix}1\\-3\\3\end{pmatrix}\right) = \begin{pmatrix}\frac15\\-\frac35\\0\\0\end{pmatrix} +s\begin{pmatrix}-1\\3\\5\\0\end{pmatrix} +t\begin{pmatrix}0\\1\\0\\1\end{pmatrix},\qquad s,t\in\mathbf R. \]
Hence this is an affine plane (solution set of an inhomogeneous system), so it is not a subspace (compare <a href="../maps/#rem-never-subspace">Remark 4.37</a>, <a href="../maps/#thm-solutions-inhomogeneous-system">Theorem 4.36</a>). Alternatively, one may compute this preimage by just finding *one* solution of the inhomogeneous system, such as $(\frac15,-\frac35,0,0)$. Then the preimage is the translate of the kernel by this solution, cf. <a href="../maps/#thm-solutions-inhomogeneous-system" data-reference-type="ref+Label" data-reference="thm:solutions-inhomogeneous-system">Theorem 4.36</a>.

Solution 4.13.11

(See Exercise 4.10.) The rank of \(A_t\) is the dimension of its row space, which we compute by bringing \(A_t\) to row echelon form (Proposition 4.32):

\[ \begin{align*} A_t = \left ( \begin{array}{cccc} 1 & 3 & -1 & 2 \\ 1 & 5 & 1 & 1 \\ 2 & 4 & t & 5 \end{array} \right ) & \leadsto \left ( \begin{array}{cccc} 1 & 3 & -1 & 2 \\ 0 & 2 & 2 & -1 \\ 0 & -2 & t+2 & 1 \end{array} \right ) \\ & \leadsto \left ( \begin{array}{cccc} 1 & 3 & -1 & 2 \\ 0 & 2 & 2 & -1 \\ 0 & 0 & t+4 & 0 \end{array} \right ) \\ & \leadsto \left ( \begin{array}{cccc} 1 & 3 & -1 & 2 \\ 0 & 1 & 1 & -\frac12 \\ 0 & 0 & t+4 & 0 \end{array} \right ) \end{align*} \]

If \(t\ne-4\), then we can further divide the last row by \(t+4 (\ne 0)\), and the rank is then 3. For \(t = -4\), the rank is 2.

For the last task: the rank of \(A_t\) is at most 3. Thus, \(\dim {\operatorname{im}\ } f \le 3\), and therefore by the rank-nullity theorem (Theorem 4.26)

\[ \dim \ker = \dim {\bf R}^4 - \dim {\operatorname{im}\ } f \ge 4 - 3 = 1. \]

This means that, for all \(t\), the kernel of \(f\) does not just consist of the zero vector, hence the answer to the question is no.

Solution 4.13.12

(See Exercise 4.11.) Let \(f:V\to W\) be linear and let \(U\subset V\) be a subspace.

  1. We verify that \(f(U)\) is a subspace of \(W\). By definition,
\[ f(U)=\{f(u)\mid u\in U\}\subset W. \]
We verify the subspace conditions (<a href="../spaces/#def-subspace" data-reference-type="ref+Label" data-reference="def:subspace">Definition 3.17</a>), similarly as in <a href="../maps/#prop-ker-im-subspace" data-reference-type="ref+Label" data-reference="prop:ker-im-subspace">Proposition 4.23</a>:

- $0_W\in f(U)$: since $U$ is a subspace, $0_V\in U$, and linearity gives $f(0_V)=0_W$ (<a href="../maps/#rem-linear-map-basic" data-reference-type="ref+Label" data-reference="rem:linear-map-basic">Remark 4.4</a>).

- *Closure under addition:* if $y_1,y_2\in f(U)$, then $y_1=f(u_1),y_2=f(u_2)$ for some $u_1,u_2\in U$. Since $U$ is a subspace, $u_1+u_2\in U$, and using the linearity of $f$ (<a href="../maps/#def-linear-map" data-reference-type="ref+Label" data-reference="def:linear-map">Definition 4.1</a>) we obtain
\[ y_1+y_2=f(u_1)+f(u_2)=f(u_1+u_2)\in f(U). \]
- *Closure under scalar multiplication:* if $y=f(u)\in f(U)$ and $\lambda\in\mathbf R$, then $\lambda u\in U$ and again using linearity of $f$:
\[ \lambda y=\lambda f(u)=f(\lambda u)\in f(U). \]
Hence $f(U)$ is a subspace of $W$.
  1. We prove \(\dim f(U)\le \dim U\). Suppose \((u_1,\dots,u_n)\) is a basis of \(U\). (Here we assume for simplicity that \(\dim U < \infty\); in general the proof is essentially the same, though.) Then the vectors \(f(u_1), \dots, f(u_n) \in W\) span \(f(U)\). (Indeed, any \(u \in U\) is of the form \(u = \sum_{i=1}^n a_i u_i\) for certain (unique) \(a_i \in \mathbf R\) (Proposition 3.61). Linearity of \(f\) implies \(f(u) = \sum_{i=1}^n a_i f(u_i)\), so \(f(u)\) is a linear combination of the \(f(u_1)\) etc.) Therefore, by the characterization of dimension via spanning sets (Theorem 3.68),
\[ \dim f(U)\le n=\dim U. \]

Solution 4.13.13

(See Exercise 4.12.) Let \(f:V\to W\) be linear and let \(U\subset W\) be a subspace. By Definition 4.20, \(f^{-1}(U)=\{v\in V\mid f(v)\in U\}\).

We verify the subspace conditions for \(f^{-1}(U)\) (Definition 3.17), in the same spirit as Proposition 4.23.

  • Zero vector: Since \(f\) is linear, \(f(0_V)=0_W\) (Remark 4.4). Because \(U\) is a subspace, \(0_W\in U\). Hence \(0_V\in f^{-1}(U)\).

  • Closure under addition: If \(v_1,v_2\in f^{-1}(U)\), then \(f(v_1),f(v_2)\in U\). Since \(U\) is a subspace, \(f(v_1)+f(v_2)\in U\). By linearity of \(f\) (Definition 4.1), we have

\[ f(v_1+v_2)=f(v_1)+f(v_2)\in U, \]

so \(v_1+v_2\in f^{-1}(U)\).

  • Closure under scalar multiplication: If \(v\in f^{-1}(U)\) and \(\lambda\in\mathbf R\), then \(f(v)\in U\). Since \(U\) is a subspace, \(\lambda f(v)\in U\). Again using linearity of \(f\),
\[ f(\lambda v)=\lambda f(v)\in U, \]

so \(\lambda v\in f^{-1}(U)\).

Therefore \(f^{-1}(U)\) is a subspace of \(V\).

As an aside, we note that the statement of this exercise, applied to \(U=\{0_W\}\) this shows again that \(f^{-1}(\{0_W\})=\ker f\) is a subspace, which is consistent with Proposition 4.23.

Solution 4.13.14

(See Exercise 4.13.) Let

\[ A=\begin{pmatrix} 2 & -1 & -\frac52 & 1\\ -1 & 0 & 1 & -\frac12\\ 1 & 1 & -\frac12 & \frac12\\ 0 & 2 & 1 & 0 \end{pmatrix}. \]

We directly bring \(A\) to row-echelon form by Gaussian elimination:

\[ \begin{pmatrix} 2 & -1 & -\frac52 & 1\\ -1 & 0 & 1 & -\frac12\\ 1 & 1 & -\frac12 & \frac12\\ 0 & 2 & 1 & 0 \end{pmatrix} %\xrightarrow{R_1\leftrightarrow R_2, %\;R_2\leftarrow R_2+2R_1, %\;R_3\leftarrow R_3+R_1, %\;R_3\leftarrow R_3+R_2, %\;R_4\leftarrow R_4+2R_2} \leadsto \begin{pmatrix} -1 & 0 & 1 & -\frac12\\ 0 & -1 & -\frac12 & 0\\ 0 & 0 & 0 & 0\\ 0 & 0 & 0 & 0 \end{pmatrix} %\xrightarrow{R_1\leftarrow -R_1,\;R_2\leftarrow -R_2} \leadsto \begin{pmatrix} 1 & 0 & -1 & \frac12\\ 0 & 1 & \frac12 & 0\\ 0 & 0 & 0 & 0\\ 0 & 0 & 0 & 0 \end{pmatrix}. \]

Hence \(\operatorname{rk}(A)=2\) and \(\dim\ker f=4-2=2\) (by Proposition 4.32 and Theorem 4.26).

For \(x=(x_1,x_2,x_3,x_4)^T\), the reduced system is

\[ \begin{align*} x_1-x_3+\frac12x_4&=0,\\ x_2+\frac12x_3&=0. \end{align*} \]

With free variables \(x_3=s\), \(x_4=t\):

\[ x=s\begin{pmatrix}1\\-\frac12\\1\\0\end{pmatrix}+t\begin{pmatrix}-\frac12\\0\\0\\1\end{pmatrix}. \]

So one convenient basis is

\[ \ker f=L\left(\begin{pmatrix}2\\-1\\2\\0\end{pmatrix},\begin{pmatrix}-1\\0\\0\\2\end{pmatrix}\right). \]

For the image, use pivot columns (Proposition 4.32). Pivot columns are 1 and 2, so

\[ \operatorname{im} f=L(c_1,c_2),\quad c_1=\begin{pmatrix}2\\-1\\1\\0\end{pmatrix},\ c_2=\begin{pmatrix}-1\\0\\1\\2\end{pmatrix}. \]

Now compute \(\ker f\cap\operatorname{im} f\), the intersection of these two subspaces (cf. Section 3.3). A general vector in the image has the form

\[ y=ac_1+bc_2=\begin{pmatrix}2a-b\\-a\\a+b\\2b\end{pmatrix}. \]

In order to determine when this lies in the intersection, we also impose the kernel equations on \(y\):

\[ \begin{align*} y_1-y_3+\frac12y_4=(2a-b)-(a+b)+b=a-b & =0,\\ y_2+\frac12y_3=-a+\frac12(a+b)=\frac{-a+b}{2} & =0. \end{align*} \]

Thus \(a=b\), and therefore

\[ y=a(c_1+c_2)=a\begin{pmatrix}1\\-1\\2\\2\end{pmatrix}. \]

Hence

\[ \ker f\cap\operatorname{im} f=L\left(\begin{pmatrix}1\\-1\\2\\2\end{pmatrix}\right). \]

Solution 4.13.15

(See Exercise 4.14.) Let

\[ f:{\bf R}^3\to{\bf R}^2,\qquad f(x,y,z)=(2x-z,\,x+y+z). \]
  1. With respect to the standard bases, the matrix is obtained from the coefficients of \(x,y,z\) in each component (Proposition 4.43):
\[ A=\begin{pmatrix} 2&0&-1\\ 1&1&1 \end{pmatrix}. \]
  1. We bring \(A\) to row-echelon form and read off both kernel and image:
\[ \begin{pmatrix} 2&0&-1\\ 1&1&1 \end{pmatrix} \xrightarrow{R_1\leftrightarrow R_2} \begin{pmatrix} 1&1&1\\ 2&0&-1 \end{pmatrix} \xrightarrow{R_2-2R_1} \begin{pmatrix} 1&1&1\\ 0&-2&-3 \end{pmatrix} \xrightarrow{R_2\leftarrow-\frac12R_2} \begin{pmatrix} 1&1&1\\ 0&1&\frac32 \end{pmatrix} \xrightarrow{R_1-R_2} \begin{pmatrix} 1&0&-\frac12\\ 0&1&\frac32 \end{pmatrix}. \]
Hence, solving $A\begin{pmatrix}x\\y\\z\end{pmatrix}=0$, we get
\[ x-\frac12z=0,\qquad y+\frac32z=0. \]
With free parameter $z=2t$, this gives $x=t$, $y=-3t$, so
\[ \ker f=L\left(\begin{pmatrix}1\\-3\\2\end{pmatrix}\right),\qquad \dim\ker f=1. \]
For the image, the pivot columns are the first and second. Therefore, by <a href="../maps/#prop-basis-row-column-space" data-reference-type="ref+Label" data-reference="prop:basis-row-column-space">Proposition 4.32</a>, a basis of ${\operatorname{im}\,}f$ is given by the corresponding columns of the original matrix:
\[ c_1=\begin{pmatrix}2\\1\end{pmatrix},\qquad c_2=\begin{pmatrix}0\\1\end{pmatrix}. \]
Hence
\[ \operatorname{im} f=L(c_1,c_2)={\bf R}^2,\qquad \dim\operatorname{im}f=2. \]
  1. By Definition 4.20, \(f^{-1}((0,1))\) is the solution set of the linear system
\[ \begin{align*} 2x-z&=0,\\ x+y+z&=1. \end{align*} \]
One may again solve this by Gaussian elimination, or by inspection: $z=2x$, and then $y=1-3x$. With parameter $t:=x$:
\[ f^{-1}((0,1))=\left\{\begin{pmatrix}t\\1-3t\\2t\end{pmatrix}:t\in{\bf R}\right\} =\begin{pmatrix}0\\1\\0\end{pmatrix}+t\begin{pmatrix}1\\-3\\2\end{pmatrix}. \]
This is an affine line (solution set of an inhomogeneous system), not a subspace of ${\bf R}^3$ because it does not contain $0$ (compare <a href="../maps/#rem-never-subspace">Remark 4.37</a>, <a href="../maps/#thm-solutions-inhomogeneous-system">Theorem 4.36</a>).
  1. Let
\[ v_1=(0,1,2),\quad v_2=(0,-1,1),\quad v_3=(1,1,1). \]
To check they form a basis of ${\bf R}^3$, put them as rows and bring to row-echelon form:
\[ M=\begin{pmatrix} 0&1&2\\ 0&-1&1\\ 1&1&1 \end{pmatrix} \xrightarrow{R_1\leftrightarrow R_3} \begin{pmatrix} 1&1&1\\ 0&-1&1\\ 0&1&2 \end{pmatrix} \xrightarrow{R_3+R_2} \begin{pmatrix} 1&1&1\\ 0&-1&1\\ 0&0&3 \end{pmatrix}. \]
This echelon matrix has three nonzero rows, hence $\operatorname{rk}(M)=3$, for example by <a href="../maps/#cor-rows-columns-independent" data-reference-type="ref+Label" data-reference="cor:rows-columns-independent">Corollary 4.93</a>. Therefore $v_1,v_2,v_3$ are linearly independent and form a basis of ${\bf R}^3$.

The matrix of $f$ with respect to this domain basis and the standard basis in ${\bf R}^2$ has columns $f(v_1),f(v_2),f(v_3)$:
\[ \begin{align*} f(v_1)&=f(0,1,2)=(-2,3),\\ f(v_2)&=f(0,-1,1)=(-1,0),\\ f(v_3)&=f(1,1,1)=(1,3). \end{align*} \]
So the wanted matrix is
\[ \mathrm M_{\{v_1, v_2, v_3\},\{e_1, e_2\}}= \begin{pmatrix} -2&-1&1\\ 3&0&3 \end{pmatrix}. \]

Solution 4.13.16

(See Exercise 4.15.) We have \(v_1 = (1,1,0)\), and compute

\[ v_2 = \left ( \begin{array}{ccc} 2 & -1 & 0 \\ 1 & 0 & 2 \\ 0 & 2 & -1 \end{array} \right ) \left ( \begin{array}{c} 1 \\ 1 \\ 0 \end{array} \right ) = \left ( \begin{array}{c} 1 \\ 1 \\ 2 \end{array} \right ), \]
\[ v_3 = \left ( \begin{array}{ccc} 2 & -1 & 0 \\ 1 & 0 & 2 \\ 0 & 2 & -1 \end{array} \right ) \left ( \begin{array}{c} 1 \\ 1 \\ 2 \end{array} \right ) = \left ( \begin{array}{c} 1 \\ 5 \\ 0 \end{array} \right ). \]

In order to confirm that they form a basis, we apply Method 3.44 and Method 3.53 by forming the associated matrix and bringing it into row echelon form:

\[ \begin{align*} \left ( \begin{array}{ccc} 1 & 1 & 0 \\ 1 & 1 & 2 \\ 1 & 5 & 0 \end{array} \right ) & \leadsto \left ( \begin{array}{ccc} 1 & 1 & 0 \\ 0 & 1 & 2 \\ -4 & 0 & 0 \end{array} \right ) \\ & \leadsto \left ( \begin{array}{ccc} 1 & 1 & 0 \\ 0 & 1 & 2 \\ 0 & 4 & 0 \end{array} \right ) \\ & \leadsto \left ( \begin{array}{ccc} 1 & 1 & 0 \\ 0 & 1 & 2 \\ 0 & 0 & -8 \end{array} \right ) \\ & \leadsto \left ( \begin{array}{ccc} 1 & 1 & 0 \\ 0 & 1 & 2 \\ 0 & 0 & 1 \end{array} \right ). \\ \end{align*} \]

This matrix has three leading ones, so that the vectors do form a basis.

We compute \(v_4 = f(v_3) = \left ( \begin{array}{ccc} 2 & -1 & 0 \\ 1 & 0 & 2 \\ 0 & 2 & -1 \end{array} \right ) \left ( \begin{array}{c} 1 \\ 5 \\ 0 \end{array} \right ) = \left ( \begin{array}{c} -3 \\ 1 \\ 10 \end{array} \right )\). The equation \(v_4 = a_1 v_1 + a_2 v_2 + a_3 v_3\) is the linear system

\[ \begin{align*} a_1 + a_2 + a_3 & = {-3} \\ a_1 + a_2 + 5a_3 &= 1 \\ 2a_2 & = 10. \end{align*} \]

We solve this: the last equation gives \(a_2 = 5\), which leads to

\[ \begin{align*} a_1 + 5 + a_3 & = -3 \\ a_1 + 5 + 5a_3 & = 1. \end{align*} \]

Therefore

\[ \begin{align*} a_1 + a_3 & = -8 \\ a_1 + 5a_3 & = -4. \end{align*} \]

This can be solved to \(a_3 = 1\) and \(a_1 = -9\). Thus, \((a_1, a_2, a_3) = (1,5,-9)\) are the coordinates of \(v_4\) in the basis \(v_1, v_2, v_3\).

We now determine the matrix of \(f\) with respect to the basis \(v_1, v_2, v_3\) (both in the domain and the codomain of \(f\)). We therefore write each \(f(v_i)\) as a linear combination of these three vectors:

\[ \begin{align*} f(v_1) = v_2 &= 0v_1 + 1 v_2 + 0 v_3 \\ f(v_2) = v_3 & =0v_1 + 0v_2+1v_3 \\ f(v_3) = v_4 & = -9v_1 + 5v_2 + v_3. \end{align*} \]

According to Proposition 4.43, the matrix of \(f\) with respect to \(v_1, v_2, v_3\) is

\[ \left ( \begin{array}{ccc} 0 & 0 & -9 \\ 1 & 0 & 5 \\ 0 & 1 & 1 \end{array} \right ). \]

Solution 4.13.17

(See Exercise 4.16.) The map is \(f(x,y,z,t)=(-x+z,\,-y+t,\,x-y)\). Applying \(f\) to the standard basis vectors \(e_1,e_2,e_3,e_4\) of \(\mathbf{R}^4\) gives the columns of the matrix (Proposition 4.43):

\[ A = \begin{pmatrix} -1 & 0 & 1 & 0 \\ 0 & -1 & 0 & 1 \\ 1 & -1 & 0 & 0 \end{pmatrix}. \]

We compute the kernel by solving \(Av=0\) by row reduction (Method 2.31):

\[ \begin{pmatrix} -1 & 0 & 1 & 0 \\ 0 & -1 & 0 & 1 \\ 1 & -1 & 0 & 0 \end{pmatrix} \longrightarrow \begin{pmatrix} 1 & 0 & -1 & 0 \\ 0 & 1 & 0 & -1 \\ 0 & -1 & 1 & 0 \end{pmatrix} \longrightarrow \begin{pmatrix} 1 & 0 & -1 & 0 \\ 0 & 1 & 0 & -1 \\ 0 & 0 & 1 & -1 \end{pmatrix} \longrightarrow \begin{pmatrix} 1 & 0 & 0 & -1 \\ 0 & 1 & 0 & -1 \\ 0 & 0 & 1 & -1 \end{pmatrix}. \]

The free variable is \(t\); setting \(t=\lambda\) gives \(x=\lambda\), \(y=\lambda\), \(z=\lambda\), so

\[ \ker f = \operatorname{span}\!\left\{\begin{pmatrix}1\\1\\1\\1\end{pmatrix}\right\}, \qquad \dim\ker f = 1. \]

Image. By the rank–nullity theorem (Theorem 4.26), we have \(\dim\operatorname{im} f = \dim\mathbf{R}^4 - \dim\ker f = 4 - 1 = 3.\) Since \(\operatorname{im} f \subseteq \mathbf{R}^3\) and has dimension \(3\), we conclude from Theorem 3.68 that \(\operatorname{im} f = \mathbf{R}^3\). Alternatively, using Proposition 4.32, the first three columns of \(A\) above are another basis of \(\operatorname{im} f\).

Solution 4.13.18

(See Exercise 4.17.)

  1. We check the linearity of the functions.

    \(f_1\) is linear: \(f_1(x)=(x_1+x_2,\,3x_1,\,2x_2)\) is given by multiplying with the matrix \(\begin{pmatrix}1&1\\3&0\\0&2\end{pmatrix}\), so it is linear by Proposition 4.19. Alternatively, one may also check the linearity directly by applying Definition 4.1.

    \(f_2\) is not linear: \(f_2(0)=(0,1)\neq(0,0)\), violating \(f(0)=0\) (Remark 4.4).

    \(f_3\) is not linear: the first component \(x_3^2\) is not a linear function of \((x_1,x_2,x_3)\) (e.g. \(f_3(2e_3)=(4,0)\neq 2f_3(e_3)=(2,0)\)).

    The identity map \(f_4\) is linear (cf. Example 4.7).

    \(f_5\) is linear: \(f_5(x)=(x_1+x_2,\,x_2+x_3,\,x_3+x_1)\) is given by multiplication with the matrix \(\begin{pmatrix}1&1&0\\0&1&1\\1&0&1\end{pmatrix}\).

  2. We describe the images of these functions.

    For the linear map \(f_1\), we can use Proposition 4.30, according to which \(\operatorname{im} f_1 = \operatorname{span}\!\left\{\begin{pmatrix}1\\3\\0\end{pmatrix},\begin{pmatrix}1\\0\\2\end{pmatrix}\right\}\), a \(2\)-dimensional subspace of \(\mathbf{R}^3\) (the two columns are linearly independent).

    \(f_2\) is not linear, but its image is \(\{(x_3+x_1,\,3x_2+4x_1+1)\mid x\in\mathbf{R}^3\}=\mathbf{R}^2\) (e.g. vary \(x_1,x_2,x_3\) freely).

    We have \(\operatorname{im} (f_3) = \{(x_3^2,\,x_2+x_1)\mid x\in\mathbf{R}^3\} = [0,\infty)\times\mathbf{R}\) (its first component is a square, the second one is arbitrary).

    We clearly have \(\operatorname{im} f_4 = \mathbf{R}^3\).

    For \(f_5\), let \(\left ( \begin{array}{c} a \\ b \\ c \end{array} \right ) \in \mathbf{R}^3\). To decide whether it lies in the image, we solve

\[ \begin{align*} x_1+x_2 & = a \\ x_2+x_3 & = b \\ x_3+x_1 & = c. \end{align*} \]
Solving this, for example using Gaussian elimination, one finds the following solutions:
\[ x_1=\frac{a-b+c}{2}, \qquad x_2=\frac{a+b-c}{2}, \qquad x_3=\frac{-a+b+c}{2}. \]
Thus for every $\left ( \begin{array}{c} a \\ b \\ c \end{array} \right ) \in \mathbf{R}^3$ there is a solution, so $\operatorname{im} f_5=\mathbf{R}^3$.
  1. Injectivity and surjectivity (for the linear maps \(f_1,f_4,f_5\)). For the linear maps we can use the rank-nullity theorem (Theorem 4.26) and our knowledge about the image to determine injectivity. Recall from Lemma 4.25 that a linear map \(f\) is injective precisely if its kernel \(\ker f\) consists only of the zero vector, or equivalently if \(\dim \ker f = 0\).

    For \(f_1\): the domain is \(\mathbf{R}^2\), we have seen \(\dim\operatorname{im} f_1=\dim\mathbf{R}^2=2\), so \(\ker f_1=\{0\}\) by rank–nullity (Theorem 4.26), so \(f_1\) is injective. It is not surjective since \(\dim \operatorname{im} f_1 = 2 < 3 = \dim \mathbf{R}^3\).

    The identity map \(f_4\) is bijective (i.e., injective and surjective).

    The map \(f_5\) is also bijective. To see this, we again use rank-nullity, which implies \(\dim \ker f_5 = 3 - \dim \operatorname{im} (f_5) = 3 - 3 = 0\).

Solution 4.13.19

(See Exercise 4.18.)

  1. Put the vectors as rows of a matrix and row-reduce:
\[ M=\begin{pmatrix} v_1\\ v_2\\ v_3 \end{pmatrix} = \begin{pmatrix} 1&2&-3&-1\\ 3&4&4&1\\ 1&0&10&3 \end{pmatrix} \xrightarrow{R_2-3R_1,\;R_3-R_1} \begin{pmatrix} 1&2&-3&-1\\ 0&-2&13&4\\ 0&-2&13&4 \end{pmatrix} \xrightarrow{R_3-R_2} \begin{pmatrix} 1&2&-3&-1\\ 0&-2&13&4\\ 0&0&0&0 \end{pmatrix}. \]
This row-echelon matrix has two nonzero rows, so by <a href="../maps/#prop-basis-row-column-space" data-reference-type="ref+Label" data-reference="prop:basis-row-column-space">Proposition 4.32</a> we read off
\[ \dim U=2, \]
where $U=L(v_1,v_2,v_3)$. A basis of $U$ is given by the nonzero rows of the echelon form:
\[ \left\{(1,2,-3,-1),\,(0,-2,13,4)\right\}. \]
(From the row operation $R_3-R_1=R_2-3R_1$ we get
\[ v_2=2v_1+v_3, \]
so equivalently $U=L(v_1,v_3)$, i.e., $v_1$ and $v_3$ also form a basis of $U$.)
  1. We extend two independent vectors to a basis of \({\bf R}^4\). For example, take
\[ e_1=(1,0,0,0),\qquad e_2=(0,1,0,0). \]
Then the vectors $v_1,v_3,e_1,e_2$ form a basis of ${\bf R}^4$. To check their linear independence, we again form the matrix comprised of these vectors
\[ \begin{pmatrix} 1&2&-3&-1\\ 1&0&10&3\\ 1&0&0&0\\ 0&1&0&0 \end{pmatrix}, \]
One checks that the rank of this matrix is 4. Thus the rows are linearly independent, hence form a basis of ${\bf R}^4$.
  1. Yes, such an \(f:{\bf R}^4\to{\bf R}^2\) exists. Define it on the basis \(v_1,v_3,e_1,e_2\) by
\[ f(v_1)=(1,0),\qquad f(v_3)=(0,1),\qquad f(e_1)=(0,0),\qquad f(e_2)=(0,0) \]
(or any other choices for $f(e_1)$, $f(e_2)$ would also do). By <a href="../maps/#prop-linear-map-defined-on-basis" data-reference-type="ref+Label" data-reference="prop:linear-map-defined-on-basis">Proposition 4.39</a>, this determines a unique linear map $f:{\bf R}^4\to{\bf R}^2$. Since $U=L(v_1,v_3)$, we get
\[ f(U)=L(f(v_1),f(v_3))=L((1,0),(0,1))={\bf R}^2. \]
For $g:{\bf R}^4\to{\bf R}^3$, the answer is no. Indeed, the restriction
\[ g_{|U}:U\to {\bf R}^3 \]
is a linear map with image $g(U)$. By <a href="../maps/#cor-maps-dim" data-reference-type="ref+Label" data-reference="cor:maps-dim">Corollary 4.28</a>,
\[ \dim g(U)\le \dim U=2. \]
Therefore $g(U)$ cannot be all of ${\bf R}^3$, since ${\bf R}^3$ has dimension $3$.

Solution 4.13.20

(See Exercise 4.19.) We attempt to invert \(A\) by row-reducing the augmented matrix \([A\mid I]\):

\[ \begin{align*} \left ( \begin{array}{ccc|ccc} 1 & 3 & -1 & 1 & 0 & 0 \\ 2 & 1 & 5 & 0 & 1 & 0 \\ 1 & -7 & 13 & 0 & 0 & 1 \end{array} \right ) & \xrightarrow{R_2-2R_1,\;R_3-R_1} \left ( \begin{array}{ccc|ccc} 1 & 3 & -1 & 1 & 0 & 0 \\ 0 & -5 & 7 & -2 & 1 & 0 \\ 0 & -10 & 14 & -1 & 0 & 1 \end{array} \right ) \\ & \xrightarrow{R_3-2R_2} \left ( \begin{array}{ccc|ccc} 1 & 3 & -1 & 1 & 0 & 0 \\ 0 & -5 & 7 & -2 & 1 & 0 \\ 0 & 0 & 0 & 3 & -2 & 1 \end{array} \right ). \end{align*} \]

The third row of the left block is all zeros, so \(A\) does not row-reduce to the identity. Therefore \(A\) is not invertible, according to Theorem 4.80.

Solution 4.13.21

(See Exercise 4.20.) We compute \(AB\) and \(BA\) in each case, by applying Definition 4.48:

  1. For \(A=\begin{pmatrix}1&2\\2&1\end{pmatrix}\), \(B=\begin{pmatrix}0&1\\-1&1\end{pmatrix}\) we obtain \(AB=\begin{pmatrix}-2&3\\-1&3\end{pmatrix}\), as opposed to \(BA=\begin{pmatrix}2&1\\1&-1\end{pmatrix}\). Hence \(AB\neq BA\), confirming that matrix multiplication is in general not commutative, cf. Warning 4.54.

  2. For \(A=\begin{pmatrix}3&0\\0&4\end{pmatrix}\), \(B=\begin{pmatrix}-1&0\\0&2\end{pmatrix}\) we compute \(AB=\begin{pmatrix}-3&0\\0&8\end{pmatrix}\), which happens to be equal to \(BA=\begin{pmatrix}-3&0\\0&8\end{pmatrix}\).

  3. For \(A=\begin{pmatrix}1&x\\0&1\end{pmatrix}\) and \(B=\begin{pmatrix}y&1\\0&y\end{pmatrix}\) we get

\[ AB=BA=\begin{pmatrix}-3&0\\0&8\end{pmatrix}. \]
  1. For \(A=\begin{pmatrix}1&x\\0&1\end{pmatrix}\) and \(B=\begin{pmatrix}y&1\\0&y\end{pmatrix}\) we obtain
\[ AB=\begin{pmatrix}y&1+xy\\0&y\end{pmatrix},\qquad BA=\begin{pmatrix}y&xy+1\\0&y\end{pmatrix}. \]
Since $1+xy=xy+1$, we get $AB=BA$ for all $x,y\in\mathbf{R}$. The matrices $A$ and $B$ are examples of *elementary matrices*, cf. <a href="../maps/#def-elementary-matrices" data-reference-type="ref+Label" data-reference="def:elementary-matrices">Definition 4.61</a>.
  1. For \(A=\begin{pmatrix}3&0\\0&3\end{pmatrix}=3I_2\) and arbitrary \(B \in \mathrm{Mat}_{2 \times 2}\), we get
\[ AB=(3I_2)B=3B=B(3I_2)=BA. \]
So $AB=BA$ for every such $B$.
  1. Let \(A\) be arbitrary and \(B=A^2\). Then, by associativity of matrix multiplication (cf. Lemma 4.59),
\[ AB=A\,A^2=A^3=A^2\,A=BA. \]
So $AB=BA$ always.

Solution 4.13.22

(See Exercise 4.21.) We use that a matrix \(A\in \mathrm{Mat}_{2\times 2}\) determines a linear map

\[ f_A:\mathbf R^2\to \mathbf R^2,\qquad x\mapsto Ax, \]

by Proposition 4.19, and conversely that a linear map is determined by the images of a basis, by Proposition 4.43.

  1. The vectors \(e_1=(1,0)\) and \(e_2=(0,1)\) form the standard basis of \(\mathbf R^2\) (cf. Example 3.59). Hence prescribing
\[ Ae_1=\begin{pmatrix}3\\4\end{pmatrix},\qquad Ae_2=\begin{pmatrix}4\\5\end{pmatrix} \]
determines the linear map uniquely. By <a href="../maps/#prop-matrix-to-linear-map" data-reference-type="ref+Label" data-reference="prop:matrix-to-linear-map">Proposition 4.43</a>, the corresponding matrix has these vectors as its columns:
\[ A=\begin{pmatrix}3&4\\4&5\end{pmatrix}. \]
  1. Again \(e_1=(1,0)\) is prescribed, and the second condition is compatible with linearity because
\[ 2e_1=\begin{pmatrix}2\\0\end{pmatrix} \quad\text{and therefore}\quad A(2e_1)=2A(e_1)=2\begin{pmatrix}3\\4\end{pmatrix}=\begin{pmatrix}6\\8\end{pmatrix}, \]
using <a href="../maps/#def-linear-map" data-reference-type="ref+Label" data-reference="def:linear-map">Definition 4.1</a>. So such matrices do exist.

However, the matrix is not unique, because we only prescribe the image of the line $L(e_1)$, not of a full basis. The vector $e_2=(0,1)$ may be sent to any vector of $\mathbf R^2$. Thus all solutions are of the form
\[ A=\begin{pmatrix}3&a\\4&b\end{pmatrix},\qquad a,b\in \mathbf R. \]
Indeed, for such a matrix,
\[ A\begin{pmatrix}1\\0\end{pmatrix}=\begin{pmatrix}3\\4\end{pmatrix}, \qquad A\begin{pmatrix}2\\0\end{pmatrix}=2A\begin{pmatrix}1\\0\end{pmatrix}=\begin{pmatrix}6\\8\end{pmatrix}. \]
  1. Here the conditions are incompatible with linearity. As above,
\[ A\begin{pmatrix}2\\0\end{pmatrix}=A(2e_1)=2A(e_1)=2\begin{pmatrix}3\\4\end{pmatrix}=\begin{pmatrix}6\\8\end{pmatrix}, \]
again by <a href="../maps/#def-linear-map" data-reference-type="ref+Label" data-reference="def:linear-map">Definition 4.1</a>. But the exercise requires instead
\[ A\begin{pmatrix}2\\0\end{pmatrix}=\begin{pmatrix}3\\4\end{pmatrix}, \]
which is different. Therefore no such matrix exists.

Geometrically, the vector $(2,0)$ lies on the same line as $(1,0)$, and a linear map must preserve scalar multiplication along that line. So once the image of $(1,0)$ is fixed, the image of $(2,0)$ is forced to be twice as large.

Solution 4.13.23

(See Exercise 4.22.) One possible choice is

\[ A=\begin{pmatrix}1&0\\0&0\end{pmatrix}, \qquad B=\begin{pmatrix}0&0\\1&0\end{pmatrix}. \]

Both matrices are nonzero. Using the definition of matrix multiplication (Definition 4.48), we compute

\[ AB= \begin{pmatrix}1&0\\0&0\end{pmatrix} \begin{pmatrix}0&0\\1&0\end{pmatrix} = \begin{pmatrix}0&0\\0&0\end{pmatrix}. \]

Hence \(AB=0\), although \(A\neq 0\) and \(B\neq 0\).

Using that matrices give rise to linear maps (Proposition 4.19), this exercise may also be understood in terms of linear maps. Consider the linear maps \(f_A,f_B:\mathbf R^2\to\mathbf R^2\). In light of Proposition 4.51, the relation \(AB=0\) means that

\[ f_A\circ f_B=0, \]

so every vector in the image of \(f_B\) lies in the kernel of \(f_A\) (cf. Definition 4.20, Definition 4.22). In our example,

\[ \operatorname{im}(f_B)=L\left(\begin{pmatrix}0\\1\end{pmatrix}\right) \subseteq L\left(\begin{pmatrix}0\\1\end{pmatrix}\right)=\ker(f_A). \]

That is why the composition is the zero map even though neither of the two maps is zero.

Solution 4.13.24

(See Exercise 4.23.)

  1. By Definition 4.88,
\[ \left ( \begin{array}{cc} 1 & s \\ -2 & t \end{array} \right )^T = \left ( \begin{array}{cc} 1 & -2 \\ s & t \end{array} \right ). \]
The matrix is symmetric if and only if it is equal to its transpose, so the off-diagonal entries must agree. Hence $s=-2$. The entry $t$ already agrees with itself on the diagonal, so $t$ is arbitrary.
  1. Let \(A=(a_{ij})\) be any square matrix. By Lemma 4.90, transposition is compatible with addition, so
\[ (A+A^T)^T = A^T + (A^T)^T. \]
Applying again <a href="../maps/#lem-properties-transposition" data-reference-type="ref+Label" data-reference="lem:properties-transposition">Lemma 4.90</a>, we have $(A^T)^T=A$. Thus
\[ (A+A^T)^T = A^T + A = A + A^T. \]
Hence $A+A^T$ is symmetric by definition.

Solution 4.13.25

(See Exercise 4.24.) Let \(A=(a_{ij})\) and \(B=(b_{ij})\) be square matrices of the same size.

  1. By Definition 4.88, the diagonal entries of \(A^T\) are \((A^T)_{ii}=a_{ii}\) for all \(i\). By definition of the trace (Exercise 4.24), we have
\[ \operatorname{tr}(A^T)=(A^T)_{11}+\dots+(A^T)_{nn}=a_{11}+\dots+a_{nn}=\operatorname{tr}(A). \]
  1. Using the definition of matrix multiplication (Definition 4.48), the \((i,i)\)-entry of \(AB\) is
\[ (AB)_{ii}=\sum_{k=1}^n a_{ik}b_{ki}. \]
Hence
\[ \operatorname{tr}(AB)=\sum_{i=1}^n (AB)_{ii}=\sum_{i=1}^n \sum_{k=1}^n a_{ik}b_{ki}. \]
Similarly,
\[ \operatorname{tr}(BA)=\sum_{k=1}^n (BA)_{kk}=\sum_{k=1}^n \sum_{i=1}^n b_{ki}a_{ik}. \]
These are the same double sum, just with the order of summation reversed. Therefore
\[ \operatorname{tr}(AB)=\operatorname{tr}(BA). \]
This is noteworthy because one generally has $AB\ne BA$, cf. <a href="../maps/#war-not-commutative" data-reference-type="ref+Label" data-reference="war:not-commutative">Warning 4.54</a>.
  1. The trace is additive because diagonal entries add componentwise:
\[ \operatorname{tr}(A+B)=(a_{11}+b_{11})+\dots+(a_{nn}+b_{nn})=\operatorname{tr}(A)+\operatorname{tr}(B). \]
Likewise, scalar multiplication acts componentwise on the diagonal, so
\[ \operatorname{tr}(rA)=ra_{11}+\dots+ra_{nn}=r\,\operatorname{tr}(A). \]
  1. Suppose there were a matrix \(B\) with
\[ AB-BA={\mathrm{id}}. \]
Applying part (3) and then part (2), we obtain
\[ \operatorname{tr}(AB-BA)=\operatorname{tr}(AB)-\operatorname{tr}(BA)=0. \]
But
\[ \operatorname{tr}({\mathrm{id}})=1+\dots+1=n, \]
where $n$ is the size of the matrix. Hence $\operatorname{tr}(AB-BA)\ne \operatorname{tr}({\mathrm{id}})$, a contradiction. Therefore no such matrix $B$ exists.

Solution 4.13.26

(See Exercise 4.25.) Consider the fixed number \(r \in \mathbf R\). If we look for a matrix \(R\) such that for all \(2 \times 2\)-matrices \(A\) we have \(RA = rA\), then this equation applied to \(A = \mathrm{id}_2\) (the identity matrix, cf. Definition 4.57) gives

\[ R \mathrm{id}_2 = r \mathrm{id}_2 = \begin{pmatrix} r & 0 \\ 0 & r \end{pmatrix}. \]

By Lemma 4.59, the left hand side equals \(R\), so we see that we need to have \(R = \begin{pmatrix} r & 0 \\ 0 & r \end{pmatrix}\). One then confirms that the equation \(RA = rA\) holds for all \(A\). This can be done by direct computation.

Solution 4.13.27

(See Exercise 4.26.)

  1. Using Definition 4.48, we compute
\[ A^2= \begin{pmatrix} 0&a&b\\ 0&0&c\\ 0&0&0 \end{pmatrix} \begin{pmatrix} 0&a&b\\ 0&0&c\\ 0&0&0 \end{pmatrix} = \begin{pmatrix} 0&0&ac\\ 0&0&0\\ 0&0&0 \end{pmatrix}. \]
Multiplying once more, we obtain
\[ A^3=A^2A= \begin{pmatrix} 0&0&ac\\ 0&0&0\\ 0&0&0 \end{pmatrix} \begin{pmatrix} 0&a&b\\ 0&0&c\\ 0&0&0 \end{pmatrix} = \begin{pmatrix} 0&0&0\\ 0&0&0\\ 0&0&0 \end{pmatrix}=0. \]
  1. A generalization to arbitrary square matrices is the following: if \(A\) is a strictly upper triangular \(n\times n\)-matrix, meaning that all entries on and below the diagonal are zero, then
\[ A^n=0. \]
Indeed, when one multiplies strictly upper triangular matrices, the first superdiagonal can move only upward. More precisely, every entry of $A^k$ on or below the $(k-1)$-st superdiagonal is zero. After $n$ factors there is no possible nonzero position left, so all entries vanish.

Equivalently, each multiplication shifts possible nonzero entries at least one step farther above the main diagonal; after $n$ steps they have all disappeared. Thus every strictly upper triangular $n\times n$-matrix is nilpotent. In terms of linear maps one can observe that such a matrix $A$ describes a linear map $f_A$ such that $f_A(e_1) = 0$, while $f_A(e_2)$ is a linear combination of $e_1$, $f_A(e_3)$ is a linear combination of $e_1$ and $e_2$, etc. Thus, the image of $f_A$ is contained in the span of $e_1, \dots, e_{n-1}$, and the image of $f_A^2$ is contained in the span of $e_1, \dots, e_{n-2}$ etc., so that $f_A^n = 0$.

Solution 4.13.28

(See Exercise 4.27.) We write any vector \(v = (a,b) = a(1,0) + b(0,1) = ae_1 + be_2\) in terms of \(v_1, v_2\):

\[ v = \alpha v_1 + \beta v_2 = \alpha(1,-3)+\beta(2,1). \]

As an example, if \(v=(0,6)\), then \((0,6)=\alpha(1,-3)+\beta(2,1)\) gives the linear system

\[ \begin{align*} 0 & = \alpha + 2\beta \\ 6 & = -3\alpha + \beta. \end{align*} \]

This can be solved as \(\beta = \frac 67\) and \(\alpha = -\frac{12}7.\) Thus, we can write

\[ v = (0,6)_{(e_1, e_2)} = (-\frac{12}7,\frac 67)_{(v_1, v_2)}, \]

where the subscripts indicate that the coordinates are with respect to the standard basis, resp. to the basis \(v_1, v_2\).

We now determine the base change matrix. We have to write the matrix of the identity map in terms of the standard basis in the domain, and the basis \(v_1, v_2\) in the codomain:

\[ \begin{align*} e_1 = (1,0) \mapsto & {\mathrm {id}}(e_1) = (1,0) = \alpha_1 v_1 + \alpha_2 v_2 \\ e_2 = (0,1) \mapsto & {\mathrm {id}}(e_2) = (0,1) = \beta_1 v_1 + \beta_2 v_2. \end{align*} \]

The base change matrix is then the matrix

\[ \left ( \begin{array}{cc} \alpha_1 & \beta_1 \\ \alpha_2 & \beta_2 \end{array} \right ). \]

We can compute \(\alpha_1\) etc. similarly as above:

\[ \begin{align*} 1 &= \alpha_1 + 2 \alpha_2 \\ 0 &= -3\alpha_1+\alpha_2 \end{align*} \]

We solve this as \(\alpha_1 = \frac 17\), \(\alpha_2 = \frac 37\). As for \(\beta_1, \beta_2\), the relevant system is

\[ \begin{align*} 0 &= \beta_1 + 2 \beta_2 \\ 1 &= -3\beta_1+\beta_2 \end{align*} \]

whose solution is \(\beta_1 = -\frac 27\), \(\beta_2 = \frac 17\). Therefore, the base change matrix is

\[ H = \left ( \begin{array}{cc} \frac 17 & -\frac 27 \\ \frac 37 & \frac 17 \end{array} \right ). \]

We compute the coordinates of \((2,-5)\) in terms of the basis \(v_1, v_2\):

\[ H \left ( \begin{array}{c} 2 \\ -5 \end{array} \right ) = \left ( \begin{array}{cc} \frac 17 & -\frac 27 \\ \frac 37 & \frac 17 \end{array} \right ) \left ( \begin{array}{c} 2 \\ -5 \end{array} \right ) = \left ( \begin{array}{c} \frac{12}7 \\ \frac 17 \end{array} \right ). \]

Thus, the coordinates of \((2, -5)\) with respect to the basis \(v_1, v_2\) is \((\frac {12}7, \frac 17)\).

Solution 4.13.29

(See Exercise 4.28.) We have, for example,

\[ {\mathrm {id}}(v_1) = v_1 = (1,0,-1) = 1 \cdot e_1 + 0 \cdot e_2 + (-1) \cdot e_3. \]

Likewise for \(v_2\) and \(v_3\). Therefore, the base change matrix is

\[ \left ( \begin{array}{ccc} 1 & 2 & -1 \\ 0 & 1 & -1 \\ -1 & 1 & 7 \end{array} \right ). \]

Solution 4.13.30

(See Exercise 4.30.) We follow the given hint, and first compute \(H\). We have

\[ \begin{align*} v_1 \mapsto & v_1 = (1,-1) = e_1 - e_2, \\ v_2 \mapsto & v_2 =(3,-1) = 3e_1 - e_2. \end{align*} \]

Thus \(H = \left ( \begin{array}{cc} 1 & 3 \\ -1 & -1 \end{array} \right )\).

We now compute the base change matrix \(K\) from the standard basis to the basis \(\underline t = \{t_1, t_2, t_3\}\):

\[ e_1 = (1,0,0) \mapsto (1,0,0) = \alpha_1 t_1 + \alpha_2 t_2 + \alpha_3 t_3. \]

Plugging in the values of \(t_1, t_2, t_3\), this gives the linear system

\[ \begin{align*} 1 &= \alpha_1+2\alpha_2 - \alpha_3 \\ 0 & = \alpha_2 - \alpha_3 \\ 0 & = \alpha_1 + \alpha_2 - \alpha_3. \end{align*} \]

This has the solution

\[ \alpha_1 = 0, \ \alpha_2 = 1, \ \alpha_3 = 1. \]

Similarly, if instead of \(e_1 = (1,0,0)\), we consider \(e_2 = (0,1,0)\), the constants in the above system change accordingly to 0, resp. 1, resp. 0 in the three equations above. The solution is then

\[ \alpha_1 = -1, \alpha_2 = 0, \alpha_3 = -1. \]

Similarly, for \(e_3\), we obtain the solution

\[ \alpha_1 = 1, \alpha_2 = -1, \alpha_3 = -1. \]

Hence

\[ K = \left ( \begin{array}{ccc} 0 & -1 & 1 \\ 1 & 0 & -1 \\ 1 & -1 & -1 \end{array} \right ). \]

(Alternatively, using methods of later chapters, one may also compute \(K = T^{-1}\), the inverse of the matrix \(T = \left ( \begin{array}{ccc} 1 & 2 & -1 \\ 0 & 1 & -1 \\ 1 & 1 & -1 \end{array} \right )\) which is the matrix whose columns are the basis vectors \(t_1, t_2\) and \(t_3\).) We compute

\[ \begin{align*} KAH & = \left ( \begin{array}{ccc} 0 & -1 & 1 \\ 1 & 0 & -1 \\ 1 & -1 & -1 \end{array} \right ) \cdot \left ( \begin{array}{cc} 2 & 1 \\ 0 & 1 \\ -3 & 1 \end{array} \right ) \cdot \left ( \begin{array}{cc} 1 & 3 \\ -1 & -1 \end{array} \right ) \\ & = \left ( \begin{array}{ccc} 0 & -1 & 1 \\ 1 & 0 & -1 \\ 1 & -1 & -1 \end{array} \right ) \cdot \left ( \begin{array}{cc} 1 & 5 \\ -1 & -1 \\ -4 & -10 \end{array} \right ) \\ & = \left ( \begin{array}{cc} -3 & -9 \\ 5 & 15 \\ 6 & 16 \end{array} \right ). \end{align*} \]

Solution 4.13.31

(See Exercise 4.32.) We follow the solution of Exercise 4.30, see above:

\[ {\bf R}^2_{\underline v} \xrightarrow[H]{\mathrm {id}} {\bf R}^2_{\underline e} \xrightarrow[A]{f} {\bf R}^2_{\underline e} \xrightarrow[K]{\mathrm {id}} {\bf R}^2_{\underline v}. \]

We have \(H=\left ( \begin{array}{cc} 1 & 1 \\ 1 & 2 \end{array} \right )\). One computes \(K = \left ( \begin{array}{cc} 2 & -1 \\ -1 & 1 \end{array} \right )\) and then

\[ \begin{align*} KAH& = \left ( \begin{array}{cc} 2 & -1 \\ -1 & 1 \end{array} \right ) \left ( \begin{array}{cc} 6 & -1 \\ 2 & 3 \end{array} \right ) \left ( \begin{array}{cc} 1 & 1 \\ 1 & 2 \end{array} \right ) \\ & = \left ( \begin{array}{cc} 2 & -1 \\ -1 & 1 \end{array} \right ) \left ( \begin{array}{cc} 5 & 4 \\ 5 & 8 \end{array} \right ) \\ & = \left ( \begin{array}{cc} 5 & 0 \\ 0 & 4 \end{array} \right ). \end{align*} \]

Solution 4.13.32

(See Exercise 4.33.) For the given matrix \(A\), the system \(Ax =x\) is written out like so:

\[ \begin{align*} x_3 & = x_1 \\ x_2 &= x_2 \\ x_1 &= x_3. \end{align*} \]

The second equation holds for all \(x\), and the first is equivalent to the third. Therefore, the system is equivalent to the one consisting of the single equation

\[ x_1 - x_3 = 0. \]

This corresponds to the system

\[ B x = x, \]

where \(B= (1 \ 0 \ {-1})\) is the corresponding \(1 \times 3\)-matrix. This matrix has rank 1, so that the solution space is two-dimensional, given by

\[ L((1,0,1), (0,1,0)). \]

Solution 4.13.33

(See Exercise 4.34.) We write out the given system

\[ \left ( \begin{array}{ccc} 1 & 0 & 0 \\ 0 & 4 & 2 \\ 0 & 2 & 1 \end{array} \right ) \left ( \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right ) = 5 \left ( \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right ) \]

as

\[ \left [ \left ( \begin{array}{ccc} 1 & 0 & 0 \\ 0 & 4 & 2 \\ 0 & 2 & 1 \end{array} \right ) - \left ( \begin{array}{ccc} 5 & 0 & 0 \\ 0 & 5 & 0 \\ 0 & 0 & 5 \end{array} \right ) \right ] \left ( \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right ) = \left ( \begin{array}{c} 0 \\ 0 \\ 0 \end{array} \right ). \]

This simplifies to

\[ \underbrace{\left ( \begin{array}{ccc} -4 & 0 & 0 \\ 0 & -1 & 2 \\ 0 & 2 & -4 \end{array} \right )}_{=:B} \left ( \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right ) = \left ( \begin{array}{c} 0 \\ 0 \\ 0 \end{array} \right ), \]

which can be solved by bringing \(B\) into row-echelon form:

\[ B \leadsto \left ( \begin{array}{ccc} 1 & 0 & 0 \\ 0 & 1 & -2 \\ 0 & 0 & 0 \end{array} \right ). \]

Thus \(x_1 = 0\), \(x_2 = 2x_3\) and \(x_3\) is a free variable. Thus the solution space of the given system is

\[ L((0, 2, 1)). \]

Solution 4.13.34

(See Exercise 4.35.) We proceed the same way as for Exercise 4.30 (see its solution above):

\[ {\bf R}^2_{\underline v} \xrightarrow[H]{\mathrm {id}} {\bf R}^2_{\underline e} \xrightarrow[A]{f} {\bf R}^2_{\underline e} \xrightarrow[H^{-1}] {\mathrm {id}} {\bf R}^2_{\underline v}. \]

Here \(H\) is the base change matrix from \(\underline v\) to the standard basis \(\underline e = \{e_1 = (1,0), e_2 = (0,1)\}\). The base change matrix from \(\underline e\) to \(\underline v\) is then \(H^{-1}\). From the given vectors we have \(H = \left ( \begin{array}{cc} 2 & 0 \\ 1 & 1 \end{array} \right )\). We compute the inverse \(H^{-1}\) using Theorem 4.80:

\[ \begin{align*} \left ( \begin{array}{cc|cc} 2 & 0 & 1 & 0 \\ 1 & 1 & 0 & 1 \end{array} \right ) & \leadsto \left ( \begin{array}{cc|cc} 1 & 1 & 0 & 1 \\ 2 & 0 & 1 & 0 \end{array} \right ) \leadsto \left ( \begin{array}{cc|cc} 1 & 1 & 0 & 1 \\ 0 & -2 & 1 & -2 \end{array} \right ) \\ & \leadsto \left ( \begin{array}{cc|cc} 1 & 1 & 0 & 1 \\ 0 & 1 & -\frac 12 & 1 \end{array} \right ) \leadsto \left ( \begin{array}{cc|cc} 1 & 0 & \frac 12 & 0 \\ 0 & 1 & -\frac12 & 1 \end{array} \right ) \\ & = ( {\mathrm {id}} \ | \ H^{-1}). \end{align*} \]

Thus, \(H^{-1} = \left ( \begin{array}{cc} \frac12 & 0 \\ -\frac12 & 1 \end{array} \right )\). Therefore the requested matrix of \(f\) is

\[ \begin{align*} H^{-1}AH & = \left ( \begin{array}{cc} \frac12 & 0 \\ -\frac12 & 1 \end{array} \right ) \left ( \begin{array}{cc} 3 & 0 \\ 8 & -1 \end{array} \right ) \left ( \begin{array}{cc} 2 & 0 \\ 1 & 1 \end{array} \right ) = \left ( \begin{array}{cc} \frac12 & 0 \\ -\frac12 & 1 \end{array} \right ) \left ( \begin{array}{cc} 6 & 0 \\ 15 & - \end{array} \right )1 \\ & = \left ( \begin{array}{cc} 3 & 0 \\ 12 & -1 \end{array} \right ). \end{align*} \]

Solution 4.13.35

(See Exercise 4.37.) This can be done as for Exercise 4.35 above. The final solution is

\[ \left ( \begin{array}{ccc} -2 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & -1 \end{array} \right ). \]

Solution 4.13.36

(See Exercise 4.38.) One computes this solution space as

\[ L((-1,2,3)) = L((1,-2,-3)). \]

According to Theorem 3.682., this vector \(l_1 = (1,-2,-3)\) can be completed to a basis of \({\bf R}^3\) by picking any basis \(v_1, v_2, v_3\) of \({\bf R}^3\). Then it is possible to find two of these three vectors which together with \(l_1\) will form a basis of \({\bf R}^3\). We pick the standard basis, \(v_1=e_2\), \(v_2=e_2\) and \(v_3=e_3\). We check that \(l_1, e_2, e_3\) form a basis. Indeed, the matrix whose rows are these vectors,

\[ \left ( \begin{array}{ccc} 1 & -2 & -3 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array} \right ), \]

is a row-echelon matrix with three leading ones, so its rank is 3.

We compute the matrix of \(f\) with respect to this basis \(\underline v = \{l_1, e_2, e_3\}\):

\[ {\bf R}^3_{\underline v} \xrightarrow[H]{{\mathrm {id}}} {\bf R}^3_{\underline e} \xrightarrow[A]{f} {\bf R}^3_{\underline e} \xrightarrow[H^{-1}]{{\mathrm {id}}} {\bf R}^3_{\underline v}. \]

We have

\[ H=\left ( \begin{array}{ccc} 1 & 0 & 0 \\ -2 & 1 & 0 \\ -3 & 0 & 1 \end{array} \right ). \]

We compute the inverse using Theorem 4.80:

\[ \left ( \begin{array}{ccc|ccc} 1 & 0 & 0 & 1 & 0 & 0 \\ -2 & 1 & 0 & 0 & 1 & 0 \\ -3 & 0 & 1 & 0 & 0 & 1 \end{array} \right ) \leadsto \left ( \begin{array}{ccc|ccc} 1 & 0 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 2 & 1 & 0 \\ 0 & 0 & 1 & 3 & 0 & 1 \end{array} \right ) = ({\mathrm {id}} \ | \ H^{-1}). \]

Hence the matrix for \(f\) with respect to the basis \(\underline v\) is

\[ H^{-1}AH = \left ( \begin{array}{ccc} 3 & -1 & 0 \\ 0 & 0 & 1 \\ 0 & -2 & 3 \end{array} \right ). \]

Remark 4.94

The choice of the two vectors \(e_2\) and \(e_3\), in addition to \(l_1\) above, is arbitrary. To begin with, one may choose a different basis (other than the standard basis) to complete \(l_1\) to a basis. Even if one takes the standard basis, for this particular value of \(l_1\), any two of the three vectors \(e_1, e_2, e_3\) together with \(l_1\) would form a basis. The resulting base change matrix \(H\) will then be different, and also the result \(H^{-1}AH\) will be different.

Solution 4.13.37

(See Exercise 4.39.) The vectors

\[ \underline v = \{v_1 = (1,0,1), v_2 = (0,3,-1), v_3 = (0,0,1)\} \]

form a basis of \({\bf R}^3\) since the corresponding matrix \(\left ( \begin{array}{ccc} 1 & 0 & 1 \\ 0 & 3 & -1 \\ 0 & 0 & 1 \end{array} \right )\) has rank 3. Hence we can compute the matrix of \(f\) with respect to that basis as follows:

\[ \begin{align*} f(v_1) &= 0 & = 0 v_1 + 0v_2 + 0v_3 \\ f(v_2) & = v_2 & = 0v_1 + 1v_2 +0v_3 \\ f(v_3) & = (0,0,2) = 2v_3 & = 0v_1 + 0v_2 + 2 v_3. \end{align*} \]

Therefore the matrix of \(f\) with respect to that basis is

\[ A = \left ( \begin{array}{ccc} 0 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 2 \end{array} \right ). \]

As before, we compute that matrix of \(f\) with respect to the standard basis using the method above:

\[ {\bf R}^3_{\underline e} \xrightarrow[K]{{\mathrm {id}}} {\bf R}^3_{\underline v} \xrightarrow[A]{f} {\bf R}^3_{\underline v} \xrightarrow[K^{-1}]{{\mathrm {id}}} {\bf R}^3_{\underline e}. \]

The base change matrix \(K\) is easily read off:

\[ K = \left ( \begin{array}{ccc} 1 & 0 & 0 \\ 0 & 3 & 0 \\ 1 & -1 & 1 \end{array} \right ). \]

We compute the inverse \(K^{-1}\):

\[ \begin{align*} \left ( \begin{array}{ccc|ccc} 1 & 0 & 0 & 1 & 0 & 0 \\ 0 & 3 & 0 & 0 & 1 & 0 \\ 1 & -1 & 1 & 0 & 0 & 1 \end{array} \right ) & \leadsto \left ( \begin{array}{ccc|ccc} 1 & 0 & 0 & 1 & 0 & 0 \\ 0 & 3 & 0 & 0 & 1 & 0 \\ 0 & -1 & 1 & -1 & 0 & 1 \end{array} \right ) \\ & \leadsto \left ( \begin{array}{ccc|ccc} 1 & 0 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 0 & \frac13 & 0 \\ 0 & -1 & 1 & -1 & 0 & 1 \end{array} \right ) \\ & \leadsto \left ( \begin{array}{ccc|ccc} 1 & 0 & 0 & 1 & 0 & 0 \\ 0 & 1 & 0 & 0 & \frac13 & 0 \\ 0 & 0 & 1 & -1 & \frac13 & 1 \end{array} \right ) \\ & = ({\mathrm {id}} \ | \ K^{-1}). \end{align*} \]

Therefore the requested matrix is the product

\[ \begin{align*} KAK^{-1} & = \left ( \begin{array}{ccc} 1 & 0 & 0 \\ 0 & 3 & 0 \\ 1 & -1 & 1 \end{array} \right ) \left ( \begin{array}{ccc} 0 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 2 \end{array} \right ) \left ( \begin{array}{ccc} 1 & 0 & 0 \\ 0 & \frac13 & 0 \\ -1 & \frac13 & 1 \end{array} \right ) \\ & = \left ( \begin{array}{ccc} 1 & 0 & 0 \\ 0 & 3 & 0 \\ 1 & -1 & 1 \end{array} \right ) \left ( \begin{array}{ccc} 0 & 0 & 0 \\ 0 & \frac13 & 0 \\ -2 & \frac 23 & 2 \end{array} \right ) \\ & = \left ( \begin{array}{ccc} 0 & 0 & 0 \\ 0 & 1 & 0 \\ -2 & \frac13 & 2 \end{array} \right ). \end{align*} \]

Solution 4.13.38

(See Exercise 4.41.) It is convenient to observe

\[ 5 \left ( \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right ) = \left ( \begin{array}{ccc} 5 & 0 & 0 \\ 0 & 5 & 0 \\ 0 & 0 & 5 \end{array} \right ) 5 \left ( \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right ). \]

Thus the given system can be rewritten as

\[ \left ( \begin{array}{ccc} 1 & 0 & 0 \\ 0 & 4 & 2 \\ 0 & 2 & 1 \end{array} \right ) \left ( \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right ) = \left ( \begin{array}{ccc} 5 & 0 & 0 \\ 0 & 5 & 0 \\ 0 & 0 & 5 \end{array} \right ) \left ( \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right ). \]

By Lemma 4.59, this is the same as the system

\[ \left [ \left ( \begin{array}{ccc} 1 & 0 & 0 \\ 0 & 4 & 2 \\ 0 & 2 & 1 \end{array} \right ) - \left ( \begin{array}{ccc} 5 & 0 & 0 \\ 0 & 5 & 0 \\ 0 & 0 & 5 \end{array} \right )\right ] \left ( \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right ) = \left ( \begin{array}{c} 0 \\ 0 \\ 0 \end{array} \right ) . \]

The left hand matrix equals \(\left ( \begin{array}{ccc} -4 & 0 & 0 \\ 0 & -1 & 2 \\ 0 & 2 & -4 \end{array} \right )\). From here, one can use the standard method to solve the linear system. (As a forecast to §Chapter 5, one can note that the determinant of the latter matrix is 0, so that the matrix is not invertible. Hence the system above has non-zero solutions.)

Solution 4.13.39

(See Exercise 4.43.) Consider the vectors \(v_1 = (1,0,-1)\), \(v_2 = (1,1,0)\), \(v_3 = (1,0,-2) \in {\bf R}^3\).

We have \(f(v_1) = (3,0,-5) = v_1 + 2v_3\), \(f(v_2) = 2v_2\) and \(f(v_3) = (5,2,-5) = v_1 + 2v_2 + 2v_3\). Therefore the matrix \(A\) with respect to the basis \(\underline v := (v_1, v_2, v_3)\) in the domain and the codomain is

\[ A = \left ( \begin{array}{ccc} 1 & 0 & 1 \\ 0 & 2 & 1 \\ 2 & 0 & 2 \end{array} \right ). \]

In order to to compute the matrix \(B\) with respect to the standard basis of \({\bf R}^3\), we express the standard basis vectors \(e_1, e_2, e_3\) as linear combinations of the \(v_1, v_2, v_3\), then compute \(f(e_k)\) using the linearity of \(f\). We have

\[ \begin{align*} e_1 & = 2v_1 - v_3 \\ e_2 & = v_2 - e_1 = -2v_1 + v_2 + v_3 \\ e_3 & = v_1 - v_3. \end{align*} \]

This implies

\[ \begin{align*} f(e_1) & = 2f(v_1) - f(v_3) = (6,0,-10) - (5,2,-5) = (1, -2, -5) \\ f(e_2) & = -2f(v_1) + f(v_2) + f(v_3) = (-6,0,10) + (2,2,0) + (5,2,-5) = (1, 4, 5) \\ f(e_3) & = f(v_1) - f(v_3) = (3,0,-5) - (5,2,-5) = (-2, -2, 0). \end{align*} \]

Therefore

\[ B = \left ( \begin{array}{ccc} 1 & 1 & -2 \\ -2 & 4 & -2 \\ -5 & 5 & 0 \end{array} \right ). \]

A slightly different way to solve this is to observe that the matrix of \(f\) with respect to the basis \(\underline v\) in the source and the standard basis \(\underline e\) in the target is \(C = \left ( \begin{array}{ccc} 3 & 2 & 5 \\ 0 & 2 & 2 \\ -5 & 0 & -5 \end{array} \right )\). We consider the matrix \(H = \left ( \begin{array}{ccc} 1 & 1 & 1 \\ 0 & 1 & 0 \\ -1 & 0 & -2 \end{array} \right )\) whose columns are the three vectors \(v_i\), i.e., the base change matrix from the basis \(\underline v\) to \(\underline e\). Compute the inverse of this matrix, for example using Theorem 4.80. This gives \(H^{-1} = \left ( \begin{array}{ccc} 2 & -2 & 1 \\ 0 & 1 & 0 \\ -1 & 1 & -1 \end{array} \right )\). This is the base change matrix from \(\underline e\) to the basis \(\underline v\). Then \(B = C H^{-1}\): \({\bf R}^3_{\underline e} \xrightarrow[H^{-1}]{{\mathrm {id}}} {\bf R}^3_{\underline v} \xrightarrow[C]{f} {\bf R}^3_{\underline e}\), which confirms the above computation. Likewise \(A = H^{-1} C\), corresponding to \({\bf R}^3_{\underline v} \xrightarrow[C]{f} {\bf R}^3_{\underline e} \xrightarrow[H^{-1}]{{\mathrm {id}}} {\bf R}^3_{\underline v}\), which confirms the above computation.

The kernel and image of \(f\) can be computed by bringing \(B\) into row echelon form. The result is \(\ker f = \{(x,x,x) | x \in {\bf R}\}\), i.e., it is 1-dimensional and has as a basis vector \((1,1,1)\). We have \({\operatorname{im}\ } f = L((1,4,5),(1,-2,-5))\).

For the last part, we form the matrix

\[ \left ( \begin{array}{ccc|c} 1 & 1 & -2 & 3 \\ -2 & 4 & -2 & t \\ -5 & 5 & 0 & -5 \end{array} \right ) \leadsto \left ( \begin{array}{ccc|c} 1 & 1 & -2 & 3 \\ 0 & 6 & -6 & t+6 \\ 0 & 10 & -10 & 10 \end{array} \right ) \leadsto \left ( \begin{array}{ccc|c} 1 & 1 & -2 & 3 \\ 0 & 0 & 0 & t \\ 0 & 1 & -1 & 1 \end{array} \right ) \]

and bring it into row echelon form as shown above. We find that it has rank 2 (the same as the rank of \(B\)), if and only if \(t = 0\). and \(f^{-1}((3,0,-5)) = \{(x, x-1, x-2) | x \in {\bf R}\}\).

These computations can also be performed with any computer algebra software, such as Wolfram Alpha:

Solution 4.13.40

(See Exercise 4.44.) We form the matrix of \(f\) and bring it to row echelon form:

\[ \left ( \begin{array}{ccc} 1 & -1 & 2 \\ -2 & 3 & -1 \\ 0 & 1 & 3 \\ -1 & 3 & t \end{array} \right ) \leadsto \left ( \begin{array}{ccc} 1 & -1 & 2 \\ 0 & 1 & 3 \\ 0 & 1 & 3 \\ 0 & 2 & t+2 \end{array} \right ) \leadsto \left ( \begin{array}{ccc} 1 & -1 & 2 \\ 0 & 1 & 3 \\ 0 & 0 & 0 \\ 0 & 0 & t-4 \end{array} \right ) \]

The \(\dim {\operatorname{im}\ } f\) equals the rank of that matrix, which is 2 if \(t=4\) and 3 otherwise.

For no \(t \in {\bf R}\), \(f\) is surjective, since \(\dim {\operatorname{im}\ } f \le 3\), but it would need to be 4 for \(f\) to be surjective. For \(t\ne 4\) it is injective. Indeed, the rank-nullity theorem (Theorem 4.26) asserts that \(\dim \ker f = 3 - \dim {\operatorname{im}\ } f\), which is 0 for \(t \ne 4\).

For \(t=4\), we have \(\ker f = L(-5,-3,1)\) and \({\operatorname{im}\ } f = L((-1,3,1,3), (1,-2,0,-1))\).

We have \(w \notin {\operatorname{im}\ } f\) for all \(t \in {\bf R}\). This can be shown by forming the augmented matrix

\[ \left ( \begin{array}{ccc|c} 1 & -1 & 2 & 1 \\ -2 & 3 & -1 & 1 \\ 0 & 1 & 3 & 0 \\ -1 & 3 & t & 1 \end{array} \right ) \leadsto \left ( \begin{array}{ccc|c} 1 & -1 & 2 & 1 \\ 0 & 1 & 3 & 3 \\ 0 & 1 & 3 & 0 \\ -1 & 3 & t & 1 \end{array} \right ) \leadsto \left ( \begin{array}{ccc|c} 1 & -1 & 2 & 1 \\ 0 & 0 & 0 & \underline{3} \\ 0 & 1 & 3 & 0 \\ -1 & 3 & t & 1 \end{array} \right ) \]

and the underlined entry \(3\) shows \(w \notin {\operatorname{im}\ } f\).

For \(t=0\) we have seen above that \(\dim {\operatorname{im}\ } f = 3\). By Theorem 3.68, a basis \(v_1, v_2, v_3\) of \({\operatorname{im}\ } f (\subset {\bf R}^4)\) can be extended to a basis of \({\bf R}^4\), say \(v_1, v_2, v_3, v_4\). By Proposition 4.39, there is a unique linear map \(g : {\bf R}^4 \to {\bf R}^3\) such that \(g(v_i) = e_i\) (for \(i=1,2,3\), and \(e_i\) denotes the \(i\)-th standard basis vector of \({\bf R}^3\), and \(g(v_4) = 0\) (or any other vector in \({\bf R}^3\)). Then \(g(f(x))=x\) for any \(x \in {\bf R}^3\).

Solution 4.13.41

(See Exercise 4.45.) The map \(f\) is given by multiplication with the matrix, which we bring to row echelon form by elementary row operations

\[ \left ( \begin{array}{cccc} 0 & -1 & 2 & 2 \\ 1 & 0 & -1 & 2 \\ -5 & 1 & 2 & -3 \end{array} \right ) \leadsto \left ( \begin{array}{cccc} 0 & -1 & 2 & 0 \\ -3 & 6 & 0 & 4 \\ -8 & 1 & 2 & -3 \end{array} \right ) \leadsto \left ( \begin{array}{cccc} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 \\ -2 & 1 & 2 & -3 \end{array} \right ) \]

It has 2 leading ones in the first two columns, so \({\operatorname{im}\ } f = L((0,2,-1,1), (-1,1,2,2))\). The third variable \(z\) is a free variable, and \(\ker f =L(-1,2,1)\).

The space \(W\) defined by the equation \(x_2 - 3x_3 = 0\) is the kernel of the matrix \(\left ( \begin{array}{c} 0 \\ 1 \\ -3 \\ 0 \end{array} \right )\), which gives (for example) three free variables \(x_1, x_3, x_4\), and accordingly a basis consisting of \((1,0,0,0), (0, 3, 1, 0), (0,0,0,1)\). We have \(\dim W = 3\).

An element in \(U = {\operatorname{im}\ } f\) is a linear combination of the first two column vectors, so of the form \((-b, 2a+b, -a+2b, a+2b)\), for arbitrary \(a, b \in {\bf R}\). This lies in \(W\) precisely if \(2a+b-3(-a+2b)=0\), i.e., if \(5a-5b=0\) or if \(a=b\). A basis of \(U \cap W\) is then given by choosing, say, \(a=1\), and \(U \cap W = L(-1,3,1,3)\).

We know \(\dim U+W = \dim U + \dim W - \dim (U \cap W) = 2 + 3 - 1 = 4\). So, for example, the three basis vectors of \(W\), together with any \(u \in U \setminus W\) will form a basis. Thus, \((1,0,0,0), (0, 3, 1, 0), (0,0,0,1), (0, 2, -1, 1)\) forms a basis of \(U+W\). (Note the last vector is not in \(W\) since its does not satisfy the equation \(x_2 - 3x_3 = 0\).)

In order to check when \((a,4,3,b)\) is in the image, we form the matrix corresponding to the linear system:

\[ \left ( \begin{array}{cc|c} 0 & -1 & a \\ 2 & 1 & 4 \\ -1 & 2 & 3 \\ 1 & 2 & b \end{array} \right ) \leadsto \left ( \begin{array}{cc|c} 0 & -1 & a \\ 0 & -3 & 4-2b \\ 0 & 4 & b+3 \\ 1 & 2 & b \end{array} \right ). \]

This shows that \(4-2b=3a\) and \(4(4-2b)+3(b+3)=0\), i.e., \(25-5b=0\), so \(b=5\) and \(a=-2\).

Solution 4.13.42

(See Exercise 4.46.) Part (1): The subspace \(W \subset \mathbf R^4\) is the solution set of a homogeneous linear system. By Definition 2.23, the coefficient matrix describing that linear system reads \(\left ( \begin{array}{cccc} 2 & -1 & -1 & 0 \\ 0 & 1 & -1 & 0 \end{array} \right )\). Thus, (after dividing the first row by 2), the leading ones in this matrix are in the first and second column, so \(x_3\) and \(x_4\) are free variables. By Method 2.31, a basis of \(W\) is given by the vectors \(w_1 = (1,1,1,0)\) and \(w_2 = (0,0,0,1)\). Hence \(\dim W = 2\).

Typical mistakes: It is important to note that \(W\) was defined as a subspace of \({\bf R}^4\). One must therefore take into account \(x_4\), even though it is not present in the linear system as written.

Another typical mistake is to assume one can read of the basis vectors of \(W\) simply as \((2,-1,-1,0)\) and \((0,1,-1,0)\). This is wrong, and can already be seen to be wrong by observing that these vectors do not even lie in \(W\)!

Part (2): To compute the subspace \(U \cap W\) we first apply Lemma 3.32 to write a general vector of \(U\) as

\[ a_1 u_1 + a_2 u_2 + a_3 u_3 = (a_1-a_2, -2a_1+a_2+3 a_3, 2a_2+a_3,a_1-a_3). \]

Recall that \(U \cap W\) consists of those vectors in \(U\) that also lie in \(W\) (Lemma 3.19), so we need to check when the above vector lies in \(W\). This happens if and only if

\[ \begin{align*} 0 & = 2(a_1-a_2)-( -2a_1+a_2+3 a_3)-(2a_2+a_3) = 4a_1 -5a_2-4a_3\\ 0 & = -2a_1+a_2+3 a_3 - (2a_2+a_3) = -2a_1-a_2+2a_3 \end{align*} \]

The solution of this system is \(a_2=0\) and \(a_1=a_3\). Therefore, the space \(U \cap W\) consists of the vectors

\[ a_1 u_1 + 0 u_2 + a_1 u_3 = a_1 (u_1+u_3). \]

Thus, a basis vector of \(U \cap W\) is \(u_1+u_3=(1,1,1,0)\). In particular \(\dim U \cap W=1\).

The three vectors \(u_1, u_2, u_3\) spanning \(U\) are linearly independent, as one sees from bringing the matrix into row echelon form:

\[ \left ( \begin{array}{cccc} 1 & -2 & 0 & 1 \\ -1 & 1 & 2 & 0 \\ 0 & 3 & 1 & -1 \end{array} \right ) \leadsto \left ( \begin{array}{cccc} 1 & -2 & 0 & 1 \\ 0 & -1 & 2 & 1 \\ 0 & 3 & 1 & -1 \end{array} \right ) \leadsto \text{etc.} \]

and seeing that there are three leading ones. Therefore, \(\dim U = 3\). Theorem 3.74 now allows to quickly compute \(\dim (U+W)=\dim U + \dim W - \dim (U \cap W) = 3+2-1=4\). By Theorem 3.68, we therefore obtain \(U+W = \mathbf R^4\). Thus the standard basis \(e_1, \dots, e_4\) is a basis of \(U+W = \mathbf R^4\).

Typical mistake: One should not mix up columns and rows. While it is in principle possible to consider a matrix whose columns are the vectors \(u_1, ...\), one then needs to perform elementary column operations. These are not discussed in these lecture notes (and are not necessary). For consistency with the methods applied in these notes, it is advisable to only write down these vectors in rows as mentioned above.

Part (3) We know that \(\dim W=2\), so if we want that \(U' \oplus W = {\bf R}^4\), we must have \(\dim U' = 2\) by Remark 3.65 Since we know that \(U \cap W = {\bf R} (1,1,1,0)\), it therefore is enough to find two linearly independent vectors \(u'_1, u'_2 \in U\) that are both independent of \(r := (1,1,1,0)\). For example, \(u'_1 = u_1\), \(u'_2 = u_2\) satisfy these requirements, as one sees by bringing

\[ \left ( \begin{array}{c} u_1 \\ u_2 \\ r \end{array} \right ) = \left ( \begin{array}{cccc} 1 & -2 & 0 & 1 \\ -1 & 1 & 2 & 0 \\ 1 & 1 & 1 & 0 \end{array} \right ) \]

into row-echelon form.

Solution 4.13.43

(See Exercise 4.47.) Part (1): Using Gaussian elimination (Method 2.29) we bring the matrix \(A\) into row echelon form (as far as possible):

\[ A = \begin{pmatrix} 1 & 2 & 1 \\ 2 & 4a^2 & -1 \\ -1 & -2 & a \end{pmatrix} \leadsto \left ( \begin{array}{ccc} 1 & 2 & 1 \\ 0 & 4a^2-4 & -3 \\ 0 & 0 & a+1 \end{array} \right ) \]

In order to proceed, we need to distinguish the case \(a=-1\) and \(a \ne -1\). In the former case the matrix reads \(\left ( \begin{array}{ccc} 1 & 2 & 1 \\ 0 & 0 & -3 \\ 0 & 0 & 0 \end{array} \right )\) and has rank 2. In the case \(a \ne -1\), we have \(a+1 \ne 0\), so we can divide the third row by \(a+1\) (cf. Definition 2.28) and get

\[ \left ( \begin{array}{ccc} 1 & 2 & 1 \\ 0 & 4a^2-4 & -3 \\ 0 & 0 & 1 \end{array} \right ) \leadsto \left ( \begin{array}{ccc} 1 & 2 & 1 \\ 0 & 4a^2-4 & 0 \\ 0 & 0 & 1 \end{array} \right ) \]

The rank of this matrix is 3 provided that \(4a^2-4 \ne 0\) and 2 if \(4a^2-4=0\), i.e., if \(a=1\). In summary, the rank is 2 if \(a = \pm 1\) and is 3 otherwise.

Typical mistake: one should not stop the analysis after the first step above.

Part (2): We apply Method 2.31 and bring the augmented matrix of the system into row echelon form:

\[ \begin{align*} \left ( \begin{array}{ccc|c} 1 & 2 & 1 & b_1 \\ 2 & 4 & -1 & b_2 \\ -1 & -2 & 1 & b_3 \end{array} \right ) & \leadsto \left ( \begin{array}{ccc|c} 1 & 2 & 1 & b_1 \\ 0 & 0 & -3 & b_2-2b_1 \\ 0 & 0 & 2 & b_3+b_1 \end{array} \right ) \\ &\leadsto \left ( \begin{array}{ccc|c} 1 & 2 & 1 & b_1 \\ 0 & 0 & -3 & b_2-2b_1 \\ 0 & 0 & 0 & 2(b_2-3b_1)+3(b_3+b_1) \end{array} \right ) \end{align*} \]

There is a solution precisely if the last entry is zero, i.e., if \(2(b_2-2b_1)+3(b_3+b_1) = 0\) or if \(-b_1 +2b_2 + 3b_3=0\).

Typical mistake: It is not legitimate to use the row echelon form of the matrix \(A\) computed in the Part (1) above, and then to append the vector \(\left ( \begin{array}{c} b_1 \\ b_2 \\ b_3 \end{array} \right )\). The row operations performed on the matrix \(A\) must be identically performed on this vector, otherwise one changes the linear system.

Part (3): We again use Gaussian elimination to solve the system (Method 2.31). One forms the augmented matrix

\[ \left ( \begin{array}{ccc|c} 1 & 2 & 1 & 2 \\ 2 & 4 & -1 & 7 \\ -1 & -2 & -1 & c \end{array} \right ) \leadsto \left ( \begin{array}{ccc|c} 1 & 2 & 1 & 2 \\ 0 & 0 & -3 & 3 \\ 0 & 0 & 0 & 2+c \end{array} \right ) \]

This shows that there is a solution if \(c=-2\). One then computes \(x_3=-1\), \(x_1=2-2x_2-x_3=3-2x_2\).

Typical mistake: Again, the row operations performed on the matrix \(A\) must also be performed on the vector \(\left ( \begin{array}{c} 2 \\ 7 \\ c \end{array} \right )\).