Linear maps
Solution 4.13.1
(See Exercise 4.1.) By Definition 4.9, Subsection 4.2.1, for \(A = \left ( \begin{array}{cc} a_{11} & a_{12} \\ a_{21} & a_{22} \end{array} \right )\) and \(v = \left ( \begin{array}{c} x \\ y \end{array} \right )\), we have
So we determine \(A\) by matching the target coordinates.
- Reflection along the \(y\)-axis sends \((x,y) \mapsto (-x,y)\), hence
- “Same point” means \((x,y) \mapsto (x,y)\), so
- “Always the origin” means \((x,y) \mapsto (0,0)\), so
- Reflection along the line \(\{(x,x)\}\) swaps coordinates: \((x,y) \mapsto (y,x)\), hence
- By Example 4.18, a counterclockwise rotation by angle \(r\) is given by
For $r=60^\circ=\pi/3$ this gives
Clockwise by $60^\circ$ means $r=-\pi/3$, so
For comparison with the \(90^\circ\) case, see also Example 4.17.
Solution 4.13.2
(See Exercise 4.2.) Using Definition 4.9, Subsection 4.2.1, for
and \(v=\left ( \begin{array}{c} x \\ y \end{array} \right )\) we have
We want
so comparing coefficients gives \(a_{11}=0\), \(a_{12}=-1\), \(a_{21}=1\), and \(a_{22}=0.\) Hence
Geometrically, this is a counterclockwise rotation by \(90^\circ\) (equivalently, clockwise by \(270^\circ\)), compare Example 4.17.
Solution 4.13.3
(See Exercise 4.3.) By Proposition 4.43 (applied to the standard basis, cf. Example 3.59), the columns of the matrix \(A\) are exactly the vectors \(f(e_1),\dots,f(e_4)\). Hence
By Definition 4.22, to determine \(\ker f\), we need to solve \(Ax=0\). Since there are so many zeros in the matrix, we do this by hand; alternatively one could also use Gaussian elimination (cf. Method 2.31). With \(x=(x_1,x_2,x_3,x_4)^T\), this system reads
From \(2x_1=0\) we get \(x_1=0\), then \(x_4=0\), then \(x_2=0\); \(x_3\) is free. So
To compute the image, we use Proposition 4.32:
Removing the zero vector, consider
If \(a_1u_1+a_2u_2+a_3u_3=0\), then from the second coordinate \(2a_1=0\), hence \(a_1=0\); from the first coordinate \(13a_3=0\), hence \(a_3=0\); from the third coordinate \(7a_2=0\), hence \(a_2=0\). So \(u_1,u_2,u_3\) are linearly independent and therefore form a basis of \(\operatorname{im} f\). Instead of the above approach, one may also bring \(A\) to row-echelon form and observe that the resulting matrix has leading ones in the first, second, and fourth columns, so we again obtain that these three columns of \(A\) form a basis of \(\operatorname{im} f\) (Proposition 4.32). In particular,
As a check, Theorem 4.26 gives
Solution 4.13.4
(See Exercise 4.4.) To compute the rank, we apply Method 2.31, which requires us to bring the matrix into row-echelon form using Gaussian elimination (Method 2.29). We perform the following elementary row operations (Definition 2.28).
This row-echelon matrix has exactly 3 leading ones, so by Proposition 4.32 we get \(\operatorname{rk}(A) = 3.\)
Solution 4.13.5
(See Exercise 4.5.) From Example 4.42 we have
and, since \(e_3=v_2-v_3\) there,
By Proposition 4.19, the matrix of \(f\) with respect to the standard basis in source and target has columns \(f(e_1),f(e_2),f(e_3)\). Hence
This is exactly the matrix already identified in Example 4.42.
Solution 4.13.6
(See Exercise 4.6.) Let \(v_1=(1,1+\lambda,-1)\), \(v_2=(2,\lambda-2,\lambda+2)\). By definition, \(W_\lambda=L(v_1,v_2).\) So a basis of \(W_\lambda\) is either \(\{v_1,v_2\}\) (if they are linearly independent) or a single nonzero vector among them (if they are dependent), cf. Definition 3.46, def:basis.
To find when they are dependent, check whether \(v_2=t v_1\) for some \(t\in{\bf R}\). From the first coordinate we get \(2=t\cdot 1\), hence \(t=2\). Then the second coordinate gives \(\lambda-2=2(1+\lambda)\) which holds precisely if \(\lambda=-4\). For \(\lambda=-4\) the third coordinates also agree. Therefore:
-
If \(\lambda\neq -4\), the vectors \(v_1,v_2\) are linearly independent and therefore a basis of \(W_\lambda\). Thus \(\dim W_\lambda=2\) by Definition 3.63.
-
If \(\lambda=-4\), then \(v_2=2v_1\), so \(v_1=(1,-3,-1)\) is a basis vector of \(W_{-4}\) (or any other non-zero multiple of it, such as \(v_2)\). We have \(\dim W_{-4}=1\).
Solution 4.13.7
(See Exercise 4.7.) Let \(A = \begin{pmatrix} \alpha & 0 & 0 \\ 0 & \alpha & 1+\alpha \\ \alpha & 1 & 2 \end{pmatrix}\) with \(\alpha \in \mathbf{R}\). We determine \(\operatorname{rank}(A)\) (for all \(\alpha\)) by bringing \(A\) to row-echelon form and applying Proposition 4.32.
In order to proceed we need to distinguish whether or not \(\alpha = 0\). If \(\alpha \neq 0\), we can divide the first row by \(\alpha\) (Definition 2.28)
Now, subtract \(\frac{1}{\alpha}\) times the second row from the third row (still for \(\alpha \neq 0\)):
Given that \(\alpha \ne 0\), the rank of this matrix is 2 if \(2-\frac{1+\alpha}\alpha = 0\), i.e., if \(\alpha = 1\), and it is 3 otherwise. This finishes the analysis of the case \(\alpha \ne 0\).
If \(\alpha = 0\), the matrix reads
Since the 3rd row equals the 2nd one, one reads off \(\operatorname{rank}(A) = 2\) in this case.
In total we get the following result:
-
If \(\alpha = 1\) or \(\alpha = 0\), \(\operatorname{rank}(A) = 2\).
-
For all other \(\alpha \neq 0,1\), \(\operatorname{rank}(A) = 3\).
Solution 4.13.8
(See Exercise 4.9.) Let \(f : \mathbf{R}^4 \to \mathbf{R}^3\) be defined by
- According to Definition 4.22, we need to compute the solution set of the system
We do this using Gaussian elimination (<a href="../systems-gaussian-elimination/#met-gaussian-elimination-solve" data-reference-type="ref+Label" data-reference="met:gaussian-elimination-solve">Method 2.31</a>): we bring the coefficient matrix to row echelon form:
Divide row 2 by $5$, then eliminate the $-1$ in row 1, column 2:
Hence $x_3,x_4$ are free, and
Writing $x_3=s$, $x_4=t$, we get
- In order to solve \(f(v) = \begin{pmatrix} 1 \\ -3 \\ -3 \end{pmatrix}\), we again use Gaussian elimination:
The augmented matrix is
By <a href="../systems-gaussian-elimination/#met-gaussian-elimination-solve" data-reference-type="ref+Label" data-reference="met:gaussian-elimination-solve">Method 2.31</a>, the last row implies that there is no solution: this last row would imply $0=-12$, which is false. Therefore the system has no solution, so the preimage is the empty set:
Solution 4.13.9
(See Exercise 4.8.) By Definition 4.22 \(\ker f = \{ \left ( \begin{array}{c} x_1 \\ x_2 \end{array} \right ) \in {\bf R}^2 \ | \ f(\left ( \begin{array}{c} x_1 \\ x_2 \end{array} \right )) = \left ( \begin{array}{c} 0 \\ 0 \\ 0 \end{array} \right ) \}\). I.e., the kernel consists of the solutions of the homongeneous system
Solving this system gives \(x_2 = 0\), then \(x_1 = 0\). Thus, \(\ker f = \{ \left ( \begin{array}{c} 0 \\ 0 \end{array} \right ) \}\).
For the second part note that by Definition 4.20, \({\operatorname{im}\ } f\) consists precisely of those vectors \(\left ( \begin{array}{c} a \\ b \\ c \end{array} \right )\) that are of the form \(\left ( \begin{array}{c} a \\ b \\ c \end{array} \right ) = f (\left ( \begin{array}{c} x_1 \\ x_2 \end{array} \right ))\) for some \(x_1, x_2 \in {\bf R}\). Thus, the question amounts to this: do there exist \(x_1, x_2 \in {\bf R}\) with
Equivalently, this is a (inhomogeneous) linear system:
One finds the solution \(x_1 = 1\), \(x_2 = 0\), i.e., \(\left ( \begin{array}{c} 1 \\ 0 \\ 3 \end{array} \right ) = f (\left ( \begin{array}{c} 1 \\ 0 \end{array} \right ))\). So the vector lies in the image.
Solution 4.13.10
(See Exercise 4.9.) By Definition 4.22, Definition 4.20, both questions are about solving linear systems. We use Gaussian elimination as in Method 2.31.
- To compute \(\ker f\), solve the homogeneous system
Its augmented matrix is
Hence $x_3,x_4$ are free variables and $x_1=-\frac15x_3$, $x_2=\frac35x_3+x_4.$ Putting $x_3=s$, $x_4=t$, this leads to the following basis of the kernel:
- We compute the preimage of \(\left(\begin{smallmatrix}1\\-3\\3\end{smallmatrix}\right)\) again using Gaussian elimination, applied to the augmented matrix describing the inhomogeneous system:
As before, $x_3,x_4$ are free variables
With $x_3=s$, $x_4=t$:
Hence this is an affine plane (solution set of an inhomogeneous system), so it is not a subspace (compare <a href="../maps/#rem-never-subspace">Remark 4.37</a>, <a href="../maps/#thm-solutions-inhomogeneous-system">Theorem 4.36</a>). Alternatively, one may compute this preimage by just finding *one* solution of the inhomogeneous system, such as $(\frac15,-\frac35,0,0)$. Then the preimage is the translate of the kernel by this solution, cf. <a href="../maps/#thm-solutions-inhomogeneous-system" data-reference-type="ref+Label" data-reference="thm:solutions-inhomogeneous-system">Theorem 4.36</a>.
Solution 4.13.11
(See Exercise 4.10.) The rank of \(A_t\) is the dimension of its row space, which we compute by bringing \(A_t\) to row echelon form (Proposition 4.32):
If \(t\ne-4\), then we can further divide the last row by \(t+4 (\ne 0)\), and the rank is then 3. For \(t = -4\), the rank is 2.
For the last task: the rank of \(A_t\) is at most 3. Thus, \(\dim {\operatorname{im}\ } f \le 3\), and therefore by the rank-nullity theorem (Theorem 4.26)
This means that, for all \(t\), the kernel of \(f\) does not just consist of the zero vector, hence the answer to the question is no.
Solution 4.13.12
(See Exercise 4.11.) Let \(f:V\to W\) be linear and let \(U\subset V\) be a subspace.
- We verify that \(f(U)\) is a subspace of \(W\). By definition,
We verify the subspace conditions (<a href="../spaces/#def-subspace" data-reference-type="ref+Label" data-reference="def:subspace">Definition 3.17</a>), similarly as in <a href="../maps/#prop-ker-im-subspace" data-reference-type="ref+Label" data-reference="prop:ker-im-subspace">Proposition 4.23</a>:
- $0_W\in f(U)$: since $U$ is a subspace, $0_V\in U$, and linearity gives $f(0_V)=0_W$ (<a href="../maps/#rem-linear-map-basic" data-reference-type="ref+Label" data-reference="rem:linear-map-basic">Remark 4.4</a>).
- *Closure under addition:* if $y_1,y_2\in f(U)$, then $y_1=f(u_1),y_2=f(u_2)$ for some $u_1,u_2\in U$. Since $U$ is a subspace, $u_1+u_2\in U$, and using the linearity of $f$ (<a href="../maps/#def-linear-map" data-reference-type="ref+Label" data-reference="def:linear-map">Definition 4.1</a>) we obtain
- *Closure under scalar multiplication:* if $y=f(u)\in f(U)$ and $\lambda\in\mathbf R$, then $\lambda u\in U$ and again using linearity of $f$:
Hence $f(U)$ is a subspace of $W$.
- We prove \(\dim f(U)\le \dim U\). Suppose \((u_1,\dots,u_n)\) is a basis of \(U\). (Here we assume for simplicity that \(\dim U < \infty\); in general the proof is essentially the same, though.) Then the vectors \(f(u_1), \dots, f(u_n) \in W\) span \(f(U)\). (Indeed, any \(u \in U\) is of the form \(u = \sum_{i=1}^n a_i u_i\) for certain (unique) \(a_i \in \mathbf R\) (Proposition 3.61). Linearity of \(f\) implies \(f(u) = \sum_{i=1}^n a_i f(u_i)\), so \(f(u)\) is a linear combination of the \(f(u_1)\) etc.) Therefore, by the characterization of dimension via spanning sets (Theorem 3.68),
Solution 4.13.13
(See Exercise 4.12.) Let \(f:V\to W\) be linear and let \(U\subset W\) be a subspace. By Definition 4.20, \(f^{-1}(U)=\{v\in V\mid f(v)\in U\}\).
We verify the subspace conditions for \(f^{-1}(U)\) (Definition 3.17), in the same spirit as Proposition 4.23.
-
Zero vector: Since \(f\) is linear, \(f(0_V)=0_W\) (Remark 4.4). Because \(U\) is a subspace, \(0_W\in U\). Hence \(0_V\in f^{-1}(U)\).
-
Closure under addition: If \(v_1,v_2\in f^{-1}(U)\), then \(f(v_1),f(v_2)\in U\). Since \(U\) is a subspace, \(f(v_1)+f(v_2)\in U\). By linearity of \(f\) (Definition 4.1), we have
so \(v_1+v_2\in f^{-1}(U)\).
- Closure under scalar multiplication: If \(v\in f^{-1}(U)\) and \(\lambda\in\mathbf R\), then \(f(v)\in U\). Since \(U\) is a subspace, \(\lambda f(v)\in U\). Again using linearity of \(f\),
so \(\lambda v\in f^{-1}(U)\).
Therefore \(f^{-1}(U)\) is a subspace of \(V\).
As an aside, we note that the statement of this exercise, applied to \(U=\{0_W\}\) this shows again that \(f^{-1}(\{0_W\})=\ker f\) is a subspace, which is consistent with Proposition 4.23.
Solution 4.13.14
(See Exercise 4.13.) Let
We directly bring \(A\) to row-echelon form by Gaussian elimination:
Hence \(\operatorname{rk}(A)=2\) and \(\dim\ker f=4-2=2\) (by Proposition 4.32 and Theorem 4.26).
For \(x=(x_1,x_2,x_3,x_4)^T\), the reduced system is
With free variables \(x_3=s\), \(x_4=t\):
So one convenient basis is
For the image, use pivot columns (Proposition 4.32). Pivot columns are 1 and 2, so
Now compute \(\ker f\cap\operatorname{im} f\), the intersection of these two subspaces (cf. Section 3.3). A general vector in the image has the form
In order to determine when this lies in the intersection, we also impose the kernel equations on \(y\):
Thus \(a=b\), and therefore
Hence
Solution 4.13.15
(See Exercise 4.14.) Let
- With respect to the standard bases, the matrix is obtained from the coefficients of \(x,y,z\) in each component (Proposition 4.43):
- We bring \(A\) to row-echelon form and read off both kernel and image:
Hence, solving $A\begin{pmatrix}x\\y\\z\end{pmatrix}=0$, we get
With free parameter $z=2t$, this gives $x=t$, $y=-3t$, so
For the image, the pivot columns are the first and second. Therefore, by <a href="../maps/#prop-basis-row-column-space" data-reference-type="ref+Label" data-reference="prop:basis-row-column-space">Proposition 4.32</a>, a basis of ${\operatorname{im}\,}f$ is given by the corresponding columns of the original matrix:
Hence
- By Definition 4.20, \(f^{-1}((0,1))\) is the solution set of the linear system
One may again solve this by Gaussian elimination, or by inspection: $z=2x$, and then $y=1-3x$. With parameter $t:=x$:
This is an affine line (solution set of an inhomogeneous system), not a subspace of ${\bf R}^3$ because it does not contain $0$ (compare <a href="../maps/#rem-never-subspace">Remark 4.37</a>, <a href="../maps/#thm-solutions-inhomogeneous-system">Theorem 4.36</a>).
- Let
To check they form a basis of ${\bf R}^3$, put them as rows and bring to row-echelon form:
This echelon matrix has three nonzero rows, hence $\operatorname{rk}(M)=3$, for example by <a href="../maps/#cor-rows-columns-independent" data-reference-type="ref+Label" data-reference="cor:rows-columns-independent">Corollary 4.93</a>. Therefore $v_1,v_2,v_3$ are linearly independent and form a basis of ${\bf R}^3$.
The matrix of $f$ with respect to this domain basis and the standard basis in ${\bf R}^2$ has columns $f(v_1),f(v_2),f(v_3)$:
So the wanted matrix is
Solution 4.13.16
(See Exercise 4.15.) We have \(v_1 = (1,1,0)\), and compute
In order to confirm that they form a basis, we apply Method 3.44 and Method 3.53 by forming the associated matrix and bringing it into row echelon form:
This matrix has three leading ones, so that the vectors do form a basis.
We compute \(v_4 = f(v_3) = \left ( \begin{array}{ccc} 2 & -1 & 0 \\ 1 & 0 & 2 \\ 0 & 2 & -1 \end{array} \right ) \left ( \begin{array}{c} 1 \\ 5 \\ 0 \end{array} \right ) = \left ( \begin{array}{c} -3 \\ 1 \\ 10 \end{array} \right )\). The equation \(v_4 = a_1 v_1 + a_2 v_2 + a_3 v_3\) is the linear system
We solve this: the last equation gives \(a_2 = 5\), which leads to
Therefore
This can be solved to \(a_3 = 1\) and \(a_1 = -9\). Thus, \((a_1, a_2, a_3) = (1,5,-9)\) are the coordinates of \(v_4\) in the basis \(v_1, v_2, v_3\).
We now determine the matrix of \(f\) with respect to the basis \(v_1, v_2, v_3\) (both in the domain and the codomain of \(f\)). We therefore write each \(f(v_i)\) as a linear combination of these three vectors:
According to Proposition 4.43, the matrix of \(f\) with respect to \(v_1, v_2, v_3\) is
Solution 4.13.17
(See Exercise 4.16.) The map is \(f(x,y,z,t)=(-x+z,\,-y+t,\,x-y)\). Applying \(f\) to the standard basis vectors \(e_1,e_2,e_3,e_4\) of \(\mathbf{R}^4\) gives the columns of the matrix (Proposition 4.43):
We compute the kernel by solving \(Av=0\) by row reduction (Method 2.31):
The free variable is \(t\); setting \(t=\lambda\) gives \(x=\lambda\), \(y=\lambda\), \(z=\lambda\), so
Image. By the rank–nullity theorem (Theorem 4.26), we have \(\dim\operatorname{im} f = \dim\mathbf{R}^4 - \dim\ker f = 4 - 1 = 3.\) Since \(\operatorname{im} f \subseteq \mathbf{R}^3\) and has dimension \(3\), we conclude from Theorem 3.68 that \(\operatorname{im} f = \mathbf{R}^3\). Alternatively, using Proposition 4.32, the first three columns of \(A\) above are another basis of \(\operatorname{im} f\).
Solution 4.13.18
(See Exercise 4.17.)
-
We check the linearity of the functions.
\(f_1\) is linear: \(f_1(x)=(x_1+x_2,\,3x_1,\,2x_2)\) is given by multiplying with the matrix \(\begin{pmatrix}1&1\\3&0\\0&2\end{pmatrix}\), so it is linear by Proposition 4.19. Alternatively, one may also check the linearity directly by applying Definition 4.1.
\(f_2\) is not linear: \(f_2(0)=(0,1)\neq(0,0)\), violating \(f(0)=0\) (Remark 4.4).
\(f_3\) is not linear: the first component \(x_3^2\) is not a linear function of \((x_1,x_2,x_3)\) (e.g. \(f_3(2e_3)=(4,0)\neq 2f_3(e_3)=(2,0)\)).
The identity map \(f_4\) is linear (cf. Example 4.7).
\(f_5\) is linear: \(f_5(x)=(x_1+x_2,\,x_2+x_3,\,x_3+x_1)\) is given by multiplication with the matrix \(\begin{pmatrix}1&1&0\\0&1&1\\1&0&1\end{pmatrix}\).
-
We describe the images of these functions.
For the linear map \(f_1\), we can use Proposition 4.30, according to which \(\operatorname{im} f_1 = \operatorname{span}\!\left\{\begin{pmatrix}1\\3\\0\end{pmatrix},\begin{pmatrix}1\\0\\2\end{pmatrix}\right\}\), a \(2\)-dimensional subspace of \(\mathbf{R}^3\) (the two columns are linearly independent).
\(f_2\) is not linear, but its image is \(\{(x_3+x_1,\,3x_2+4x_1+1)\mid x\in\mathbf{R}^3\}=\mathbf{R}^2\) (e.g. vary \(x_1,x_2,x_3\) freely).
We have \(\operatorname{im} (f_3) = \{(x_3^2,\,x_2+x_1)\mid x\in\mathbf{R}^3\} = [0,\infty)\times\mathbf{R}\) (its first component is a square, the second one is arbitrary).
We clearly have \(\operatorname{im} f_4 = \mathbf{R}^3\).
For \(f_5\), let \(\left ( \begin{array}{c} a \\ b \\ c \end{array} \right ) \in \mathbf{R}^3\). To decide whether it lies in the image, we solve
Solving this, for example using Gaussian elimination, one finds the following solutions:
Thus for every $\left ( \begin{array}{c} a \\ b \\ c \end{array} \right ) \in \mathbf{R}^3$ there is a solution, so $\operatorname{im} f_5=\mathbf{R}^3$.
-
Injectivity and surjectivity (for the linear maps \(f_1,f_4,f_5\)). For the linear maps we can use the rank-nullity theorem (Theorem 4.26) and our knowledge about the image to determine injectivity. Recall from Lemma 4.25 that a linear map \(f\) is injective precisely if its kernel \(\ker f\) consists only of the zero vector, or equivalently if \(\dim \ker f = 0\).
For \(f_1\): the domain is \(\mathbf{R}^2\), we have seen \(\dim\operatorname{im} f_1=\dim\mathbf{R}^2=2\), so \(\ker f_1=\{0\}\) by rank–nullity (Theorem 4.26), so \(f_1\) is injective. It is not surjective since \(\dim \operatorname{im} f_1 = 2 < 3 = \dim \mathbf{R}^3\).
The identity map \(f_4\) is bijective (i.e., injective and surjective).
The map \(f_5\) is also bijective. To see this, we again use rank-nullity, which implies \(\dim \ker f_5 = 3 - \dim \operatorname{im} (f_5) = 3 - 3 = 0\).
Solution 4.13.19
(See Exercise 4.18.)
- Put the vectors as rows of a matrix and row-reduce:
This row-echelon matrix has two nonzero rows, so by <a href="../maps/#prop-basis-row-column-space" data-reference-type="ref+Label" data-reference="prop:basis-row-column-space">Proposition 4.32</a> we read off
where $U=L(v_1,v_2,v_3)$. A basis of $U$ is given by the nonzero rows of the echelon form:
(From the row operation $R_3-R_1=R_2-3R_1$ we get
so equivalently $U=L(v_1,v_3)$, i.e., $v_1$ and $v_3$ also form a basis of $U$.)
- We extend two independent vectors to a basis of \({\bf R}^4\). For example, take
Then the vectors $v_1,v_3,e_1,e_2$ form a basis of ${\bf R}^4$. To check their linear independence, we again form the matrix comprised of these vectors
One checks that the rank of this matrix is 4. Thus the rows are linearly independent, hence form a basis of ${\bf R}^4$.
- Yes, such an \(f:{\bf R}^4\to{\bf R}^2\) exists. Define it on the basis \(v_1,v_3,e_1,e_2\) by
(or any other choices for $f(e_1)$, $f(e_2)$ would also do). By <a href="../maps/#prop-linear-map-defined-on-basis" data-reference-type="ref+Label" data-reference="prop:linear-map-defined-on-basis">Proposition 4.39</a>, this determines a unique linear map $f:{\bf R}^4\to{\bf R}^2$. Since $U=L(v_1,v_3)$, we get
For $g:{\bf R}^4\to{\bf R}^3$, the answer is no. Indeed, the restriction
is a linear map with image $g(U)$. By <a href="../maps/#cor-maps-dim" data-reference-type="ref+Label" data-reference="cor:maps-dim">Corollary 4.28</a>,
Therefore $g(U)$ cannot be all of ${\bf R}^3$, since ${\bf R}^3$ has dimension $3$.
Solution 4.13.20
(See Exercise 4.19.) We attempt to invert \(A\) by row-reducing the augmented matrix \([A\mid I]\):
The third row of the left block is all zeros, so \(A\) does not row-reduce to the identity. Therefore \(A\) is not invertible, according to Theorem 4.80.
Solution 4.13.21
(See Exercise 4.20.) We compute \(AB\) and \(BA\) in each case, by applying Definition 4.48:
-
For \(A=\begin{pmatrix}1&2\\2&1\end{pmatrix}\), \(B=\begin{pmatrix}0&1\\-1&1\end{pmatrix}\) we obtain \(AB=\begin{pmatrix}-2&3\\-1&3\end{pmatrix}\), as opposed to \(BA=\begin{pmatrix}2&1\\1&-1\end{pmatrix}\). Hence \(AB\neq BA\), confirming that matrix multiplication is in general not commutative, cf. Warning 4.54.
-
For \(A=\begin{pmatrix}3&0\\0&4\end{pmatrix}\), \(B=\begin{pmatrix}-1&0\\0&2\end{pmatrix}\) we compute \(AB=\begin{pmatrix}-3&0\\0&8\end{pmatrix}\), which happens to be equal to \(BA=\begin{pmatrix}-3&0\\0&8\end{pmatrix}\).
-
For \(A=\begin{pmatrix}1&x\\0&1\end{pmatrix}\) and \(B=\begin{pmatrix}y&1\\0&y\end{pmatrix}\) we get
- For \(A=\begin{pmatrix}1&x\\0&1\end{pmatrix}\) and \(B=\begin{pmatrix}y&1\\0&y\end{pmatrix}\) we obtain
Since $1+xy=xy+1$, we get $AB=BA$ for all $x,y\in\mathbf{R}$. The matrices $A$ and $B$ are examples of *elementary matrices*, cf. <a href="../maps/#def-elementary-matrices" data-reference-type="ref+Label" data-reference="def:elementary-matrices">Definition 4.61</a>.
- For \(A=\begin{pmatrix}3&0\\0&3\end{pmatrix}=3I_2\) and arbitrary \(B \in \mathrm{Mat}_{2 \times 2}\), we get
So $AB=BA$ for every such $B$.
- Let \(A\) be arbitrary and \(B=A^2\). Then, by associativity of matrix multiplication (cf. Lemma 4.59),
So $AB=BA$ always.
Solution 4.13.22
(See Exercise 4.21.) We use that a matrix \(A\in \mathrm{Mat}_{2\times 2}\) determines a linear map
by Proposition 4.19, and conversely that a linear map is determined by the images of a basis, by Proposition 4.43.
- The vectors \(e_1=(1,0)\) and \(e_2=(0,1)\) form the standard basis of \(\mathbf R^2\) (cf. Example 3.59). Hence prescribing
determines the linear map uniquely. By <a href="../maps/#prop-matrix-to-linear-map" data-reference-type="ref+Label" data-reference="prop:matrix-to-linear-map">Proposition 4.43</a>, the corresponding matrix has these vectors as its columns:
- Again \(e_1=(1,0)\) is prescribed, and the second condition is compatible with linearity because
using <a href="../maps/#def-linear-map" data-reference-type="ref+Label" data-reference="def:linear-map">Definition 4.1</a>. So such matrices do exist.
However, the matrix is not unique, because we only prescribe the image of the line $L(e_1)$, not of a full basis. The vector $e_2=(0,1)$ may be sent to any vector of $\mathbf R^2$. Thus all solutions are of the form
Indeed, for such a matrix,
- Here the conditions are incompatible with linearity. As above,
again by <a href="../maps/#def-linear-map" data-reference-type="ref+Label" data-reference="def:linear-map">Definition 4.1</a>. But the exercise requires instead
which is different. Therefore no such matrix exists.
Geometrically, the vector $(2,0)$ lies on the same line as $(1,0)$, and a linear map must preserve scalar multiplication along that line. So once the image of $(1,0)$ is fixed, the image of $(2,0)$ is forced to be twice as large.
Solution 4.13.23
(See Exercise 4.22.) One possible choice is
Both matrices are nonzero. Using the definition of matrix multiplication (Definition 4.48), we compute
Hence \(AB=0\), although \(A\neq 0\) and \(B\neq 0\).
Using that matrices give rise to linear maps (Proposition 4.19), this exercise may also be understood in terms of linear maps. Consider the linear maps \(f_A,f_B:\mathbf R^2\to\mathbf R^2\). In light of Proposition 4.51, the relation \(AB=0\) means that
so every vector in the image of \(f_B\) lies in the kernel of \(f_A\) (cf. Definition 4.20, Definition 4.22). In our example,
That is why the composition is the zero map even though neither of the two maps is zero.
Solution 4.13.24
(See Exercise 4.23.)
- By Definition 4.88,
The matrix is symmetric if and only if it is equal to its transpose, so the off-diagonal entries must agree. Hence $s=-2$. The entry $t$ already agrees with itself on the diagonal, so $t$ is arbitrary.
- Let \(A=(a_{ij})\) be any square matrix. By Lemma 4.90, transposition is compatible with addition, so
Applying again <a href="../maps/#lem-properties-transposition" data-reference-type="ref+Label" data-reference="lem:properties-transposition">Lemma 4.90</a>, we have $(A^T)^T=A$. Thus
Hence $A+A^T$ is symmetric by definition.
Solution 4.13.25
(See Exercise 4.24.) Let \(A=(a_{ij})\) and \(B=(b_{ij})\) be square matrices of the same size.
- By Definition 4.88, the diagonal entries of \(A^T\) are \((A^T)_{ii}=a_{ii}\) for all \(i\). By definition of the trace (Exercise 4.24), we have
- Using the definition of matrix multiplication (Definition 4.48), the \((i,i)\)-entry of \(AB\) is
Hence
Similarly,
These are the same double sum, just with the order of summation reversed. Therefore
This is noteworthy because one generally has $AB\ne BA$, cf. <a href="../maps/#war-not-commutative" data-reference-type="ref+Label" data-reference="war:not-commutative">Warning 4.54</a>.
- The trace is additive because diagonal entries add componentwise:
Likewise, scalar multiplication acts componentwise on the diagonal, so
- Suppose there were a matrix \(B\) with
Applying part (3) and then part (2), we obtain
But
where $n$ is the size of the matrix. Hence $\operatorname{tr}(AB-BA)\ne \operatorname{tr}({\mathrm{id}})$, a contradiction. Therefore no such matrix $B$ exists.
Solution 4.13.26
(See Exercise 4.25.) Consider the fixed number \(r \in \mathbf R\). If we look for a matrix \(R\) such that for all \(2 \times 2\)-matrices \(A\) we have \(RA = rA\), then this equation applied to \(A = \mathrm{id}_2\) (the identity matrix, cf. Definition 4.57) gives
By Lemma 4.59, the left hand side equals \(R\), so we see that we need to have \(R = \begin{pmatrix} r & 0 \\ 0 & r \end{pmatrix}\). One then confirms that the equation \(RA = rA\) holds for all \(A\). This can be done by direct computation.
Solution 4.13.27
(See Exercise 4.26.)
- Using Definition 4.48, we compute
Multiplying once more, we obtain
- A generalization to arbitrary square matrices is the following: if \(A\) is a strictly upper triangular \(n\times n\)-matrix, meaning that all entries on and below the diagonal are zero, then
Indeed, when one multiplies strictly upper triangular matrices, the first superdiagonal can move only upward. More precisely, every entry of $A^k$ on or below the $(k-1)$-st superdiagonal is zero. After $n$ factors there is no possible nonzero position left, so all entries vanish.
Equivalently, each multiplication shifts possible nonzero entries at least one step farther above the main diagonal; after $n$ steps they have all disappeared. Thus every strictly upper triangular $n\times n$-matrix is nilpotent. In terms of linear maps one can observe that such a matrix $A$ describes a linear map $f_A$ such that $f_A(e_1) = 0$, while $f_A(e_2)$ is a linear combination of $e_1$, $f_A(e_3)$ is a linear combination of $e_1$ and $e_2$, etc. Thus, the image of $f_A$ is contained in the span of $e_1, \dots, e_{n-1}$, and the image of $f_A^2$ is contained in the span of $e_1, \dots, e_{n-2}$ etc., so that $f_A^n = 0$.
Solution 4.13.28
(See Exercise 4.27.) We write any vector \(v = (a,b) = a(1,0) + b(0,1) = ae_1 + be_2\) in terms of \(v_1, v_2\):
As an example, if \(v=(0,6)\), then \((0,6)=\alpha(1,-3)+\beta(2,1)\) gives the linear system
This can be solved as \(\beta = \frac 67\) and \(\alpha = -\frac{12}7.\) Thus, we can write
where the subscripts indicate that the coordinates are with respect to the standard basis, resp. to the basis \(v_1, v_2\).
We now determine the base change matrix. We have to write the matrix of the identity map in terms of the standard basis in the domain, and the basis \(v_1, v_2\) in the codomain:
The base change matrix is then the matrix
We can compute \(\alpha_1\) etc. similarly as above:
We solve this as \(\alpha_1 = \frac 17\), \(\alpha_2 = \frac 37\). As for \(\beta_1, \beta_2\), the relevant system is
whose solution is \(\beta_1 = -\frac 27\), \(\beta_2 = \frac 17\). Therefore, the base change matrix is
We compute the coordinates of \((2,-5)\) in terms of the basis \(v_1, v_2\):
Thus, the coordinates of \((2, -5)\) with respect to the basis \(v_1, v_2\) is \((\frac {12}7, \frac 17)\).
Solution 4.13.29
(See Exercise 4.28.) We have, for example,
Likewise for \(v_2\) and \(v_3\). Therefore, the base change matrix is
Solution 4.13.30
(See Exercise 4.30.) We follow the given hint, and first compute \(H\). We have
Thus \(H = \left ( \begin{array}{cc} 1 & 3 \\ -1 & -1 \end{array} \right )\).
We now compute the base change matrix \(K\) from the standard basis to the basis \(\underline t = \{t_1, t_2, t_3\}\):
Plugging in the values of \(t_1, t_2, t_3\), this gives the linear system
This has the solution
Similarly, if instead of \(e_1 = (1,0,0)\), we consider \(e_2 = (0,1,0)\), the constants in the above system change accordingly to 0, resp. 1, resp. 0 in the three equations above. The solution is then
Similarly, for \(e_3\), we obtain the solution
Hence
(Alternatively, using methods of later chapters, one may also compute \(K = T^{-1}\), the inverse of the matrix \(T = \left ( \begin{array}{ccc} 1 & 2 & -1 \\ 0 & 1 & -1 \\ 1 & 1 & -1 \end{array} \right )\) which is the matrix whose columns are the basis vectors \(t_1, t_2\) and \(t_3\).) We compute
Solution 4.13.31
(See Exercise 4.32.) We follow the solution of Exercise 4.30, see above:
We have \(H=\left ( \begin{array}{cc} 1 & 1 \\ 1 & 2 \end{array} \right )\). One computes \(K = \left ( \begin{array}{cc} 2 & -1 \\ -1 & 1 \end{array} \right )\) and then
Solution 4.13.32
(See Exercise 4.33.) For the given matrix \(A\), the system \(Ax =x\) is written out like so:
The second equation holds for all \(x\), and the first is equivalent to the third. Therefore, the system is equivalent to the one consisting of the single equation
This corresponds to the system
where \(B= (1 \ 0 \ {-1})\) is the corresponding \(1 \times 3\)-matrix. This matrix has rank 1, so that the solution space is two-dimensional, given by
Solution 4.13.33
(See Exercise 4.34.) We write out the given system
as
This simplifies to
which can be solved by bringing \(B\) into row-echelon form:
Thus \(x_1 = 0\), \(x_2 = 2x_3\) and \(x_3\) is a free variable. Thus the solution space of the given system is
Solution 4.13.34
(See Exercise 4.35.) We proceed the same way as for Exercise 4.30 (see its solution above):
Here \(H\) is the base change matrix from \(\underline v\) to the standard basis \(\underline e = \{e_1 = (1,0), e_2 = (0,1)\}\). The base change matrix from \(\underline e\) to \(\underline v\) is then \(H^{-1}\). From the given vectors we have \(H = \left ( \begin{array}{cc} 2 & 0 \\ 1 & 1 \end{array} \right )\). We compute the inverse \(H^{-1}\) using Theorem 4.80:
Thus, \(H^{-1} = \left ( \begin{array}{cc} \frac12 & 0 \\ -\frac12 & 1 \end{array} \right )\). Therefore the requested matrix of \(f\) is
Solution 4.13.35
(See Exercise 4.37.) This can be done as for Exercise 4.35 above. The final solution is
Solution 4.13.36
(See Exercise 4.38.) One computes this solution space as
According to Theorem 3.682., this vector \(l_1 = (1,-2,-3)\) can be completed to a basis of \({\bf R}^3\) by picking any basis \(v_1, v_2, v_3\) of \({\bf R}^3\). Then it is possible to find two of these three vectors which together with \(l_1\) will form a basis of \({\bf R}^3\). We pick the standard basis, \(v_1=e_2\), \(v_2=e_2\) and \(v_3=e_3\). We check that \(l_1, e_2, e_3\) form a basis. Indeed, the matrix whose rows are these vectors,
is a row-echelon matrix with three leading ones, so its rank is 3.
We compute the matrix of \(f\) with respect to this basis \(\underline v = \{l_1, e_2, e_3\}\):
We have
We compute the inverse using Theorem 4.80:
Hence the matrix for \(f\) with respect to the basis \(\underline v\) is
Remark 4.94
The choice of the two vectors \(e_2\) and \(e_3\), in addition to \(l_1\) above, is arbitrary. To begin with, one may choose a different basis (other than the standard basis) to complete \(l_1\) to a basis. Even if one takes the standard basis, for this particular value of \(l_1\), any two of the three vectors \(e_1, e_2, e_3\) together with \(l_1\) would form a basis. The resulting base change matrix \(H\) will then be different, and also the result \(H^{-1}AH\) will be different.
Solution 4.13.37
(See Exercise 4.39.) The vectors
form a basis of \({\bf R}^3\) since the corresponding matrix \(\left ( \begin{array}{ccc} 1 & 0 & 1 \\ 0 & 3 & -1 \\ 0 & 0 & 1 \end{array} \right )\) has rank 3. Hence we can compute the matrix of \(f\) with respect to that basis as follows:
Therefore the matrix of \(f\) with respect to that basis is
As before, we compute that matrix of \(f\) with respect to the standard basis using the method above:
The base change matrix \(K\) is easily read off:
We compute the inverse \(K^{-1}\):
Therefore the requested matrix is the product
Solution 4.13.38
(See Exercise 4.41.) It is convenient to observe
Thus the given system can be rewritten as
By Lemma 4.59, this is the same as the system
The left hand matrix equals \(\left ( \begin{array}{ccc} -4 & 0 & 0 \\ 0 & -1 & 2 \\ 0 & 2 & -4 \end{array} \right )\). From here, one can use the standard method to solve the linear system. (As a forecast to §Chapter 5, one can note that the determinant of the latter matrix is 0, so that the matrix is not invertible. Hence the system above has non-zero solutions.)
Solution 4.13.39
(See Exercise 4.43.) Consider the vectors \(v_1 = (1,0,-1)\), \(v_2 = (1,1,0)\), \(v_3 = (1,0,-2) \in {\bf R}^3\).
We have \(f(v_1) = (3,0,-5) = v_1 + 2v_3\), \(f(v_2) = 2v_2\) and \(f(v_3) = (5,2,-5) = v_1 + 2v_2 + 2v_3\). Therefore the matrix \(A\) with respect to the basis \(\underline v := (v_1, v_2, v_3)\) in the domain and the codomain is
In order to to compute the matrix \(B\) with respect to the standard basis of \({\bf R}^3\), we express the standard basis vectors \(e_1, e_2, e_3\) as linear combinations of the \(v_1, v_2, v_3\), then compute \(f(e_k)\) using the linearity of \(f\). We have
This implies
Therefore
A slightly different way to solve this is to observe that the matrix of \(f\) with respect to the basis \(\underline v\) in the source and the standard basis \(\underline e\) in the target is \(C = \left ( \begin{array}{ccc} 3 & 2 & 5 \\ 0 & 2 & 2 \\ -5 & 0 & -5 \end{array} \right )\). We consider the matrix \(H = \left ( \begin{array}{ccc} 1 & 1 & 1 \\ 0 & 1 & 0 \\ -1 & 0 & -2 \end{array} \right )\) whose columns are the three vectors \(v_i\), i.e., the base change matrix from the basis \(\underline v\) to \(\underline e\). Compute the inverse of this matrix, for example using Theorem 4.80. This gives \(H^{-1} = \left ( \begin{array}{ccc} 2 & -2 & 1 \\ 0 & 1 & 0 \\ -1 & 1 & -1 \end{array} \right )\). This is the base change matrix from \(\underline e\) to the basis \(\underline v\). Then \(B = C H^{-1}\): \({\bf R}^3_{\underline e} \xrightarrow[H^{-1}]{{\mathrm {id}}} {\bf R}^3_{\underline v} \xrightarrow[C]{f} {\bf R}^3_{\underline e}\), which confirms the above computation. Likewise \(A = H^{-1} C\), corresponding to \({\bf R}^3_{\underline v} \xrightarrow[C]{f} {\bf R}^3_{\underline e} \xrightarrow[H^{-1}]{{\mathrm {id}}} {\bf R}^3_{\underline v}\), which confirms the above computation.
The kernel and image of \(f\) can be computed by bringing \(B\) into row echelon form. The result is \(\ker f = \{(x,x,x) | x \in {\bf R}\}\), i.e., it is 1-dimensional and has as a basis vector \((1,1,1)\). We have \({\operatorname{im}\ } f = L((1,4,5),(1,-2,-5))\).
For the last part, we form the matrix
and bring it into row echelon form as shown above. We find that it has rank 2 (the same as the rank of \(B\)), if and only if \(t = 0\). and \(f^{-1}((3,0,-5)) = \{(x, x-1, x-2) | x \in {\bf R}\}\).
These computations can also be performed with any computer algebra software, such as Wolfram Alpha:
-
https://www.wolframalpha.com/input?i=Inverse%28%28%281%2C1%2C1%29%2C%280%2C1%2C0%29%2C%28-1%2C0%2C-2%29%29%29 (compute the inverse of \(H\))
-
https://www.wolframalpha.com/input?i=%28%283%2C2%2C5%29%2C%280%2C2%2C2%29%2C%28-5%2C0%2C-5%29%29*Inverse%28%28%281%2C1%2C1%29%2C%280%2C1%2C0%29%2C%28-1%2C0%2C-2%29%29%29 (compute \(H^{-1}C\))
-
https://www.wolframalpha.com/input?i=kernel%28%281%2C1%2C-2%29%2C%28-2%2C4%2C-2%29%2C%28-5%2C5%2C0%29%29, https://www.wolframalpha.com/input?i=column+space%28%281%2C1%2C-2%29%2C%28-2%2C4%2C-2%29%2C%28-5%2C5%2C0%29%29 (computation of kernel and image of \(f\))
-
https://www.wolframalpha.com/input?i=Solve%28%28%281%2C1%2C-2%29%2C%28-2%2C4%2C-2%29%2C%28-5%2C5%2C0%29%29*%28x%2Cy%2Cz%29%3D%283%2Ct%2C-5%29%29 (compute the preimage of a vector)
Solution 4.13.40
(See Exercise 4.44.) We form the matrix of \(f\) and bring it to row echelon form:
The \(\dim {\operatorname{im}\ } f\) equals the rank of that matrix, which is 2 if \(t=4\) and 3 otherwise.
For no \(t \in {\bf R}\), \(f\) is surjective, since \(\dim {\operatorname{im}\ } f \le 3\), but it would need to be 4 for \(f\) to be surjective. For \(t\ne 4\) it is injective. Indeed, the rank-nullity theorem (Theorem 4.26) asserts that \(\dim \ker f = 3 - \dim {\operatorname{im}\ } f\), which is 0 for \(t \ne 4\).
For \(t=4\), we have \(\ker f = L(-5,-3,1)\) and \({\operatorname{im}\ } f = L((-1,3,1,3), (1,-2,0,-1))\).
We have \(w \notin {\operatorname{im}\ } f\) for all \(t \in {\bf R}\). This can be shown by forming the augmented matrix
and the underlined entry \(3\) shows \(w \notin {\operatorname{im}\ } f\).
For \(t=0\) we have seen above that \(\dim {\operatorname{im}\ } f = 3\). By Theorem 3.68, a basis \(v_1, v_2, v_3\) of \({\operatorname{im}\ } f (\subset {\bf R}^4)\) can be extended to a basis of \({\bf R}^4\), say \(v_1, v_2, v_3, v_4\). By Proposition 4.39, there is a unique linear map \(g : {\bf R}^4 \to {\bf R}^3\) such that \(g(v_i) = e_i\) (for \(i=1,2,3\), and \(e_i\) denotes the \(i\)-th standard basis vector of \({\bf R}^3\), and \(g(v_4) = 0\) (or any other vector in \({\bf R}^3\)). Then \(g(f(x))=x\) for any \(x \in {\bf R}^3\).
Solution 4.13.41
(See Exercise 4.45.) The map \(f\) is given by multiplication with the matrix, which we bring to row echelon form by elementary row operations
It has 2 leading ones in the first two columns, so \({\operatorname{im}\ } f = L((0,2,-1,1), (-1,1,2,2))\). The third variable \(z\) is a free variable, and \(\ker f =L(-1,2,1)\).
The space \(W\) defined by the equation \(x_2 - 3x_3 = 0\) is the kernel of the matrix \(\left ( \begin{array}{c} 0 \\ 1 \\ -3 \\ 0 \end{array} \right )\), which gives (for example) three free variables \(x_1, x_3, x_4\), and accordingly a basis consisting of \((1,0,0,0), (0, 3, 1, 0), (0,0,0,1)\). We have \(\dim W = 3\).
An element in \(U = {\operatorname{im}\ } f\) is a linear combination of the first two column vectors, so of the form \((-b, 2a+b, -a+2b, a+2b)\), for arbitrary \(a, b \in {\bf R}\). This lies in \(W\) precisely if \(2a+b-3(-a+2b)=0\), i.e., if \(5a-5b=0\) or if \(a=b\). A basis of \(U \cap W\) is then given by choosing, say, \(a=1\), and \(U \cap W = L(-1,3,1,3)\).
We know \(\dim U+W = \dim U + \dim W - \dim (U \cap W) = 2 + 3 - 1 = 4\). So, for example, the three basis vectors of \(W\), together with any \(u \in U \setminus W\) will form a basis. Thus, \((1,0,0,0), (0, 3, 1, 0), (0,0,0,1), (0, 2, -1, 1)\) forms a basis of \(U+W\). (Note the last vector is not in \(W\) since its does not satisfy the equation \(x_2 - 3x_3 = 0\).)
In order to check when \((a,4,3,b)\) is in the image, we form the matrix corresponding to the linear system:
This shows that \(4-2b=3a\) and \(4(4-2b)+3(b+3)=0\), i.e., \(25-5b=0\), so \(b=5\) and \(a=-2\).
Solution 4.13.42
(See Exercise 4.46.) Part (1): The subspace \(W \subset \mathbf R^4\) is the solution set of a homogeneous linear system. By Definition 2.23, the coefficient matrix describing that linear system reads \(\left ( \begin{array}{cccc} 2 & -1 & -1 & 0 \\ 0 & 1 & -1 & 0 \end{array} \right )\). Thus, (after dividing the first row by 2), the leading ones in this matrix are in the first and second column, so \(x_3\) and \(x_4\) are free variables. By Method 2.31, a basis of \(W\) is given by the vectors \(w_1 = (1,1,1,0)\) and \(w_2 = (0,0,0,1)\). Hence \(\dim W = 2\).
Typical mistakes: It is important to note that \(W\) was defined as a subspace of \({\bf R}^4\). One must therefore take into account \(x_4\), even though it is not present in the linear system as written.
Another typical mistake is to assume one can read of the basis vectors of \(W\) simply as \((2,-1,-1,0)\) and \((0,1,-1,0)\). This is wrong, and can already be seen to be wrong by observing that these vectors do not even lie in \(W\)!
Part (2): To compute the subspace \(U \cap W\) we first apply Lemma 3.32 to write a general vector of \(U\) as
Recall that \(U \cap W\) consists of those vectors in \(U\) that also lie in \(W\) (Lemma 3.19), so we need to check when the above vector lies in \(W\). This happens if and only if
The solution of this system is \(a_2=0\) and \(a_1=a_3\). Therefore, the space \(U \cap W\) consists of the vectors
Thus, a basis vector of \(U \cap W\) is \(u_1+u_3=(1,1,1,0)\). In particular \(\dim U \cap W=1\).
The three vectors \(u_1, u_2, u_3\) spanning \(U\) are linearly independent, as one sees from bringing the matrix into row echelon form:
and seeing that there are three leading ones. Therefore, \(\dim U = 3\). Theorem 3.74 now allows to quickly compute \(\dim (U+W)=\dim U + \dim W - \dim (U \cap W) = 3+2-1=4\). By Theorem 3.68, we therefore obtain \(U+W = \mathbf R^4\). Thus the standard basis \(e_1, \dots, e_4\) is a basis of \(U+W = \mathbf R^4\).
Typical mistake: One should not mix up columns and rows. While it is in principle possible to consider a matrix whose columns are the vectors \(u_1, ...\), one then needs to perform elementary column operations. These are not discussed in these lecture notes (and are not necessary). For consistency with the methods applied in these notes, it is advisable to only write down these vectors in rows as mentioned above.
Part (3) We know that \(\dim W=2\), so if we want that \(U' \oplus W = {\bf R}^4\), we must have \(\dim U' = 2\) by Remark 3.65 Since we know that \(U \cap W = {\bf R} (1,1,1,0)\), it therefore is enough to find two linearly independent vectors \(u'_1, u'_2 \in U\) that are both independent of \(r := (1,1,1,0)\). For example, \(u'_1 = u_1\), \(u'_2 = u_2\) satisfy these requirements, as one sees by bringing
into row-echelon form.
Solution 4.13.43
(See Exercise 4.47.) Part (1): Using Gaussian elimination (Method 2.29) we bring the matrix \(A\) into row echelon form (as far as possible):
In order to proceed, we need to distinguish the case \(a=-1\) and \(a \ne -1\). In the former case the matrix reads \(\left ( \begin{array}{ccc} 1 & 2 & 1 \\ 0 & 0 & -3 \\ 0 & 0 & 0 \end{array} \right )\) and has rank 2. In the case \(a \ne -1\), we have \(a+1 \ne 0\), so we can divide the third row by \(a+1\) (cf. Definition 2.28) and get
The rank of this matrix is 3 provided that \(4a^2-4 \ne 0\) and 2 if \(4a^2-4=0\), i.e., if \(a=1\). In summary, the rank is 2 if \(a = \pm 1\) and is 3 otherwise.
Typical mistake: one should not stop the analysis after the first step above.
Part (2): We apply Method 2.31 and bring the augmented matrix of the system into row echelon form:
There is a solution precisely if the last entry is zero, i.e., if \(2(b_2-2b_1)+3(b_3+b_1) = 0\) or if \(-b_1 +2b_2 + 3b_3=0\).
Typical mistake: It is not legitimate to use the row echelon form of the matrix \(A\) computed in the Part (1) above, and then to append the vector \(\left ( \begin{array}{c} b_1 \\ b_2 \\ b_3 \end{array} \right )\). The row operations performed on the matrix \(A\) must be identically performed on this vector, otherwise one changes the linear system.
Part (3): We again use Gaussian elimination to solve the system (Method 2.31). One forms the augmented matrix
This shows that there is a solution if \(c=-2\). One then computes \(x_3=-1\), \(x_1=2-2x_2-x_3=3-2x_2\).
Typical mistake: Again, the row operations performed on the matrix \(A\) must also be performed on the vector \(\left ( \begin{array}{c} 2 \\ 7 \\ c \end{array} \right )\).