Skip to main content

Linear Algebra

Module 13 Change of Basis II

In this module you will learn
  • How to create change-of-basis matrices.
  • How to write a linear transformation in multiple bases.
Given a basis \(\mathcal{A}\) for \(\R^{n}\text{,}\) every vector \(\vec x\in\R^{n}\) uniquely corresponds to the list of numbers \([\vec x]_{\mathcal{A}}\) (its coordinates with respect to \(\mathcal{A}\)), and the operation of writing a vector in a basis is invertible.
If we have two bases, \(\mathcal{A}\) and \(\mathcal{B}\text{,}\) for \(\R^{n}\text{,}\) we have two equally valid ways of representing a vector in coordinates.
Figure 13.0.1.
Not only that, but there must be a function that converts between \([\vec x]_{\mathcal{A}}\) and \([\vec x]_{\mathcal{B}}\text{.}\) The function works as follows: input the list of numbers \([\vec x]_{\mathcal{A}}\text{,}\) use those numbers as coefficients of the \(\mathcal{A}\) basis vectors to get the true vector \(\vec x\text{,}\) and then find the coordinates of that vector with respect to the \(\mathcal{B}\) basis.

Example 13.0.2.

Let \(\mathcal{A}=\Set{\vec a_1,\vec a_2}\) where \(\vec a_{1}=\mat{1\\1}_{\mathcal{E}}\) and \(\vec a_{2}=\mat{1\\-1}_{\mathcal{E}}\) and let \(\mathcal{B}=\Set{\vec b_1,\vec b_2}\) where \(\vec b_{1}=\mat{2\\1}_{\mathcal{E}}\) and \(\vec b_{2}=\mat{5\\3}_{\mathcal{E}}\) be bases for \(\R^{2}\text{.}\) Given that \([\vec x]_{\mathcal{A}}=\mat{2\\-3}\text{,}\) find \([\vec x]_{\mathcal{B}}\text{.}\)

Solution.

By definition,
\begin{equation*} \vec x=2\vec a_{1}-3\vec a_{2}=2(\xhat+\yhat)-3(\xhat-\yhat)=-\xhat+5\yhat. \end{equation*}
We need to rewrite \(\vec x\) as a linear combination of \(\vec b_{1}=2\xhat+\yhat\) and \(\vec b_{2}=5\xhat+3\yhat\text{.}\) That is, we need to solve the equation
\begin{equation*} \vec x=-\xhat+5\yhat = \alpha(2\xhat+\yhat)+\beta(5\xhat+3\yhat)=(2\alpha+5\beta)\xhat+(\alpha+3\beta)\yhat. \end{equation*}
Equating the coefficients of \(\xhat\) and \(\yhat\text{,}\) we get
\begin{equation*} \left\{\begin{array}{rcrl}2\alpha&+&5\beta&=-1\\\alpha&+&3\beta&=5\end{array}\right., \end{equation*}
which has a unique solution \((\alpha,\beta)=(-28,11)\text{.}\) We conclude
\begin{equation*} [\vec x]_{\mathcal{B}}=\mat{-28\\11}. \end{equation*}
For a basis \(\mathcal{A}\text{,}\) the invertible function that takes a vector \(\vec x\) and generates the coordinates \([\vec x]_{\mathcal{A}}\) is a linear function. Therefore, for bases \(\mathcal{A}\) and \(\mathcal{B}\text{,}\) the function that converts \([\vec x]_{\mathcal{A}}\) to \([\vec x]_{\mathcal{B}}\) must have a matrix. This matrix is called the change of basis matrix.

Definition 13.0.3. Change of Basis Matrix.

Let \(\mathcal{A}\) and \(\mathcal{B}\) be bases for \(\R^{n}\text{.}\) The matrix \(M\) is called a change of basis matrix (which converts from \(\mathcal{A}\) to \(\mathcal{B}\)) if for all \(\vec x\in \R^{n}\)
\begin{equation*} M[\vec x]_{\mathcal{A}}=[\vec x]_{\mathcal{B}}. \end{equation*}
Notationally, \(\BasisChange{\mathcal{A}}{\mathcal{B}}\) stands for the change of basis matrix converting from \(\mathcal{A}\) to \(\mathcal{B}\text{,}\) and we may write \(M=\BasisChange{\mathcal{A}}{\mathcal{B}}\text{.}\)

Example 13.0.4.

Let \(\mathcal{A}=\Set{\vec a_1,\vec a_2}\) where \(\vec a_{1}=\mat{1\\1}_{\mathcal{E}}\) and \(\vec a_{2}=\mat{1\\-1}_{\mathcal{E}}\) and let \(\mathcal{B}=\Set{\vec b_1,\vec b_2}\) where \(\vec b_{1}=\mat{2\\1}_{\mathcal{E}}\) and \(\vec b_{2}=\mat{5\\3}_{\mathcal{E}}\) be bases for \(\R^{2}\text{.}\) Find the change of basis matrix \(\BasisChange{\mathcal{A}}{\mathcal{B}}\text{.}\)

Solution.

We know \(\BasisChange{\mathcal{A}}{\mathcal{B}}\) will be a \(2\times 2\) matrix and that
\begin{equation*} \BasisChange{\mathcal{A}}{\mathcal{B}}[\vec a_{1}]_{\mathcal{A}}=[\vec a_{1}]_{\mathcal{B}}\qquad \text{and}\qquad \BasisChange{\mathcal{A}}{\mathcal{B}}[\vec a_{2}]_{\mathcal{A}}=[\vec a_{2}]_{\mathcal{B}}. \end{equation*}
Therefore, we need to compute \([\vec a_{1}]_{\mathcal{B}}\) and \([\vec a_{2}]_{\mathcal{B}}\text{.}\) Repeating the procedure from the previous example, we find
\begin{equation*} [\vec a_{1}]_{\mathcal{B}}=\mat{-2\\1}\qquad \text{and}\qquad [\vec a_{2}]_{\mathcal{B}}=\mat{8\\-3}, \end{equation*}
and so
\begin{equation*} \BasisChange{\mathcal{A}}{\mathcal{B}}=\mat{-2&8\\1&-3}. \end{equation*}
We can now enhance our diagram from earlier.
Figure 13.0.5.
The notation \(\BasisChange{\mathcal{A}}{\mathcal{B}}\) for the matrix that changes from the \(\mathcal{A}\) basis to the \(\mathcal{B}\) basis is suggestive. Suppose we have another basis \(\mathcal{C}\text{.}\) We can obtain \(\BasisChange{\mathcal{A}}{\mathcal{C}}\) by multiplying \(\BasisChange{\mathcal{A}}{\mathcal{B}}\) on the left by \(\BasisChange{\mathcal{B}}{\mathcal{C}}\text{.}\) That is,
\begin{equation*} \BasisChange{\mathcal{A}}{\mathcal{C}}=\BasisChange{\mathcal{B}}{\mathcal{C}}\BasisChange{\mathcal{A}}{\mathcal{B}}. \end{equation*}
The backwards arrow “\(\leftarrow\)” in the change-of-basis matrix notation comes because when we multiply a vector and a matrix, the matrix is always to the left of the vector. So,
\begin{equation*} [\vec x]_{\mathcal{C}}= \BasisChange{\mathcal{A}}{\mathcal{C}}[\vec x]_{\mathcal{A}}=\BasisChange{\mathcal{B}}{\mathcal{C}}\BasisChange{\mathcal{A}}{\mathcal{B}}[\vec x]_{\mathcal{A}}. \end{equation*}
As such, the notation for the change of basis matrix chains, allowing you to figure out what’s going on without too much trouble.

Section 13.1 Change of Basis Matrix in Detail

Let \(\mathcal{A}\) and \(\mathcal{B}\) be bases for \(\R^{n}\) and \(M=\BasisChange{\mathcal{A}}{\mathcal{B}}\) be the matrix that changes from the \(\mathcal{A}\) to the \(\mathcal{B}\) basis. Since we can change vectors back from \(\mathcal{B}\) to \(\mathcal{A}\text{,}\) we know \(M\) is invertible and
\begin{equation*} M^{-1}=\BasisChange{\mathcal{B}}{\mathcal{A}}. \end{equation*}
Just playing with notation, we see
\begin{equation*} M^{-1}M=\BasisChange{\mathcal{B}}{\mathcal{A}}\BasisChange{\mathcal{A}}{\mathcal{B}}=\BasisChange{\mathcal{A}}{\mathcal{A}}= I\qquad MM^{-1}=\BasisChange{\mathcal{A}}{\mathcal{B}}\BasisChange{\mathcal{B}}{\mathcal{A}}=\BasisChange{\mathcal{B}}{\mathcal{B}}= I, \end{equation*}
which makes sense. The matrices \(\BasisChange{\mathcal{A}}{\mathcal{A}}\) and \(\BasisChange{\mathcal{B}}{\mathcal{B}}\) take vectors and rewrite them in the same basis, which is to say, they do nothing to the vectors.
The argument above shows that every change of basis matrix is invertible. The converse is also true.

Proof.

Suppose \(M=\BasisChange{\mathcal{A}}{\mathcal{B}}\) is a change-of-basis matrix. Then
\begin{equation*} M^{-1}=\BasisChange{\mathcal{B}}{\mathcal{A}}, \end{equation*}
and so \(M\) is invertible.
Alternatively, suppose \(M=[C_{1}|C_{2}|\cdots|C_{n}]\) is an invertible \(n\times n\) matrix with columns \(C_{1}\text{,}\) …, \(C_{n}\text{.}\) Let \(\vec c_{i}=[C_{i}]_{\mathcal{E}}\text{.}\) That is, \(\vec c_{i}\) is the vector which comes from interpreting \(C_{i}\) as coordinates with respect to the standard basis.
Since \(M\) is invertible, \(\Rref(M)=I\text{,}\) and so \(\Set{\vec c_1,\ldots,\vec c_n}\) is a linearly independent set of \(n\) vectors. Therefore \(\mathcal{C}=\Set{\vec c_1,\ldots,\vec c_n}\) is a basis for \(\R^{n}\text{.}\) Now, observe
\begin{equation*} M[\vec c_{i}]_{\mathcal{C}}= C_{i}=[\vec c_{i}]_{\mathcal{E}} \end{equation*}
for \(i=1,\ldots, n\text{,}\) and so \(M=\BasisChange{\mathcal{C}}{\mathcal{E}}\) is a change-of-basis matrix.
The proof of the above theorem highlights something interesting. Let \(\mathcal{A}=\Set{\vec a_1,\ldots, \vec a_n}\) be a basis for \(\R^{n}\text{.}\) It is always the case that
\begin{equation*} [\vec a_{i}]_{\mathcal{A}}=\matc{\vdots\\0\\1\\0\\0\\\vdots} \end{equation*}
has a \(1\) in the \(i\)th position and zeros elsewhere. Now, let \(\mathcal{B}=\Set{\vec b_1,\ldots,\vec b_n}\) be another basis for \(\R^{n}\) and define the matrix \(M=\mat{[\vec a_1]_{\mathcal{B}}&[\vec a_2]_{\mathcal{B}}&\cdots&[\vec a_n]_{\mathcal{B}}}\) to be the matrix with columns \([\vec a_{1}]_{\mathcal{B}},\ldots, [\vec a_{n}]_{\mathcal{B}}\text{.}\) Since multiplying a matrix by \([\vec a_{i}]_{\mathcal{A}}\) will pick out the \(i\)th column, we have that
\begin{equation*} M[\vec a_{i}]_{\mathcal{A}}= [\vec a_{i}]_{\mathcal{B}}. \end{equation*}
In other words,
\begin{equation*} M=\BasisChange{\mathcal{A}}{\mathcal{B}}. \end{equation*}

Section 13.2 Transformations and Bases

A linear transformation \(\mathcal{T}:\R^{n}\to\R^{n}\) always has a matrix associated with it. This matrix is defined as the matrix \(M\) so that
\begin{equation*} [\mathcal{T}\vec x]_{\mathcal{E}}=M[\vec x]_{\mathcal{E}}. \end{equation*}
But, what if we swapped out \(\mathcal{E}\) for a different basis?

Definition 13.2.1. Linear Transformation in a Basis.

Let \(\mathcal{T}:\R^{n}\to\R^{n}\) be a linear transformation and let \(\mathcal{B}\) be a basis for \(\R^{n}\text{.}\) The matrix for \(\mathcal{T}\) with respect to \(\mathcal{B}\), notated \([\mathcal{T}]_{\mathcal{B}}\text{,}\) is the \(n\times n\) matrix satisfying
\begin{equation*} [\mathcal{T}\vec x]_{\mathcal{B}}= [\mathcal{T}]_{\mathcal{B}}[\vec x]_{\mathcal{B}}. \end{equation*}
In this case, we say the matrix \([\mathcal{T}]_{\mathcal{B}}\) is the representation of \(\mathcal{T}\) in the \(\mathcal{B}\) basis.
Just like there are many ways to write down coordinates for a vector—one per choice of basis—there are many ways to write down a matrix for a linear transformation. Up to this point, when we’ve said “\(M\) is a matrix for \(\mathcal{T}\)”, what we meant is “\(M=[\mathcal{T}]_{\mathcal{E}}\)”. And, like with vectors, if we talk about a matrix for a linear transformation without specifying the basis, we mean the matrix for the transformation with respect to the standard basis.

Example 13.2.2.

Let \(\mathcal{B}=\Set{\vec b_1,\vec b_2}\) where \(\vec b_{1}=\mat{2\\-3}_{\mathcal{E}}\) and \(\vec b_{2}=\mat{5\\-7}_{\mathcal{E}}\) be a basis for \(\R^{2}\) and let \(\mathcal{T}:\R^{2}\to\R^{2}\) be the transformation that stretches in the \(\vec e_{1}\) direction by a factor of \(2\text{.}\) Find \([\mathcal{T}]_{\mathcal{E}}\) and \([\mathcal{T}]_{\mathcal{B}}\text{.}\)

Solution.

Since \(\mathcal{T}\xhat=2\xhat\) and \(\mathcal{T}\yhat=\yhat\text{,}\) We know
\begin{equation*} [\mathcal{T}]_{\mathcal{E}}\mat{1\\0}=\mat{2\\0}\qquad\text{and}\qquad [\mathcal{T}]_{\mathcal{E}}\mat{0\\1}=\mat{0\\1} \end{equation*}
and so
\begin{equation*} [\mathcal{T}]_{\mathcal{E}}=\mat{2&0\\0&1}. \end{equation*}
We can find \([\mathcal{T}]_{\mathcal{B}}\) in two ways: directly from the definition, or by using change of basis matrices. First, we will work directly from the definition.
To find \([\mathcal{T}]_{\mathcal{B}}\text{,}\) we need to figure out what \(\mathcal{T}\) does to \(\vec b_{1}\) and \(\vec b_{2}\text{.}\) However, since \(\mathcal{T}\) is described in term of \(\xhat\) and \(\yhat\text{,}\) it might be easier to express \(\xhat\) and \(\yhat\) in the \(\mathcal{B}\) basis, and then analyze \(\mathcal{T}\text{.}\)
Computing,
\begin{equation*} \begin{aligned}[\xhat]_{\mathcal{B}}&= \BasisChange{\mathcal{E}}{\mathcal{B}}\mat{1\\0}= \mat{2&5\\-3&-7}^{-1}\mat{1\\0}=\mat{-7\\3}\\[0pt] [\yhat]_{\mathcal{B}}&= \BasisChange{\mathcal{E}}{\mathcal{B}}\mat{0\\1}= \mat{2&5\\-3&-7}^{-1}\mat{0\\1}=\mat{-5\\2}\end{aligned}. \end{equation*}
Now we know
\begin{equation*} \begin{aligned}[\mathcal{T}]_{\mathcal{B}}[\xhat]_{\mathcal{B}}&=[\mathcal{T}\xhat]_{\mathcal{B}}=[2\xhat]_{\mathcal{B}}=\mat{-14\\6}\\[0pt] [\mathcal{T}]_{\mathcal{B}}[\yhat]_{\mathcal{B}}&=[\mathcal{T}\yhat]_{\mathcal{B}}=[\yhat]_{\mathcal{B}}=\mat{-5\\2}\end{aligned}. \end{equation*}
Since \([\mathcal{T}]_{\mathcal{B}}\) is a \(2\times 2\) matrix, we can use what we know to solve for its entries, finding
\begin{equation*} [\mathcal{T}]_{\mathcal{B}}=\mat{-13&-35\\6&16}. \end{equation*}
Let’s try finding \([\mathcal{T}]_{\mathcal{B}}\) using change of basis matrices. We already know \([\mathcal{T}]_{\mathcal{E}}\text{,}\) and so
\begin{equation*} [\mathcal{T}]_{\mathcal{B}}= \BasisChange{\mathcal{E}}{\mathcal{B}}[\mathcal{T}]_{\mathcal{E}}\BasisChange{\mathcal{B}}{\mathcal{E}}. \end{equation*}
Further, we know
\begin{equation*} \BasisChange{\mathcal{B}}{\mathcal{E}}=\mat{2&5\\-3&-7}\qquad \text{and}\qquad \BasisChange{\mathcal{E}}{\mathcal{B}}=\mat{2&5\\-3&-7}^{-1}=\mat{-7&-5\\3&2}. \end{equation*}
Putting it all together,
\begin{equation*} [\mathcal{T}]_{\mathcal{B}}=\mat{-7&-5\\3&2}\mat{2&0\\0&1}\mat{2&5\\-3&-7}= \mat{-13&-35\\6&16}. \end{equation*}

Section 13.3 Similar Matrices

Just like some bases are better than others to represent particular vectors, some bases are better than others to represent a particular linear transformation.

Example 13.3.1.

Let \(\mathcal{B}=\Set{\vec b_1,\vec b_2}\) where \(\vec b_{1}=\mat{2\\-3}_{\mathcal{E}}\) and \(\vec b_{2}=\mat{5\\-7}_{\mathcal{E}}\) be a basis for \(\R^{2}\) and let \(\mathcal{S}:\R^{2}\to\R^{2}\) be the transformation that stretches in the \(\vec b_{1}=2\vec e_{1}-3\vec e_{2}\) direction by a factor of \(2\) and reflects vectors in the \(\vec b_{2}=5\xhat-7\yhat\) direction. Find \([\mathcal{S}]_{\mathcal{E}}\) and \([\mathcal{S}]_{\mathcal{B}}\text{.}\)

Solution.

In this example, \(\mathcal{S}\) is described in terms of the \(\mathcal{B}\) basis. We know
\begin{equation*} \mathcal{S}\vec b_{1}=2\vec b_{1}\qquad\text{and}\qquad\mathcal{S}\vec b_{2}=-\vec b_{2}. \end{equation*}
Therefore,
\begin{equation*} [\mathcal{S}]_{\mathcal{B}}=\mat{2&0\\0&-1}. \end{equation*}
To find \([\mathcal{S}]_{\mathcal{E}}\text{,}\) we will use change of basis matrices. Notice that
\begin{equation*} [\mathcal{S}]_{\mathcal{E}}= \BasisChange{\mathcal{B}}{\mathcal{E}}[\mathcal{S}]_{\mathcal{B}}\BasisChange{\mathcal{E}}{\mathcal{B}}, \end{equation*}
and that
\begin{equation*} \BasisChange{\mathcal{B}}{\mathcal{E}}=\mat{2&5\\-3&-7}\qquad \text{and}\qquad \BasisChange{\mathcal{E}}{\mathcal{B}}=\mat{2&5\\-3&-7}^{-1}=\mat{-7&-5\\3&2}. \end{equation*}
Therefore
\begin{equation*} [\mathcal{S}]_{\mathcal{E}}= \mat{2&5\\-3&-7}\mat{2&0\\0&-1}\mat{-7&-5\\3&2}=\mat{-43&-30\\63&44}. \end{equation*}
In the example above, \([\mathcal{S}]_{\mathcal{B}}\) is a much nicer matrix than \([\mathcal{S}]_{\mathcal{E}}\text{.}\) However, the two matrices relate to each other. After all,
\begin{equation*} [\mathcal{S}]_{\mathcal{B}}=\BasisChange{\mathcal{E}}{\mathcal{B}}[\mathcal{S}]_{\mathcal{E}}\BasisChange{\mathcal{B}}{\mathcal{E}}. \end{equation*}
In this case, we call these matrices similar
 1 
Another commonly used term is conjugate.
.

Definition 13.3.2. Similar Matrices.

The matrices \(A\) and \(B\) are called similar matrices, denoted \(A\sim B\), if \(A\) and \(B\) represent the same linear transformation but in possibly different bases. Equivalently, \(A\sim B\) if there is an invertible matrix \(X\) so that
\begin{equation*} A=XBX^{-1}. \end{equation*}
The \(X\) in the definition of similar matrices is always a change-of-basis matrix.
When studying a linear transformation, you can pick any basis to represent it in and study the resulting matrix. Different choices of basis will give you different perspectives on the linear transformation. In what’s to follow, we will work to find the “best” basis in which to study a given linear transformation
 2 
If you cannot wait, the “best” basis will turn out to be the eigen basis (provided it exists).
.

Exercises 13.4 Exercises

1.

Let
\begin{equation*} \mathcal{A}=\Set{\mat{2\\1}_{\mathcal{E}}, \mat{1\\-2}_{\mathcal{E}}}\quad\text{and}\quad \mathcal{B}=\Set{\mat{3\\-1}_{\mathcal{E}}, \mat{-2\\3}_{\mathcal{E}}}. \end{equation*}
be bases for \(\R^{2}\text{.}\) Define \(\vec x\in \R^{2}\) by \([\vec x]_{\mathcal{A}}=\mat{1\\-1}\text{.}\)
  1. Find \([\vec x]_{\mathcal{E}}\) and \([\vec x]_{\mathcal{B}}\text{.}\)
  2. Find the change of basis matrices \(\BasisChange{\mathcal{A}}{\mathcal{E}}\text{,}\) \(\BasisChange{\mathcal{E}}{\mathcal{A}}\text{,}\) \(\BasisChange{\mathcal{A}}{\mathcal{B}}\text{,}\) and \(\BasisChange{\mathcal{B}}{\mathcal{A}}\text{.}\)
Solution.
  1. Note that by definition
    \begin{equation*} A=\Set{2\vec e_1+\vec e_2, \vec e_1-2\vec e_2}. \end{equation*}
    Since \([\vec x]_{\mathcal{A}}=\mat{1\\-1}\text{,}\) it follows that
    \begin{equation*} \vec x=(2\vec e_{1}+\vec e_{2})-(\vec e_{1}-2\vec e_{2})=\vec e_{1}+3\vec e_{2}. \end{equation*}
    It thus follows that \([\vec x]_{\mathcal{E}}=\mat{1\\3}\text{.}\)
    To find \([\vec x]_{\mathcal{B}}\text{,}\) we must first express the two elements of \(\mathcal{A}\) as linear combinations of the two elements of \(\mathcal{B}\text{.}\) This involves solving two systems of linear equations:
    \begin{equation*} 2\vec e_{1}+\vec e_{2}=x_{1}(3\vec e_{1}-\vec e_{2})+x_{2}(-2\vec e_{1}+3\vec e_{2}) \end{equation*}
    and
    \begin{equation*} \vec e_{1}-2\vec e_{2}=y_{1}(3\vec e_{1}-\vec e_{2})+y_{2}(-2\vec e_{1}+3\vec e_{2}). \end{equation*}
    Written as column vectors, these are
    \begin{equation*} \mat{2\\1}=x_{1}\mat{3\\-1}+x_{2}\mat{-2\\3} \end{equation*}
    and
    \begin{equation*} \mat{1\\-2}=y_{1}\mat{3\\-1}+y_{2}\mat{-2\\3}. \end{equation*}
    Solving the systems and applying the definition, we obtain
    \begin{equation*} [\vec a_{1}]_{\mathcal{B}}=\mat{x_1\\x_2}=\frac{1}{7}\mat{8\\5},\quad [\vec a_{2}]_{\mathcal{B}}=\mat{y_1\\y_2}=\frac{1}{7}\mat{-1\\-5}. \end{equation*}
    Now we have
    \begin{equation*} [\vec x]_{\mathcal{B}}=[\vec a_{1}]_{\mathcal{B}}-[\vec a_{2}]_{\mathcal{B}}=\frac{1}{7}\mat{8\\5}-\frac{1}{7}\mat{-1\\-5}=\frac{1}{7}\mat{9\\10}. \end{equation*}
  2. By definition, \(\BasisChange{\mathcal{A}}{\mathcal{E}}\) is the matrix \(M\) satisfying
    \begin{equation*} M[\vec x]_{\mathcal{A}}=[\vec x]_{\mathcal{E}} \end{equation*}
    for all vectors \(\vec x\text{.}\) In particular, it must satisfy
    \begin{equation*} M[\vec a_{1}]_{\mathcal{A}}=[\vec a_{1}]_{\mathcal{E}},\quad M[\vec a_{2}]_{\mathcal{A}}=[\vec a_{2}]_{\mathcal{E}}. \end{equation*}
    It thus follows that \(M\) satisfies
    \begin{equation*} M\mat{1\\0}=\mat{2\\1},\quad M\mat{0\\1}=\mat{1\\-2}. \end{equation*}
    Solving for \(M\) gives
    \begin{equation*} M= \left[\begin{array}{}2 & 1 \\ 1 & -2\end{array}\right]. \end{equation*}
    To find \(\BasisChange{\mathcal{E}}{\mathcal{A}}\text{,}\) we simply need to compute \(M^{-1}\text{.}\) Following the explicit inverse formula for \(2\times2\) matrices, we see that
    \begin{equation*} \BasisChange{\mathcal{E}}{\mathcal{A}}=M^{-1}=\frac{1}{5}\left[\begin{array}{}2 & 1 \\ 1 & -2\end{array}\right]. \end{equation*}
    To compute \(\BasisChange{\mathcal{A}}{\mathcal{B}}\text{,}\) we note that
    \begin{equation*} \BasisChange{\mathcal{A}}{\mathcal{B}}=\BasisChange{\mathcal{E}}{\mathcal{B}}\BasisChange{\mathcal{A}}{\mathcal{E}}. \end{equation*}
    We compute \(\BasisChange{\mathcal{E}}{\mathcal{B}}\) readily as
    \begin{equation*} \BasisChange{\mathcal{B}}{\mathcal{E}}= \left[\begin{array}{}3 & -2 \\ -1 & 3\end{array}\right]. \end{equation*}
    Thus,
    \begin{equation*} \BasisChange{\mathcal{E}}{\mathcal{B}}=\BasisChange{\mathcal{B}}{\mathcal{E}}^{-1}=\frac{1}{7}\left[\begin{array}{}3 & 2 \\ 1 & 3\end{array}\right]. \end{equation*}
    Therefore,
    \begin{equation*} \BasisChange{\mathcal{A}}{\mathcal{B}}=\frac{1}{7}\left[\begin{array}{}3 & 2 \\ 1 & 3\end{array}\right] \left[\begin{array}{}2 & 1 \\ 1 & -2\end{array}\right]= \frac{1}{7}\left[\begin{array}{}8 & -1 \\ 5 & -5\end{array}\right]. \end{equation*}
    Taking inverse gives \(\BasisChange{\mathcal{B}}{\mathcal{A}}\text{.}\)

2.

Let
\begin{equation*} \mathcal{A}=\Set{ \mat{2\\1\\0}_{\mathcal{E}}, \mat{1\\-2\\0}_{\mathcal{E}}, \mat{0\\0\\1}_{\mathcal{E}} }\quad\text{and}\quad \mathcal{B}=\Set{\vec b_1,\vec b_2,\vec b_3 } \end{equation*}
be bases for \(\R^{3}\) where
\begin{equation*} \vec b_{1}=\mat{1\\0\\0}_{\mathcal{A}}\qquad \vec b_{2}=\mat{1\\1\\0}_{\mathcal{A}}\qquad \vec b_{3}=\mat{1\\1\\1}_{\mathcal{A}}. \end{equation*}
  1. Find the representation of \(\vec b_{1}\text{,}\) \(\vec b_{2}\text{,}\) and \(\vec b_{3}\) in the standard basis.
  2. Find the change of basis matrices \(\BasisChange{\mathcal{E}}{\mathcal{A}}\) and \(\BasisChange{\mathcal{B}}{\mathcal{E}}\text{.}\)
  3. Use \(\BasisChange{\mathcal{E}}{\mathcal{A}}\) and \(\BasisChange{\mathcal{B}}{\mathcal{E}}\) to compute \(\BasisChange{\mathcal{B}}{\mathcal{A}}\text{.}\)
Solution.
  1. By definition, we have
    \begin{equation*} \begin{aligned}[\vec b_{1}]_{\mathcal{E}}&=[\vec a_{1}]_{\mathcal{E}}=[2\vec e_{1}+\vec e_{2}]_{\mathcal{E}}\\&= \mat{2\\1\\0},\\ [\vec b_{2}]_{\mathcal{E}}&=[\vec a_{1}]_{\mathcal{E}}+[\vec a_{2}]_{\mathcal{E}}\\&=[(2\vec e_{1}+\vec e_{2})+(\vec e_{1}-2\vec e_{2})]_{\mathcal{E}}\\&=[3\vec e_{1}-\vec e_{2}]_{\mathcal{E}}\\&=\mat{3\\-1\\0},\\ [\vec b_{3}]_{\mathcal{E}}&=[\vec a_{1}]_{\mathcal{E}}+[\vec a_{2}]_{\mathcal{E}}+[\vec a_{3}]_{\mathcal{E}}\\&=[(2\vec e_{1}+\vec e_{2})+(\vec e_{1}-2\vec e_{2})+(\vec e_{3})]_{\mathcal{E}}\\&=[3\vec e_{1}-\vec e_{2}+\vec e_{3}]_{\mathcal{E}}\\&=\mat{3\\-1\\1}.\end{aligned} \end{equation*}
  2. We have that
    \begin{equation*} \BasisChange{\mathcal{A}}{\mathcal{E}}= \left[\begin{array}{}2 & 1 & 0\\ 1 & -2 & 0\\ 0 & 0 & 1\end{array}\right]. \end{equation*}
    By part (a), we have
    \begin{equation*} \BasisChange{\mathcal{B}}{\mathcal{E}}= \left[\begin{array}{}2 & 3 & 3\\ 1 & -1 & -1\\ 0 & 0 & 1\end{array}\right]. \end{equation*}
    Inverting the former, we see that
    \begin{equation*} \BasisChange{\mathcal{E}}{\mathcal{A}}=\frac{1}{5}\left[\begin{array}{}2 & 1 & 0\\ 1 & -2 & 0\\ 0 & 0 & 5\end{array}\right]. \end{equation*}
  3. We have
    \begin{equation*} \begin{aligned}\BasisChange{\mathcal{B}}{\mathcal{A}} &=\BasisChange{\mathcal{E}}{\mathcal{A}}\BasisChange{\mathcal{B}}{\mathcal{E}}\\ &=\frac{1}{5} \left[\begin{array}{}2 & 1 & 0\\ 1 & -2 & 0\\ 0 & 0 & 5\end{array}\right] \left[\begin{array}{}2 & 3 & 3\\ 1 & -1 & -1\\ 0 & 0 & 1\end{array}\right]\\ &= \left[\begin{array}{}1 & 1 & 1\\ 0 & 1 & 1\\ 0 & 0 & 1\end{array}\right].\end{aligned} \end{equation*}

3.

Let \(\mathcal{B}=\Set{\mat{1\\0}_{\mathcal{E}},\mat{1\\1}_{\mathcal{E}}}\text{.}\) For each linear transformation \(\mathcal{T}:\R^{2}\to\R^{2}\) defined below, compute \([\mathcal{T}]_{\mathcal{E}}\) and \([\mathcal{T}]_{\mathcal{B}}\text{.}\)
  1. Let \(\mathcal{T}\) be the transformation that rotates every vector counter clockwise by \(90^{\circ}\text{.}\)
  2. Let \(\mathcal{T}\) be the transformation that projects every vector onto the \(y\)-axis.
  3. Let \(\mathcal{T}\) be the transformation that doubles every vector.
  4. Let \(\mathcal{T}\) be the transformation that reflects every vector over the line \(y=x\text{.}\)
Solution.
  1. \([\mathcal{T}]_{\mathcal{E}}=\mat{0&-1\\1&0}\) and \([\mathcal{T}]_{\mathcal{B}}= \left[\begin{array}{}-1 & -2 \\ 1 & 1\end{array}\right]\)
  2. \([\mathcal{T}]_{\mathcal{E}}= \left[\begin{array}{}0 & 0 \\ 0 & 1\end{array}\right]\)
    To compute \([\mathcal{T}]_{\mathcal{B}}\text{,}\) note that \(\mathcal{T}\vec b_{1}=\mathcal{T}\vec e_{1}=\vec 0\) and \(\mathcal{T}\vec b_{2}=\mathcal{T}(\vec e_{1}+\vec e_{2})=\vec e_{2}\text{.}\) Further, \(\vec e_{2}=-\vec b_{1}+\vec b_{2}\text{.}\) Therefore, \([\mathcal{T}]_{\mathcal{B}}= \left[\begin{array}{}0 & -1 \\ 0 & 1\end{array}\right]\text{.}\)
  3. \(\displaystyle [\mathcal{T}]_{\mathcal{E}}=[\mathcal{T}]_{\mathcal{B}}= \left[\begin{array}{}2 & 0 \\ 0 & 2\end{array}\right]\)
  4. Note that \(\mathcal{T}\vec e_{1}=\vec e_{2}\) and \(\mathcal{T}\vec e_{2}=\vec e_{1}\text{,}\) so \([\mathcal{T}]_{\mathcal{E}}= \left[\begin{array}{}0 & 1 \\ 1 & 0\end{array}\right]\text{.}\) To compute \([\mathcal{T}]_{\mathcal{B}}\text{,}\) we simply observe that \(\vec b_{1}=\vec e_{1}\) and \(\vec b_{2}=\vec e_{1}+\vec e_{2}\text{.}\) Thus, \(\mathcal{T}\vec b_{1}=\vec e_{2}=-\vec b_{1}+\vec b_{2}\) and \(\mathcal{T}\vec b_{2}=\vec e_{1}+\vec e_{2}=\vec b_{2}\text{.}\) Therefore, \([\mathcal{T}]_{\mathcal{B}}= \left[\begin{array}{}-1 & 0 \\ 1 & 1\end{array}\right]\text{.}\)

4.

For each statement below, determine whether it is true or false. Justify your answer.
  1. Any invertible \(n \times n\) matrix can be viewed as a change of basis matrix.
  2. Any \(n \times n\) matrix is similar to itself.
  3. Let \(A\) be an \(m \times n\) matrix. If \(m \neq n\text{,}\) then there is no matrix that is similar to \(A\text{.}\)
  4. Any invertible \(n \times n\) matrix \(A\) is similar to \(A^{-1}\) since \(AA^{-1}=I\text{.}\)
Solution.
  1. True. A square matrix \(M\) is invertible if and only if it is a change of basis matrix.
  2. True. Any square matrix \(M\) satsifies \(M=IMI^{-1}\text{.}\)
  3. True. The definition of similarity requires the existence of an invertible matrix \(P\text{,}\) i.e. a square matrix, and a matrix \(B\) such that \(B=PAP^{-1}\text{.}\) If \(A\) is not a square matrix, then either \(PA\) or \(AP^{-1}\) is not defined, so such a matrix \(P\) cannot exist.
  4. False. For example, \(A=\left[\begin{array}{}2 & 0 \\ 0 & -1\end{array}\right]\) is not similar to its inverse.