Section6.3Similarity¶ permalink

Objectives

Learn to interpret similar matrices geometrically.
Understand the relationship between the eigenvalues, eigenvectors, and characteristic polynomials of similar matrices.
Recipe: compute in terms of for
Picture: the geometry of similar matrices.
Vocabulary: similarity.

Some matrices are easy to understand. For instance, a diagonal matrix

just scales the coordinates of a vector: The purpose of most of the rest of this chapter is to understand complicated-looking matrices by analyzing to what extent they “behave like” simple matrices. For instance, the matrix

has eigenvalues and with corresponding eigenvectors and Notice that

Using instead of the usual coordinates makes “behave” like a diagonal matrix.





Figure1The matrices and behave similarly. Click “multiply” to multiply the colored points by on the left and on the right. (We will see in Section 6.4 why the points follow hyperbolic paths.)

The other case of particular importance will be matrices that “behave” like a rotation matrix: indeed, this will be crucial for understanding Section 6.5 geometrically. See this important note.

In this section, we study in detail the situation when two matrices behave similarly with respect to different coordinate systems. In Section 6.4 and Section 6.5, we will show how to use eigenvalues and eigenvectors to find a simpler matrix that behaves like a given matrix.

Subsection6.3.1Similar Matrices

We begin with the algebraic definition of similarity.

Definition

Two matrices and are similar if there exists an invertible matrix such that

Example

As in the above example, one can show that is the only matrix that is similar to and likewise for any scalar multiple of

Similarity is unrelated to row equivalence. Any invertible matrix is row equivalent to but is the only matrix similar to For instance,

are row equivalent but not similar.

As suggested by its name, similarity is what is called an equivalence relation. This means that it satisfies the following properties.

Proposition

Let and be matrices.

Reflexivity: is similar to itself.
Symmetry: if is similar to then is similar to
Transitivity: if is similar to and is similar to then is similar to

Proof

Example

We conclude with an observation about similarity and powers of matrices.

Fact

Let Then for any we have

Proof

First note that

Next we have

The pattern is clear.

Example

Subsection6.3.2Geometry of Similar Matrices

Similarity is a very interesting construction when viewed geometrically. We will see that, roughly, similar matrices do the same thing in different coordinate systems. The reader might want to review -coordinates and nonstandard coordinate grids in Section 3.5 and well as -matrices in Section 4.7 before reading this subsection.

Recall that (by conditions 4 and 5 of the invertible matrix theorem in Section 6.1) an matrix is invertible if and only if its columns form a basis for This means we can speak of the -coordinates of a vector in where is the basis of columns of Recall from Section 4.7 that this means

Observation

If the linear map has as standard matrix and is the matrix with columns given by the basis then the -matrix of is

In other words, two -matrices and are similar if and only they represent the same linear map but expressed in different bases.

Let's now illustrate this more concretely. Suppose that The above observation gives us another way of computing for a vector in Recall that so that multiplying by means first multiplying by then by then by See this example in Section 4.4.

Recipe: Computing in terms of

Suppose that where is an invertible matrix with columns and let be the corresponding basis for Let be a vector in To compute one does the following:

Multiply by which changes to the -coordinates:
Multiply this by
Interpreting this vector as a -coordinate vector, we multiply it by to change back to the usual coordinates:

To summarize: if then and do the same thing, only in different coordinate systems.

The following example is the heart of this section.

Example

Consider the matrices

One can verify that see this example in Section 6.4. Let and the columns of and let a basis of

The matrix is diagonal: it scales the -direction by a factor of and the -direction by a factor of

To compute first we multiply by to find the -coordinates of then we multiply by then we multiply by again. For instance, let

We see from the -coordinate grid below that Therefore,
Multiplying by scales the coordinates:
Interpreting as a -coordinate vector, we multiply by to get

Of course, this vector lies at on the -coordinate grid.

Now let

We see from the -coordinate grid that Therefore,
Multiplying by scales the coordinates:
Interpreting as a -coordinate vector, we multiply by to get

This vector lies at on the -coordinate grid.

To summarize:

scales the -direction by and the -direction by
scales the -direction by and the -direction by





Figure13The geometric relationship between the similar matrices and acting on Click and drag the heads of and Study this picture until you can reliably predict where the other three vectors will be after moving one of them: this is the essence of the geometry of similar matrices.

Interactive: Another matrix similar to

Example(A matrix similar to a rotation matrix)

To summarize and generalize the previous example:

A Matrix Similar to a Rotation Matrix

Let

where is assumed invertible. Then:

rotates the plane by an angle of around the circle centred at the origin and passing through and in the direction from to
rotates the plane by an angle of around the ellipse centred at the origin and passing through and in the direction from to

Interactive: Similar matrices

Subsection6.3.3Eigenvalues of Similar Matrices

Since similar matrices behave in the same way with respect to different coordinate systems, we should expect their eigenvalues and eigenvectors to be closely related.

Fact

Similar matrices have the same characteristic polynomial.

Proof

Suppose that where are matrices. We calculate

Therefore,

Here we have used the multiplicativity property in Section 5.1 and its corollary in Section 5.1.

Since the eigenvalues of a matrix are the roots of its characteristic polynomial, we have shown:

Similar matrices have the same eigenvalues.

By this theorem in Section 6.2, similar matrices also have the same trace and determinant. Both of these observations also follow from the fact that similar matrices represent the same linear endomorphism considered with respect to different bases, and the determinant, trace, and characteristic polynomial don't depend on the choice of basis.

Note

The converse of the fact is false. Indeed, the matrices

both have characteristic polynomial but they are not similar, because the only matrix that is similar to is itself.

Given that similar matrices have the same eigenvalues, one might guess that they have the same eigenvectors as well. Upon reflection, this is not what one should expect: indeed, the eigenvectors should only match up after changing from one coordinate system to another. This is the content of the next fact, remembering that and change between the usual coordinates and the -coordinates.

Fact

Suppose that Then

The eigenvalues of / or / are the same.

Proof

Suppose that is an eigenvector of with eigenvalue so that Then

so that is an eigenvector of with eigenvalue Likewise if is an eigenvector of with eigenvalue then and we have

so that is an eigenvalue of with eigenvalue

If then takes the -eigenspace of to the -eigenspace of and takes the -eigenspace of to the -eigenspace of

Example

We continue with the above example: let

so Let and the columns of Recall that:

scales the -direction by and the -direction by
scales the -direction by and the -direction by

This means that the -axis is the -eigenspace of and the -axis is the -eigenspace of likewise, the “ -axis” is the -eigenspace of and the “ -axis” is the -eigenspace of This is consistent with the fact, as multiplication by changes into and into





Figure27The eigenspaces of are the lines through and These are the images under of the coordinate axes, which are the eigenspaces of

Interactive: Another matrix similar to

Interactive: Similar matrices

Comments, corrections or suggestions?(Free GitHub account required)

Objectives

Definition

Example

Example

Proposition

Proof

Example

Fact

Proof

Example

Observation

Recipe: Computing Ax in terms of B

Example

Interactive: Another matrix similar to B

Example(A matrix similar to a rotation matrix)

A Matrix Similar to a Rotation Matrix

Interactive: Similar 3×3 matrices

Fact

Proof

Note

Fact

Proof

Example

Interactive: Another matrix similar to B

Interactive: Similar 3×3 matrices

Recipe: Computing in terms of

Interactive: Another matrix similar to

Interactive: Similar matrices

Interactive: Another matrix similar to

Interactive: Similar matrices