Skip to main content

Section4.1Matrix Transformations

Objectives
  1. Learn to view a matrix geometrically as a function.
  2. Learn examples of matrix transformations: reflection, dilation, rotation, shear, projection.
  3. Understand the vocabulary surrounding transformations: domain, codomain, range.
  4. Understand the domain, codomain, and range of a matrix transformation.
  5. Pictures: common matrix transformations.
  6. Vocabulary: transformation / function, domain, codomain, range, identity transformation, matrix transformation.

In this section we learn to understand matrices geometrically as functions, or transformations. We briefly discuss transformations in general, then specialize to matrix transformations, which are transformations that come from matrices.

Subsection4.1.1Matrices as Functions

Informally, a function is a rule that accepts inputs and produces outputs. For instance, f(x)=x2 is a function that accepts one number x as its input, and outputs the square of that number: f(2)=4. In this subsection, we interpret matrices as functions.

Let A be a matrix with m rows and n columns. Consider the matrix equation b=Ax (we write it this way instead of Ax=b to remind the reader of the notation y=f(x) ). If we vary x, then b will also vary; in this way, we think of A as a function with independent variable x and dependent variable b.

  • The independent variable (the input) is x, which is a vector in Rn.
  • The dependent variable (the output) is b, which is a vector in Rm.

The set of all possible output vectors are the vectors b such that Ax=b has some solution; this is the same as the column space of A by this note in Section 2.4.

RnRmb=AxxAxCol(A)

Subsection4.1.2Transformations

At this point it is convenient to fix our ideas and terminology regarding functions, which we will call transformations in this book. This allows us to systematize our discussion of matrices as functions.

Definition

A transformation from Rn to Rm is a rule T that assigns to each vector x in Rn a vector T(x) in Rm.

  • Rn is called the domain of T.
  • Rm is called the codomain of T.
  • For x in Rn, the vector T(x) in Rm is the image of x under T.
  • The set of all images {T(x)|xinRn} is the range of T.

The notation T:Rn−→Rm means T is a transformation from Rn to Rm.

It may help to think of T as a “machine” that takes x as an input, and gives you T(x) as the output.

RnRmdomaincodomainTxT(x)rangeT

The points of the domain Rn are the inputs of T: this simply means that it makes sense to evaluate T on vectors with n entries, i.e., lists of n numbers. Likewise, the points of the codomain Rm are the outputs of T: this means that the result of evaluating T is always a vector with m entries.

The range of T is the set of all vectors in the codomain that actually arise as outputs of the function T, for some input. In other words, the range is all vectors b in the codomain such that T(x)=b has a solution x in the domain.

Definition

The identity transformation IdRn:RnRn is the transformation defined by the rule

IdRn(x)=xforallxinRn.

In other words, the identity transformation does not move its input vector: the output is the same as the input. Its domain and codomain are both Rn, and its range is Rn as well, since every vector in Rn is the output of itself.

Subsection4.1.3Matrix Transformations

Now we specialize the general notions and vocabulary from the previous subsection to the functions defined by matrices that we considered in the first subsection.

Definition

Let A be an m×n matrix. The matrix transformation associated to A is the transformation

T:Rn−→RmdeBnedbyT(x)=Ax.

This is the transformation that takes a vector x in Rn to the vector Ax in Rm.

If A has n columns, then it makes sense to multiply A only by vectors with n entries. This is why the domain of T(x)=Ax is Rn. If A has m rows, then Ax has m entries for any vector x in Rn; this is why the codomain of T(x)=Ax is Rm.

The definition of a matrix transformation T tells us how to evaluate T on any given vector: we multiply the input vector by a matrix. For instance, let

A=I123456J

and let T(x)=Ax be the associated matrix transformation. Then

TA123B=AA123B=I123456JA123B=I1432J.

Suppose that A has columns v1,v2,...,vn. If we multiply A by a general vector x, we get

Ax=A|||v1v2···vn|||BCGGEx1x2...xnDHHF=x1v1+x2v2+···+xnvn.

This is just a general linear combination of v1,v2,...,vn. Therefore, the outputs of T(x)=Ax are exactly the linear combinations of the columns of A: the range of T is the column space of A. See this note in Section 2.4.

Let A be an m×n matrix, and let T(x)=Ax be the associated matrix transformation.

  • The domain of T is Rn, where n is the number of columns of A.
  • The codomain of T is Rm, where m is the number of rows of A.
  • The range of T is the column space of A.

In the case of an n×n square matrix, the domain and codomain of T(x)=Ax are both Rn. In this situation, one can regard T as operating on Rn: it moves the vectors around in the same space.