Skip to main content
Logo image

Section 12.1 Matrix Groups

Subsection Some Facts from Linear Algebra

Before we study matrix groups, we must recall some basic facts from linear algebra. One of the most fundamental ideas of linear algebra is that of a linear transformation. A linear transformation or linear map T:RnRm is a map that preserves vector addition and scalar multiplication; that is, for vectors x and y in Rn and a scalar αR,
T(x+y)=T(x)+T(y)T(αy)=αT(y).
An m×n matrix with entries in R represents a linear transformation from Rn to Rm. If we write vectors x=(x1,,xn)t and y=(y1,,yn)t in Rn as column matrices, then an m×n matrix
A=(a11a12a1na21a22a2nam1am2amn)
maps the vectors to Rm linearly by matrix multiplication. Observe that if α is a real number,
A(x+y)=Ax+AyandαAx=A(αx),
where
x=(x1x2xn).
We will often abbreviate the matrix A by writing (aij).
Conversely, if T:RnRm is a linear map, we can associate a matrix A with T by considering what T does to the vectors
e1=(1,0,,0)te2=(0,1,,0)ten=(0,0,,1)t.
We can write any vector x=(x1,,xn)t as
x1e1+x2e2++xnen.
Consequently, if
T(e1)=(a11,a21,,am1)t,T(e2)=(a12,a22,,am2)t,T(en)=(a1n,a2n,,amn)t,
then
T(x)=T(x1e1+x2e2++xnen)=x1T(e1)+x2T(e2)++xnT(en)=(k=1na1kxk,,k=1namkxk)t=Ax.

Example 12.1.

If we let T:R2R2 be the map given by
T(x1,x2)=(2x1+5x2,4x1+3x2),
the axioms that T must satisfy to be a linear transformation are easily verified. The column vectors Te1=(2,4)t and Te2=(5,3)t tell us that T is given by the matrix
A=(2543).
Since we are interested in groups of matrices, we need to know which matrices have multiplicative inverses. Recall that an n×n matrix A is invertible exactly when there exists another matrix A1 such that AA1=A1A=I, where
I=(100010001)
is the n×n identity matrix. From linear algebra we know that A is invertible if and only if the determinant of A is nonzero. Sometimes an invertible matrix is said to be nonsingular.

Example 12.2.

If A is the matrix
(2153),
then the inverse of A is
A1=(3152).
We are guaranteed that A1 exists, since det(A)=2351=1 is nonzero.
Some other facts about determinants will also prove useful in the course of this chapter. Let A and B be n×n matrices. From linear algebra we have the following properties of determinants.
  • The determinant is a homomorphism into the multiplicative group of real numbers; that is, det(AB)=(detA)(detB).
  • If A is an invertible matrix, then det(A1)=1/detA.
  • If we define the transpose of a matrix A=(aij) to be At=(aji), then det(At)=detA.
  • Let T be the linear transformation associated with an n×n matrix A. Then T multiplies volumes by a factor of |detA|. In the case of R2, this means that T multiplies areas by |detA|.
Linear maps, matrices, and determinants are covered in any elementary linear algebra text; however, if you have not had a course in linear algebra, it is a straightforward process to verify these properties directly for 2×2 matrices, the case with which we are most concerned.

Subsection The General and Special Linear Groups

The set of all n×n invertible matrices forms a group called the general linear group. We will denote this group by GLn(R). The general linear group has several important subgroups. The multiplicative properties of the determinant imply that the set of matrices with determinant one is a subgroup of the general linear group. Stated another way, suppose that det(A)=1 and det(B)=1. Then det(AB)=det(A)det(B)=1 and det(A1)=1/detA=1. This subgroup is called the special linear group and is denoted by SLn(R).

Example 12.3.

Given a 2×2 matrix
A=(abcd),
the determinant of A is adbc. The group GL2(R) consists of those matrices in which adbc0. The inverse of A is
A1=1adbc(dbca).
If A is in SL2(R), then
A1=(dbca).
Geometrically, SL2(R) is the group that preserves the areas of parallelograms. Let
A=(1101)
be in SL2(R). In Figure 12.4, the unit square corresponding to the vectors x=(1,0)t and y=(0,1)t is taken by A to the parallelogram with sides (1,0)t and (1,1)t; that is, Ax=(1,0)t and Ay=(1,1)t. Notice that these two parallelograms have the same area.
A square on set of axes with the left edge a vector from the origin to (0,1) and bottom edge a vector from the origin to (1,0).
A parallelogram on set of axes with the left edge a vector from the origin to (1,1) and bottom edge a vector from the origin to (1,0).
Figure 12.4. SL2(R) acting on the unit square

Subsection The Orthogonal Group O(n)

Another subgroup of GLn(R) is the orthogonal group. A matrix A is orthogonal if A1=At. The orthogonal group consists of the set of all orthogonal matrices. We write O(n) for the n×n orthogonal group. We leave as an exercise the proof that O(n) is a subgroup of GLn(R).

Example 12.5.

The following matrices are orthogonal:
(3/54/54/53/5),(1/23/23/21/2),(1/201/21/62/61/61/31/31/3).
There is a more geometric way of viewing the group O(n). The orthogonal matrices are exactly those matrices that preserve the length of vectors. We can define the length of a vector using the Euclidean inner product, or dot product, of two vectors. The Euclidean inner product of two vectors x=(x1,,xn)t and y=(y1,,yn)t is
x,y=xty=(x1,x2,,xn)(y1y2yn)=x1y1++xnyn.
We define the length of a vector x=(x1,,xn)t to be
x=x,x=x12++xn2.
Associated with the notion of the length of a vector is the idea of the distance between two vectors. We define the distance between two vectors x and y to be xy. We leave as an exercise the proof of the following proposition about the properties of Euclidean inner products.

Example 12.7.

The vector x=(3,4)t has length 32+42=5. We can also see that the orthogonal matrix
A=(3/54/54/53/5)
preserves the length of this vector. The vector Ax=(7/5,24/5)t also has length 5.
Since det(AAt)=det(I)=1 and det(A)=det(At), the determinant of any orthogonal matrix is either 1 or 1. Consider the column vectors
aj=(a1ja2janj)
of the orthogonal matrix A=(aij). Since AAt=I, ar,as=δrs, where
δrs={1r=s0rs
is the Kronecker delta. Accordingly, column vectors of an orthogonal matrix all have length 1; and the Euclidean inner product of distinct column vectors is zero. Any set of vectors satisfying these properties is called an orthonormal set. Conversely, given an n×n matrix A whose columns form an orthonormal set, it follows that A1=At.
We say that a matrix A is distance-preserving, length-preserving, or inner product-preserving when AxAy=xy, Ax=x, or Ax,Ay=x,y, respectively. The following theorem, which characterizes the orthogonal group, says that these notions are the same.

Proof.

We have already shown (1) and (2) to be equivalent.
(2)(3).
Ax,Ay=(Ax)tAy=xtAtAy=xty=x,y.
(3)(2). First, Ax,Ay=(Ax)tAy=xtAtAy=xty. If M is any n×n matrix, then eitMejt=Mij, where Mij is the ijth entry of the matrix M. Noticing that xtAtAy=xty, let x=ei and y=ej. Then
eitAtAej=eitej.
Thus, the ijth component of AtAis 1 when i=j and 0 otherwise. In other words AtA=I, and A1=At.
(3)(4). If A is inner product-preserving, then A is distance-preserving, since
AxAy2=A(xy)2=A(xy),A(xy)=xy,xy=xy2.
(4)(5). If A is distance-preserving, then A is length-preserving. Letting y=0, we have
Ax=AxAy=xy=x.
(5)(3). We use the following identity to show that length-preserving implies inner product-preserving:
x,y=12[x+y2x2y2].
Observe that
Ax,Ay=12[Ax+Ay2Ax2Ay2]=12[A(x+y)2Ax2Ay2]=12[x+y2x2y2]=x,y.
Two side-by figures. The figure on the left is a set of axes with an arrow pointing up and right from the origin to (a,b) and the second arrow pointing down and right from the origin to a point (a, -b). The figure on the right is a set of axes with an arrow pointed up and right from the origin to (cosine theta, sine theta) and an arrow point up and left at a right angle to the first vector from the origin to (sine theta, minus cosine theta).
Two side-by figures. The figure on the left is a set of axes with an arrow pointing up and right from the origin to (a,b) and the second arrow pointing down and right from the origin to a point (a, -b). The figure on the right is a set of axes with an arrow pointed up and right from the origin to (cosine theta, sine theta) and an arrow point up and left at a right angle to the first vector from the origin to (sine theta, minus cosine theta).
Figure 12.9. O(2) acting on R2

Example 12.10.

Let us examine the orthogonal group on R2 a bit more closely. An element AO(2) is determined by its action on e1=(1,0)t and e2=(0,1)t. If Ae1=(a,b)t, then a2+b2=1, since the length of a vector must be preserved when it is multiplied by A. Since multiplication of an element of O(2) preserves length and orthogonality, Ae2=±(b,a)t. If we choose Ae2=(b,a)t, then
A=(abba)=(cosθsinθsinθcosθ),
where 0θ<2π. The matrix A rotates a vector in R2 counterclockwise about the origin by an angle of θ (Figure 12.9).
If we choose Ae2=(b,a)t, then we obtain the matrix
B=(abba)=(cosθsinθsinθcosθ).
Here, detB=1 and
B2=(1001).
A reflection about the horizontal axis is given by the matrix
C=(1001),
and B=AC (see Figure 12.9). Thus, a reflection about a line is simply a reflection about the horizontal axis followed by a rotation.
Two of the other matrix or matrix-related groups that we will consider are the special orthogonal group and the group of Euclidean motions. The special orthogonal group, SO(n), is just the intersection of O(n) and SLn(R); that is, those elements in O(n) with determinant one. The Euclidean group, E(n), can be written as ordered pairs (A,x), where A is in O(n) and x is in Rn. We define multiplication by
(A,x)(B,y)=(AB,Ay+x).
The identity of the group is (I,0); the inverse of (A,x) is (A1,A1x). In Exercise 12.4.6, you are asked to check that E(n) is indeed a group under this operation.
A set of axes with an arrow pointing up and right from the origin to a point x.
A set of axes with an arrow of the same length and same direction as in the previous diagram from a point to the right and above the origin to a point x + y.
Figure 12.11. Translations in R2