A Scholar’s Review of Lie Groups and Algebras

Prof. Barak Shoshany

https://baraksh.com/
baraksh@gmail.com

Download these notes in PDF format

Abstract

These lecture notes, prepared for the Perimeter Scholars International master’s program at Perimeter Institute, provide a detailed and self-contained introduction to Lie groups, Lie algebras, and their representations. The notes first review fundamental concepts in differential geometry and abstract algebra, define Lie groups and algebras, and discuss the relation between a Lie group and its associated Lie algebra in full mathematical detail, along with some subtleties. An exhaustive list of common matrix Lie groups, their associated Lie algebras, and their topological properties is provided, along with a detailed discussion of the Lorentz group. Representation theory is then introduced, along with some relevant concepts, followed by a thorough derivation of some of the irreducible representations of several popular Lie groups.

1 Introduction

These lecture notes are intended to provide students with some essential knowledge of Lie groups, algebras, and their representations, which are ubiquitous in theoretical physics, especially in the context of quantum mechanics, gauge theories, and quantum gravity.

The notes are structured as follows:

  • Chapter 2 provides the mathematical concepts and definitions necessary to understand Lie groups and algebras at the abstract level, and in particular the precise relation between Lie groups and their corresponding Lie algebras.

  • Chapter 3 presents an organized and comprehensive discussion of matrix Lie groups, which are the most common Lie groups encountered in theoretical physics, along with their associated algebras and topological properties.

  • Chapter 4 discusses representations of Lie groups and algebras at the abstract level, along with some relevant concepts.

  • Finally, Chapter 5 derives some of the representations of several popular matrix Lie groups: U ( 1 ) , SO ( 2 ) , SU ( 2 ) , and SO ( 3 ) .

Should the reader encounter any typos or mistakes, please report them to the author at the email address provided above.

2 Abstract Lie Groups and Algebras

2.1 Manifolds

2.1.1 Smooth Manifolds

The Earth is shaped roughly like a spheroid. Therefore it has curvature, and its topology is compact. However, a person standing on the surface of the planet and looking around will not see this curvature; their immediate surroundings will look flat. Moreover, they will not notice the compact topology; the Earth seems to keep going as far as the eye can see. They might then conclude that the entire Earth is, in fact, a flat Euclidean plane, and not a spheroid! However, this is only true locally; satellite photos, for example, reveal the global structure of the Earth.

An n -dimensional manifold, or n -manifold, generalizes this idea: it is a space which locally looks like n , although globally it may be more complicated. More precisely, an n -manifold is a topological space equipped with an atlas, which is a collection of charts. These charts, also known as coordinate charts or neighborhoods, are open sets which cover the manifold – that is, their union is the whole manifold. This means that any point on the manifold is in at least one chart. Each chart is homeomorphic to n , or in other words, there is a continuous invertible function from each chart to n .

The charts are open sets, so to cover the whole manifold they must overlap. When two charts overlap, there is a transition function which allows us to move from one chart to the other in a well-defined way. More precisely, this transition function provides a change of coordinates from one coordinate chart to another.

Generally, it is impossible to cover a whole manifold with just one chart. As a simple example, consider the circle S 1 . Points on the circle may be given by an angle θ ; however, the angles are in the interval [ 0 , 2 π ) , which is not open and thus cannot be a chart. Furthermore, we cannot take our chart to be a larger interval, such as ( - π , 2 π ) , since then some points will be described by two different values of the coordinate, and the map cannot be invertible. Instead, we must cover the circle with two or more overlapping charts, related by transition functions.

In a differentiable manifold, the transition functions are all continuously differentiable a finite number of times, that is, they belong to C k for some k . In a smooth manifold, the transition functions are smooth, that is, infinitely differentiable or C . We will assume all manifolds are smooth in these notes.

2.1.2 Tangent Spaces

A tangent space to a manifold at a point is simply the collection of vectors that are tangent to the manifold at that point. The tangent space to the manifold M at a point p M is denoted T p M . For example, the tangent space T p S 2 to the sphere S 2 at the point p S 2 is the plane orthogonal to the radial vector pointing from the origin to p . Of course, this definition relies on the fact that we can embed the sphere in a 3-dimensional ambient space. We should instead define a tangent space in an abstract way, without using an embedding.

A nice way to define the tangent space T p M is as follows. Let f , g C ( M ) be smooth functions on the manifold, f , g : M . We define a tangent vector 𝐱 at the point p M as a map which takes a smooth function f C ( M ) to a number 𝐱 [ f ] ,

𝐱 : C ( M ) , f 𝐱 [ f ] ,

and satisfies the following axioms:

  1. 1.

    𝐱 [ f + g ] = 𝐱 [ f ] + 𝐱 [ g ] ,

  2. 2.

    𝐱 [ α f ] = α 𝐱 [ f ] where α ,

  3. 3.

    𝐱 [ f g ] = 𝐱 [ f ] g + f 𝐱 [ g ] .

A map satisfying these axioms is also called a derivation. The first two axioms ensure that it is a linear map, while the third is a generalization of the familiar Leibniz rule. Furthermore, given a number α and another tangent vector 𝐲 , we define

( α 𝐱 ) [ f ] α 𝐱 [ f ] ,
( 𝐱 + 𝐲 ) [ f ] 𝐱 [ f ] + 𝐲 [ f ] .

Then it is easy to see that the tangent vectors indeed form a vector space. We call that space the tangent space to M at p , or T p M .

Now, a curve γ is a function γ : M mapping real numbers to points on the manifolds in a smooth way, that is, such that for any smooth function f C ( M ) , the composition ( f γ ) ( t ) f ( γ ( t ) ) depends smoothly on the real parameter t . The curve has a tangent vector γ ( t ) at each point γ ( t ) M . We may imagine the curve as indicating the position of a particle on the manifold at time t , in which case γ ( t ) is the velocity vector, indicating the particle’s instantaneous velocity at time t .

The meaning of the derivative γ ( t ) is intuitively clear, but to define it rigorously, we must use its action on a function f C ( M ) :

γ ( t ) : C ( M ) ,
γ ( t ) [ f ] ( f γ ) ( t ) .

One can easily check that γ ( t ) satisfies the three axioms above. Therefore, we may define the tangent space T p M as the collection of tangent vectors γ ( 0 ) to all the curves passing through p at time t = 0 , that is, curves satisfying γ ( 0 ) = p .

2.2 Lie Groups

2.2.1 Groups

Let us recall that a group G is a set of elements, along with a product, satisfying the following axioms:

  1. 1.

    Closure: For any two elements g , h G , the product g h is also in G .

  2. 2.

    Associativity: For any three elements f , g , h G , we have ( f g ) h = f ( g h ) .

  3. 3.

    Identity element 1 1 1 Usually, the identity element is labeled e . Here we instead use I , in order to avoid confusion with the exponential map. I is the standard notation for the identity matrix, and in a matrix Lie group, the identity element is indeed the identity matrix. : There exists a (unique) element I G such that, for any element g G , we have I g = g I = g .

  4. 4.

    Inverse element: For any element g G there exists a unique element g - 1 such that g g - 1 = g - 1 g = I .

If the product is commutative, g h = h g for all g , h G , we say that the group is Abelian. However, the product is usually not commutative.

Note that the product can be any binary operation, not just multiplication; for example, the integers with the addition operation, m + n for m , n , form an (Abelian) group. However, in theoretical physics, and especially in the context of representation theory, we mostly use the notation g h for the group product, since the group elements are represented as matrices, and the product is then matrix multiplication.

2.2.2 Lie Groups and Their Actions

We are now ready to define a Lie group: it is a group which is also a manifold. In other words, it is a set of elements which satisfy the group axioms, with the additional requirement that a neighborhood of each element is homeomorphic to n . Furthermore, the group product and the inversion g g - 1 are both required to be smooth maps. The dimension of the Lie group is n , the dimension of the manifold.

One of the most powerful concepts in physics is symmetries: transformations under which a particular object is invariant. Some symmetries are discrete; for example, a square is invariant under four rotations: 0 , π / 2 , π , and 3 π / 2 radians, and four flips: vertical, horizontal, and the two diagonals. This forms a discrete symmetry group with 8 elements, known as the dihedral group of order 8 and denoted D 4 . However, symmetries can also be continuous; for example, a circle is invariant under rotations by any angle. This forms a continuous symmetry group with a continuum of elements, known as the circle group, or the unitary group of degree 1, and denoted U ( 1 ) .

Lie group are generally used in physics as continuous symmetry groups, and thus their elements represent symmetries of a certain object. The group action defines how elements of the group act on the object; for example, the action of an element g U ( 1 ) on the circle rotates the circle by the angle corresponding to this element.

Given a group G and a space X , we define the action of G on X as follows. The left group action is a function G × X X , denoted for g G and x X as g x , or simply g x , such that

  1. 1.

    I x = x for all x X , where I G is the identity element, and

  2. 2.

    ( g h ) x = g ( h x ) for all g , h G and x X , that is, the action of the product g h is the same as first applying h and then applying g to the result.

Note that if X = G , that is, the group is acting on itself (via the usual group product), then these two axioms are automatically satisfied. The axioms mean that, for every g G , the left action is an invertible map from X to itself, which is exactly what we would expect from a group of transformations.

We can similarly define the right group action, a function X × G X , denoted for g G and x X as x g , or simply x g , such that x I = x and x ( g h ) = ( x g ) h . Whether the action is left or right ultimately amounts to which elements acts first when the product g h acts on x : h for a left action or g for a right action, as can be seen from the axioms ( g h ) x = g ( h x ) and x ( g h ) = ( x g ) h . Thus, if we are given a right action, we can always construct a left action from it simply by mapping x g g - 1 x . Indeed, we have that ( g h ) - 1 x = h - 1 g - 1 x , so g would still end up acting first, as would be expected from a right action. Therefore, we can limit ourselves to discuss left actions exclusively without loss of generality.

2.3 Lie Algebras

2.3.1 Algebras and Lie Algebras

An algebra is a vector space 𝔤 over a field 𝔽 (which we will take to be either or ) equipped with a binary operation [ , ] : 𝔤 × 𝔤 𝔤 satisfying the following axioms:

  • Right distributivity: For all 𝐱 , 𝐲 , 𝐳 𝔤 , [ 𝐱 + 𝐲 , 𝐳 ] = [ 𝐱 , 𝐳 ] + [ 𝐲 , 𝐳 ] .

  • Left distributivity: For all 𝐱 , 𝐲 , 𝐳 𝔤 , [ 𝐱 , 𝐲 + 𝐳 ] = [ 𝐱 , 𝐲 ] + [ 𝐱 , 𝐳 ] .

  • Compatibility with scalars: For all 𝐱 , 𝐲 𝔤 and α , β 𝔽 , [ α 𝐱 , β 𝐲 ] = α β [ 𝐱 , 𝐲 ] .

Note that the operation [ , ] is not necessarily commutative, or even associative!

In the case of a Lie algebra, the binary operator [ , ] is called the Lie bracket or commutator, and it is required to satisfy two additional axioms:

  • Alternativity: For all 𝐱 𝔤 , [ 𝐱 , 𝐱 ] = 0 .

  • The Jacobi identity: For all 𝐱 , 𝐲 , 𝐳 𝔤 , [ 𝐱 , [ 𝐲 , 𝐳 ] ] + [ 𝐲 , [ 𝐳 , 𝐱 ] ] + [ 𝐳 , [ 𝐱 , 𝐲 ] ] = 0 .

The alternativity axiom implies anti-commutativity of the bracket:

0 = [ 𝐱 + 𝐲 , 𝐱 + 𝐲 ] = [ 𝐱 , 𝐲 ] + [ 𝐲 , 𝐱 ]
[ 𝐱 , 𝐲 ] = - [ 𝐲 , 𝐱 ] .

A well-known example of a Lie algebra is the vector space 3 with the cross product × . It’s easy to check that it indeed satisfies all of the axioms above. Another well-known example is the set M ( n , 𝔽 ) of real or complex n × n matrices, with the binary operation being the matrix commutator [ A , B ] A B - B A .

2.3.2 Generators of Lie Algebras and Structure Constants

Since a Lie algebra is a vector space, it has a basis. In this context, we call the basis vectors generators. The number of generators defines the dimension of the Lie algebra. We will denote the generators 𝝉 i , where the index i goes from 1 to the number of dimensions. Now, if we take the commutator of two generators, the result is by definition another vector in the algebra, and therefore it must be a linear combination of the generators. Thus we define:

[ 𝝉 i , 𝝉 j ] f i j 𝝉 k k , (2.1)

where f i j k are called the structure constants. (Note that the Einstein summation convention is implied: any index which appears twice, once as an upper index and once as a lower index, is summed over.) Since the commutator is anti-symmetric and satisfies the Jacobi identity, the structure constants satisfy

f i j = k - f j i , k
f i j f k l l + m f j k f i l l + m f k i f j l l = m 0 .

Sometimes in physics we instead use the convention 2 2 2 To avoid confusion, in these lecture notes the imaginary unit i is written in roman typeface, while the index i is written in italics. [ 𝝉 i , 𝝉 j ] i f i j 𝝉 k k , which differs from the math convention by a factor of i . In these notes we will use the math convention, except in Subsection 5.2.5 where we will use the physics convention.

2.4 The Lie Algebra Associated to a Lie Group

Every Lie group has a Lie algebra associated to it. In this section, we will see that the Lie algebra is the tangent space to the Lie group at the identity element, and it inherits its Lie bracket from the action of tangent vectors. However, in order to define this relation rigorously, we will first have to introduce some fundamental concepts in differential geometry.

It is important to note that while every Lie group has a Lie algebra, the relation is not one-to-one; two groups can have the same algebra. There are some other subtleties involved, depending on the topology of the group. We will discuss some of them below.

In general, the notation for the Lie algebra will use the same letters designating the Lie group it is associated with, written in lowercase Fraktur font. For example, the Lie algebra of SU ( 2 ) is 𝔰 𝔲 ( 2 ) . For a general Lie group G , the associated Lie algebra is written 𝔤 . We will denote group elements with Roman font, g G , and algebra elements in bold font, 𝐱 𝔤 . This is to remind the reader that the Lie algebra elements are actually vectors in a vector space.

2.4.1 The Differential and Left-Invariant Vector Fields

Given a map ϕ : M N between a manifold M and another manifold N , the differential or pushforward of ϕ at a point p M , denoted d ϕ p , is the linear map

d ϕ p : T p M T ϕ ( p ) N ,
d ϕ p ( γ ( 0 ) ) ( ϕ γ ) ( 0 ) ,

where γ : M is a curve such that γ ( 0 ) = p . To understand what this means, recall that γ ( 0 ) is a tangent vector at p . We can map the curve γ : M to a curve ϕ γ : N , and the differential “pushes us forward” from the tangent vector γ ( 0 ) T p M at p = γ ( 0 ) M to the tangent vector ( ϕ γ ) ( 0 ) T ϕ ( p ) N at ϕ ( p ) = ( ϕ γ ) ( 0 ) N .

In Subsection 2.2.2 we defined the group action, and we noted that any group can also act on itself, with the group action being the usual group product. For each group element g G , we define a corresponding left action:

L g : G G L g ( h ) g h .

Now, let 𝐱 Γ ( T G ) be a vector field 3 3 3 T M is called the tangent bundle to the manifold M . It is simply the set of pairs ( p , 𝐱 ) such that p M and 𝐱 T p M , equipped with a projection π : T M M such that π ( p , 𝐱 ) = p . A section of a bundle is the inverse of the projection, in the sense that it is a map σ : M T M such that π ( σ ( p ) ) = p for all p M . In other words, a section assigns, to each point on the manifold, a tangent vector in the tangent space at that point. This is exactly what a vector field does, and hence, a vector field is a section of the tangent bundle. The set of sections of the tangent bundle is denoted Γ ( T M ) . on the Lie group G , that is, a function assigning a tangent vector 𝐱 ( g ) T g G to each point g G in the manifold. Given some h G , we can employ two different actions to combine it with g :

  1. 1.

    First compose g with h to get L g ( h ) = g h , and then calculate 𝐱 ( L g ( h ) ) = 𝐱 ( g h ) ;

  2. 2.

    First calculate 𝐱 ( h ) , and then push it forward from the point h to the point g h using the differential of the left action, ( d L g ) ( 𝐱 ( h ) ) .

In general, these two actions will produce difference results. The vector field 𝐱 is called left-invariant if, for all g , h G , these two actions coincide:

𝐱 ( g h ) = ( d L g ) ( 𝐱 ( h ) ) .

In other words, for a left-invariant vector field, if we calculate 𝐱 at g h , or first calculate 𝐱 at h and then push the result forward to g h , we will get the same result.

The reason we care about left-invariant vector fields is that, if 𝐱 is a left-invariant vector field, and we take h = I (the identity element), then

𝐱 ( g ) = ( d L g ) ( 𝐱 ( I ) ) .

Hence, the value of 𝐱 at any point g G is completely determined by its value at the identity. Instead of looking at an entire vector field – a tangent vector at each group element – we can just look at the field’s value at the identity, provided that it’s left-invariant.

Conversely, if we have a tangent vector at the identity element, 𝐯 T I G , we may define a vector field from it as follows:

𝐱 ( g ) ( d L g ) ( 𝐯 ) , g G .

Obviously, 𝐱 ( I ) = 𝐯 . Furthermore, we have

𝐱 ( g h ) = ( d L g h ) ( 𝐯 )
= d ( L g L h ) ( 𝐯 )
( * ) = ( d L g ) ( d L h ( 𝐯 ) )
= ( d L g ) ( 𝐱 ( h ) ) ,

where in ( * ) we used the chain rule. Hence, the vector field 𝐱 thus defined is left-invariant. We conclude that there is an isomorphism – an invertible homomorphism – between the space of left-invariant vector fields on G and the space of tangent vectors at the identity, T I G .

2.4.2 The Lie Algebra of a Lie Group

Now we can properly define the Lie algebra 𝔤 of a Lie group G : it is the set of all left-invariant vector fields on G . One can check that this set is indeed a vector space. However, we are still missing one crucial ingredient – the Lie bracket. In Subsection 2.1.2 we defined tangent vectors as derivations: linear maps satisfying the Leibniz rule. Given two vector fields 𝐱 , 𝐲 on G , we define their commutator by its action on functions f C ( G ) as follows:

[ 𝐱 , 𝐲 ] [ f ] 𝐱 [ 𝐲 [ f ] ] - 𝐲 [ 𝐱 [ f ] ] . (2.2)

Here, by 𝐱 [ 𝐲 [ f ] ] we mean: first act with the derivation 𝐲 on the function f , which produces another function (a real number at each point on the manifold), and then act with 𝐱 on the resulting function. One can check that [ 𝐱 , 𝐲 ] satisfies the axioms for a derivation, and therefore it is a vector field as well.

2.4.3 Integral Curves and Flows

Let 𝐱 Γ ( T G ) be a vector field on a manifold G , let g G be a point on the manifold, and let R be an open interval containing 0 . The integral curve of 𝐱 passing through g is a curve γ : R G such that

γ ( t ) = 𝐱 ( γ ( t ) ) , γ ( 0 ) = g . (2.3)

In other words, the curve passes through g and the tangent vector to the curve at each point is given by the value of the vector field 𝐱 at that point. If we imagine a vector field as a collection of arrows sprinkled on the manifold, then the velocity of the integral curve follows these arrows. ( 2.3 ) is an ordinary differential equation, and by the standard existence and uniqueness theorems for such equations, there exists a unique integral curve for any vector field 𝐱 passing through any point g for some interval R with 0 R .

In particular, there exists a unique maximal integral curve γ : R G , where the interval R is maximal. A vector field 𝐱 Γ ( T G ) is called complete if every maximal integral curve γ of that vector field is defined on all of , that is, γ : G . Given an integral curve γ of a left-invariant vector field 𝐱 passing through a group element g , the curve γ g L g γ is also an integral curve, due to 𝐱 being left-invariant. Using this property, we may in fact extend any integral curve to , and thus 𝐱 is complete (this is left as an exercise for the reader). Since the Lie algebra is the set of left-invariant vector fields, any vector in a Lie algebra is complete.

2.4.4 The Exponential Map

Using the fact that 𝐱 is complete, we may define a smooth map ϕ 𝐱 : × G G such that, for all g G ,

  1. 1.

    ϕ 𝐱 ( 0 , g ) = g ,

  2. 2.

    t ϕ 𝐱 ( t , g ) is an integral curve of 𝐱 passing through g .

We may collect all of these maps into a map ϕ : × G × 𝔤 G defined by ϕ ( t , g , 𝐱 ) ϕ 𝐱 ( t , g ) . This map satisfies:

  1. 1.

    ϕ ( 0 , g , 𝐱 ) = g ,

  2. 2.

    t ϕ ( t , g , 𝐱 ) is an integral curve of 𝐱 passing through g ,

  3. 3.

    ϕ ( t , g , 𝐱 ) = g ϕ ( t , I , 𝐱 ) ,

  4. 4.

    ϕ ( t , g , s 𝐱 ) = ϕ ( s t , g , 𝐱 ) ,

  5. 5.

    ϕ ( s , I , 𝐱 ) ϕ ( t , I , 𝐱 ) = ϕ ( s + t , I , 𝐱 ) .

Properties 1 and 2 follow automatically from the definition of ϕ 𝐱 . Property 3 follows from the fact that γ ( t ) g ϕ ( t , I , 𝐱 ) is an integral curve of 𝐱 with initial condition

γ ( 0 ) = g ϕ ( 0 , I , 𝐱 ) = g I = g ,

and since ϕ ( t , g , 𝐱 ) is also an integral curve of 𝐱 with the same initial condition, by the uniqueness property they must be equivalent. Properties 4 and 5 follow similarly.

By combining properties 3 and 4, we find:

ϕ ( t , g , 𝐱 ) = g ϕ ( t , I , 𝐱 ) = g ϕ ( 1 , I , t 𝐱 ) .

Therefore, the map ϕ is in fact completely determined by 𝐱 alone: if we know ϕ ( 1 , I , 𝐱 ) , then for any g and t we can easily find ϕ ( t , g , 𝐱 ) . We call this map the exponential map:

exp 𝐱 : 𝔤 G , exp 𝐱 ϕ ( 1 , I , 𝐱 ) .

Then g exp ( t 𝐱 ) is the integral curve for 𝐱 which passes through g at t = 0 . Furthermore, from property 5 we have

exp ( s 𝐱 ) exp ( t 𝐱 ) = exp ( ( s + t ) 𝐱 ) .

We may also write e 𝐱 exp 𝐱 for short. However, note that the usual identity e 𝐱 + 𝐲 = e 𝐱 e 𝐲 does not apply here, unless 𝐱 and 𝐲 are linearly dependent.

In summary, using the exponential map, we may obtain a group element exp 𝐱 G from an algebra element 𝐱 𝔤 .

3 The Matrix Lie Groups and Their Algebras

So far, we’ve dealt with abstract Lie groups and algebras. Now that we have a fairly good understanding of how they are defined, we will talk about the Lie groups that are the most common in physics: the matrix Lie groups.

Let M ( n , 𝔽 ) be the set of n × n matrices with entries from the field 𝔽 , which we will assume is either the real numbers or the complex numbers . This set is not a group with respect to matrix multiplication, since some matrices are not invertible. However, if we take only the invertible matrices, we obtain the Lie group GL ( n , 𝔽 ) , also known as the general linear group. Note that for n 2 this group is non-Abelian, since matrix multiplication is non-commutative.

The group GL ( n , 𝔽 ) and its subgroups are collectively known as the matrix Lie groups. Their associated Lie algebras are matrix Lie algebras, and the Lie bracket is the usual matrix commutator. In this chapter we will give a more down-to-earth definition of the exponential map for the case of a matrix Lie group, review some subtleties, recall the definition of a normal subgroup, and then list the most important matrix Lie groups, along with their dimensions, topological properties, and associated Lie algebras.

3.1 The Exponential Map for Matrix Lie Groups

If our Lie group G is a matrix group with a matrix Lie algebra 𝔤 , then the exponential map has a concrete definition:

e 𝐱 k = 0 𝐱 k k ! ,

where 𝐱 𝔤 , and 𝐱 k is simply the product of the matrix 𝐱 with itself k times. The identity element I G , which is the n × n identity matrix, is the exponential of the zero element 𝟎 𝔤 , which is the n × n zero matrix:

I = e 𝟎 .

The exponential of a general algebra element 𝐱 𝔤 will always result in some group element g G :

e 𝐱 = g G .

If the matrices 𝐱 𝔤 and 𝐲 𝔤 commute, then

e 𝐱 + 𝐲 = e 𝐱 e 𝐲 .

In particular, since 𝐱 commutes with - 𝐱 , we have e 𝐱 e - 𝐱 = I . Therefore, the inverse g - 1 G will be the exponential of - 𝐱 𝔤 :

g - 1 = e - 𝐱 .

Furthermore, for two scalars s , t 𝔽 , the matrices s 𝐱 and t 𝐱 commute, so

e s 𝐱 e t 𝐱 = e ( s + t ) 𝐱 .

We also have, for a scalar t and a matrix 𝐱 𝔤 ,

d d t e t 𝐱 = 𝐱 e t 𝐱 = e t 𝐱 𝐱
d d t e t 𝐱 | t = 0 = 𝐱 .

Finally, we have the important relation 4 4 4 To remember this, recall that the determinant is the product of the eigenvalues of the matrix, while the trace is the sum of the eigenvalues. In particular, for a diagonal matrix 𝐱 = diag ( λ 1 , , λ n ) , we have tr 𝐱 = i = 1 n λ i . One can easily calculate that e 𝐱 = diag ( e λ 1 , , e λ n ) , and thus det e 𝐱 = i = 1 n e λ i = e i = 1 n λ i = e tr 𝐱 . The reader is invited to prove this relation for a general (not necessarily diagonal) matrix.

det e 𝐱 = e tr 𝐱 .

Now, if G is connected and compact 5 5 5 A space is connected if it cannot be divided into two disjoint non-empty open sets. A space X is compact if every one of its open covers – collections of open subsets of X such that X is the union of those subsets, just like a manifold is covered by an atlas of charts – has a finite subcover. Intuitively, this generalizes the notion of a closed and bounded set in Euclidean space to more abstract spaces. , then the map 𝐱 e 𝐱 is surjective (onto), meaning that each group element g G has at least one element 𝐱 𝔤 such that g = e 𝐱 . This is a sufficient condition, but not a necessary one; for example, even though GL ( n , ) is non-compact, the map from the Lie algebra 𝔤 𝔩 ( n , ) to the Lie group GL ( n , ) is surjective.

Moreover, if the map 𝐱 e 𝐱 is injective (one-to-one), meaning that each group element g G has at most one element 𝐱 𝔤 such that g = e 𝐱 , then G is simply connected 6 6 6 In a simply connected space, every path between two points can be continuously deformed into any other path between the same points, or equivalently, every loop can be continuously shrunk to a point. . This is a necessary condition, but not a sufficient one; for example, even though SU ( 2 ) is simply connected, the matrix 𝐱 diag ( i π , - i π ) 𝔰 𝔲 ( 2 ) and its negative both map to the same matrix e 𝐱 = e - 𝐱 SU ( 2 ) .

3.2 Normal Subgroups

3.2.1 Subgroups and Cosets

A subgroup is a subset of elements in the group which themselves form a group. For example, consider the general linear group GL ( n , 𝔽 ) , which consists of all n × n matrices over the field 𝔽 . If we restrict to the subset of matrices with determinant 1, we get the special linear group SL ( n , 𝔽 ) . It is easy to see that the properties of closure, associativity, identity element and inverse element are all satisfied for SL ( n , 𝔽 ) , so it is indeed a group, and hence a subgroup of GL ( n , 𝔽 ) .

Now, given a group G and a subgroup H , the left coset g H and right coset H g are the sets of elements formed by acting on every single element of H with a particular element g G , either from the left or from the right:

g H { g h | h H } ,
H g { h g | h H } .

Note that if G is Abelian, then the left and right cosets are the same. Also, since ( g H ) - 1 = H g - 1 , the number of left cosets is equal to the number of right cosets.

As a simple example, consider the group ( , + ) of the integers with the addition operation. The even integers 2 are clearly a subgroup of ( , + ) . There are two possible cosets with respect to this subgroup: 2 itself (acting on every element of 2 with the identity element 0 ) and 2 + 1 , which is the subgroup of odd integers. However, note that, in general, cosets are not necessarily subgroups!

3.2.2 Definition of a Normal Subgroup

A normal subgroup N of G is a subgroup for which the left and right cosets are the same:

g N = N g , g G .

In other words, the normal subgroup N “commutes” with every element of the group G . (Obviously, if G is Abelian, every subgroup is automatically normal.) An equivalent definition is that a normal subgroup N of G is invariant under conjugation with any element of G :

g h g - 1 N , h N , g G .

As an example, consider the group GL ( n , 𝔽 ) and its subgroup SL ( n , 𝔽 ) . Given an element g GL ( n , 𝔽 ) and an element h SL ( n , 𝔽 ) , we have:

det ( g h g - 1 ) = det ( g ) det ( h ) det ( g - 1 ) = 1 ,

since det ( g - 1 ) = 1 / det ( g ) and det ( h ) = 1 . We see that g h g - 1 has determinant 1, and since it is also invertible, we conclude that it is in SL ( n , 𝔽 ) . Hence SL ( n , 𝔽 ) is a normal subgroup of GL ( n , 𝔽 ) .

3.2.3 Quotient Groups

The quotient group G / N , where N is a normal subgroup, is defined as the set of left cosets:

G / N { g N | g G } ,

where

g N { g h | h N } .

Every element g N in G / N is actually the set of all of the possible ways elements from the normal subgroup N can be applied to g from the right. In order to make this set into a group, we must also define a group product. We take the product to be:

( g 1 N ) ( g 2 N ) ( g 1 g 2 ) N ,
g 1 N , g 2 N G / N .

Here, the fact that N is normal becomes crucial. If N was not normal, then we could have had a situation where g 1 N = g 1 N and g 2 N = g 2 N for g 1 , g 2 , g 1 , g 2 G , and yet ( g 1 g 2 ) N ( g 1 g 2 ) N , which means that the product does not have a well-defined outcome! However, if N is normal, then by definition we know that N “commutes” with any element of G . Thus

( g 1 g 2 ) N = g 1 g 2 N
= g 1 N g 2
= g 1 N g 2
= N g 1 g 2
= ( g 1 g 2 ) N ,

and we have a well-defined product.

3.2.4 Semidirect Products and Simple Groups

Finally, let Q be a group isomorphic to a quotient group, Q G / N . Then we say that G is the semidirect product of Q and the normal subgroup N :

G = Q N .

Any group G possesses two trivial normal subgroups: the group { I } , consisting of only the identity element, and the group G itself. If G does not possess any non-trivial normal subgroups, we say that it is a simple group. If G is not simple, then we can break it into two smaller groups by taking the quotient with a non-trivial normal subgroup. In the example of GL ( n , 𝔽 ) and its (non-trivial) normal subgroup SL ( n , 𝔽 ) , we find

GL ( n , 𝔽 ) = 𝔽 × SL ( n , 𝔽 ) ,
𝔽 × GL ( n , 𝔽 ) / SL ( n , 𝔽 ) ,

where 𝔽 × 𝔽 \ { 0 } is the multiplicative group of the field 𝔽 .

3.3 The General and Special Linear Groups

The general linear group GL ( n , 𝔽 ) consists of all invertible n × n matrices:

GL ( n , 𝔽 ) { A M ( n , 𝔽 )
| det A 0 } .
  • Action: GL ( n , 𝔽 ) is the group of all invertible linear transformations from the vector space 𝔽 n to itself (also known as automorphisms).

  • Dimension: The dimension of GL ( n , ) is n 2 , since real n 2 parameters are needed to specify an n × n matrix. The requirement that it is invertible does not reduce the number of parameters. To see that, note that removing the matrices with det A = 0 is analogous to removing one point (the origin) from ; you still need 1 parameter to specify where on the real line you are. Similarly, GL ( n , ) has dimension 2 n 2 , since its n 2 complex entries require 2 n 2 real parameters.

  • Compactness: GL ( n , 𝔽 ) is non-compact.

  • Connectedness: GL ( n , ) is disconnected. It consists of two connected components, corresponding to positive and negative determinants respectively. To see this, note that the image of det A is × \ { 0 } , which is disconnected, since it can be divided into the two disjoint open sets ( - , 0 ) and ( 0 , + ) . Since det A is a continuous function, GL ( n , ) must also be disconnected 7 7 7 A continuous function maps connected spaces to connected spaces. . In contrast, GL ( n , ) is connected, since the image of det A is × \ { 0 } , which is connected (but not simply connected – why?)

  • Lie algebra: The associated Lie algebra 𝔤 𝔩 ( n , 𝔽 ) is simply the set M ( n , 𝔽 ) of all n × n matrices over the field 𝔽 . Indeed, for any matrix 𝐱 M ( n , 𝔽 ) , the corresponding group element e 𝐱 is automatically invertible, with its inverse being e - 𝐱 .

The special linear group SL ( n , 𝔽 ) consists of all n × n matrices with determinant 1:

SL ( n , 𝔽 ) { A M ( n , 𝔽 )
| det A = 1 } .
  • Action: SL ( n , 𝔽 ) is the group of all automorphisms of 𝔽 n that preserve volume and orientation.

  • Dimension: The dimension of SL ( n , ) is n 2 - 1 , and the dimension of SL ( n , ) is 2 ( n 2 - 1 ) .

  • Compactness: SL ( n , 𝔽 ) is still non-compact.

  • Connectedness: SL ( n , 𝔽 ) is connected, even for 𝔽 = .

  • Lie algebra: The associated Lie algebra 𝔰 𝔩 ( n , 𝔽 ) is the set of all n × n matrices over 𝔽 with vanishing trace, since det e 𝐱 = e tr 𝐱 .

Note that SL ( n , 𝔽 ) is a normal subgroup of GL ( n , 𝔽 ) , and we have

GL ( n , 𝔽 ) = 𝔽 × SL ( n , 𝔽 ) ,

where 𝔽 × 𝔽 \ { 0 } is the multiplicative group of 𝔽 . This means that we can decompose any general linear transformation in GL ( n , 𝔽 ) into a volume- and orientation-preserving transformation in SL ( n , 𝔽 ) , times an element of 𝔽 × which may change the volume and/or orientation. Or, more simply, any matrix in GL ( n , 𝔽 ) can be written as its determinant, which is a number in 𝔽 × , times a matrix in SL ( n , 𝔽 ) .

3.4 The Orthogonal and Special Orthogonal Groups

The orthogonal group O ( n ) consists of all real orthogonal n × n matrices:

O ( n ) { A M ( n , )
| A T A = I } ,

where A T is the transpose of A such that ( A T ) i j A j i .

  • Action: O ( n ) is the group of distance-preserving automorphisms of n (also known as isometries) which leave the origin invariant. Indeed, the inner product of two vectors 𝐱 , 𝐲 n is given by 𝐱 𝐲 𝐱 T 𝐲 , and if A is orthogonal, the inner product is invariant under the orthogonal transformation 𝐱 A 𝐱 , 𝐲 A 𝐲 . Hence the norm 𝐱 𝐱 𝐱 , and in particular the Euclidean distance 𝐱 - 𝐲 , is invariant under the action of O ( n ) .

  • Dimension: The dimension of O ( n ) is n ( n - 1 ) / 2 , since that is the number of real parameters needed to specify an orthogonal matrix 8 8 8 To see this, consider that a general real n × n matrix A has n 2 arbitrary parameters. The matrix A T A satisfies ( A T A ) T = A T A , so it is symmetric and thus has n ( n + 1 ) / 2 independent parameters. Hence the equation A T A = I results in n ( n + 1 ) / 2 equations for the original n 2 parameters, reducing the number of independent parameters to n ( n - 1 ) / 2 . .

  • Compactness: O ( n ) is our first example of a compact Lie group. Indeed, first recall that, by the Heine-Borel theorem, a subset of n is compact if and only if it is closed and bounded. We can identify M ( n , ) with n 2 by taking the n 2 entries of the vector in n 2 to be the matrix entries of a matrix in M ( n , ) ; for example ( a b c d ) ( a , b , c , d ) . Then O ( n ) may be similarly identified with a subset of n 2 . To see that it is closed, consider the continuous function f ( A ) A T A . Then we have O ( n ) = f - 1 ( { I } ) , that is, O ( n ) is the preimage of the closed set { I } ; therefore, since f is continuous, O ( n ) must itself be closed 9 9 9 A function f from a topological space X to a topological space Y is continuous if and only if the preimage f - 1 ( S ) { A X | f ( A ) S } of every closed set S in Y is closed in X . (Alternatively, if and only if the preimage f - 1 ( S ) of every open set S in Y is open in X .) . Furthermore, O ( n ) is bounded, since each of the rows (or columns) of an orthogonal matrix is a vector of norm 1, and thus no entry in the matrix may be larger than 1. Thus O ( n ) is compact.

  • Connectedness: O ( n ) is disconnected, since an orthogonal matrix has det ( A T A ) = ( det A ) 2 = 1 and thus det A = ± 1 ; the two connected components correspond to the two possible values of the determinant.

  • Lie algebra: The associated Lie algebra 𝔬 ( n ) is the set of real anti-symmetric n × n matrices. Indeed, if 𝐱 is anti-symmetric then 𝐱 T = - 𝐱 and thus, if A = e 𝐱 , we have A T = ( e 𝐱 ) T = e - 𝐱 = A - 1 , so A is orthogonal.

The special orthogonal group SO ( n ) consists of all real orthogonal n × n matrices with determinant 1:

SO ( n ) { A M ( n , )
| A T A = I and det A = 1 } .
  • Action: SO ( n ) is the group of isometries of n which leave the origin invariant and preserve orientation. These transformations are also known as rotations.

  • Dimension: The dimension of SO ( n ) , like that of O ( n ) , is n ( n - 1 ) / 2 . The condition det A = 1 doesn’t reduce the number of parameters. To see why, note that O ( n ) is a space of dimension n ( n - 1 ) / 2 consisting of two disjoint open sets, one with determinant + 1 and another with determinant - 1 . Just like dividing the real numbers into positive and negative numbers does not reduce the dimension of the real line, so does selecting only the component with determinant + 1 not reduce the dimension of the overall space.

  • Compactness: Like O ( n ) , SO ( n ) is compact.

  • Connectedness: SO ( n ) is connected, and it is in fact one of the two connected components of O ( n ) : the one with determinant 1.

  • Lie algebra: The associated Lie algebra 𝔰 𝔬 ( n ) is the set of real anti-symmetric n × n matrices, and it is identical to 𝔬 ( n ) . Note that since an anti-symmetric matrix always has zero trace, its exponential always has determinant 1. Thus, even though both groups have the same Lie algebra, exponentiating that algebra only allows access to the component of O ( n ) connected to the identity matrix, namely SO ( n ) . This is to be expected, since the Lie algebra is the tangent space at the identity, and the identity has determinant 1 10 10 10 Recall that, as we mentioned above, if a group is connected and compact, like SO ( n ) , then the map from the algebra to the group is surjective (onto), but this is not necessarily true for groups that are disconnected, like O ( n ) , or non-compact. .

SO ( n ) is a normal subgroup of O ( n ) , and we have

O ( n ) = 2 SO ( n ) ,

where 2 is the group { + 1 , - 1 } with the group product being multiplication. This results from the fact that any matrix in O ( n ) can be written as its determinant, which is a number in 2 , times a matrix in SO ( n ) .

3.5 The Indefinite Orthogonal and Special Orthogonal Groups

3.5.1 General p and q

Let p and q be positive integers. The indefinite orthogonal group O ( p , q ) consists of all real ( p + q ) × ( p + q ) matrices which leave a bilinear form (or metric) of signature ( p , q ) invariant:

O ( p , q ) { A M ( p + q , )
| A T η A = η } ,
η diag ( - 1 , , - 1 p , + 1 , , + 1 q ) .
  • Action: O ( p , q ) is the group of isometries of p + q which leave the origin invariant. p + q is the ( p + q ) -dimensional real vector space with the dot product 𝐱 𝐲 𝐱 T η 𝐲 , and it is invariant under the transformation 𝐱 A 𝐱 , 𝐲 A 𝐲 . Note that for p = 0 and q = n , the metric η is the n × n identity matrix, and we recover the condition for O ( n ) , A T A = I .

  • Dimension: The dimension of O ( p , q ) is n ( n - 1 ) / 2 where n = p + q .

  • Compactness: O ( p , q ) is non-compact for p , q > 0 , since the elements of the matrices are not bounded.

  • Connectedness: O ( p , q ) is disconnected and consists of four connected components, described below.

  • Lie algebra: The associated Lie algebra 𝔬 ( p , q ) is the set of real n × n matrices 𝐱 satisfying 𝐱 T = - η 𝐱 η - 1 . Indeed, we then have

    ( e 𝐱 ) T η e 𝐱 = e - η 𝐱 η - 1 η e 𝐱
    = ( η e - 𝐱 η - 1 ) η e 𝐱
    = η ,

    as required. Note that for p = 0 and q = n , this condition reduces to the anti-symmetry condition defining 𝔬 ( n ) . As for 𝔬 ( n ) , the Lie algebra only allows us access to the component connected to the identity.

The indefinite special orthogonal group SO ( p , q ) consists of all of the matrices in O ( p , q ) which have determinant 1:

SO ( p , q ) { A M ( p + q , )
| A T η A = η and det A = 1 } ,

where η is defined as before.

  • Action: It is the group of isometries of p + q which leave the origin invariant and preserve orientation.

  • Dimension: SO ( p , q ) still has dimension n ( n - 1 ) / 2 where n = p + q .

  • Compactness: Like O ( p , q ) , the group SO ( p , q ) is non-compact for p , q > 0 .

  • Connectedness: SO ( p , q ) is still disconnected for p , q > 0 , but with only two connected components, the ones where det A = 1 .

  • Lie algebra: The associated Lie algebra 𝔰 𝔬 ( p , q ) is the set of real n × n matrices 𝐱 satisfying 𝐱 T = - η 𝐱 η - 1 . It is identical to 𝔬 ( p , q ) , and only allows us access to the component of SO ( p , q ) connected to the identity.

3.5.2 p = 1 : The Lorentz Group

The special case p = 1 is the most important one in physics, since it defines the Lorentz group, with η the Minkowski metric. The group O ( 1 , n - 1 ) is the Lorentz group in n dimensions. To describe the four connected components of this group, let us consider the simplest non-trivial case, that of O ( 1 , 1 ) , the Lorentz group in 2 dimensions. The Minkowski metric is

η ( - 1 0 0 1 ) ,

and we take a general matrix A O ( 1 , 1 ) ,

A ( a b c d ) .

From the defining relation A T η A = η , we get

( a c b d ) ( - 1 0 0 1 ) ( a b c d ) =
= ( c 2 - a 2 c d - a b c d - a b d 2 - b 2 )
= ( - 1 0 0 1 ) .

Thus

a b = c d , c 2 - a 2 = - 1 , d 2 - b 2 = 1 .

These are 3 equations for 4 unknowns, so we must have one free parameter. Indeed, this is consistent with the fact that the dimension of the group is 1. Let us choose the free parameter to be a . Then we immediately get c 2 = a 2 - 1 and thus we must have a 2 1 . Furthermore, b = c d / a , and plugging that into 1 = d 2 - b 2 we obtain

1 = d 2 - c 2 d 2 a 2
= d 2 - ( a 2 - 1 ) d 2 a 2
= d 2 a 2 ,

so d = ± a . Then from b = c d / a we get b = ± c , where the sign corresponds to the same sign as that chosen for d = ± a . In conclusion, we have two sets of solutions, one with positive determinant:

b = c = ± a 2 - 1 ,
d = + a ,
det A = a d - b c = + 1 ,

and one with negative determinant:

b = ± a 2 - 1 ,
c = a 2 - 1 ,
d = - a ,
det A = a d - b c = - 1 .

In addition, the condition a 2 1 is satisfied by either a + 1 or a - 1 . We can now classify the solutions into four types, depending on the sign of a and the sign of the determinant:

  • Proper: det A = + 1 ,

  • Improper: det A = - 1 ,

  • Orthochronous: a + 1 ,

  • Non-orthochronous: a - 1 .

The four connected components of O ( 1 , 1 ) are then:

  • Proper, orthochronous transformations: det A = + 1 and a + 1 . This is the component connected to the identity matrix I = diag ( 1 , 1 ) , which is obtained for a = + 1 .

  • Improper, orthochronous transformations: det A = - 1 and a + 1 . This component includes the parity inversion matrix P diag ( 1 , - 1 ) , which is obtained for a = + 1 .

  • Improper, non-orthochronous transformations: det A = - 1 and a - 1 . This component includes the time reversal matrix T diag ( - 1 , 1 ) , which is obtained for a = - 1 .

  • Proper, non-orthochronous transformations: det A = + 1 and a - 1 . This component includes the matrix P T = diag ( - 1 , - 1 ) = - I , which is obtained for a = - 1 .

Now, by definition SO ( 1 , 1 ) contains the two proper components, both orthochronous and non-orthochronous. The subgroup of proper orthochronous transformations is denoted SO + ( 1 , 1 ) . It can be shown that this is a normal subgroup of O ( 1 , 1 ) .

Furthermore, note that we can move between the four connected components by applying P , T , or both. One can check that the elements { I , P , T , P T } form a group; in fact, this group is isomorphic to the quotient group O ( 1 , 1 ) / SO + ( 1 , 1 ) . Thus we have the semidirect product

O ( 1 , 1 ) = { I , P , T , P T } SO + ( 1 , 1 ) .

This analysis readily generalizes to the 4-dimensional Lorentz group O ( 1 , 3 ) .

3.6 The Unitary and Special Unitary Groups

The unitary group U ( n ) consists of all complex unitary n × n matrices:

U ( n ) { A M ( n , )
| A A = I } ,

where A is the conjugate transpose of A such that ( A ) i j A ¯ j i .

  • Action: U ( n ) is the group of isometries of n , with respect to the Hermitian inner product given by 𝐱 , 𝐲 𝐱 𝐲 where 𝐱 , 𝐲 n , which leave the origin invariant. It is the analogue of O ( n ) for complex spaces.

  • Dimension: The dimension of U ( n ) is n 2 . To see this, note that a general complex matrix requires 2 n 2 real parameters. The matrix A A is Hermitian, since ( A A ) = A A . The diagonal entries of a Hermitian matrix must be real, so they correspond to n real parameters, and there are k = 1 n - 1 k = n (