Octonions and the Standard Model (Part 13)
Posted by John Baez
When Lee and Yang suggested that the laws of physics might not be invariant under spatial reflection — that there’s a fundamental difference between left and right — Pauli was skeptical. In a letter to Victor Weisskopf in January 1957, he wrote:
“Ich glaube aber nicht, daß der Herrgott ein schwacher Linkshänder ist.”
(I do not believe that the Lord is a weak left-hander.)
But just two days after Pauli wrote this letter, Chien-Shiung Wu’s experiment confirmed that Lee and Yang were correct. There’s an inherent asymmetry in nature.
We can trace this back to how the ‘left-handed’ fermions and antifermions live in a different representation of the Standard Model gauge group than the right-handed ones. And when we try to build grand unified theories that take this into account, we run into the fact that while we can fit the Standard Model gauge group into in various ways, not all these ways produce the required asymmetry. There’s a way where it fits into , which does not work… and alas, this one has a nice octonionic description!
To keep things simple I’ll explain this by focusing, not on the whole Standard Model gauge group, but its subgroup . Here is a theorem proved by Will Sawin in response to a question of mine on MathOverflow:
Theorem 10. There are exactly two conjugacy classes of subgroups of that are isomorphic to . One of them has a representative that is a subgroup of , while the other does not.
I’ll describe representatives of these two subgroups; then I’ll say a bit about how they show up in physics, and then I’ll show you Sawin’s proof.
We can get both subgroups in a unified way! There’s always an inclusion
and taking double covers of each group we get a 2-1 homomorphism
In particular we have
so composing with the exceptional isomorphisms:
we get a 2-1 homomorphism
Now, there are three obvious ways to include in . There is an obvious inclusion
but there are three obvious inclusions
namely the left one:
the right one:
and the diagonal one:
Combining these with our earlier maps, we actually get a one-to-one map from to . So we get three subgroups of , all isomorphic to :
- There’s the left subgroup , which is the image of this composite homomorphism:
- There’s the diagonal subgroup , which is the image of this:
- And there’s the right subgroup , which is the image of this:
The left and right subgroups are actually conjugate, but the diagonal one is truly different! We’ll prove this by taking a certain representation of , called the Weyl spinor representation, and restricting it to those two subgroups. We’ll get inequivalent representations of . This proves the two subgroups aren’t conjugate.
This argument is also interesting for physics. When restrict to the left subgroup, we get a representation of that matches what we actually see for one generation of fermions! This is the basis of the so-called grand unified theory, which should really be called the grand unified theory.
(In fact this works not only for but for the whole Standard Model gauge group, which is larger. I’m focusing on just because it makes the story simpler.)
When we restrict the Weyl spinor representation to the diagonal subgroup, we get a representation of that is not physically correct. Unfortunately, it’s the diagonal subgroup that shows up in several papers connecting the Standard Model gauge group to the octonions. I plan to say a lot more about this later.
The left subgroup
Let’s look at the left subgroup , the image of this composite:
has a 32-dimensional unitary representation called the ‘Dirac spinor’ representation. This representation is really on the exterior algebra . It’s the direct sum of two irreducible parts, the even grades and the odd grades:
Physicists call these two irreducible representations ‘right- and left-handed Weyl spinors’, and denote them as and since they’re 16-dimensional and one is the dual of the other.
Let’s restrict the to the left subgroup and see what we get.
To do this, first we can restrict the along and get
Here is the trivial representation of , is the tautologous representation of , and is the tautologous rep of .
Then let’s finish the job by restricting this representation along . Restricting the of to gives : the sum of the tautologous representation of and the trivial representation. Restricting to the left copy of gives the tautologous representation , while restricting to this left copy gives : the sum of two copies of the trivial representation. All in all, we get this representation of :
This is what we actually see for one generation of left-handed fermions and antifermions in the Standard Model! The representation describes how the left-handed fermions in one generation transform under : 3 colors of quark and one ‘white’ lepton. The representation does the same for the left-handed antifermions. The left-handed fermions form an isospin doublet, giving us the , while the left-handed antifermions have no isospin, giving us the .
This strange lopsidedness is a fundamental feature of the Standard Model.
The right subgroup would work the same way, up to switching the words ‘left-handed’ and ‘right-handed’. And by Theorem 10, the left and right subgroups must be conjugate in , because now we’ll see one that’s not conjugate to either of these.
The diagonal subgroup
Consider the diagonal subgroup , the image of this composite:
Let’s restrict the to .
To do this, first let’s restrict the along and get
as before. Then let’s restrict this representation along . The part works as before, but what happens when we restrict or along the diagonal map ? We get . So, this is the representation of that we get:
This is not good for the Standard Model. It describes a more symmetrical universe than ours, where both left-handed fermions and antifermions transform as doublets under .
The fact that we got a different answer this time proves that and are not conjugate in . So to complete the proof of Theorem 10, we only need to prove
Every subgroup of isomorphic to is conjugate to or .
is conjugate to a subgroup of , but is not.
I’ll prove 2, and then I’ll turn you over to Will Sawin to do the rest.
Why the diagonal subgroup fits in
Every rotation of extends to a rotation of that leaves the last coordinate fixed, so we get an inclusion , which lifts to an inclusion of the double covers, . Since we have exceptional isomorphisms
it’s natural to ask how the inclusion looks in these terms. And the answer is: it’s the diagonal map! In other words, we have a commutative diagram
Now, we can easily fit this into a larger commutative diagram involving some natural maps and :
We can simplify this diagram using the isomorphism :
and then we can use our friend the inclusion :
This shows that the diagonal subgroup of is actually a subgroup of !
Why the left subgroup does not fit in
The three-fold way is a coarse classification of irreducible representations of compact Lie group. Every such representation is of one and only one of these three kinds:
1) not self-dual: not isomorphic to its dual,
2a) Self-dual and orthogonal: isomorphic to its dual via an invariant nondegenerate symmetric bilinear form, also called an orthogonal structure,
2b) Self-dual and symplectic: isomorphic to its dual via an invariant nondegenerate antisymmetric bilinear form, also called a symplectic structure.
I’ve written about how these three cases are related to the division algebras and , respectively:
- John Baez, Division algebras and quantum theory.
But we don’t need most of this here. We just need to know one fact: when is odd, every irreducible representation of , and thus every representation of this Lie group, is self-dual! In particular this is true of .
Why does this matter? Assume the left subgroup is a subgroup of . When we restrict the Weyl spinor representation of to it will be self-dual, like every representation of . Then when we restrict this representation further to it must still be self-dual, since the restriction of a self-dual representation is clearly self-dual.
However, we know this representation is
and this is not self-dual, since and but .
So, it must be that is not a subgroup of .
Proof of Theorem 10
To complete the proof of Theorem 10 we just need to see why there are just two conjugacy classes of subgroups of isomorphic to . But in fact Will Sawin proved a stronger result! He was answering this question of mine:
Define the Standard Model gauge group to be , the subgroup of consisting of block diagonal matrices with a block and then a block. (This is isomorphic to the quotient of by the subgroup of elements ) where is a 6th root of unity.)
Up to conjugacy, how many subgroups isomorphic to the Standard Model gauge group does have?
This question is relevant to grand unified theories of particle physics, as explained here:
- John C. Baez and John Huerta, The algebra of grand unified theories, Bull. Amer. Math. Soc. 47 (2010), 483-552.
This article focuses on one particular copy of in , given as follows. By definition we have an inclusion , and we also have an inclusion because for any we have an inclusion , and is simply connected so this gives a homomorphism .
However I think there is also an inclusion , studied by Krasnov:
- Kirill Krasnov, SO(9) characterisation of the Standard Model gauge group.
Composing this with , this should give another inclusion , and I believe this one is ‘truly different from’ — i.e., not conjugate to — the first one I mentioned.
So I believe my current answer to my question is “at least two”. But that’s not good enough.
Sawin’s answer relies on the 3-fold way. we embed in , we are automatically giving this group an orthogonal 10-dimensional representation, thanks to the map . We can classify the possibilities.
He writes:
There are infinitely many embeddings. However, all but one of them is “essentially the same as” the one you studied as they become equal to the one you studied on restriction to . The remaining one is the one studied by Krasnov.
I follow the strategy suggested by Kenta Suzuki.
has irreducible representations of dimensions , and higher dimensions. The -dimensional ones are dual to each other, as are the -dimensional ones, so they can’t appear. The -dimensional ones are dual to each other and can only appear together. So the only -dimensional self-dual representations of decompose as irreducibles as , , or ten s. All of these are orthogonal because the 8-dimensional representation is orthogonal. However, the ten s cannot appear because then would act trivially.
A representation of is a sum of tensor products of irreducible representations of and irreducible representations of . Restricted to , each tensor product splits into a sum of copies of the same irreducible representation. So can only act nontrivially when the same representation appears multiple times. Since the is two different -dimensional representation, only the -dimensional representation can occur twice. Thus, our 10-dimensional orthogonal representation of necessarily splits as either the -dimensional adjoint repsentation of plus a -dimensional orthogonal representation of or the -dimensional sum of standard and conjugate [i.e., dual] representations of plus a -dimensional orthogonal representation of . However, has a unique nontrivial representation of dimension and it isn’t orthgonal, so only the second case can appear. has representations of dimension of which the and -dimensional ones are symplectic and so must appear with even multiplicity in any orthogonal representation, so the only nontrivial -dimensional orthogonal ones are or .
So there are two ten-dimensional orthogonal representations of that are nontrivial on both factors, those being the sum of two different -dimensional irreducible representations of with either two copies of the two-dimensional irreducible representation of or the three-dimensional and the one-dimensional irreducible representation of . The orthogonal structure is unique up to isomorphisms, so these give two conjugacy classes of homomorphisms and thus two conjugacy classes of homomorphisms . The first one corrresponds to the embedding you studied while only the second one restricts to so indeed these are different.
To understand how to extend these to , I consider the centralizer of the representation within . Since the group is connected, this is the same as the centralizer of its Lie algebra, which is therefore the inverse image of the centralizer in . Now there is a distinction between the two examples because the example with irrep dimensions has centralizer with identity component while the example with irrep dimensions has centralizer with identity component . In the second case, the image of must be the image of times the centralizer of the image of , so this gives a unique example, which must be the one considered by Krasnov.
In the first case, we can restrict attention to a torus in . The center of maps to a one-dimensional subgroup of this torus, which can be described by a pair of integers. Explicitly, given a two-by-two-unitary matrix and a three-by-three unitary matrix with , we can map to by sending to where , and then map from to . This lifts to the spin group if and only if the determinant in is a perfect square. The determinant is so a lift exists if and only if is even.
The only possible kernel of this embedding is the scalars. The scalar maps to and so the kernel is trivial if and only if .
However, there are infinitely many integer solutions to with even (in fact, a random and even works with probability ), so this gives infinitely many examples.
- Part 1. How to define octonion multiplication using complex scalars and vectors, much as quaternion multiplication can be defined using real scalars and vectors. This description requires singling out a specific unit imaginary octonion, and it shows that octonion multiplication is invariant under .
- Part 2. A more polished way to think about octonion multiplication in terms of complex scalars and vectors, and a similar-looking way to describe it using the cross product in 7 dimensions.
- Part 3. How a lepton and a quark fit together into an octonion — at least if we only consider them as representations of , the gauge group of the strong force. Proof that the symmetries of the octonions fixing an imaginary octonion form precisely the group .
- Part 4. Introducing the exceptional Jordan algebra : the self-adjoint octonionic matrices. A result of Dubois-Violette and Todorov: the symmetries of the exceptional Jordan algebra preserving their splitting into complex scalar and vector parts and preserving a copy of the adjoint octonionic matrices form precisely the Standard Model gauge group.
- Part 5. How to think of self-adjoint octonionic matrices as vectors in 10d Minkowski spacetime, and pairs of octonions as left- or right-handed spinors.
- Part 6. The linear transformations of the exceptional Jordan algebra that preserve the determinant form the exceptional Lie group . How to compute this determinant in terms of 10-dimensional spacetime geometry: that is, scalars, vectors and left-handed spinors in 10d Minkowski spacetime.
- Part 7. How to describe the Lie group using 10-dimensional spacetime geometry. This group is built from the double cover of the Lorentz group, left-handed and right-handed spinors, and scalars in 10d Minkowski spacetime.
- Part 8. A geometrical way to see how is connected to 10d spacetime, based on the octonionic projective plane.
- Part 9. Duality in projective plane geometry, and how it lets us break the Lie group into the Lorentz group, left-handed and right-handed spinors, and scalars in 10d Minkowski spacetime.
- Part 10. Jordan algebras, their symmetry groups, their invariant structures — and how they connect quantum mechanics, special relativity and projective geometry.
- Part 11. Particle physics on the spacetime given by the exceptional Jordan algebra: a summary of work with Greg Egan and John Huerta.
- Part 12. The bioctonionic projective plane and its connections to algebra, geometry and physics.
- Part 13. Two ways to embed in , and their consequences for particle physics.

Re: Octonions and the Standard Model (Part 13)
Hi, John,
Should this “division algebras ℂ,ℂ and ℍ, respectively” be ” division algebras ℝ, ℂ and ℍ, respectively”.
David