### Octonions and the Standard Model (Part 2)

#### Posted by John Baez

My description of the octonions in Part 1 raised enough issues that I’d like to talk about it a bit more. I’ll show you a prettier formula for octonion multiplication in terms of $\mathbb{C} \oplus \mathbb{C}^3$… and also a very similar-looking formula for it in terms of $\mathbb{R} \oplus \mathbb{R}^7$.

Last time I took the usual construction of the quaternions from $\mathbb{R}^3$ and modified it to construct the octonions from $\mathbb{C}^3$. The key step was replacing the dot product on $\mathbb{R}^3$ by the inner product on $\mathbb{C}^3$, which is conjugate-linear in one slot.

Layra Idarani pointed out some other conventions that make the formula for multiplication more beautiful, so let me explain those here. This stuff deserves to be made beautiful.

First, we’ll make the inner product conjugate-linear in the *second* slot:

$\langle \vec{a} , \vec{b} \rangle = \sum_{i=1}^3 a_i \overline{b}_{i}$

It breaks my heart to do this: physicists do it the other way, so this is the convention preferred only by mathematicians who have never studied quantum mechanics. But life is cruel: sticking with the other convention would force me to do other unpleasant things.

Next, we define the **hermitian cross product** of vectors in $\mathbb{C}^3$ by

$(\vec{a} \overline{\times} \vec{b})_i = \overline{(\vec{a} \times \vec{b})_{i}}$

We saw last time that this operation is invariant under $\mathrm{SU}(3)$, though the usual cross product is not, nor is the operation of taking the conjugate of each component of a vector.

Lastly, we’ll define a funny way of multiplying vectors in $\mathbb{C}^3$ *on the right* by complex numbers:

$\vec{a} a = \overline{a} \vec{a}$

for all $a \in \mathbb{C}, \vec{a} \in \mathbb{C}^3$.

The payoff for all these contortions is a very pretty formula for octonion multiplication. We’ll take elements of $\mathbb{C} \oplus \mathbb{C}^3$ and write them like $a + \vec{a}$, where the ‘scalar part’ $a$ lives in $\mathbb{C}$ and the ‘vector part’ $\vec{a}$ lives in $\mathbb{C}^3$. And these will be our octonions!

**Theorem 2.** *If we define multiplication on $\mathbb{C} \oplus \mathbb{C}^3$ by
$(a + \vec{a})(b + \vec{b}) = a b - \langle \vec{a}, \vec{b}\rangle + a \vec{b} + \vec{a} b + \vec{a} \overline{\times} \vec{b}$
then this space becomes a 8-dimensional normed division algebra over the real numbers, since $1 \in \mathbb{C}$ acts as the multiplicative identity and
$\|(a + \vec{a})(b + \vec{b})\| = \|a + \vec{a}\| \|b + \vec{b}\|$
This normed division algebra is isomorphic to the octonions.*

I’ll prove this at the end of this post. It’s almost exactly like the proof of the theorem last time, but I want everyone to be able to check it.

The idea of making $\mathbb{C}^3$ into a right $\mathbb{C}$-module in a nontrivial way, ‘twisted by conjugation’, may seem weird. Pragmatically speaking we do this to hide the appearance of conjugation in our formula for multiplication. But it’s actually quite natural. Let me explain why.

First of all, note that if we pick any element $i \in \mathbb{O}$ with $i^2 = -1$, we get a copy of $\mathbb{C}$ sitting in $\mathbb{O}$ as a subalgebra. $\mathbb{O}$ then becomes a left $\mathbb{C}$-module by left multiplication. This is not as obvious as it sounds, since it’s saying that

$a(b x) = (a b)x$

for all $a, b \in \mathbb{C} \subseteq \mathbb{O}$ and $x \in \mathbb{O}$. The octonions are not associative! But they are **alternative** — the subalgebra generated by any two elements is associative — and in the above equation $a$ and $b$ are in the subalgebra generated by $i$, while $x$ is in the subalgebra generated by $x$. So the desired equation holds!

We can similarly make $\mathbb{O}$ into a right $\mathbb{C}$-bimodule by right multiplication. Even better, $\mathbb{O}$ is then a $\mathbb{C},\mathbb{C}$-bimodule. The reason is that

$(a x)b = a(x b)$

for all $a, b \in \mathbb{C} \subseteq \mathbb{O}$ and $x \in \mathbb{O}$, again because the octonions are alternative.

The left and right actions of $\mathbb{C}$ on the octonions are really different:

$a x \ne x a$

for most $a \in \mathbb{C}$, $x \in \mathbb{O}$, because the octonions are noncommutative. But we can say more than just this negative statement. There’s an inner product on the octonions that’s invariant under all automorphisms, so we can pick an invariant complement $V$ to $\mathbb{C} \subseteq \mathbb{O}$. We then have

$a v = v \overline{a}$

for $a \in \mathbb{C}$, $v \in V$, because orthogonal square roots of $-1$ in the octonions anticommute.

$V$ is a $\mathbb{C},\mathbb{C}$-bimodule. As a left $\mathbb{C}$-module it’s isomorphic to $\mathbb{C}^3$ . So, we can use this idea to describe $\mathbb{O}$ as $\mathbb{C} \oplus \mathbb{C}^3$, and it then becomes very natural to have $\mathbb{C}$ act on $\mathbb{C}^3$ in a ‘twisted’ way, obeying $a v = v \overline{a}$. This is exactly what we need to make $\mathbb{C}^3$ isomorphic to $V$ as a $\mathbb{C},\mathbb{C}$-bimodule!

It’s probably good to summarize this:

**Theorem 3.** *Any element $i \in \mathbb{O}$ with $i^2 = -1$ generates a subalgebra of the octonions that can be canonically identified with $\mathbb{C}$. Left and right multiplication by this copy of $\mathbb{C}$ makes $\mathbb{O}$ into a $\mathbb{C},\mathbb{C}$-bimodule. As a $\mathbb{C},\mathbb{C}$-bimodule we then have
$\mathbb{O} \cong \mathbb{C} \oplus \mathbb{C}^3$
where the left action of $\mathbb{C}$ on $\mathbb{C} \oplus \mathbb{C}^3$ is
the usual one, but the right action is given by
$(b , \vec{b})a = (a b, \overline{a}\vec{b})$
for $a,b \in \mathbb{C}, \vec{b} \in \mathbb{C}^3$.*

Finally, one more thing for today — a cute fact I’m not sure what to do with.

We can also take the octonions and split them as $\mathbb{R} \oplus \mathbb{R}^7$ where $\mathbb{R}$ contains $1$ and $\mathbb{R}^7$ contains all the ‘imaginary’ octonions: those orthogonal to $1$. We can thus write any octonion as $a + \vec{a}$ where now $a \in \mathbb{R}$ and $\vec{a} \in \mathbb{R}^7$. And now octonion multiplication looks like this:

$(a + \vec{a})(b + \vec{b}) = a b - \vec{a} \cdot \vec{b} + a \vec{b} + \vec{a} b + \vec{a} \times \vec{b}$

This formula looks almost exactly like the one in the theorem! But now $\vec{a} \cdot \vec{b}$ is defined using the usual dot product on $\mathbb{R}^7$, and $\vec{a} \times \vec{b}$ is defined using a funny cross product in 7 dimensions, which unfortunately is easiest to define *using the octonions*. So, this formula is less useful than the one in terms of $\mathbb{C} \oplus \mathbb{C}^3$ if you’re trying to get off the ground and explain the octonions from scratch.

Oh, and by the way: I wrote $\vec{a} b$ in this formula just to be cute: in this real context it’s just the same as $b \vec{a}$: no sneaky tricks with complex conjugation.

That’s all for today, except proving this:

**Theorem 2.** *If we define multiplication on $\mathbb{C} \oplus \mathbb{C}^3$ by
$(a + \vec{a})(b + \vec{b}) = a b - \langle \vec{a}, \vec{b}\rangle + a \vec{b} + \vec{a} b + \vec{a} \overline{\times} \vec{b}$
then this space becomes a 8-dimensional normed division algebra over the real numbers, since $1 \in \mathbb{C}$ acts as the multiplicative identity and
$\|(a + \vec{a})(b + \vec{b})\| = \|a + \vec{a}\| \|b + \vec{b}\|$
This normed division algebra is isomorphic to the octonions.*

**Proof.** It’s enough to show that we have a normed division algebra, since the octonions are the only 8-dimensional normed division algebra over the reals. We’ll use the norm with

$\|a + \vec{a}\|^2 = |a|^2 + \langle \vec{a} , \vec{a} \rangle$

and show that

$\|(a + \vec{a})(b + \vec{b})\| = \|a + \vec{a}\| \|b + \vec{b}\|$

We start with

$\|(a + \vec{a})(b + \vec{b})\|^2 = \|a b - \langle\vec{a}, \vec{b}\rangle + a \vec{b} + \vec{a} b + \vec{a} \overline{\times} \vec{b} \|^2$

and use the definition of the norm to break this up into two terms:

$|a b - \langle \vec{a}, \vec{b} \rangle|^2 + \| a \vec{b} + \vec{a} b + \vec{a} \overline{\times} \vec{b} \|^2$

We can expand the first term:

$|a b - \langle \vec{a}, \vec{b} \rangle|^2 = |a b|^2 - 2 \mathrm{Re}(a b \langle \vec{b} , \vec{a} \rangle) + |\langle \vec{a}, \vec{b}\rangle|^2$

We can expand the second term:

$\|a \vec{b} + \vec{a} b + \vec{a} \overline{\times} \vec{b} \|^2 =$ $\| a \vec{b} \|^2 + \|\vec{a} b\|^2 + \|\vec{a} \overline{\times} \vec{b}\|^2 + 2 \mathrm{Re} \big(a b \langle \vec{b}, \vec{a} \rangle + \overline{a} \langle \vec{b}, \vec{a} \overline{\times} \vec{b} \rangle + b \langle \vec{a}, \vec{a} \overline{\times} \vec{b} \rangle \big)$

But note that

$\langle \vec{a}, \vec{a} \overline{\times} \vec{b} \rangle = \vec{a} \cdot (\vec{a} \times \vec{b}) = 0$

by a well-known vector identity which works for complex vectors just as for real ones. Similarly $\langle \vec{b}, \vec{a} \overline{\times} \vec{b} \rangle = 0$. So, the expanded second term simplifies, and when we add it to the expanded first term the bits involving $\mathrm{Re}(a b \langle \vec{b} , \vec{a} \rangle)$ cancel out. We’re left with this:

$|a b|^2 + |\langle \vec{a}, \vec{b}\rangle|^2 + \| a \vec{b} \|^2 + \|\vec{a} b\|^2 + \|\vec{a} \overline{\times} \vec{b}\|^2$

We need to show this equals

$\begin{array}{ccl} \|a + \vec{a} \|^2 \| b + \vec{b}\|^2 &=& (|a|^2 + \|\vec{a}\|^2) (|b|^2 + \|\vec{b}\|^2) \\ \\ &=& |a|^2 |b|^2 + |a|^2 \|\vec{b}\|^2 + |b|^2 \|\vec{a}\|^2 + \|\vec{a}\|^2 \|\vec{b}\|^2 \end{array}$

Three terms match, so it’s enough to show

$|\langle \vec{a}, \vec{b}\rangle|^2 + \|\vec{a} \overline{\times} \vec{b}\|^2 = \|\vec{a}\|^2 \|\vec{b}\|^2$

This would be a familiar vector identity if we were working in $\mathbb{R}^3$ with the usual cross product instead of the mutant one. But notice, the mutant cross product is obtained from the ordinary one by componentwise complex conjugation, so

$\|\vec{a} \overline{\times} \vec{b}\|^2 = \|\vec{a} \times \vec{b}\|^2$

Thus, we just need to show

$|\langle \vec{a}, \vec{b}\rangle|^2 + \|\vec{a} \times \vec{b}\|^2 = \|\vec{a}\|^2 \|\vec{b}\|^2$

This works for $\mathbb{C}^3$ and its inner product exactly as it does for $\mathbb{R}^3$ and its dot product: you can either write out both sides using components and see they agree, or do a geometrical argument using the law of cosines and the law of sines. █

- Part 1. How to define octonion multiplication using complex scalars and vectors, much as quaternion multiplication can be defined using real scalars and vectors. This description requires singling out a specific unit imaginary octonion, and it shows that octonion multiplication is invariant under $\mathrm{SU}(3)$.
- Part 2. A more polished way to think about octonion multiplication in terms of complex scalars and vectors, and a similar-looking way to describe it using the cross product in 7 dimensions.
- Part 3. How a lepton and a quark fit together into an octonion — at least if we only consider them as representations of $\mathrm{SU}(3)$, the gauge group of the strong force. Proof that the symmetries of the octonions fixing an imaginary octonion form precisely the group $\mathrm{SU}(3)$.
- Part 4. Introducing the exceptional Jordan algebra $\mathfrak{h}_3(\mathbb{O})$: the $3 \times 3$ self-adjoint octonionic matrices. A result of Dubois-Violette and Todorov: the symmetries of the exceptional Jordan algebra preserving their splitting into complex scalar and vector parts and preserving a copy of the $2 \times 2$ adjoint octonionic matrices form precisely the Standard Model gauge group.
- Part 5. How to think of $2 \times 2$ self-adjoint octonionic matrices as vectors in 10d Minkowski spacetime, and pairs of octonions as left- or right-handed spinors.
- Part 6. The linear transformations of the exceptional Jordan algebra that preserve the determinant form the exceptional Lie group $\mathrm{E}_6$. How to compute this determinant in terms of 10-dimensional spacetime geometry: that is, scalars, vectors and left-handed spinors in 10d Minkowski spacetime.
- Part 7. How to describe the Lie group $\mathrm{E}_6$ using 10-dimensional spacetime geometry.
- Part 8. A geometrical way to see how $\mathrm{E}_6$ is connected to 10d spacetime, based on the octonionic projective plane.
- Part 9. Duality in projective plane geometry, and how it lets us break the Lie group $\mathrm{E}_6$ into the Lorentz group, left-handed and right-handed spinors, and scalars in 10d Minkowski spacetime.

## Re: Octonions and the Standard Model (Part 2)

Typo:

Also, there seems to be one too many hyphens after “alternative”.