State-Observable Duality (Part 3)
Posted by John Baez
This is the third and final episode of a little story about the foundations of quantum mechanics.
In the first episode, I reminded you of some basic facts about the real numbers , the complex numbers , and the quaternions .
In the second episode, I told you how Jordan, von Neumann and Wigner classified ‘formally real Jordan algebras’, which can serve as algebras of observables in quantum theory. Apart from the ‘spin factors’ and the Jordan algebra of self-adjoint octonionic matrices, , these come in three kinds:
- The algebra of self-adjoint real matrices with the product .
- The algebra of self-adjoint complex matrices with the product .
- The algebra of self-adjoint quaternionic matrices with the product .
In every case, even the curious exceptional cases, there is a concept of what it means for an element to be ‘positive’, and the positive elements form a cone. In this episode we’ll explore that further: we’ll meet the Koecher–Vinberg classification of convex homogeneous self-dual cones, and see how it’s really all about state-observable duality.
Last time we talked about Jordan algebras and their role in the foundations of quantum mechanics. But the formalism of Jordan algebras seems rather removed from the actual practice of physics. After all, physicists hardly ever take two observables and and form their Jordan product . As I hinted last time, it is better to think of this operation as derived from the process of squaring an observable, which is something physicists actually do. But still, I can’t help wondering: can we see the classification of finite-dimensional formally real Jordan algebras, and thus the special role of normed division algebras, as arising from some axioms more closely tied to quantum theory as physicists usually practice it?
One answer involves the duality between states and observables. To understand this, you need to know about ‘mixed states’. A ‘state’ describes your knowledge about a physical system. If you know as much as possible we call it a ‘pure state’, but more generally, you may not know as much as possible, and then we speak of ‘mixed states’. It’s usually best to consider pure states as a special case of mixed states.
How does this play out in ordinary quantum theory? If a quantum system has the Hilbert space , observables are described by self-adjoint complex matrices: elements of the Jordan algebra . But matrices of this form that are nonnegative and have trace 1 also play another role. They are called density matrices, and they describe mixed states of our quantum system. The idea is that any density matrix lets us define expectation values of observables via The map sending observables to their expectation values is real-linear. The fact that is nonnegative is equivalent to and the fact that has trace 1 is equivalent to
We might call this relationship between states and observables ‘state-observable duality’. By this, we are not merely referring to the fact that states live in the dual of the vector space of observables: that much is obvious, given that the expectation value of an observable should depend linearly on that observable. The nontrivial thing is that we can identify the vector space of observables with its dual: using the trace, which puts a real-valued inner product on the space of observables: Thus, states can be identified with certain special observables!
All this generalizes to an arbitrary finite-dimensional formally real Jordan algebra . Every such algebra automatically has an identity element. This lets us define a state on to be a linear functional that is nonnegative: and normalized: But in fact, there is a one-to-one correspondence between linear functionals on and elements of . The reason is that every finite-dimensional Jordan algebra has a trace defined so that is the trace of the linear operator ‘multiplication by ’. Such a Jordan algebra is then formally real if and only if is a real-valued inner product. So, when is a finite-dimensional formally real Jordan algebra, any linear functional can be written as for a unique element . Conversely, every element gives a linear functional by this formula. While not obvious, it is true that the linear functional is nonnegative if and only if in terms of the ordering on . More obviously, is normalized if and only if . So, states can be identified with certain special observables: namely, those observables with and .
In short: whenever the observables in our theory form a finite-dimensional formally real Jordan algebra, we have state-observable duality. But what is the physical meaning of state-observable duality? Why in the world should states correspond to special observables? A state is a way for your system to be; an observable is something you can measure about it. They seem quite different!
Here is one attempt at an answer. Every finite-dimensional formally real Jordan algebra comes equipped with a distinguished observable, the most boring one of all: the identity, . This is nonnegative, so if we normalize it, we get an observable of the special kind that corresponds to a state. This state, say , is just the normalized trace: And this state has a clear physical meaning: it is the state of maximal ignorance! It is the state where we know as little as possible about our system — or more precisely, at least in the case of ordinary complex quantum theory, the state where entropy is maximized.
For example, take , the algebra of observables of a spin- particle. Then the space of states is the so-called Bloch sphere, really a 3-dimensional ball.
The ball is convex, and for a good reason Suppose I flip a coin, don’t show you the result, and tell you “I made the particle’s state be if the coin landed heads up, and if it landed tails up”. Then the mixed state that describes your knowledge is halfway between and . More generally any convex linear combination of mixed states is another mixed state, where the probability is between and . That’s what we mean by saying the ball is convex. Indeed, for any formally real Jordan algebra, the space of states is convex!
So, on the surface of this ball are the pure states: the states where you know as much as possible. For any point on this surface, there’s a state where you know the electron’s spin points that way. At the center of the ball is the state of maximum ignorance. This corresponds to the density matrix In this state, when I ask you about the particle’s spin along any axis, all you can say is that there’s a chance that it’s pointing one way, and a chance of it pointing the other way.
Now, back to the general theory:
acts on its dual : given and a linear functional , we get a new linear functional . This captures the idea, familiar in quantum theory, that observables are also ‘operators’: they act on states. And state-observable duality means we can get any state from the state of complete ignorance by act on it with a suitable observable. After all, any state corresponds to some observable , as follows: So, we can get this state by acting on the state of maximal ignorance, , by the observable : So, we see that the correspondence between states and special observables springs from two causes. First, there is a distinguished state, the state of maximal ignorance. Second, any other state can be obtained from the state of maximal ignorance by acting on it with a suitable observable.
While these ideas raise a host of questions, they also help motivate an important theorem of Koecher and Vinberg. The idea is to axiomatize the situation we we have just described, in a way that does not mention the Jordan product in , but instead emphasizes:
- state-observable duality,
- the fact that ‘positive’ observables, namely those whose observed values are always positive, form a cone.
To find appropriate axioms, suppose is a finite-dimensional formally real Jordan algebra. Then seven facts are always true.
First, the set of positive observables is a cone: that is, implies that every positive multiple of is also in . Second, this cone is convex: if then any linear combination with also lies in . Third, it is an open set. Fourth, it is regular, meaning that if and are both in the closure , then . This last condition may seem obscure, but if we note that we see that being regular simply means a perfectly plausible assumption.
Next recall that has an inner product; this is what lets us identify linear functionals on with elements of . This also lets us define the dual cone which one can check is indeed a cone. The fifth fact about is that it is self-dual, meaning . This formalizes the notion of state-observable duality!
The sixth fact is is also homogeneous: given any two points , there is a real-linear linear transformation mapping to itself in a one-to-one and onto way, with the property that . This says that cone is highly symmetrical: no point of is any ‘better’ than any other, at least if we only consider the linear structure of the space , ignoring the Jordan product and the trace.
From another viewpoint, however, there is a very special point of , namely the identity of our Jordan algebra. And this brings us to our seventh and final fact: the cone is pointed, meaning that it is equipped with a distinguished element (in this case ). As we have seen, this element corresponds to the ‘state of complete ignorance’, at least after we normalize it.
In short: when is a finite-dimensional formally real Jordan algebra, is a pointed homogeneous self-dual regular open convex cone. All the elements are positive observables, but certain special ones, namely those with , can also be viewed as states.
In fact, there is a category of pointed homogeneous self-dual regular open convex cones, where:
- An object is a finite-dimensional real inner product space equipped with a pointed homogeneous self-dual regular open convex cone .
- A morphism from one object, say , to another, say , is a linear map preserving the inner product and mapping into .
Now for the payoff. The work of Koecher and Vinberg, nicely explained in Koecher’s Minnesota notes:
- Max Koecher, The Minnesota Notes on Jordan Algebras and Their Applications, eds. Aloys Krieg and Sebastican Walcher, Lecture Notes in Mathematics 1710, Springer, Berlin, 1999.
shows that:
Theorem: The category of pointed homogeneous self-dual regular open convex cones is equivalent to the category of finite-dimensional formally real Jordan algebras.
This means that the theorem of Jordan, von Neumann and Wigner, which we saw last time, also classifies the pointed homogeneous self-dual regular convex cones!
Theorem: Every pointed homogeneous self-dual regular open convex cones is isomorphic to a direct sum of those on this list:
- the cone of positive elements in ,
- the cone of positive elements in ,
- the cone of positive elements in ,
- the cone of positive elements in ,
- the future lightcone in .
Some of this deserves a bit of explanation. For , an element is positive if and only if the corresponding operator has for all nonzero . A similar trick works for defining positive elements of , but we do not need the details here. We say an element lies in the future lightcone if and . This of course fits in nicely with the idea that the spin factors are Minkowski spacetimes. Finally, there is an obvious notion of direct sum for Euclidean spaces with cones, where the direct sum of and is equipped with the cone
In short: self-adjoint operators on real, complex and quaternionic Hilbert spaces arise fairly naturally as observables starting from a formalism where nonnegative observables form a cone, and we insist on state-observable duality.
There is a well-developed approach to probabilistic theories that works for cones that are neither self-dual nor homogeneous: see for example the work of Howard Barnum and his coauthors. This has already allowed Barnum, Gaebler and Wilce to shed new light on the physical significance of self-duality. But perhaps if we think more about state-observable duality we can better understand its meaning… and thus the appearance of normed division algebras in quantum physics!
Finally, as Urs Schreiber pointed out, it is worth comparing state-observable duality to the ‘state-operator correspondence’. This was made popular in the context of string theory, but it really applies whenever we have a -algebra of observables, say , equipped with a state . Then the Gelfand-Naimark-Segal construction lets us build a Hilbert space on which acts, together with a distinguished unit vector called the ‘vacuum state’. The Hilbert space is built by completing a quotient of , so a dense set of vectors in come from elements of . Thus again, some observables give us states. In particular, the vacuum state comes from the element .
This is reminiscent of how in the Jordan algebra framework, the state of maximal ignorance comes from the element in the Jordan algebra of observables. But there are also some differences: for example, the Gelfand-Naimark-Segal construction requires choosing a state, and it works for infinite-dimensional -algebras, while our construction works for finite-dimensional formally real Jordan algebras, which have a canonical state: the state of maximum ignorance. Presumably both constructions are special cases of something more general.
Re: State-Observable Duality (Part 3)
I’d say if anyone is interested in exploring the state of the art of the classification of state spaces of operator algebras, a good place to start would be