Comagnitude 1

January 22, 2025

Posted by Tom Leinster

$MathML-enabled post (click for more details).$

In this post and the next, I want to try out a new idea and see where it leads. It goes back to where magnitude began, which was the desire to unify elementary counting formulas like the inclusion-exclusion principle and the simple formula for the number of orbits in a free action of a group on a finite set.

To prepare the ground for comagnitude, I need to present magnitude itself in a slightly different way from usual. I won’t assume you know anything about magnitude, but if you do, watch out for something new: a connection between magnitude and entropy (ordinary, relative and conditional) that I don’t think has quite been articulated before.

$MathML-enabled post (click for more details).$

The inclusion-exclusion formula tells us the cardinality of a union, which in categorical terms is a pushout. The set of orbits of a free action of a group $G$ on a finite set is the colimit of the associated functor $B G \to FinSet$ . (Here $B G$ is the one-object category whose maps are the elements of $G$ .) Generally, we’re going to think about the cardinality of the colimit of a functor $X: A \to FinSet$ , where $A$ is a finite category.

In both the examples I just mentioned, the cardinality $|colim X|$ of the colimit is a $\mathbb{Q}$ -linear combination of the cardinalities $|X(a)|$ of the individual sets $X(a)$ , taken over all objects $a \in A$ . For inclusion-exclusion, we take

$A = (0 \leftarrow 1 \rightarrow 2)$

and get

$|colim(X)| = |X(0)| - |X(1)| + |X(2)|.$

For a free action of a group $G$ , we take $A = B G$ . If the single object of $A$ is called $a$ , then

$|colim(X)| = |X(a)|/o(G)$

— the cardinality of the set $X(a)$ being acted on, divided by the order of $G$ .

Generally, there’s no hope of computing the cardinality of the colimit of a functor $X: A \to FinSet$ in terms of the cardinalities of the sets $X(a)$ alone. The colimit usually depends on what the functor $X$ does to morphisms, as well as objects. So in order to pull this off, we’re going to have to confine ourselves to only those functors $X$ that are especially convenient. The hope is that for any $A$ , there are rational numbers $w_A(a)$ ( $a \in A$ ) such that for all “convenient” functors $X: A \to FinSet$ ,

$|colim(X)| = \sum_{a \in A} w_A(a) |X(a)|.$

Here’s how to simultaneously figure out what the coefficients $w_A(a)$ in this weighted sum must be and what “convenient” should mean.

The starting thought is that most notions of “nice enough” for set-valued functors include the representables. Since the colimit of any representable is the one-element set, for the last equation to hold for all representables $A(b, -)$ means that

$1 = \sum_{a \in A} w_A(a) |A(b, a)|$

for all $b \in A$ .

And now we’re getting somewhere! This is a system of equations with the same number of equations as unknowns $w_A(a)$ , namely, the number of objects of $A$ . And in this situation, there’s typically a unique solution, at least if we work over the field $\mathbb{Q}$ .

A family $(w_A(a))_{a \in A}$ of rational numbers satisfying

$\sum_{a \in A} |A(b, a)| w_A(a) = 1$

for all $b \in A$ is called a weighting on $A$ , and $w_A(a)$ is called the weight of $a$ . For the purposes of this post, I’ll assume there’s always exactly one weighting on $A$ . That’s not entirely justified: there are examples of finite categories with no weighting or finitely many. (See Examples 1.11 of my paper The Euler characteristic of a category, where all the stuff I’m talking about right now is worked out.) But it’s safe enough.

So, take the weighting $w_A$ on $A$ . By defintion, the class of functors $X: A \to FinSet$ satisfying

$|colim(X)| = \sum_{a \in A} w_A(a) |X(a)|$

contains the representables. But it’s easy to see that it’s also closed under coproducts. So, this equation — which I’ll call the colimit formula — holds for all coproducts of representables!

Which functors $A \to Set$ are coproducts of representables? They’re sometimes called the “familially representable” functors, the idea being that a coproduct of representables $\sum_{i \in I} A(a_i, -)$ is represented by a family of objects $(a_i)_{i \in I}$ . But in my last post, I called them the “free functors”. The way I put it there was that the forgetful functor

$Set^A \to Set^{ob\ A}$

has both a left and a right adjoint, and a functor $X \in Set^A$ is free if and only if it’s in the image of the left adjoint. That means

$X \cong \sum_{a \in A} S(a) \times A(a, -)$

for some family of sets $(S(a))_{a \in A}$ .

Examples

When $A$ is the discrete two-object category $\{0, 1\}$ , both weights are $1$ , and the general colimit formula

$|colim(X)| = \sum_a w_A(a) |X(a)|$

reduces to something very basic:

$|X(0) + X(1)| = |X(0)| + |X(1)|$

— the cardinality of a coproduct is the sum of the individual cardinalities. When $A$ is a discrete category, all functors $A \to Set$ are coproducts of representables.
When $A$ is the shape $(0 \leftarrow 1 \rightarrow 2)$ for pushouts, a functor $X: A \to FinSet$ is a diagram

$(X(0) \leftarrow X(1) \rightarrow X(2))$

of finite sets, and $X$ is a coproduct of representables if and only if both these maps are injections. Assume $X$ is a coproduct of representables. The weighting on $A$ is $(1, -1, 1)$ , so the colimit formula says that the cardinality of a pushout is

$|X(0)| - |X(1)| + |X(2)|$

— the inclusion-exclusion formula.
For a group $G$ (or more generally, a monoid), the weight of the unique object of $B G$ is $1/o(G)$ , the reciprocal of the order. Given an action of $G$ on a a finite set, the corresponding functor $B G \to FinSet$ is a coproduct of representables if and only if the action is free. In that case, the colimit formula says that the number of orbits is the cardinality of the set divided by $o(G)$ .
Finally, when $A = (0 \rightrightarrows 1)$ , the colimit formula says that the coequalizer of $X(0) \rightrightarrows X(1)$ has $|X(1)| - |X(0)|$ elements, provided that the two functions from $X(0)$ to $X(1)$ are injective with disjoint images.

Aside for category theorists Under finiteness conditions, and assuming that $A$ is Cauchy complete, a functor $X: A \to Set$ is a coproduct of representables if and only if it is flat with respect to the class of finite connected limits. This might seem more abstract than the original condition, but it actually gives a concrete, testable condition for $X$ to be a coproduct of representables. (See Lemma 5.2 and Definition 3.2 here.) This result enables one to see rather quickly that in all the examples above, the functors arising as coproducts of representables are exactly what I said they are.

Summary so far We’ve seen that a finite category $A$ typically has a unique weighting $w_A$ — meaning a family $(w_A(a))_{a \in A}$ of rational numbers satisfying

$\sum_a |A(a, b)| w_A(a) = 1$

for all $b \in A$ . And we’ve seen that when $X: A \to FinSet$ is a coproduct of representables, the “colimit formula” holds:

$|colim(X)| = \sum_a w_A(a) |X(a)|.$

Now, what if $X$ isn’t a coproduct of representables? The colimit formula won’t usually hold. But the right-hand side of this non-equation still calculates something. What is it?

Let me define the magnitude $|X|$ of a functor $X: A \to FinSet$ by

$|X| = \sum_{a \in A} w_A(a) |X(a)| \in \mathbb{Q}.$

This is equal to $|colim(X)|$ if $X$ is a coproduct of representables, but not in general.

For example, if we take a monoid $M$ acting on a finite set, the magnitude of the corresponding functor $B M \to FinSet$ is the cardinality of the set divided by the order of $M$ . Unless the action is free, this isn’t usually the number of orbits, and it might not even be an integer.

The functor $\Delta 1: A \to FinSet$ sending everything in $A$ to the one-element set has magnitude

$|\Delta 1| = \sum_{a \in A} w_A(a),$

which is called the magnitude or Euler characteristic of $A$ . (Under finiteness hypothesis, it’s equal to the topological Euler characteristic of the classifying space of $A$ .) Hence the magnitude of a category is a special case of the magnitude of a set-valued functor. Conversely, one can show that the magnitude of a functor $X: A \to FinSet$ is equal to the magnitude of its category of elements.

So, the concepts of magnitude of a category (already studied extensively) and the magnitude of a set-valued functor (a term I think I’m introducing for the first time here) are each special cases of the other. But in these posts, unlike everything I’ve written about magnitude before, I’m going to give prime position to magnitude of set-valued functors rather than of categories.

Does this definition of the magnitude of a set-valued functor have sensible properties? It does!

First, if $X, Y: A \to FinSet$ are isomorphic then certainly $|X| = |Y|$ . Second, and a bit less trivially, magnitude is invariant under equivalence in the following sense: if we compose $X: A \to FinSet$ with an equivalence of categories $G: B \to A$ then

$|X \circ G| = |X|.$

In fact, this holds just as long as $G$ has a left adjoint.

The term “magnitude” suggests it should behave in a cardinality-like way, and it does that too. For example, given functors $X, Y: A \to FinSet$ ,

$|X + Y| = |X| + |Y|,$

where the left-hand side is the magnitude of the coproduct of our two functors. A bit more generally, suppose we have a diagram of functors and natural transformations

$X \leftarrow Y \rightarrow Z$

such that for each $a \in A$ , the functions

$X(a) \leftarrow Y(a) \rightarrow Z(a)$

are injective (or more abstractly, the corresponding functor from $(0 \leftarrow 1 \rightarrow 2)$ to $FinSet$ is a coproduct of representables). Then one can show that the magnitude of the pushout $X +_Y Z$ is given by

$|X +_Y Z| = |X| - |Y| + |Z|.$

So magnitude in this sense obeys the inclusion-exclusion formula. More generally still, there’s a similar result for any shape of colimit, but I won’t write it out here. It’s exactly what the pushout case suggests.

I hope I’ve driven home the message that although the colimit formula

$|colim(X)| = |X|$

for functors $X: A \to FinSet$ holds when $X$ is a coproduct of representables, it doesn’t hold for arbitrary $X$ . So perhaps it’s interesting to see how much the two sides of the equation differ when $X$ isn’t a coproduct of representables. In other words, let’s look at the difference

$|colim(X)| - |X| = |colim(X)| - \sum_a w_A(a) |X(a)| \in \mathbb{Q}.$

It seems to me that this quantity could be interesting. However, I don’t have a strong feel for what it means yet, so for now I’ll give it a bland name: the discrepancy of $X$ .

Example The discrepancy of a coequalizer

$X_0 \rightrightarrows X_1 \to C$

$|C| - (|X_1| - |X_0|) = |X_0| - |X_1| + |C|.$

We’ve already seen that this is $0$ if our two functions $f, g: X_0 \rightrightarrows X_1$ are injective with disjoint images. But in general, it looks rather like an Euler characteristic. In fact, from our coequalizer we can form a chain complex of free abelian groups

$0 \to \mathbb{Z} X_0 \stackrel{\mathbb{Z}g - \mathbb{Z}f}{\to} \mathbb{Z} X_1 \stackrel{\mathbb{Z}h}{\to} \mathbb{Z} C \to 0$

where $h$ is the original function $X_1 \to C$ . Then the discrepancy of our coequalizer is equal to the Euler characteristic of this complex.

This complex is exact except maybe at $\mathbb{Z} X_0$ . Let’s do a little check: if $f$ and $g$ are injective with disjoint images then $\mathbb{Z}g - \mathbb{Z}f$ is injective too, so the complex is exact everywhere, which implies that the Euler characteristic is $0$ . Since the discrepancy is also $0$ in this case, we’ve just confirmed one case of the result that the discrepancy of the coequalizer is the Euler characteristic of the resulting complex.

Example For a pushout square

$\begin{array}{ccc} X_1 &\rightarrow &X_0 \\ \downarrow & &\downarrow \\ X_2 &\rightarrow &P, \end{array}$

the discrepancy is

$|P| - (|X_0| - |X_1| + |X_2|) = |X_1| - |X_0| - |X_2| + |P|.$

I don’t know what to do with this formula, but it has a certain symmetry. Has anyone come across it before?

I’ll finish by explaining the entropy connection I promised at the start. Here we go!

Ordinary Shannon entropy For this, we need to think about the Shannon entropy not just of probability measures — the usual setting — but of arbitrary measures. The sets we’re considering measures on will always be finite.

A probability measure $p$ on a finite set $\{1, \ldots, n\}$ is just an $n$ -tuple $(p_1, \ldots, p_n)$ of nonnegative reals summing to $1$ , and its entropy $H(p)$ is defined by

$H(p) = -\sum_i p_i \log p_i,$

with the convention that $0\ \log\ 0 = 0$ . A measure $a = (a_1, \ldots, a_n)$ on $\{1, \ldots, n\}$ is any $n$ -tuple of nonnegative reals, and the definition of entropy is extended from probability measures to arbitrary measures by homogeneity. That is, writing $\|a\| = \sum a_i$ , the tuple $a/\|a\|$ is a probability measure, and we define

$H(a) = \|a\| H(a/\|a\|)$

(or $H(a) = 0$ if $\|a\| = 0$ ). This is the unique way of extending the definition so that

$H(\lambda a) = \lambda H(a)$

for all measures $a$ and real $\lambda \geq 0$ .

Let’s now consider measures $a = (a_1, \ldots, a_n)$ where each $a_i$ is an integer. Such a thing amounts to a map

$A \stackrel{\pi}{\to} I$

of finite sets. To see this, take $I = \{1, \ldots, n\}$ , and take $A \stackrel{\pi}{\to} I$ to be a function into $I$ whose $i$ -fibre has cardinality $a_i$ .

This categorification manoeuvre — upgrading from numbers to sets — brings maps into play. In particular, we get the monoid $End(A)$ of all functions $A \to A$ , and its submonoid $End_I(A)$ consisting of the endomorphisms of $A$ over $I$ . That is, the elements of $End_I(A)$ are the functions $f: A \to A$ such that $\pi \circ f = \pi$ .

The inclusion of monoids

$End_I(A) \hookrightarrow End(A),$

like any other monoid homomorphism, induces an action of the domain on the underlying set of the codomain: applying an element $f \in End_I(A)$ to an element $g \in End(A)$ gives a new element $f \circ g \in End(A)$ . And this monoid action corresponds to a functor

$End_I(A) \to FinSet.$

(Or “ $B End_I(A) \to FinSet$ ” if you want to be fussy, but I’ll drop the $B$ s.)

Question: what’s the magnitude of this functor?

Answer: it’s

$\frac{|End(A)|}{|End_I(A)|} = \frac{\|a\|^{\|a\|}}{\prod_i a_i^{a_i}} = \prod_i \biggl( \frac{\|a\|}{a_i} \biggr)^{a_i}.$

This is nothing but $\exp(H(a))$ , the exponential of the entropy of our measure! So:

The exponential of entropy is a special case of magnitude.

(Sometimes it seems that the exponential of entropy is a more fundamental quantity than the entropy itself. Apart from anything else, it doesn’t depend on a choice of base. I like to call the exponential of entropy the diversity.)

At least, that’s true when the measure of each point is an integer. But from knowing the entropy of such measures, we can easily obtain the entropy of those where the entropy of each point is rational, assuming the homogeneity property $H(\lambda a) = \lambda H(a)$ . And then one can get the completely general case of real measures by extending by continuity.

The basic calculation above was described in an earlier post of mine, but at that time I didn’t see clearly what it meant.

Relative entropy Given two probability measures $p$ and $r$ on the same finite set, you can calculate their relative entropy. It’s a measure of how surprised you’d be to observe a distribution of $p$ in a sample drawn from a population with distribution $r$ . (I think of $r$ as the “reference” distribution.) The formula for $H(p\|r)$ , the entropy of $p$ relative to $r$ , is

$H(p\|r) = \sum_i p_i \log(p_i/r_i) \in [0, \infty].$

Just as for ordinary entropy, we can extend it from probability measures to arbitrary measures. We’re now dealing with two measures $a$ and $b$ on the same finite set, and I’ll assume they have the same total mass: $\|a\| = \|b\|$ . Again, we extend homogeneously, so that

$H(\lambda a \| \lambda b) = \lambda H(a\|b)$

whenever $\lambda \geq 0$ .

And as before, we’ll focus on measures taking integer values. A pair of measures on a finite set $I$ , both taking integer values and with the same total mass, amounts to a diagram

$A \rightrightarrows I$

in $FinSet$ . The idea is that if we take $I = \{1, \ldots, n\}$ and write $a_i$ and $b_i$ for the cardinalities of the fibres of the $i$ -fibres of these two maps, then one of our measures is $(a_1, \ldots, a_n)$ and the other is $(b_1, \ldots, b_n)$ . They have the same total mass because

$a_1 + \cdots + a_n = |A| = b_1 + \cdots + b_n.$

Calling our two maps $\pi, \rho: A \to I$ , we can consider the set

$Hom_I(A \stackrel{\pi}{\to} I, A \stackrel{\rho}{\to} I)$

of functions $g : A \to A$ such that $\rho \circ g = \pi$ . This set is acted on by the monoid

$End_I(A \stackrel{\pi}{\to} I)$

of functions $f : A \to A$ such that $\pi \circ f = \pi$ , by composition.

As we keep seeing, a monoid action corresponds to a functor from that monoid to $Set$ , whose magnitude we can compute. In this case, the magnitude is

$\frac{|Hom_I(A \stackrel{\pi}{\to} I, A \stackrel{\rho}{\to} I)|}{|End_I(A \stackrel{\pi}{\to} I)|} = \frac{\prod b_i^{a_i}}{\prod a_i} = \prod \biggl( \frac{a_i}{b_i} \biggr)^{a_i},$

which is equal to

$\exp(-H(a \| b))$

—the negative exponential of relative entropy.

(We’re never going to be able to get rid of that negative and get $\exp(H(a \| b))$ itself as a magnitude, since $H(a\|b)$ can be infinite but the magnitude of a functor can’t be, at least in the context at hand.)

As for ordinary entropy, if we know relative entropy for integer-valued measures then it’s only two short steps to the definition for arbitrary measures.

Conditional entropy Whereas relative entropy takes as its input a pair of measures on the same set, conditional entropy takes as its input two measures on different sets.

It’s usually presented in terms of random variables: we have random variables $X$ and $Y$ taking values in sets $I$ and $J$ respectively (which for us will be finite), and the conditional entropy $H(X | Y)$ is defined by

$H(X | Y) = \sum_{i \in I, j \in J} P(X = i, Y = j) \log \frac{1}{P(X = i | Y = j)}.$

If we write $(p_{i j})_{i \in I, j \in J}$ for the joint distribution — so that the $p_{i j}$ are nonnegative reals summing to $1$ — then

$H(X | Y) = \sum_{i, j} p_{i j} \log \biggl( \frac{p_j}{p_{i j}} \biggr).$

Here $p$ is a probability measure on $I \times J$ . But just as for ordinary and relative entropy, this definition extends to arbitrary measures $a$ on $I \times J$ , by scaling homogeneously.

To see how conditional entropy is a special case of magnitude, we follow a path that should be familiar by now.

An integer-valued measure on $I \times J$ amounts to a map $A \to I \times J$ of finite sets, or equivalently a diagram

$I \leftarrow A \rightarrow J$

in $FinSet$ . In the monoid of all endomorphisms of $A$ , we can consider the endomorphisms over $I$ , or over $J$ , or most restrictively of all, over $I \times J$ . In particular, there is an inclusion of monoids

$End_{I \times J}(A) \hookrightarrow End_J(A).$

As for ordinary entropy, this homomorphism induces an action of the domain on the underlying set of the codomain, giving a functor $End_{I \times J}(A) \to FinSet$ whose magnitude we can calculate. It’s

$\frac{|End_J(A)|}{|End_{I \times J}(A)|} = \frac{\prod_j a_j^{a_j}}{\prod_{i, j} a_{i j}^{a_{i j}}} = \prod_{i, j} \biggl( \frac{a_j}{a_{i j}} \biggr)^{a_{i j}},$

where I’m using $a_{i j}$ to mean the cardinality of the $(i, j)$ -fibre of $A \to I \times J$ and $a_j$ to mean the cardinality of the $j$ -fibre of $A \to J$ . This is exactly the exponential of the conditional entropy.

Conclusion The exponentials of

ordinary entropy
the negative of relative entropy
conditional entropy

all arise as special cases of magnitude — at least for integer-valued measures, but it’s routine to derive the definition for general measures from that special case.

If you know lots about entropy, you might be wondering now: what about mutual information? Is that the magnitude of some set-valued functor too?

I don’t know! Given $A \to I \times J$ , the quantity we’d like to obtain as a magnitude is

$\frac{|End(A)| |End_{I \times J}(A)|}{|End_I(A)| |End_J(A)|}.$

I can’t see how to do that in any natural way.

Maybe I’ll come back to this point next time, when I’ll start on the topic that’s actually the title of these posts: comagnitude.

Posted at January 22, 2025 9:45 PM UTC

TrackBack URL for this Entry: https://golem.ph.utexas.edu/cgi-bin/MT-3.0/dxy-tb.fcgi/3585

26 Comments & 1 Trackback

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

This last formula for mutual information you’d like to obtain as a magnitude looks a lot like the exponential of the discrepancy of a pushout square — but you’ve probably noticed that already?

Posted by: Quentin Aristote on January 23, 2025 10:09 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Absolutely! Mutual information can be seen as the codiscrepancy of a pullback square, which is what I was intending to mention next time.

I haven’t defined “codiscrepancy” yet, but it’s going to be something multiplicative. Whereas the discrepancy of a pushout square is an alternating sum of the four cardinalities, the codiscrepancy is an alternating product.

In terms of this kind of thing, discrepancy is more like Euler characteristic and codiscrepancy is more like homotopy cardinality.

I’ll get on to this properly next time. It’s a nice observation, but I’m still quite puzzled by the whole situation with mutual information and magnitude.

Posted by: Tom Leinster on January 23, 2025 10:24 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

In our paper The linearity of traces in monoidal categories and bicategories, Kate Ponto and I showed (Theorem 7.14) that if $A$ is a finite category with no nonidentity endomorphisms, then for any functor $X:A\to FinSet$ , then what you call the magnitude $|X|$ is always equal to the Euler characteristic of the homotopy colimit $hocolim(X)$ . So in this case, the discrepancy is measuring the difference between the colimit and the homotopy colimit, kind of like a $lim^1$ .

Another way of saying this is that your observation about a chain complex in the case of coequalizers is generic. For any finite category $A$ with no nonidentity endomorphisms and any $X:A\to FinSet$ , there is a natural chain complex $Y$ , namely the homotopy colimit in chain complexes of the diagram of free abelian groups $\mathbb{Z}X_a$ (each regarded as a chain complex concentrated in degree 0), or equivalently the cellular chain complex of the homotopy colimit of $X$ in spaces, whose Euler characteristic is the magnitude $|X|$ . Moreover, if $C$ is the actual colimit of $X$ , there is the canonical map $Y\to \mathbb{Z}C$ from the homotopy colimit to the colimit, which we can regard as augmenting $Y$ to a larger chain complex, and the Euler characteristic of this larger chain complex (which I think is the “cone” of the map $Y\to \mathbb{Z}C$ ) will be the discrepancy.

If $A$ has nonidentity endomorphisms, there is another contribution to the discrepancy. Suppose first $A=B G$ for a finite group $G$ . Then the homotopy colimit of a finite $G$ -set $X$ in spaces, or of $\mathbb{Z}X$ in chain complexes over $\mathbb{Z}$ , doesn’t have a finite Euler characteristic. But if we instead consider chain complexes over $\mathbb{Q}$ , then the homotopy colimit of $\mathbb{Q}X$ is in fact equal to $\mathbb{Q}C$ , where $C = X/G$ is the ordinary colimit of $X$ in $FinSet$ . Thus, the Euler characteristic of $hocolim(X)$ is equal to the cardinality $|colim(X)|$ .

But this is still different from the magnitude of $X$ , which is $\frac{|X(a)|}{|G|}$ , where $a$ is the unique object of $B G$ so that $X(a)$ is the underlying set with a $G$ -action. (In this case writing $|X|$ is a bit ambiguous as to whether it means the cardinality of the $G$ -set $X$ or the magnitude of the functor $X:B G \to FinSet$ .) In this case we have the orbit-counting theorem (a.k.a. Burnside’s lemma or the Cauchy-Frobenius lemma)

$|colim(X)| \;=\; \sum_{g\in G} \frac{|X^g|}{|G|}$

where $X^g$ is the set of fixed points of $g$ acting on $X$ . Now note that if $e\in G$ is the identity element, then $X^e$ is the underlying set $X(a)$ . Thus, in this case the discrepancy is

$|colim(X)| - \frac{|X(a)|}{|G|} \;=\; \sum_{g\in G, g\neq e} \frac{|X^g|}{|G|}.$

These two classes of examples can be combined into a theorem about EI-categories. If $A$ is a finite EI-category such that each group $A(b,b)$ acts freely by composition on each hom-set $A(a,b)$ , then:

There is a weighting on the category $End(A)$ who objects are endomorphisms in $A$ (which are automorphisms), and whose morphism are commutative squares in $A$ ,
When restricted to the identity endomorphisms, this weighting becomes a weighting on $A$ , and
For any $X : A \to FinSet$ , the Euler characteristic of $hocolim(X)$ is equal to $\sum_{g\in End(A)} w(g) \,{|X(a)^g|}.$

This is Proposition 7.18 and Corollary 7.19 in our paper, while Theorem 9.1 includes a formula for this weighting. Therefore, in this case the discrepancy is

$\begin{aligned} {|colim(X)|} - {|X|} &= \Big({|colim(X)|} - {|hocolim(X)|}\Big) + \Big({|hocolim(X)|} - {|X|}\Big)\\ &= \Big({|colim(X)|} - {|hocolim(X)|}\Big) + \Big(\sum_{g\in End(A)} w(g) \,{|X(a)^g|} - \sum_{a\in A} w(id_a) \,{|X(a)|} \Big)\\ &= \Big({|colim(X)|} - {|hocolim(X)|}\Big) + \sum_{g\in End(A), g\neq id} w(g) \,{|X(a)^g|}. \end{aligned}$

That is, it combines the “homotopical discrepancy” ${|colim(X)|} - {|hocolim(X)|}$ with the “automorphism discrepancy” $\sum_{g\in End(A), g\neq id} w(g) \,{|X(a)^g|}$ .

I don’t know how to generalize this point of view beyond EI-categories, though.

Posted by: Mike Shulman on January 24, 2025 4:37 AM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

On that last point, our paper does have formulas that go somewhat beyond the EI case, but it’s not clear to me whether they can be related to magnitude in a similar way. For instance, if $A$ is the free-living idempotent, then our formula says that the Euler characteristic of $hocolim(X)$ (which is just $colim(X)$ in this case) is the cardinality of the set of fixed-points of the idempotent (which is obvious, since the colimit is the set of fixed-points, but that’s what our formula specializes to). But in this case the magnitude of $X$ is just $\frac{1}{2}$ of its cardinality.

Posted by: Mike Shulman on January 24, 2025 5:23 AM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Ah, that’s very helpful. Thanks, Mike. I literally have it in my notes to look into that paper of yours and Kate’s, because I know from previous conversations that there’s lots of good and relevant stuff in there. But obviously I haven’t got round to it (sorry!). Actually, there have been several occasions in the last decade or so when I’ve wanted to read it, but I think what happens is that I open it, see “page 1 of 97”, gulp, and find something else to do. I should persevere! And the summary in your comment will really help.

Posted by: Tom Leinster on January 24, 2025 8:32 AM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Well, that’s a perfectly normal reaction. Anyone who didn’t believe me about the value of writing short papers, take note.

I did also write a blog post and slides about that paper, which are shorter.

Posted by: Mike Shulman on January 24, 2025 7:43 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Thanks for the links. I think you also explained some of this story to me in person. But I’m often quite slow to absorb things, as is apparent :-)

Posted by: Tom Leinster on January 24, 2025 7:52 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

I’m trying to understand the relationship between two theorems. On the one hand, there’s the theorem of yours and Ponto’s that you mention:

if $A$ is a finite category with no nonidentity endomorphisms, then for any functor $X: A \to FinSet$ , then what you call the magnitude $|X|$ is always equal to the Euler characteristic of the homotopy colimit $hocolim(X)$ .

On the other, there’s Proposition 2.18 of The Euler characteristic of a category, which states in part:

if $A$ is any finite category, then for any functor $X: A \to FinSet$ , the magnitude $|X|$ is always equal to the magnitude of the category of elements (a.k.a. Grothendieck construction) $\mathbb{E}(X)$ .

(I’m silently assuming that all the relevant (co)weightings exist.)

Since the category of elements is a colax colimit, there should be a close relationship between these two results. And it seems to me that each implies the other, at least up to some difference in hypotheses.

The bridge between the two results is formed of two parts. Proposition 2.11 of the same paper says:

if $A$ is a finite category containing no endomorphisms or isomorphisms apart from identities, then $\chi(N A) = |A|$ .

(The hypotheses on endomorphisms and isomorphisms guarantee that the Euler characteristic of the simplicial set $N A$ is well-defined. And I’m consistently using bars to mean magnitude, not geometric realization.)

Also, Thomason’s homotopy colimit theorem (Theorem 1.2 of “Homotopy colimits in the category of small categories”) implies:

if $A$ is a small category, then for any functor $X: A \to Set$ , the simplicial sets $hocolim(X)$ and $N(\mathbb{E}(X))$ are homotopy equivalent.

(Thomason’s result, your result with Ponto, and my result that $|X| = |\mathbb{E}(X)|$ all hold for functors taking values in $Cat$ , not just $Set$ . But I’m sticking to the simplest setting here.)

Since homotopy equivalent spaces have the same Euler characteristic, Thomason’s theorem implies that

$\chi(hocolim(X)) = \chi(N(\mathbb{E}(X))).$

If $A$ contains no endomorphisms or isomorphisms other than identities then neither does $\mathbb{E}(X)$ , so an equivalent statement is

$\chi(hocolim(X)) = |\mathbb{E}(X)|.$

And this lets us see the equivalence between your and Ponto’s result that

$|X| = \chi(hocolim(X))$

and my result that

$|X| = |\mathbb{E}(X)|.$

I imagine none of this is news to you, Mike: probably it’s all in your paper. And I see a different spin on this story in the last slide of the talk you linked to. But it helps me to write this out here.

Posted by: Tom Leinster on January 24, 2025 9:11 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

That’s nicely explained, thanks. We didn’t actually say anything about Grothendieck constructions in our paper, because we were focused more on enriched situations where they don’t make as much sense. And we only mentioned the magnitude of a category by-the-way; we were mainly interested in computing the Euler characteristics (and trace invariants) of homotopy colimits. But it’s neat that for ordinary finite categories, you can make this connection and point to your Proposition 2.11 as “what goes wrong” for finite categories that aren’t what we called “homotopy finite”.

For those who don’t want to click through to the slides, the “different spin” is sort of a converse: Tom’s Proposition 2.11 is also a special case of our theorem, since if $X:A\to FinSet$ is the terminal functor, then $hocolim(X) = N A$ and ${|A|}={|X|}$ .

Posted by: Mike Shulman on January 28, 2025 8:53 AM | Permalink | Reply to this

Channel capacity is a coweighting

$MathML-enabled post (click for more details).$

For the details, see section 4.3.1 of https://arxiv.org/abs/2304.08334.

Posted by: Steve Huntsman on January 24, 2025 2:33 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

It would definitely be nice to understand mutual information from this perspective. By following the inclusion-exclusion principle, one can define mutual information of higher orders, e.g., a tertiary mutual information among three random variables $I(A;B;C)$ that rather resembles an Euler characteristic:

$I(A;B;C) = H(A) + H(B) + H(C) - H(A,B) - H(B,C) - H(C,A) + H(A,B,C).$

“Vertices” minus “edges” plus a “face”, so to speak.

This has unusual properties. Notably, it can go negative. Suppose that $A$ , $B$ and $C$ are binary variables whose values are set by picking a row of the XOR truth table at random. Then

$H(A) = H(B) = H(C) = 1,$

whereas

$I(A;B) = H(A) + H(B) - H(A,B) = 0,$

and so

$I(A;B;C) = -1.$

One approach I pursued for a while was to say that each random variable is really a set of binary questions, so the entropy is the cardinality of that set; to have nonintegral values of the entropy, we’d say that the binary questions are interdependent in a way described by a groupoid, and then take the groupoid cardinality.

Even weirder stuff happens when the random variables take more than 2 values. There are multipartite correlations to which Shannon-style information measures are blind, but which can be teased out by introducing auxiliary systems.

Posted by: Blake Stacey on January 24, 2025 10:53 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

The alternating sum is the interaction information

https://en.wikipedia.org/wiki/Interaction_information

Posted by: Steve Huntsman on January 26, 2025 7:31 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Really interesting post. I still need some time to fully process what you’ve said at the very end concerning entropy, but for the moment I wanted to say two things:

First, I wanted to remark that you can also see the discrepancy of your pushout square as an Euler characteristic; namely, of the complex $0 \to \mathbb{Z} X_1 \to \mathbb{Z} X_0 \oplus \mathbb{Z} X_2 \to \mathbb{Z} P \to 0,$ where the map $\mathbb{Z} X_1 \to \mathbb{Z} X_0 \oplus \mathbb{Z} X_2$ is the direct sum of the two maps with source $X$ in the square and the map $\mathbb{Z} X_0 \oplus \mathbb{Z} X_2 \to \mathbb{Z} P$ is the difference of the other two in the square. That said, Mike Shulman already made a much more general statement in this direction.

The second point concerns an alternative way of obtaining the entropy that I think has a more direct information-theoretic interpretation. Let $a=(a_1,...,a_n)$ be a vector of integers and $\pi:A\to I$ a map between the sets $A=\{1,...,\lVert a\rVert\}$ and $I=\{1,...,n\}$ such that $\pi^{-1}(i)=a_i$ . One can regard a choice of $\pi$ as a word of length $\lVert a \rVert$ with symbols in $I$ : under this interpretation, the elements of $A$ are positional indices, and $\pi(a)=i$ if $i$ is the symbol in position $a$ .

The group $\operatorname{Aut}(A)$ gives you all the possible permutations of the indices; if $A$ is ordered, this really corresponds to all the ways of reordering the characters in the word while preserving the frequency of each symbol $i\in I$ . One can also consider the group $\operatorname{Aut}_I(A)$ : in this case, you only permute positions that are associated to the same symbol; from the perspective of “reading” $w$ these transformations are not noticeable. Since $\operatorname{Aut}_I(A)$ is a subgroup of $\operatorname{Aut}(A)$ , one can again build a functor $X: B \operatorname{Aut}_I(A) \to \operatorname{FinSet}$ and its magnitude is $|X| = \frac{\lVert a \rVert !}{\prod_{i=1}^n a_i !}$ i.e. a multinomial coefficient. In the language of information theory, this is the cardinality of the “type class” $T^{\lVert a\rVert}_{a/\lVert a\rVert}$ , which is the set of words $w\in I^{\lVert a \rVert}$ such that each $i\in I$ appears with frequency $a_i/\lVert a\rVert$ . In this “type theory”, entropy appears asymptotically, by taking $a_i/\lVert a\rVert \to p_i\in \mathbb{Q}$ as $\lVert a \rVert \to \infty$ for each $i\in I$ : then, because of Stirling’s formula, $|X| \to \exp( n H(p) + o(n))$ . See, for instance, Chapter 2 in the book “Information theory: Coding theorems for discrete memoryless systems” by Körner and Csiszár.

For me, the nice thing about this story is that it relates entropy, a measure of uncertainty, to ambiguity in the group-theoretic sense. There are variations of it, too. Namely, in https://arxiv.org/abs/1807.05152 , I’ve considered the quotient of $\operatorname{GL}_{\lVert a \rVert}(\mathbb{F}_q)$ by the parabolic subgroup $P$ that stabilizes a given flag $V_1 \subset V_2 \subset ... \subset V_n$ such that $\operatorname{dim}(V_i) = \sum_{j=1}^i a_j$ ; the cardinality $|G/P| = |G|/|P|$ is a $q$ -binomial coefficient, which asymptotically yields the Tsallis $2$ -entropy. I wonder if one can obtain this quotient as the magnitude of a functor too.

Posted by: Juan Pablo Vigneaux on January 25, 2025 5:22 AM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Sorry, I realized there’s a typo, because my previous comment should say $|\pi^{-1}(i) |=a_i$ . But these $a_i$ are not elements of $A$ , so the notation is a bit confusing.

It’d be better to denote by $m=(m_1,...,m_n)$ the vector of integers. Then $\pi:A\to I$ is any map such that $|\pi^{-1}(i)| = m_i$ , for each $i\in I$ .

If $A$ is totally ordered, $A= \{1,...,\lVert m\rVert\}\subset\mathbb{N}$ , then a possible choice of $\pi$ sends $j\in A$ to $i\in I$ iff $\sum_{i=1}^{j-1} m_j +1 \leq j \leq \sum_{i=1}^{j} m_j$ . As a word, it consists of $m_1$ ones, followed by $m_2$ twos, etc. Any other word with the same frequency of each symbol in $I$ (i.e. of the same type) can be obtained by acting on $\pi$ with an element of $\operatorname{Aut}(A)$ ; since the stabilizer of $\pi$ is isomorphic to $\operatorname{Aut}_I(A)$ , the size of the orbit (i.e. the type class) is the quotient $|\operatorname{Aut}(A)|/|\operatorname{Aut}_I(A)|$ .

Posted by: Juan Pablo Vigneaux on January 26, 2025 12:48 AM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Sorry, coming late to this comment.

I like this way of looking at entropy via the type class formula (and I seem to remember Dan Piponi singing its praises somewhere on the internet).

It would be good if it was possible to explain the information-theoretic relevance of $End_I(A)$ in as convincing a way as the explanation you gave of $Aut_I(A)$ .

Posted by: Tom Leinster on February 11, 2025 9:44 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

I can’t see how to do that in any natural way.

As you wrote, there is an action of $End_{I \times J}(A)$ on all of $End_{I}(A)$ , $End_{J}(A)$ and $End(A)$ . Can one combine these actions in some way to obtain the desired magnitude? I don’t know what the magnitude of the (functor to $FinSet$ coming from) the action of $End_{I \times J}(A)$ on $End_{I}(A) \times End_{J}(A) \times End(A)$ is for example?

Posted by: product action on January 26, 2025 12:02 AM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

For any action of a monoid $M$ on a set $S$ (both finite), the magnitude of the resulting functor $B M \to Set$ is $|S|/|M|$ , the cardinality of $S$ divided by the cardinality of $M$ . So in this case, the action you mention would give a magnitude of

$\frac{{|End_I(A)|}{|End_J(A)|}{|End(A)|}}{{|End_{I \times J}(A)|}},$

which isn’t quite what we want.

Posted by: Tom Leinster on January 26, 2025 10:04 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Thank you for your patience! One could try to construct an action of $End_{I}(A) \times End_{J}(A)$ on $End_{I \times J}(A)$ (which seems to be the crucial point if one hopes for the same approach to work in a simple way) for which the application of $(f,g)$ to $h$ is something like $g \circ h \circ f$ . One has that the following diagram commutes, which seems vaguely encouraging, but it doesn’t seem to actually help much unless $I = J$ , or one has some other additional assumptions.

If one had some map $A \times A \rightarrow A$ over $I \times J$ one might also be able to do something.

I guess that you already had tried all this though!

Posted by: product action on January 27, 2025 12:49 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Did you maybe omit some assumptions in what you stated? I mean the idea I suggested does define an action of $End_I(A)^op \times End_J(A)$ on $End(A)$ (I was missing an op originally), and there is a trivial action of $End_I(A)^op \times End_J(A)$ on $End_{I \times J}(A)$ , so combining these one gets an action of $End_I(A)^op \times End_J(A)$ on $End(A) \times End_{I \times J}(A)$ . According to what you stated, the magnitude of the corresponding functor would be what you’re looking for. But this surely can’t be right?

Posted by: product action on January 28, 2025 9:43 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Thanks for the ideas! It’s entirely possible I’ve overlooked something straightforward, but I don’t think I’ve omitted any assumptions.

A couple of thoughts. First, we can certainly get the thing we want —

$\frac{|End(A)| |End_{I \times J}(A)|}{|End_I(A)| |End_J(A)|}.$

— as the magnitude of a monoid action, because we can just take the trivial action of the monoid $End_I(A) \times End_J(A)$ on the set $End(A) \times End_{I \times J}(A)$ . But of course, a trivial answer like that isn’t going to help anything. Maybe that counts as an omitted assumption. What you’re mentioning isn’t a trivial action, but half of it is.

The difficulty is to come up with any nontrivial action on $End_{I \times J}(A)$ . Anyway, I hope to write part 2 soon, which will have another approach to all this, as hinted here.

Posted by: Tom Leinster on January 29, 2025 8:25 AM | Permalink | Reply to this

Read the post Comagnitude 2
Weblog: The n-Category Café
Excerpt: Thinking about the cardinality of limits leads to a new numerical invariant of set-valued functors.
Tracked: January 29, 2025 11:17 PM

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Regarding your formula for the discrepancy of the pushout, I’m reminded of the formula $P(a \vee b) + P(a \wedge b) = P(a) + P(b)$ . This has the shape of the discrepancy formula with the discrepancy set to 0, for the “square” between $A \times B$ and $A + B$ (with other vertices at A and B). This is entirely heuristic and might not make any sense, but I have seen quite a few variations on this shape of formula and never could quite figure out a way to connect them beyond being analogous to some form of inclusion/exclusion.

The idea of these “squares with 0 discrepancy” is something that I have thought about before this post, but maybe the notion of discrepancy will be enough to categorically formulate something concrete besides me just seeing things in a few different fields that look similar :)

Posted by: Sofia B on February 2, 2025 4:48 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Is a Categorified Two Cultures Challenge on the cards?

For $X, Y: A \to FinSet$ , and $F$ a natural transformation between them, is there some inequality

$|X \times_F X| \geq |X|^2/|Y|?$

Posted by: David Corfield on February 15, 2025 12:54 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

That challenge of Terry’s is a great one! I think about it every now and then. But I haven’t thought about it recently, so thanks for the reminder. The categorified one is obviously harder…

Posted by: Tom Leinster on February 25, 2025 8:20 PM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

I’m going to be speaking about this stuff at Category Theory 2025 next week. My talk is almost exclusively about the magnitude of a presheaf, rather than the comagnitude, but there’s still loads to say.

Here are my slides:

Tom Leinster, The magnitude of a presheaf, Category Theory 2025.

Posted by: Tom Leinster on July 13, 2025 9:36 AM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

Cool! Good luck!

Posted by: John Baez on July 13, 2025 9:50 AM | Permalink | Reply to this

Re: Comagnitude 1

$MathML-enabled post (click for more details).$

There’s a typo on the slide with the PDE on it, which I can’t fix just now. Where I’ve written $h''=0$ , it should say $h''=h$ , both times.

Posted by: Tom Leinster on July 15, 2025 5:38 PM | Permalink | Reply to this

The n-Category Café

Skip to the Main Content

January 22, 2025

Comagnitude 1

Posted by Tom Leinster

26 Comments & 1 Trackback

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Channel capacity is a coweighting

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Access Keys:

The n-Category Café

Skip to the Main Content

January 22, 2025

Comagnitude 1

Posted by Tom Leinster

Some Related Entries

26 Comments & 1 Trackback

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Channel capacity is a coweighting

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Re: Comagnitude 1

Access Keys: