Universe Enlargement
Posted by Mike Shulman
When dealing with categories of more than one size—that is, belonging to more than one set-theoretic universe—we are sometimes faced with the need to replace some given category by a version of “the same” category defined at a different size level. Usually, it is fairly obvious how to do this, but for theoretical reasons one would like a general construction that always works. One very nice way to perform such an enlargement is described in sections 3.11–3.12 of Kelly’s book, using Day convolution. But he doesn’t mention that in many cases, this general construction actually gives the same result as the naive one.
So what do I mean by “the naive construction”? The easiest case is the category of small sets, i.e. sets belonging to some universe . If is a larger universe with , then the obvious enlargement of to is the category of sets belonging to .
(In the context of two universes , I like to speak of sets in as “small,” sets in as “large,” and sets not necessarily in as “very large.” Thus is the large category of small sets, and is the very large category of large sets. In the context of a third universe with , I would say “very large” for sets in and “extremely large” for sets not necessarily in .)
Similarly, we can replace the large category of small topological spaces by the very large category of large topological spaces, the large category of small groups by the very large category of large groups, and so on. In general, if is a large category whose objects are the “models of some theory” in , then its obvious enlargement is the very large category of models of the same theory in . Note that the theory in question could be “algebraic,” as in the case of groups, but it could also be a “higher-order” theory such as is required for topological spaces—all that matters is that we know how to talk about models of the theory in an elementary topos such as or .
That approach covers pretty much any situation arising in practice, but as I said, for theoretical reasons it would be nice to have a construction that works on any large category. A very nice way to do this uses the Yoneda embedding, as described in Kelly’s book: for any large category , the very large category contains as a full subcategory. Furthermore, is -bicomplete, i.e. it has all large limits and colimits, whether or not was -complete or cocomplete, and the embedding preserves all limits that exist in .
However, it preserves hardly any colimits (since it is a free cocompletion, after all). If we want an enlargement of which preserves some class of colimits in , then we can restrict to the full subcategory of consisting of the presheaves which preserve the colimits (i.e. take them to limits in ). Let’s denote this category by . (Suggestions for better notation are welcome.) Since representable functors preserve all limits, the Yoneda embedding of factors through , and all the colimits in are preserved by this restricted embedding.
Moreover, is closed under limits in , so it is itself -complete. And as long as the (diagrams sizes of the) colimits in are at most large, then (by theorems of Day etc.) is reflective in , and hence also -cocomplete. (Usually, the only colimits we might care about in a large category are small, hence a fortiori also large. Occasionally one encounters large limits in a large category, such as in some forms of the adjoint functor theorem, but I don’t think I’ve ever seen any use of a very large limit in a large category.)
Now what does this have to do with the naive version? Suppose that is locally presentable relative to . Then there is a small cardinal and a small category with -small colimits such that is equivalent to the category of -limit–preserving functors from to . Evidently, the “naive” enlargement of is then , which is -locally-presentable, and we have a full embedding , which preserves limits and colimits since they are calculated in the same way in both categories.
Now every object of is a small colimit of objects of , and the objects of are all -presentable in and in . Thus, every object of is -presentable in , where is the size of the universe . Conversely, since is locally -presentable (relative to ), any -presentable object in it is a -small colimit of -presentable objects—so since is closed under small colimits in , it must consist exactly of the -presentable objects. However, since is locally -presentable, it is also locally -presentable, so this implies that . But is precisely , where is the class of all small colimts in .
Thus, for locally presentable categories, the general universe-enlargement agrees with the naive one: e.g. we have , , and so on. And importantly, if is the topos of small sheaves on a small site, then is the category of large sheaves on the same site—which can also be identified with the category of large sheaves on itself with its canonical topology.
For non-locally-presentable categories, however, the two enlargements are generally different. This is most obvious if is not -complete or cocomplete, in which case its naive enlargement will almost surely not be -complete or cocomplete, whereas is always both. But the two completions can also differ when is complete and cocomplete; for instance, . Of course, is not locally presentable. I would like to be able to say that locally presentable is “the weakest hypothesis” one can hope for under which the two enlargements agree. For if is -cocomplete, the generic enlargement is always -locally-presentable, being the category of -limit-preserving -valued presheaves on a -small category with -small limits (namely, ) for some -small cardinal (namely, the cardinality of ). But if is not -locally-presentable, one would be surprised if its naive enlargement were -locally presentable.
It can happen, however. Suppose that is -cocomplete and has a -small dense subcategory, but is not -locally-presentable. The existence of such a is equivalent to saying that is not a Vopenka cardinal. Then if the cocompleteness and small-dense-subcategory of are for reasons which “enlarge” to imply that its naive enlargement is -cocomplete and has a -small dense subcategory, but is a Vopenka cardinal, then the naive enlargement of will be -locally-presentable. I don’t know whether in such a case, the naive enlargement can end up being equivalent to the general construction!
In any case, though, this seems contrived enough not to be particularly interesting in practice. So it still feels to me as though local presentability is the only reasonable hypothesis under which to expect this equivalence. In other words, another reason locally presentable categories are nice is that they have a unique and well-behaved enlargement to any bigger universe.
Re: Universe Enlargement
I like your compromise on the notation for the enlargement - the ‘thick up arrow’ does give a more intuitive meaning once explained than my own ‘ordinary up arrow’ version I proposed at the nForum. Also the subscript is good.