Cantor’s Diagonal Argument






Because of its importance to the IFF metastack, we discuss, quote and paraphrase here in some detail Cantor’s diagonal argument as presented in the book Sets for Mathematics (2003) by Lawvere and Rosebrugh. As discussed in that book, in the nineteenth century George Cantor proved an important theorem using a diagonal argument, which implies the result that any set X is smaller than its power set

X < 2X = PX.

(For sets X and Y, the inequality X ≤ Y means that there exists at least one injection from X to Y, and the strict inequality X < Y means XY and that no surjection XY exists.) The two essential ingredients in Cantor’s argument are the diagonal function and a fixed point free map. As Lawvere and Rosebrugh point out,

“This diagonal argument has been traced (by philosophers) back to ancient philosophers who used something like it to mystify people with the Liar’s paradox. Cantor, however, used his method to prove positive results, namely inequalities between cardinalities. The philosopher Bertrand Russell, who was familiar with Cantor’s theorem, applied it to demonstrate the inconsistency of a system of logic proposed by the philosopher Frege; since then philosophers have referred to Cantor’s theorem as Russell’s paradox and have even used their relapse into the ancient paradox habit as a reason for their otherwise unfounded rumor that Cantor’s set theory might be inconsistent. (Combatting this rumor became one of the main preoccupations of the developers of the axiomatized set theories of Zermelo, Fraenkel, von Neumann, and Bernays [P. Suppes. Axiomatic Set Theory. Dover, 1972]. This preoccupation assumed such an importance that the use of such axiom systems for clarifying the role of abstract sets as a guide to mathematical subjects such as geometry, analysis, combinatorial topology, etc., fell into neglect for many years.)”

Definition 1. Let Y be any set. An endofunction τ : YY has an element yY as a fixed point when τ(y) = y. The endofunction τ : YY is fixed point free when no element of Y is a fixed point. The set Y has the fixed point property when no endofunction τ : YY is fixed point free; that is, every endofunction τ : YY has at least one fixed point.

Although there are objects in the category of continuous spaces with the fixed point property (Brouwer: n-dimensional unit ball), most abstract sets do not have the fixed point property. We prove the contrapositive of Cantor’s theorem.

Theorem 1. Suppose there is a set X and a function φ : X×XY whose curry φ̂ : XYX, where φ̂(a) = φ(a,–) for all a X, is surjective; that is, such that for every function f : XY there is at least one element aX such that f = φ̂(a) = φ(a,–). Then Y has the fixed point property.

Proof: Consider any endofunction τ : YY. We must show that τ has a fixed point. Using the diagonal function δX : XX×X, define the function f = δX · φ · τ : XX×XYY. In other words, for all xX, f(x) = τ(φ(x,x)). By the assumption, f = φ(a,–) for some element element a ∈ X; that is, f(x) = φ(a,x) for some element element aX and all elements xX. Hence, τ(φ(x,x)) = φ(a,x) for all xX. In particular, letting x = a, τ(φ(a,a)) = φ(a,a). Hence, φ(a,a) = φ(δX(a)) is a fixed point of τ.

Corollary 1 (Cantor). If a set Y has at least one endofunction τ : YY with no fixed points, then for every set X there is no surjection XYX.

Corollary 2. For any set X,

X < 2X = PX.

Proof: Let 2 be the set of two elements 2 = {0,1}. Logical negation ¬ : 2 → 2, defined by ¬(0) = 1 and ¬(1) = 0, has no fixed point. Hence, there is no surjection X → 2X = PX. But, the singleton map {–} : X → 2X = PX is injective.

Corollary 3. There cannot exist a “universal set” U for which every set X is a subset XU.

Proof: If so, then the inclusion XU is an injection. Hence, the exponent map 2U → 2X is a surjection. Define X = 2U to get a contradiction.

Corollary 4. The collection set of all sets is not a set.

Proof: If set were a set, then U = set = {X | X ∈ set }, the union of all sets, would be a well-defined set. But, U would clearly be a “universal set”. Hence, the assumption that set is a set leads to a contradiction.

Let us take inventory of the assumptions on sets that were needed to prove this. We have assumed that sets have: (basic) singleton maps, diagonal maps, local unions, the two-element (truth-value) set 2, logical negation; (finite limits) terminal elements, finite products; and (exponents) exponent sets, curry operators, exponent functions. What is the meaning of “local union”. A local union operator at level n maps a level n set of level n sets to a level n set (consisting of all elements in at least one of the component sets), and hence is a function : Psetnsetn, where Psetn = {Xsetn | X ∈ setn}. A local union is equivalent to an “indexed union”, which takes an level n indexed collection of level n sets {Xisetn | i ∈ Isetn} to a level n set. These notions are stronger than the ordinary “powerset union” operators : PPXPX for level n sets X ∈ setn.