MODULE 4:   Limits of Computation
Uncountable Sets

In this chapter, we would like to remind you the concepts of countable and uncountable sets, as well as the general techniques involved in countability and uncountability proofs. Even though it may seem like we are diverging from the main discussion on computation, we’ll see in future chapters that the concepts in this chapter are intimately related to the concepts of computability and uncomputability. Countable and uncountable sets, together with the diagonalization proof technique, have major applications in proving the limits of computation.

1
Comparison of Sets
Definition (Injection, surjection, and bijection)

Let XX and YY be two (possibly infinite) sets.

  • A function f:XYf: X \to Y is called injective if for any x,xXx,x' \in X such that xxx \neq x', we have f(x)f(x)f(x) \neq f(x'). We write XYX \hookrightarrow Y if there exists an injective function from XX to YY.

  • A function f:XYf: X \to Y is called surjective if for all yYy \in Y, there exists an xXx \in X such that f(x)=yf(x) = y. We write XYX \twoheadrightarrow Y if there exists a surjective function from XX to YY.

  • A function f:XYf: X \to Y is called bijective (or one-to-one correspondence) if it is both injective and surjective. We write XYX \leftrightarrow Y if there exists a bijective function from XX to YY.

Theorem (Relationships between different types of functions)

Let X,YX, Y and ZZ be three (possibly infinite) sets. Then,

  • XYX \hookrightarrow Y if and only if YXY \twoheadrightarrow X;

  • if XYX \hookrightarrow Y and YZY \hookrightarrow Z, then XZX \hookrightarrow Z;

  • XYX \leftrightarrow Y if and only if XYX \hookrightarrow Y and YXY \hookrightarrow X.

Definition (Comparison of cardinality of sets)

Let XX and YY be two sets.

  • We write X=Y|X| = |Y| if XYX \leftrightarrow Y.

  • We write XY|X| \leq |Y| (or YX|Y| \geq |X|) if XYX \hookrightarrow Y, or equivalently, if YXY \twoheadrightarrow X.

  • We write X<Y|X| < |Y| (or Y>X|Y| > |X|) if it is not the case that XY|X| \geq |Y|.

Remark (Sanity checks for comparing cardinality of sets)

Theorem (Relationships between different types of functions) justifies the use of the notation ==, \leq, \geq, << and >>. The properties we would expect to hold for this type of notation indeed do hold. For example,

  • X=Y|X| = |Y| if and only if XY|X| \leq |Y| and YX|Y| \leq |X|,

  • if XYZ|X| \leq |Y| \leq |Z|, then XZ|X| \leq |Z|,

  • if XY<Z|X| \leq |Y| < |Z|, then X<Z|X| < |Z|, and so on.

Note (Set inclusion vs cardinality)

If XYX \subseteq Y, then XY|X| \leq |Y| since the identity function that maps xXx \in X to xYx \in Y is an injection.

Proposition (S=N|\mathbb S| = |\mathbb N|)

Let S={0,1,4,9,}\mathbb S= \{0, 1, 4, 9,\ldots\} be the set of squares. Then S=N|\mathbb S| = |\mathbb N|.

Proof

The function f:NSf : \mathbb N\to \mathbb S defined as f(n)=n2f(n) = n^2 is a bijection. It is an injection since if n2=m2n^2 = m^2, then n=mn = m. And it is surjective since for every sSs \in \mathbb S, by the definition of S\mathbb S, there exists an nn such that n2=sn^2 = s.

Proposition (Z=N|\mathbb Z| = |\mathbb N|)

Let Z\mathbb Z be the set of integers. Then Z=N|\mathbb Z| = |\mathbb N|.

Proof

Since NZ\mathbb N\subseteq \mathbb Z, we have NZ|\mathbb N| \leq |\mathbb Z|. So we just need to argue ZN|\mathbb Z| \leq |\mathbb N|, i.e. there is a surjective function ff from N\mathbb N to Z\mathbb Z. For this, our strategy is to argue that we can list the elements of Z\mathbb Z such that every element eventually appears in the list. If we can do this, then we have the surjection ff that we want: we let f(n)f(n) be the nn’th element in the list. Creating such a listing of Z\mathbb Z is relatively straightforward: 0,  1,  1,  2,  2,  3,  3,  0, \; 1, \; -1, \; 2, \; -2, \; 3, \; -3, \; \ldots With this listing, the mapping ff is such that odd numbers are mapped to positive integers and even numbers are mapped to non-positive integers. The formula for ff is f(n)={n/2 if n is even,(n+1)/2 if n is odd.f(n) = \begin{cases} -n/2 & \text{ if $n$ is even,} \\ (n+1)/2 & \text{ if $n$ is odd.} \end{cases}

Proposition (Z×Z=N|\mathbb Z\times \mathbb Z| = |\mathbb N|)

Let Z×Z\mathbb Z\times \mathbb Z be the set of tuples with integer coordinates. Then Z×Z=N|\mathbb Z\times \mathbb Z| = |\mathbb N|.

Proof

We will show Z×Z=N|\mathbb Z\times \mathbb Z| = |\mathbb N| by arguing NZ×Z|\mathbb N| \leq |\mathbb Z\times \mathbb Z| and Z×ZN.|\mathbb Z\times \mathbb Z| \leq |\mathbb N|.

The former doesn’t really require much of an argument. The map n(n,0)n \mapsto (n,0) is an injection and so NZ×Z.|\mathbb N| \leq |\mathbb Z\times \mathbb Z|.

For the latter, we will show that there is a surjective function ff from N\mathbb N to Z×Z\mathbb Z\times \mathbb Z, and we will do so by arguing that we can list the elements of Z×Z\mathbb Z\times \mathbb Z such that every element eventually appears in the list. If we can do this, then we have the surjection ff that we want: we let f(n)f(n) be the nn’th element in the list.

We now describe how to list the elements of Z×Z\mathbb Z\times \mathbb Z. Consider the plot of Z×Z\mathbb Z\times \mathbb Z on a 2-dimensional grid. Starting at (0,0)(0,0) we list the elements of Z×Z\mathbb Z\times \mathbb Z using a spiral shape, as shown below.

image

(The picture shows only a small part of the spiral.) Since we have a way to list all the elements such that every element eventually appears in the list, we are done.

(Side node: It is not a requirement that we give an explicit formula for f(i)f(i). In fact, sometimes in such proofs, an explicit formula may not exist. This does not make the proof any less rigorous! The above proof is perfectly acceptable.)

Proposition (Q=N|\mathbb Q| = |\mathbb N|)

Let Q\mathbb Q be the set of rational numbers. Then Q=N|\mathbb Q| = |\mathbb N|.

Proof

We want to show Q=N|\mathbb Q| = |\mathbb N|, and it is clear that NQ|\mathbb N| \leq |\mathbb Q| (since NQ\mathbb N\subseteq \mathbb Q), so we just need to show QN|\mathbb Q| \leq |\mathbb N|. We will make use of the previous proposition to establish this.

Note that every element of Q\mathbb Q can be written as a fraction a/ba/b where a,bZa, b \in \mathbb Z. In other words, there is a surjection from Z×Z\mathbb Z\times \mathbb Z to Q\mathbb Q that maps (a,b)(a,b) to a/ba/b (if b=0b = 0, map (a,b)(a,b) to say 00). This shows that QZ×Z|\mathbb Q| \leq |\mathbb Z\times \mathbb Z|. From the previous proposition, Z×Z=N|\mathbb Z\times \mathbb Z| = |\mathbb N|. Putting things together, QN|\mathbb Q| \leq |\mathbb N|.

Proposition (Σ=N|\Sigma^*| = |\mathbb N|)

Let Σ\Sigma be a finite non-empty set. Then Σ=N|\Sigma^*| = |\mathbb N|.

Proof

Recall that Σ\Sigma^* denotes the set of all words/strings over the alphabet Σ\Sigma with finitely many symbols. We want to show NΣ|\mathbb N| \leq |\Sigma^*| and ΣN|\Sigma^*| \leq |\mathbb N|.

To show NΣ|\mathbb N| \leq |\Sigma^*|, note that the function that maps a string to its length, sss \mapsto |s|, is a surjection from Σ\Sigma^* to N\mathbb N. (Remark: When Σ=1|\Sigma| = 1, this function is a bijection.)

We will now show ΣN|\Sigma^*| \leq |\mathbb N| by presenting a way to list all the elements of Σ\Sigma^* such that eventually all the elements appear in the list. (As before, if we can present this listing, then we have a surjection f:NΣf : \mathbb N\to \Sigma^*: we let f(n)f(n) be the nn’th element in the list.)

For each k=0,1,2,k = 0,1,2,\ldots, let Σk\Sigma^k denote the set of words in Σ\Sigma^* that have length exactly kk. Note that Σk\Sigma^k is a finite set for each kk, and Σ\Sigma^* is a union of these sets: Σ=Σ0Σ1Σ2\Sigma^* = \Sigma^0 \cup \Sigma^1 \cup \Sigma^2 \cup \cdots. This gives us a way to list the elements of Σ\Sigma^* so that any element of Σ\Sigma^* eventually appears in the list. First list the elements of Σ0\Sigma^0, then list the elements of Σ1\Sigma^1, then list the elements of Σ2\Sigma^2, and so on.

Theorem (No cardinality strictly between finite and N|\mathbb N|)

If SS is an infinite set and SN|S| \leq |\mathbb N|, then S=N|S| = |\mathbb N|.

Exercise (Proof of no cardinality strictly between finite and N|\mathbb N|)

Prove the above theorem.

Solution

Assume SS is such that SN|S| \leq |\mathbb N| and SS is infinite. Since SN|S| \leq |\mathbb N|, there is an injection f:SNf : S \to \mathbb N. This ff allows us to define an ordering on SS. For s,tSs, t \in S, write s<ts < t if f(s)<f(t)f(s) < f(t). Using this ordering, we can define the order of an element sSs \in S as ord(s)={xS:f(x)<f(s)}\text{ord}(s) = |\{x \in S : f(x) < f(s)\}|.

image

We now observe that the ord\text{ord} function that we have defined is a bijection from SS to N\mathbb N. To see that this is a bijection, first note that it is a surjective function because SS is an infinite set. And it is an injective function because it is a total order. In particular, if ord(s)=ord(s)\text{ord}(s) = \text{ord}(s'), then it must be f(s)=f(s)f(s) = f(s'), and since ff is injective, this implies s=ss = s'.

Proposition (P=N|\mathbb{P}| = |\mathbb N|)

Let P={2,3,5,7,}\mathbb{P} = \{2, 3, 5, 7, \ldots\} be the set of prime numbers. Then P=N|\mathbb{P}| = |\mathbb N|.

Proof

Since PN\mathbb{P} \subseteq \mathbb N, we know PN|\mathbb{P}| \leq |\mathbb N|. We also know that there are infinitely many primes. Then by Theorem (No cardinality strictly between finite and N|\mathbb N|), P=N|\mathbb{P}| = |\mathbb N|.

(One can also prove this proposition without invoking the theorem. The function that maps nn to the nn’th prime number is a bijection.)

2
Countable Sets

In the last section, we have shown N=S=Z=P=Z×Z=Q=Σ.|\mathbb N| = |\mathbb S| = |\mathbb Z| = |\mathbb{P}| = |\mathbb Z\times \mathbb Z| = |\mathbb Q| = |\Sigma^*|. In light of Theorem (No cardinality strictly between finite and N|\mathbb N|), there are three categories of sets:

  1. finite sets,

  2. sets SS such that S=X|S| = |X|, where XX is any of the sets listed above (like N\mathbb N),

  3. all other sets (i.e. sets SS with S>X|S| > |X|, where XX is any of the sets listed above).

It makes sense to give a name for these different categories. In the literature, the first two categories combined is known as countable sets. And the third category is known as uncountable sets.

We can define a countable set as any set SS with SX|S| \leq |X|, where X=NX = \mathbb N. But the choice of N\mathbb N here, in a sense, is arbitrary. We can also choose, for instance, X=ΣX = \Sigma^*, in which case we see countability is equivalent to encodability. And arguably, encodability captures the essence of this class of sets better. That being said, in the literature, X=NX = \mathbb N is the standard choice, leading to the definition below.

Definition (Countable and uncountable sets)
  • A set SS is called countable if SN|S| \leq |\mathbb N|.

  • A set SS is called countably infinite if it is countable and infinite.

  • A set SS is called uncountable if it is not countable, i.e. S>N|S| > |\mathbb N|.

Note (Only two options for countable sets)

Theorem (No cardinality strictly between finite and N|\mathbb N|) implies that if SS is countable, there are two options: either SS is finite, or S=N|S| = |\mathbb N|.

Important Note (Countability is equivalent to encodability)

Recall that a set SS is encodable if for some alphabet Σ\Sigma, there is an injection from SS to Σ\Sigma^*, or equivalently, SΣ|S| \leq |\Sigma^*| (Definition (Encoding elements of a set)). Since N=Σ|\mathbb N| = |\Sigma^*|, we see that a set is countable if and only if it is encodable. Therefore, one can show that a set is countable by showing that it is encodable. We will call this the “CS method” for showing countability.

The standard definition of countability (SN|S| \leq |\mathbb N|) highlights the following heuristic.

  • If you can list the elements of SS in a way that every element appears in the list eventually, then SS is countable.

The encodability definition highlights another heuristic that is far more relevant in computer science.

  • If you can “write down” each element of SS using a finite number of symbols, then SS is countable.

Proposition (The set of polynomials with rational coefficients is countable)

The set of all polynomials in one variable with rational coefficients is countable.

Proof

Let Q[x]\mathbb Q[x] denote the set of all polynomials in one variable with rational coefficients. We want to show that Q[x]\mathbb Q[x] is countable. We will do so using the CS method. Let Σ={0,1,2,3,4,5,6,7,8,9,+,-,/,x}.\Sigma = \{{\texttt{0}},{\texttt{1}},{\texttt{2}},{\texttt{3}},{\texttt{4}},{\texttt{5}},{\texttt{6}},{\texttt{7}},{\texttt{8}},{\texttt{9}},{\texttt{+}},{\texttt{-}},{\texttt{/}},{\texttt{x}}\}. Then observe that every element of Q[x]\mathbb Q[x] can be written as a finite string over this alphabet. For example, 2x3-1/34x2+99/100x1+22/7{\texttt{2x3-1/34x2+99/100x1+22/7}} represents the polynomial 2x31/34x2+99/100x+22/7.2x^3 - 1/34x^2 + 99/100x + 22/7. This implies that there is a surjective map from Σ\Sigma^* to Q[x]\mathbb Q[x]. And therefore Q[x]Σ|\mathbb Q[x]| \leq |\Sigma^*|. Since Σ\Sigma^* is countable, i.e. ΣN|\Sigma^*| \leq |\mathbb N|, Q[x]\mathbb Q[x] is also countable.

Exercise (Practice with countability proofs)

Show that the following sets are countable.

  1. Z×Z×Z\mathbb Z\times \mathbb Z\times \mathbb Z.

  2. The set of all functions f:SNf: S \to \mathbb N, where SS is some fixed finite set.

Solution

Part 1: We want to show that Z×Z×Z\mathbb Z\times \mathbb Z\times \mathbb Z is countable. We use the CS method with Σ={0,1,2,3,4,5,6,7,8,9,-,$}\Sigma = \{{\texttt{0}},{\texttt{1}},{\texttt{2}},{\texttt{3}},{\texttt{4}},{\texttt{5}},{\texttt{6}},{\texttt{7}},{\texttt{8}},{\texttt{9}},{\texttt{-}},{\texttt{\$}}\}. Note that any element of Z×Z×Z\mathbb Z\times \mathbb Z\times \mathbb Z can be written uniquely as a finite word over Σ\Sigma (we use the dollar sign as a separator between the integers). As an illustration, (9234851,-1234,0)Z×Z×Z(9234851,\text{-}1234,0) \in \mathbb Z\times \mathbb Z\times \mathbb Z can be encoded as the string 9234851$-1234$0{\texttt{9234851\$-1234\$0}}. Every integer has finite length, so the string encoding is always of finite length.

Part 2: Let UU be the set of all functions f:SNf: S \to \mathbb N, where SS is a finite set. We want to show that UU is countable.

We first make an observation about the elements of UU. Take a function f:SNf: S \to \mathbb N, where SS is a finite set. Let kk be the size of SS and let s1,s2,,sks_1, s_2, \ldots, s_k be its elements. Then ff can be uniquely represented by the tuple (f(s1),f(s2),,f(sk)),(f(s_1), f(s_2), \ldots, f(s_k)), where each element of the tuple is an element from N\mathbb N.

We now show that UU is countable using the CS method with the alphabet Σ={0,1,2,3,4,5,6,7,8,9,$}\Sigma = \{{\texttt{0}},{\texttt{1}},{\texttt{2}},{\texttt{3}},{\texttt{4}},{\texttt{5}},{\texttt{6}},{\texttt{7}},{\texttt{8}},{\texttt{9}},{\texttt{\$}}\}. The observation above shows that any element of UU can be uniquely represented with a finite length string where commas are replaced with ${\texttt{\$}}. (Note that there is no need to put the opening and closing parentheses.) This suffices to conclude that UU is countable.

3
Uncountable Sets
Lemma (Diagonalization)

Let F\mathcal{F} be a set of functions f:XYf: X \to Y where Y2|Y| \geq 2. If XF|X| \geq |\mathcal{F}|, we can construct a function fD:XYf_D : X \to Y such that fD∉Ff_D \not \in \mathcal{F}.

Proof

Given a set F\mathcal{F} of functions f:XYf: X \to Y, we want to construct a function fD:XYf_D : X \to Y that is different from every fFf \in \mathcal{F}. The main idea is the following. For each fFf \in \mathcal{F}, pick a unique input xXx \in X and define fD(x)f_D(x) in a way such that it is different from f(x)f(x). Since by construction fDf_D and ff disagree on input xx, fDf_D is different from ff. And since we do this for every fFf \in \mathcal{F}, fDf_D is different from all fFf \in \mathcal{F}.

Above, it is important that we pick a unique xx for each fFf \in \mathcal{F} so that fDf_D can be defined in a consistent way. The ability to pick a unique xx for each fFf \in \mathcal{F} is equivalent to XF|X| \geq |\mathcal{F}|.

A bit more formally, since XF|X| \geq |\mathcal{F}|, there is an injection ϕ:FX\phi : \mathcal{F}\to X. Let xf=ϕ(f)x_f = \phi(f). So fff \neq f' implies xfxfx_f \neq x_{f'}. Define fDf_D such that for all fFf \in \mathcal{F}, fD(xf)f(xf)f_D(x_f) \neq f(x_f) (this is where the assumption Y2|Y| \geq 2 is used). This ensures that fDf_D is different from every fFf \in \mathcal{F}, and therefore fD∉Ff_D \not \in \mathcal{F}. (Note that our description of fDf_D leaves it underspecified, but see the remark below.)

Remark (There are many choices for the diagonal element)

When we apply the above lemma to construct an explicit fD∉Ff_D \not\in \mathcal{F}, we call that diagonalizing against the set F\mathcal{F}. And we call fDf_D a diagonal element. Typically there are several choices for the definition of fDf_D:

  • Different injections ϕ:FX\phi : \mathcal{F}\to X can lead to different diagonal elements.

  • If Y>2|Y| > 2, we have more than one choice on what value we assign to fD(xf)f_D(x_f) that makes fD(xf)f(xf)f_D(x_f) \neq f(x_f) (here xfx_f denotes ϕ(f)\phi(f)).

  • If there are elements xXx \in X not in the range of ϕ\phi, then we can define fD(x)f_D(x) any way we like.

Note (Functions in disguise)

Many mathematical objects can be easily viewed as functions. The following are some examples.

  • A set SUS \subseteq U can be viewed as a function fS:U{0,1}f_S: U \to \{0,1\}. This is called the characteristic function of the set. Recall that we made this observation in Important Note (Correspondence between decision problems and languages).

  • A sequence ss of length kk with elements from a set YY can be viewed as a function fs:{1,2,,k}Yf_s : \{1,2,\ldots,k\} \to Y, where fs(i)f_s(i) is the ii’th element of the sequence.

  • A string/word is a sequence of symbols from an alphabet Σ\Sigma. Therefore a string sΣs \in \Sigma^* with s=k|s| = k can be viewed as a function fs:{1,2,,k}Σf_s : \{1,2,\ldots,k\} \to \Sigma.

  • An infinite-length sequence ss with elements from YY can be viewed as a function fs:N+Yf_s : \mathbb N^+ \to Y.

  • Numbers can be viewed as functions since the binary representation of a number is just a string/sequence of bits (possibly infinite-length).

Important Note (Diagonalization gives an explicit construction)

Note that diagonalizing against a set F\mathcal{F} produces an explicit function fDf_D not in F\mathcal{F}. Therefore, in situations where you wish to find an explicit object that is not in a given set, you should consider if diagonalization could be applied.

Corollary (Direct corollary of diagonalization)

If F\mathcal{F} is the set of all functions f:XYf: X \to Y (and Y2|Y| \geq 2), then X<F|X| < |\mathcal{F}|.

Proof

Diagonalization tells us that whenever XF|X| \geq |\mathcal{F}|, we can construct a function fDf_D not in F\mathcal{F}. But if F\mathcal{F} denotes the set of all functions f:XYf: X \to Y, the construction of fDf_D is not possible. Therefore it must be the case that X<F|X| < |\mathcal{F}|.

Theorem (Cantor’s Theorem)

For any set XX, X<(X)|X| < |\wp(X)|.

Proof

Let F\mathcal{F} be the set of all functions f:X{0,1}f : X \to \{0,1\}, which are the characteristic functions of the subsets of XX (so f(x)=1f(x) = 1 if and only if xXx \in X). Then by Note (Functions in disguise), F=(X)|\mathcal{F}| = |\wp(X)| and the result follows from the above Corollary.

Remark (Russell’s Paradox as diagonalization)

In the famous Russell’s Paradox, we consider the set of all sets that do not contain themselves. That is, we consider D={set S:S∉S}.D = \{\text{set } S : S \not \in S\}. Then we ask whether DD is in DD or not. If DDD \in D, then by the definition of DD, D∉DD \not \in D. And if D∉DD \not \in D, again by the definition of DD, DDD \in D. Either way we get a contradiction. Therefore, the conclusion is that such a set DD should not be mathematically definable.

This paradox is also an application of diagonalization. For any set XX, we cannot diagonalize against the set of all functions f:X{0,1}f : X \to \{0,1\}. If we imagine diagonalizing against this set, where XX is the set of all sets, the natural diagonal element fDf_D that comes out of diagonalization is precisely the characteristic function of the set DD defined above.

Corollary (The power set of an infinite set is uncountable)

For any infinite set SS, (S)\wp(S) is uncountable. In particular, (N)\wp(\mathbb N) is uncountable.

Proof

Let’s break this up into two cases: (i) SS is countably infinite and (ii) SS is uncountable.

If SS is countably infinite, S=N|S| = |\mathbb N|. And by Cantor’s Theorem, N=S<(S)|\mathbb N| = |S| < |\wp(S)|, which by definition means (S)\wp(S) is uncountable.

If SS is uncountable, then N<S<(S)|\mathbb N| < |S| < |\wp(S)|, and so once again, (S)\wp(S) is uncountable.

Note (Diagonalization with countable sets)

In a common scenario where diagonalization is applied, both F\mathcal{F} and XX are countably infinite sets. So we can list the elements of F\mathcal{F} as f1,  f2,  f3,  f_1,\; f_2,\; f_3,\; \ldots as well as the elements of XX as x1,  x2,  x3,  x_1,\; x_2,\; x_3,\; \ldots Then for all ii, define fD(xi)f_D(x_i) such that fD(xi)fi(xi)f_D(x_i) \neq f_i(x_i). If Y={0,1}Y = \{0,1\}, for example, fD(xi)=not  fi(xi)f_D(x_i) = \textbf{not} \; f_i(x_i). The construction of fDf_D can be nicely visualized with a table, as shown below. Here, an entry corresponding to row fif_i and column xjx_j contains the value fi(xj)f_i(x_j).

image

By construction, the diagonal element fDf_D differs from every fif_i, i{1,2,3,}i \in \{1,2,3,\ldots\}. In particular, it differs from fif_i with respect to the input xix_i.

Below, we will apply Lemma (Diagonalization) to construct an irrational number (i.e. a number in RQ\mathbb R\setminus \mathbb Q). Even though the result may not be interesting (since there is a relatively simple proof that 2\sqrt{2} is irrational), it does illustrate the use of diagonalization nicely. Cantor used the same technique to construct an explicit transcendental number, which was a quite non-trivial and important result at the time. Instead of proving that result, we put it in the practice set for you.

Proposition (Irrational numbers exist)

There exists an irrational real number, that is, there exists a number in RQ\mathbb R\setminus \mathbb Q.

Proof

We will prove the existence of a number in RQ\mathbb R\setminus \mathbb Q (i.e. the existence of an irrational number) by constructing such a number using diagonalization. In order to simplify the proof, we will restrict our attention to numbers in the interval [0,1][0,1], so we will construct an irrational number in [0,1][0,1]. The restriction to [0,1][0,1] allows us to represent a number just by considering the fractional part.

As mentioned in Note (Functions in disguise), numbers can be viewed as functions: Every function f:N+{0,1}f : \mathbb N^+ \to \{0,1\} represents a real number in [0,1][0,1] in binary, namely 0.f(1)f(2)f(3)0.f(1)f(2)f(3)\ldots. Two different functions ff and ff' may represent the same real number, e.g. 0.100000.10000\ldots and 0.011110.01111\ldots represent the same real number. But we don’t have more than two different functions representing the same number.

Now consider the set F\mathcal{F} of all functions f:N+{0,1}f : \mathbb N^+ \to \{0,1\} representing a rational number. We will diagonalize against F\mathcal{F} to construct fDf_D not in F\mathcal{F}. This fDf_D then represents a real number that is not rational, and we are done.

In order to diagonalize against F\mathcal{F}, all we need is that N+F|\mathbb N^+| \geq |\mathcal{F}| holds. And this is indeed the case. F\mathcal{F} is countably infinite because Q\mathbb Q is countably infinite. And obviously N+\mathbb N^+ is countably infinite. This means we are in the situation outlined in Note (Diagonalization with countable sets), and N+=F|\mathbb N^+| = |\mathcal{F}|.

image

Exercise (R\mathbb R is uncountable)

Prove that R\mathbb R is uncountable.

Solution

The proof is basically the same as the proof of Proposition (Irrational numbers exist). There, we have implicitly proved that the interval [0,1][0,1] cannot be countable. To see this, note that if [0,1][0,1] is countable, then the set F\mathcal{F} of all functions f:N+{0,1}f : \mathbb N^+ \to \{0,1\} is countable. So F=N+|\mathcal{F}| = |\mathbb N^+|, which allows us to diagonalize against F\mathcal{F}, which is not possible.

Definition (Σ\Sigma^\infty)

Let Σ\Sigma be some finite alphabet. We denote by Σ\Sigma^\infty the set of all infinite length words over the alphabet Σ\Sigma.

Remark

Observe that ΣΣ=\Sigma^* \cap \Sigma^\infty = \varnothing.

Theorem ({0,1}\{{\texttt{0}},{\texttt{1}}\}^\infty is uncountable)

The set {0,1}\{{\texttt{0}},{\texttt{1}}\}^\infty is uncountable.

Proof

Using the observation in Note (Functions in disguise), an infinite-length string s{0,1}s \in \{{\texttt{0}},{\texttt{1}}\}^\infty corresponds to a function fs:N+{0,1}f_s : \mathbb N^+ \to \{{\texttt{0}}, {\texttt{1}}\}, which is the characteristic function of a set SN+S \subseteq \mathbb N^+ (where nSn \in S if and only if fs(n)=1f_s(n) = {\texttt{1}}). Therefore the set of all infinite-length strings, {0,1}\{{\texttt{0}},{\texttt{1}}\}^\infty, corresponds to the set of all subsets of N+\mathbb N^+, (N+)\wp(\mathbb N^+). That is, {0,1}(N+)\{{\texttt{0}},{\texttt{1}}\}^\infty \leftrightarrow\wp(\mathbb N^+). Using Corollary (The power set of an infinite set is uncountable) we can conclude {0,1}\{{\texttt{0}},{\texttt{1}}\}^\infty is uncountable.

Exercise (Uncountable sets are closed under supersets)

Prove that if XX is uncountable and XYX \subseteq Y, then YY is also uncountable.

Solution

We want to show that if YY is a superset of an uncountable set XX, then YY must be uncountable.

If XX is uncountable, by definition, X>N|X| > |\mathbb N|. If XYX \subseteq Y, then there is a clear injection from XX to YY (map xXx \in X to xYx \in Y), so XY|X| \leq |Y|. Combining this with X>N|X| > |\mathbb N|, we have YX>N|Y| \geq |X| > |\mathbb N|, and therefore YY is uncountable.

Important Note (Uncountability via {0,1}\{{\texttt{0}},{\texttt{1}}\}^\infty)

If we want to show that a set XX is uncountable, it suffices to show that XY|X| \geq |Y| for some uncountable set YY. Typically, a good choice for such a YY is {0,1}\{{\texttt{0}},{\texttt{1}}\}^\infty. And one strategy for establishing X{0,1}|X| \geq |\{{\texttt{0}},{\texttt{1}}\}^\infty| is to identify a subset SS of XX such that S{0,1}S \leftrightarrow\{{\texttt{0}},{\texttt{1}}\}^\infty.

Exercise (Practice with uncountability proofs)

Show that the following sets are uncountable.

  1. The set of all bijective functions from N\mathbb N to N\mathbb N.

  2. {x1x2x3{1,2}: for all n1  i=1nxi≢0mod  4}\{x_1x_2 x_3 \ldots \in \{{\texttt{1}},{\texttt{2}}\}^\infty : \text{ for all $n \geq 1$, $\; \sum_{i=1}^n x_i \not\equiv 0 \mod 4$}\}

Solution

Part 1: Let SS be the set of all bijective functions from N\mathbb N to N\mathbb N. We want to show that SS is uncountable, and we will do so by showing that {0,1}S\{{\texttt{0}},{\texttt{1}}\}^\infty \hookrightarrow S, establishing {0,1}S|\{{\texttt{0}},{\texttt{1}}\}^\infty| \leq |S|.

We now describe this injective mapping. Given x{0,1}x \in \{{\texttt{0}},{\texttt{1}}\}^\infty, we map it to a bijection fx:NNf_x :\mathbb N\to \mathbb N as follows. Let xnx_n be the nn’th bit of xx, and assume the indexing starts from 00. Then for all nNn \in \mathbb N, if xn=0,fx maps 2n to 2n and 2n+1 to 2n+1;if xn=1,fx maps 2n to 2n+1 and 2n+1 to 2n.\begin{aligned} \text{if $x_n = {\texttt{0}}$}, & \quad \text{$f_x$ maps $2n$ to $2n$ and $2n+1$ to $2n+1$;} \\ \text{if $x_n = {\texttt{1}}$}, & \quad \text{$f_x$ maps $2n$ to $2n+1$ and $2n+1$ to $2n$.}\end{aligned}

The below picture illustrates the construction of fxf_x. If xn=0x_n = {\texttt{0}}, we pick the black arrows to map 2n2n and 2n+12n+1, and if xn=1x_n = 1 we pick the red/dashed arrows to map 2n2n and 2n+12n+1.

image

Observe that for any x{0,1}x \in \{{\texttt{0}},{\texttt{1}}\}^\infty, the corresponding function fxf_x is indeed a bijection. It is also clear that if xxx \neq x', then fxfxf_{x} \neq f_{x'}. So this mapping from {0,1}\{{\texttt{0}},{\texttt{1}}\}^\infty to SS is indeed an injection. This completes the proof.

Part 2: Let S={x1x2x3{1,2}: for all n1  i=1nxi≢0mod  4}S = \{x_1x_2 x_3 \ldots \in \{{\texttt{1}},{\texttt{2}}\}^\infty : \text{ for all $n \geq 1$, $\; \sum_{i=1}^n x_i \not\equiv 0 \mod 4$}\}. We want to show that SS is uncountable, and we will do so by identifying a subset of SS that is in one-to-one correspondence with {0,1}\{{\texttt{0}},{\texttt{1}}\}^\infty.

Let a=22a = {\texttt{22}} and b=112b = {\texttt{112}}. Define the set S={1w:w{a,b}}S' = \{{\texttt{1}}w : w \in \{a,b\}^\infty\}. There are a couple of important observations (whose proofs are omitted):

  • SSS' \subseteq S.

  • The mapping f:{0,1}{a,b}f : \{{\texttt{0}},{\texttt{1}}\}^\infty \to \{a,b\}^\infty such that y1y2{0,1}y_1y_2\ldots \in \{{\texttt{0}},{\texttt{1}}\}^\infty is mapped to w1w2{a,b}w_1w_2\ldots \in \{a,b\}^\infty, where wi=aw_i = a if yi=0y_i = {\texttt{0}} and wi=bw_i = b if yi=1y_i = {\texttt{1}}, is a bijection.

These two observations imply that we have identified a subset of SS (namely SS') that is in one-to-one correspondence with {0,1}\{{\texttt{0}},{\texttt{1}}\}^\infty, which allows us to conclude that SS is uncountable.

4
Check Your Understanding
Problem
  1. For sets X,YX, Y, what are the definitions of XY|X| \leq |Y|, XY|X| \geq |Y|, X=Y|X| = |Y|, and X<Y|X| < |Y|?

  2. What is the definition of a countable set?

  3. What is the CS method for showing that a set is countable?

  4. True or false: There exists an infinite set SS such that S<N|S| < |\mathbb N|.

  5. True or false: {0,1}{0,1}=\{{\texttt{0}},{\texttt{1}}\}^* \cap \{{\texttt{0}},{\texttt{1}}\}^\infty = \varnothing.

  6. True or false: {0,1,2}=Q×Q|\{{\texttt{0}},{\texttt{1}},{\texttt{2}}\}^*| = |\mathbb Q\times \mathbb Q|.

  7. State the Diagonalization Lemma and briefly explain how it is proved.

  8. What is the connection between Diagonalization Lemma and uncountability?

  9. What is Cantor’s Theorem?

  10. True or false: ({0,1})=(({0,1}))|\wp(\{{\texttt{0}},{\texttt{1}}\}^\infty)| = |\wp(\wp(\{{\texttt{0}},{\texttt{1}}\}^\infty))|.

  11. True or false: There is a surjection from {0,1}\{{\texttt{0}},{\texttt{1}}\}^\infty to {0,1,2,3}\{{\texttt{0}},{\texttt{1}},{\texttt{2}},{\texttt{3}}\}^\infty.

  12. True or false: Let Σ\Sigma be an alphabet. The set of encodings mapping N\mathbb N to Σ\Sigma^* is countable.

  13. True or false: Let Σ={1}\Sigma = \{{\texttt{1}}\} be a unary alphabet. The set of all languages over Σ\Sigma is countable.

  14. State a technique for proving that a given set is uncountable.

5
High-Order Bits
Important Note

Here are the important things to keep in mind from this chapter.

  1. The definitions of injective, surjective and bijective functions are fundamental.

  2. The concepts of countable and uncountable sets, and their precise mathematical definitions.

  3. When it comes to showing that a set is countable, the CS method is the best choice, almost always.

  4. The Diagonalization Lemma is one of the most important and powerful techniques in all mathematics.

  5. When showing that a set is uncountable, establishing an injection from {0,1}\{0,1\}^\infty (or a surjection to {0,1}\{{\texttt{0}},{\texttt{1}}\}^\infty) is one of the best techniques you can use.

Theorem (Relationships between different types of functions)

Let \(X, Y\) and \(Z\) be three (possibly infinite) sets. Then,

  • \(X \hookrightarrow Y\) if and only if \(Y \twoheadrightarrow X\);

  • if \(X \hookrightarrow Y\) and \(Y \hookrightarrow Z\), then \(X \hookrightarrow Z\);

  • \(X \leftrightarrow Y\) if and only if \(X \hookrightarrow Y\) and \(Y \hookrightarrow X\).

See in chapter
Theorem (No cardinality strictly between finite and \(|\mathbb N|\))

If \(S\) is an infinite set and \(|S| \leq |\mathbb N|\), then \(|S| = |\mathbb N|\).

See in chapter
Theorem (No cardinality strictly between finite and \(|\mathbb N|\))

If \(S\) is an infinite set and \(|S| \leq |\mathbb N|\), then \(|S| = |\mathbb N|\).

See in chapter
Theorem (No cardinality strictly between finite and \(|\mathbb N|\))

If \(S\) is an infinite set and \(|S| \leq |\mathbb N|\), then \(|S| = |\mathbb N|\).

See in chapter
Definition (Encoding elements of a set)

Let \(A\) be a set and let \(\Sigma\) be an alphabet. An encoding of the elements of \(A\), using \(\Sigma\), is an injective function \(\text{Enc}: A \to \Sigma^*\). We denote the encoding of \(a \in A\) by \(\left \langle a \right\rangle\).

If \(w \in \Sigma^*\) is such that there is some \(a \in A\) with \(w = \left \langle a \right\rangle\), then we say \(w\) is a valid encoding of an element in \(A\).

A set that can be encoded is called encodable.

See in chapter
Important Note (Correspondence between decision problems and languages)

There is a one-to-one correspondence between decision problems and languages. Let \(f:\Sigma^* \to \{0,1\}\) be some decision problem. Now define \(L \subseteq \Sigma^*\) to be the set of all words in \(\Sigma^*\) that \(f\) maps to 1. This \(L\) is the language corresponding to the decision problem \(f\). Similarly, if you take any language \(L \subseteq \Sigma^*\), we can define the corresponding decision problem \(f:\Sigma^* \to \{0,1\}\) as \(f(w) = 1\) if and only if \(w \in L\). We consider the set of languages and the set of decision problems to be the same set of objects.

image

It is important to get used to this correspondence since we will often switch between talking about languages and talking about decision problems without explicit notice.

See in chapter
Note (Functions in disguise)

Many mathematical objects can be easily viewed as functions. The following are some examples.

  • A set \(S \subseteq U\) can be viewed as a function \(f_S: U \to \{0,1\}\). This is called the characteristic function of the set. Recall that we made this observation in Important Note (Correspondence between decision problems and languages).

  • A sequence \(s\) of length \(k\) with elements from a set \(Y\) can be viewed as a function \(f_s : \{1,2,\ldots,k\} \to Y\), where \(f_s(i)\) is the \(i\)’th element of the sequence.

  • A string/word is a sequence of symbols from an alphabet \(\Sigma\). Therefore a string \(s \in \Sigma^*\) with \(|s| = k\) can be viewed as a function \(f_s : \{1,2,\ldots,k\} \to \Sigma\).

  • An infinite-length sequence \(s\) with elements from \(Y\) can be viewed as a function \(f_s : \mathbb N^+ \to Y\).

  • Numbers can be viewed as functions since the binary representation of a number is just a string/sequence of bits (possibly infinite-length).

See in chapter
Lemma (Diagonalization)

Let \(\mathcal{F}\) be a set of functions \(f: X \to Y\) where \(|Y| \geq 2\). If \(|X| \geq |\mathcal{F}|\), we can construct a function \(f_D : X \to Y\) such that \(f_D \not \in \mathcal{F}\).

See in chapter
Note (Functions in disguise)

Many mathematical objects can be easily viewed as functions. The following are some examples.

  • A set \(S \subseteq U\) can be viewed as a function \(f_S: U \to \{0,1\}\). This is called the characteristic function of the set. Recall that we made this observation in Important Note (Correspondence between decision problems and languages).

  • A sequence \(s\) of length \(k\) with elements from a set \(Y\) can be viewed as a function \(f_s : \{1,2,\ldots,k\} \to Y\), where \(f_s(i)\) is the \(i\)’th element of the sequence.

  • A string/word is a sequence of symbols from an alphabet \(\Sigma\). Therefore a string \(s \in \Sigma^*\) with \(|s| = k\) can be viewed as a function \(f_s : \{1,2,\ldots,k\} \to \Sigma\).

  • An infinite-length sequence \(s\) with elements from \(Y\) can be viewed as a function \(f_s : \mathbb N^+ \to Y\).

  • Numbers can be viewed as functions since the binary representation of a number is just a string/sequence of bits (possibly infinite-length).

See in chapter
Note (Diagonalization with countable sets)

In a common scenario where diagonalization is applied, both \(\mathcal{F}\) and \(X\) are countably infinite sets. So we can list the elements of \(\mathcal{F}\) as \[f_1,\; f_2,\; f_3,\; \ldots\] as well as the elements of \(X\) as \[x_1,\; x_2,\; x_3,\; \ldots\] Then for all \(i\), define \(f_D(x_i)\) such that \(f_D(x_i) \neq f_i(x_i)\). If \(Y = \{0,1\}\), for example, \(f_D(x_i) = \textbf{not} \; f_i(x_i)\). The construction of \(f_D\) can be nicely visualized with a table, as shown below. Here, an entry corresponding to row \(f_i\) and column \(x_j\) contains the value \(f_i(x_j)\).

image

By construction, the diagonal element \(f_D\) differs from every \(f_i\), \(i \in \{1,2,3,\ldots\}\). In particular, it differs from \(f_i\) with respect to the input \(x_i\).

See in chapter
Proposition (Irrational numbers exist)

There exists an irrational real number, that is, there exists a number in \(\mathbb R\setminus \mathbb Q\).

See in chapter
Note (Functions in disguise)

Many mathematical objects can be easily viewed as functions. The following are some examples.

  • A set \(S \subseteq U\) can be viewed as a function \(f_S: U \to \{0,1\}\). This is called the characteristic function of the set. Recall that we made this observation in Important Note (Correspondence between decision problems and languages).

  • A sequence \(s\) of length \(k\) with elements from a set \(Y\) can be viewed as a function \(f_s : \{1,2,\ldots,k\} \to Y\), where \(f_s(i)\) is the \(i\)’th element of the sequence.

  • A string/word is a sequence of symbols from an alphabet \(\Sigma\). Therefore a string \(s \in \Sigma^*\) with \(|s| = k\) can be viewed as a function \(f_s : \{1,2,\ldots,k\} \to \Sigma\).

  • An infinite-length sequence \(s\) with elements from \(Y\) can be viewed as a function \(f_s : \mathbb N^+ \to Y\).

  • Numbers can be viewed as functions since the binary representation of a number is just a string/sequence of bits (possibly infinite-length).

See in chapter
Corollary (The power set of an infinite set is uncountable)

For any infinite set \(S\), \(\wp(S)\) is uncountable. In particular, \(\wp(\mathbb N)\) is uncountable.

See in chapter