
In this chapter, we would like to remind you the concepts of countable and uncountable sets, as well as the general techniques involved in countability and uncountability proofs. Even though it may seem like we are diverging from the main discussion on computation, we’ll see in future chapters that the concepts in this chapter are intimately related to the concepts of computability and uncomputability. Countable and uncountable sets, together with the diagonalization proof technique, have major applications in proving the limits of computation.
Let and be two (possibly infinite) sets.
A function is called injective if for any such that , we have . We write if there exists an injective function from to .
A function is called surjective if for all , there exists an such that . We write if there exists a surjective function from to .
A function is called bijective (or one-to-one correspondence) if it is both injective and surjective. We write if there exists a bijective function from to .
Let and be three (possibly infinite) sets. Then,
if and only if ;
if and , then ;
if and only if and .
Let and be two sets.
We write if .
We write (or ) if , or equivalently, if .
We write (or ) if it is not the case that .
Theorem (Relationships between different types of functions) justifies the use of the notation , , , and . The properties we would expect to hold for this type of notation indeed do hold. For example,
if and only if and ,
if , then ,
if , then , and so on.
If , then since the identity function that maps to is an injection.
The function defined as is a bijection. It is an injection since if , then . And it is surjective since for every , by the definition of , there exists an such that .
Since , we have . So we just need to argue , i.e. there is a surjective function from to . For this, our strategy is to argue that we can list the elements of such that every element eventually appears in the list. If we can do this, then we have the surjection that we want: we let be the ’th element in the list. Creating such a listing of is relatively straightforward: With this listing, the mapping is such that odd numbers are mapped to positive integers and even numbers are mapped to non-positive integers. The formula for is
We will show by arguing and
The former doesn’t really require much of an argument. The map is an injection and so
For the latter, we will show that there is a surjective function from to , and we will do so by arguing that we can list the elements of such that every element eventually appears in the list. If we can do this, then we have the surjection that we want: we let be the ’th element in the list.
We now describe how to list the elements of . Consider the plot of on a 2-dimensional grid. Starting at we list the elements of using a spiral shape, as shown below.
(The picture shows only a small part of the spiral.) Since we have a way to list all the elements such that every element eventually appears in the list, we are done.
(Side node: It is not a requirement that we give an explicit formula for . In fact, sometimes in such proofs, an explicit formula may not exist. This does not make the proof any less rigorous! The above proof is perfectly acceptable.)
We want to show , and it is clear that (since ), so we just need to show . We will make use of the previous proposition to establish this.
Note that every element of can be written as a fraction where . In other words, there is a surjection from to that maps to (if , map to say ). This shows that . From the previous proposition, . Putting things together, .
Recall that denotes the set of all words/strings over the alphabet with finitely many symbols. We want to show and .
To show , note that the function that maps a string to its length, , is a surjection from to . (Remark: When , this function is a bijection.)
We will now show by presenting a way to list all the elements of such that eventually all the elements appear in the list. (As before, if we can present this listing, then we have a surjection : we let be the ’th element in the list.)
For each , let denote the set of words in that have length exactly . Note that is a finite set for each , and is a union of these sets: . This gives us a way to list the elements of so that any element of eventually appears in the list. First list the elements of , then list the elements of , then list the elements of , and so on.
If is an infinite set and , then .
Assume is such that and is infinite. Since , there is an injection . This allows us to define an ordering on . For , write if . Using this ordering, we can define the order of an element as .
We now observe that the function that we have defined is a bijection from to . To see that this is a bijection, first note that it is a surjective function because is an infinite set. And it is an injective function because it is a total order. In particular, if , then it must be , and since is injective, this implies .
Since , we know . We also know that there are infinitely many primes. Then by Theorem (No cardinality strictly between finite and ), .
(One can also prove this proposition without invoking the theorem. The function that maps to the ’th prime number is a bijection.)
In the last section, we have shown In light of Theorem (No cardinality strictly between finite and ), there are three categories of sets:
finite sets,
sets such that , where is any of the sets listed above (like ),
all other sets (i.e. sets with , where is any of the sets listed above).
It makes sense to give a name for these different categories. In the literature, the first two categories combined is known as countable sets. And the third category is known as uncountable sets.
We can define a countable set as any set with , where . But the choice of here, in a sense, is arbitrary. We can also choose, for instance, , in which case we see countability is equivalent to encodability. And arguably, encodability captures the essence of this class of sets better. That being said, in the literature, is the standard choice, leading to the definition below.
A set is called countable if .
A set is called countably infinite if it is countable and infinite.
A set is called uncountable if it is not countable, i.e. .
Theorem (No cardinality strictly between finite and ) implies that if is countable, there are two options: either is finite, or .
Recall that a set is encodable if for some alphabet , there is an injection from to , or equivalently, (Definition (Encoding elements of a set)). Since , we see that a set is countable if and only if it is encodable. Therefore, one can show that a set is countable by showing that it is encodable. We will call this the “CS method” for showing countability.
The standard definition of countability () highlights the following heuristic.
If you can list the elements of in a way that every element appears in the list eventually, then is countable.
The encodability definition highlights another heuristic that is far more relevant in computer science.
If you can “write down” each element of using a finite number of symbols, then is countable.
The set of all polynomials in one variable with rational coefficients is countable.
Let denote the set of all polynomials in one variable with rational coefficients. We want to show that is countable. We will do so using the CS method. Let Then observe that every element of can be written as a finite string over this alphabet. For example, represents the polynomial This implies that there is a surjective map from to . And therefore . Since is countable, i.e. , is also countable.
Show that the following sets are countable.
.
The set of all functions , where is some fixed finite set.
Part 1: We want to show that is countable. We use the CS method with . Note that any element of can be written uniquely as a finite word over (we use the dollar sign as a separator between the integers). As an illustration, can be encoded as the string . Every integer has finite length, so the string encoding is always of finite length.
Part 2: Let be the set of all functions , where is a finite set. We want to show that is countable.
We first make an observation about the elements of . Take a function , where is a finite set. Let be the size of and let be its elements. Then can be uniquely represented by the tuple where each element of the tuple is an element from .
We now show that is countable using the CS method with the alphabet . The observation above shows that any element of can be uniquely represented with a finite length string where commas are replaced with . (Note that there is no need to put the opening and closing parentheses.) This suffices to conclude that is countable.
Let be a set of functions where . If , we can construct a function such that .
Given a set of functions , we want to construct a function that is different from every . The main idea is the following. For each , pick a unique input and define in a way such that it is different from . Since by construction and disagree on input , is different from . And since we do this for every , is different from all .
Above, it is important that we pick a unique for each so that can be defined in a consistent way. The ability to pick a unique for each is equivalent to .
A bit more formally, since , there is an injection . Let . So implies . Define such that for all , (this is where the assumption is used). This ensures that is different from every , and therefore . (Note that our description of leaves it underspecified, but see the remark below.)
When we apply the above lemma to construct an explicit , we call that diagonalizing against the set . And we call a diagonal element. Typically there are several choices for the definition of :
Different injections can lead to different diagonal elements.
If , we have more than one choice on what value we assign to that makes (here denotes ).
If there are elements not in the range of , then we can define any way we like.
Many mathematical objects can be easily viewed as functions. The following are some examples.
A set can be viewed as a function . This is called the characteristic function of the set. Recall that we made this observation in Important Note (Correspondence between decision problems and languages).
A sequence of length with elements from a set can be viewed as a function , where is the ’th element of the sequence.
A string/word is a sequence of symbols from an alphabet . Therefore a string with can be viewed as a function .
An infinite-length sequence with elements from can be viewed as a function .
Numbers can be viewed as functions since the binary representation of a number is just a string/sequence of bits (possibly infinite-length).
Note that diagonalizing against a set produces an explicit function not in . Therefore, in situations where you wish to find an explicit object that is not in a given set, you should consider if diagonalization could be applied.
Diagonalization tells us that whenever , we can construct a function not in . But if denotes the set of all functions , the construction of is not possible. Therefore it must be the case that .
Let be the set of all functions , which are the characteristic functions of the subsets of (so if and only if ). Then by Note (Functions in disguise), and the result follows from the above Corollary.
In the famous Russell’s Paradox, we consider the set of all sets that do not contain themselves. That is, we consider Then we ask whether is in or not. If , then by the definition of , . And if , again by the definition of , . Either way we get a contradiction. Therefore, the conclusion is that such a set should not be mathematically definable.
This paradox is also an application of diagonalization. For any set , we cannot diagonalize against the set of all functions . If we imagine diagonalizing against this set, where is the set of all sets, the natural diagonal element that comes out of diagonalization is precisely the characteristic function of the set defined above.
For any infinite set , is uncountable. In particular, is uncountable.
Let’s break this up into two cases: (i) is countably infinite and (ii) is uncountable.
If is countably infinite, . And by Cantor’s Theorem, , which by definition means is uncountable.
If is uncountable, then , and so once again, is uncountable.
In a common scenario where diagonalization is applied, both and are countably infinite sets. So we can list the elements of as as well as the elements of as Then for all , define such that . If , for example, . The construction of can be nicely visualized with a table, as shown below. Here, an entry corresponding to row and column contains the value .
By construction, the diagonal element differs from every , . In particular, it differs from with respect to the input .
Below, we will apply Lemma (Diagonalization) to construct an irrational number (i.e. a number in ). Even though the result may not be interesting (since there is a relatively simple proof that is irrational), it does illustrate the use of diagonalization nicely. Cantor used the same technique to construct an explicit transcendental number, which was a quite non-trivial and important result at the time. Instead of proving that result, we put it in the practice set for you.
There exists an irrational real number, that is, there exists a number in .
We will prove the existence of a number in (i.e. the existence of an irrational number) by constructing such a number using diagonalization. In order to simplify the proof, we will restrict our attention to numbers in the interval , so we will construct an irrational number in . The restriction to allows us to represent a number just by considering the fractional part.
As mentioned in Note (Functions in disguise), numbers can be viewed as functions: Every function represents a real number in in binary, namely . Two different functions and may represent the same real number, e.g. and represent the same real number. But we don’t have more than two different functions representing the same number.
Now consider the set of all functions representing a rational number. We will diagonalize against to construct not in . This then represents a real number that is not rational, and we are done.
In order to diagonalize against , all we need is that holds. And this is indeed the case. is countably infinite because is countably infinite. And obviously is countably infinite. This means we are in the situation outlined in Note (Diagonalization with countable sets), and .
The proof is basically the same as the proof of Proposition (Irrational numbers exist). There, we have implicitly proved that the interval cannot be countable. To see this, note that if is countable, then the set of all functions is countable. So , which allows us to diagonalize against , which is not possible.
Let be some finite alphabet. We denote by the set of all infinite length words over the alphabet .
Observe that .
Using the observation in Note (Functions in disguise), an infinite-length string corresponds to a function , which is the characteristic function of a set (where if and only if ). Therefore the set of all infinite-length strings, , corresponds to the set of all subsets of , . That is, . Using Corollary (The power set of an infinite set is uncountable) we can conclude is uncountable.
Prove that if is uncountable and , then is also uncountable.
We want to show that if is a superset of an uncountable set , then must be uncountable.
If is uncountable, by definition, . If , then there is a clear injection from to (map to ), so . Combining this with , we have , and therefore is uncountable.
If we want to show that a set is uncountable, it suffices to show that for some uncountable set . Typically, a good choice for such a is . And one strategy for establishing is to identify a subset of such that .
Show that the following sets are uncountable.
The set of all bijective functions from to .
Part 1: Let be the set of all bijective functions from to . We want to show that is uncountable, and we will do so by showing that , establishing .
We now describe this injective mapping. Given , we map it to a bijection as follows. Let be the ’th bit of , and assume the indexing starts from . Then for all ,
The below picture illustrates the construction of . If , we pick the black arrows to map and , and if we pick the red/dashed arrows to map and .
Observe that for any , the corresponding function is indeed a bijection. It is also clear that if , then . So this mapping from to is indeed an injection. This completes the proof.
Part 2: Let . We want to show that is uncountable, and we will do so by identifying a subset of that is in one-to-one correspondence with .
Let and . Define the set . There are a couple of important observations (whose proofs are omitted):
.
The mapping such that is mapped to , where if and if , is a bijection.
These two observations imply that we have identified a subset of (namely ) that is in one-to-one correspondence with , which allows us to conclude that is uncountable.
For sets , what are the definitions of , , , and ?
What is the definition of a countable set?
What is the CS method for showing that a set is countable?
True or false: There exists an infinite set such that .
True or false: .
True or false: .
State the Diagonalization Lemma and briefly explain how it is proved.
What is the connection between Diagonalization Lemma and uncountability?
What is Cantor’s Theorem?
True or false: .
True or false: There is a surjection from to .
True or false: Let be an alphabet. The set of encodings mapping to is countable.
True or false: Let be a unary alphabet. The set of all languages over is countable.
State a technique for proving that a given set is uncountable.
Here are the important things to keep in mind from this chapter.
The definitions of injective, surjective and bijective functions are fundamental.
The concepts of countable and uncountable sets, and their precise mathematical definitions.
When it comes to showing that a set is countable, the CS method is the best choice, almost always.
The Diagonalization Lemma is one of the most important and powerful techniques in all mathematics.
When showing that a set is uncountable, establishing an injection from (or a surjection to ) is one of the best techniques you can use.