Sat, 13 Oct 2012 15:44:07 GMT

Again, I gave a talk for the best mathematics club of the multiverse, this time for the pupils who do not yet study mathematics. I thought it would be nice to have something that is seldom taught, a proper introduction to "from scratch". This is what I tried to do. Keep in mind that this has very low requirements and is not thought for mathematicians, but for people who may become mathematicians. Since I still do not have anything better than MathJax, JavaScript is needed to render the formulae, sorry about that. Here is a script:

**Inroduction**

is probably the most known object in mathematics. However, only few non-mathematicians know how it is actually defined. Usually, they are seen as some "obvious", "natural" object. However, there are several fallacies in the step from to , like the fact that it is not algebraically closed (), and that . To avoid such fallacies, we should first give some axioms that we want the reals to satisfy:

**Equivalence Relations**

Equivalence relations are an extremely important concept in mathematics. They show up in almost every mathematical topic, like logic, algebra, topology. It is important to understand this concept to define , but it is also worthwile for its own sake.

An eqivalence relation is a binary relation that usually somehow resembles similarities between objects. Formally, a relation on a set is a relation which obeys the following three axioms:

(Source)

is probably the most known object in mathematics. However, only few non-mathematicians know how it is actually defined. Usually, they are seen as some "obvious", "natural" object. However, there are several fallacies in the step from to , like the fact that it is not algebraically closed (), and that . To avoid such fallacies, we should first give some axioms that we want the reals to satisfy:

- should be a
*field*: - , , , ,
- There exist additive and multiplicative neutral elements and , such that
- There exists an additive inverse for every , such that . There exists a multiplicative inverse for every such that
- on should be a
*linear partial ordering relation*: - and implies
- and implies
- should be an
*ordered field*regarding : - implies
- and implies
- is complete over :
- If for all , then there exists a maximal with for all .
- If for all , then there exists a minimal with for all .

It can be shown that these axioms are sufficient to classify a unique structure "up to isomorphy", that is, two such structures are equivalent up to "renaming" of the elements. However, this just says that if such a structure exists, it is unique. It does not show that it actually exists, which is what we will approach.

Equivalence relations are an extremely important concept in mathematics. They show up in almost every mathematical topic, like logic, algebra, topology. It is important to understand this concept to define , but it is also worthwile for its own sake.

An eqivalence relation is a binary relation that usually somehow resembles similarities between objects. Formally, a relation on a set is a relation which obeys the following three axioms:

*Reflexivity*: For all we have*Symmetry*: For all we have*Transitivity*: For all if we have and then we also have .

For every we can define its *equivalence class* .

__Lemma__: Two equivalence classes and are either equal or disjoint.

__Proof__: Every equivalence class contains at least one element, so they cannot be both disjoint and equal. Assuming they are not disjoint, they must have a common element , but then and , so by transitivity, , hence .∎

Hence, the set of equivalence classes over regarding is a *partition* of . We call this set , say "S modulo ".

Trivial examples are standard equality , in which case can be identified with in a canonical way, and the "squash"-relation that makes every two elements equal, in which case there is only one equivalence class.

Furthermore, important equivalence classes are the residue-equalities on , where means that is divisible by . Their equivalence classes are the *residue classes*, and they can be added and multiplied elementwise, forming the *residue class rings*.

**Natural Numbers**

Our bootstrapping of begins with a slightly simpler object, . First, we give axioms for something we shall call a *Peano Structure*:

A triple , where is a set, and is a function, is called a Peano Structure, if

- There is no with
- For every ,
- For every , if , then
- If , and if , then

That is called *zero*, and is called *successor function* of that peano structure.

Mathematicians call a function between two structures that preserves relevant parts of that structure an *isomorphism*. In case of Peano Structures, a function between two Peano Structures is an *isomorphism* if

- is bijective
- For all we have , and for all we have .

If such an isomorphism exists between two Peano Structures, they are called *isomorphic*. They are then essentially equal structures, in the sense that there is no mathematically meaningful difference between them, and the one structure is a result of just renaming the elements of the other.

__Lemma__: Every two peano structures are isomorphic (in one set universe)

__Proof__: Let two peano structures be given. We define a function recursively by . Therefore, the second and a part of the third axiom are already satisfied by definition.

We show that is surjective: We know that the image of is a subset of , and we know that is in this image. If is in the image of , say, for some , then we know that by definition, hence, is also in the image of . By the axioms of peano structures, this means that the image of is .

Now consider the function with . By the same argument as above, is surjective. Furthermore, since we have and from follows we have for all , and similarily for all , hence, . Therefore, is also injective, and the third axiom of isomorphism is proved.∎

What we now know is that if there is a peano structure, then there is - up to isomorphism - only one. There are several ways to show that (in a set universe) there actually is one. The common way is to classify a special set of finite sets, the *finite cardinal numbers*. In ZFC, this can be done using the well-ordering theorem, the axiom of infinity and the axiom of comprehension. But doing this is outside our scope, we just believe the following theorem:

__Theorem__: A peano structure exists (in every set universe).

And now we can take an arbitrary peano structure as our set of natural numbers, , by setting and [note that we have yet to define addition, this is just for intuition]. The first three axioms of peano structures become obvious arithmetical laws, the fourth axiom becomes the *induction axiom*. We can define addition recursively by

.

__Exercise__: Prove

We can define multiplication recursively by

The arithmetical laws follow from these definitions.

**Integers and Rational Numbers**

Now that we have defined , we want to reach out for , and it turns out that this is much easier than getting in the first place. However, for a non-mathematician, these definitions sound rather overcomplicated, but it turns out that they are much easier to handle than "less complicated" ones.

The need for negative numbers comes from the fact that subtraction is not total on , that is, for example, . Subtraction itself is just the operation of solving an additive equation, in our example. To be consistent with most of our arithmetical laws so far, it is desirable that the solution of is equal to the solution of and generally , due to cancellation.

We observe that the equations and must have the same solutions in if . Now, let us have a look at the set of pairs of natural numbers. If we define , then is an equivalence relation on : Reflexivity and symmetry are obvious. For transitivity, consider and . That is, and . Since we have equality of sums here, we may subtract without getting negative results, therefore , so and hence , which means . Now we define .

We furthermore define . We have to show that this is well-defined, which is left as an exercise.

Now, at schools you are taught that , and even in higher mathematics this is often used. With our definition, this is of course not the case, but we can identify every natural number with the equivalence class . As isomorphic structures do not need to be distinguished, we can safely redefine to be the set of these equivalence classes. The changes we would have to accomplish are below the usual abstraction level.

The structure with addition and multiplication is called a *Ring*.

Rational numbers are similar, this time we add solutions to , but we have to avoid the pathological case of . The reason is that we want the rationals to be a *Field*. Similar to how we did before, we define an equivalence relation on , namely . You are probably familiar with this representation of rational numbers, you just write instead of - we just chose an equivalence that tells when two fractions are equal, and we have .

**Cauchy Sequences**

The step from rational numbers to real numbers is a bit more complicated. It is not sufficient to add solutions of special equations to our system. However, of course, in many cases, the real numbers are just solutions of equations. If we add zeroes of all polynomials, for example, we would get (more than) the algebraic real numbers over . But there are also transcendental numbers like .

Let us first have a look at one of the first irrational numbers ever discovered: .

__Theorem:__ There is no rational solution to the equation .

__Proof:__ By monotony of we know that a positive solution must lie between and , and can especially not be integral. Now assume there was some solution with , where w.l.o.g. . Then also . But , and as cannot be factorized nontrivially, must be divisible by , therefore, for some . But then yields and by the same argument as before, we get that is divisible by , which contradicts our assumption .∎

So we know that , but still, we can define the set , and this set will be an open interval, with rational numbers approaching up to an arbitrary precision. The set is *bounded*, but has no rational bound. The set is therefore not *complete*, we say.

However, for every given we can give a rational number such that . We now touch the topic of *infinitesimal analysis*, however, do not fear, we do not have to dive in it too deeply. The crucial concept we have to define now is the Cauchy sequence:

__Definition:__ A sequence of rational numbers is called a *Cauchy sequence*, if for every we can find an such that for all we have .

That is, if we have such an , we know an such that all elements of that sequence above do not differ about more than , and since we may choose this arbitrarily small, the sequence "approximates" something.

(Source)

For some of these sequences, we can give a rational number, which they "obviously" approximate. For example, somehow, the sequence "approximates" : It gets arbitrarily close to . On the other hand, if we set , this is a cauchy sequence, but does not approximate any rational number. However, we want this to be a number - in general, we want things to which we have an intuitive arbitrarily precise "approximation" to be numbers - we want the numbers to be *complete*.

The easiest way to make sure this is the case is to just postulate that every Cauchy sequence has such an approximation (a "limit point"), but then the problem remains that we do not really know how the objects we add should look like. So we simply do the first thing that comes to your mind when you are a mathematician: Just use the Cauchy sequences themselves, and define a proper equivalence relation on them.

To do this, we first have to make a formal definition of what we mean by an "approximation", but we can give it for rational approximated numbers only, so far:

__Definition:__ A sequence *converges* towards , if for an arbitrarily small we can find an such that for all we have .

As an exercise, check whether our above statements on "approximations" hold for this definition. Now we define an equivalence relation on cauchy sequences:

__Definition:__ Two cauchy sequences and *converge towards the same point*, denoted by , if the sequence converges towards .

__Theorem:__ is an equivalence relation.

__Proof:__ Reflexivity holds trivially, since . Symmetry is trivial since . For transitivity, assume and . Now let an be given. By definition, we find an such that for all we have , and similarily, we find an such that for all we have . But then we have for all . Therefore, .∎

Let be the set of Cauchy sequences, then we define . We can embed the rational numbers into the real numbers by . Furthermore, we define addition componentwise by

same for multiplication. The simple arithmetic laws then also follow componentwise. For and , we say that they hold when they hold for infinitely many corresponding elements of the compared sequences.

Of course, we still have to show a lot of things, like showing that this is really a field with the desired axioms, and showing that addition and multiplication defined in this way are compatible with the equivalence relation. However, this is just technical, and we omit it, since we think that for people not familiar with this topic, it is already sufficiently complicated.

**Prospects**

For everyone who does not yet feel challenged enough, I have a few prospects. The first one is from set theory over the real line:

__Hypothesis (Suslin):__ Let be a set and satisfy the following conditions:

- is a
*linear ordering*: , , . - is
*dense*: If and , then there is a such that . - is
*complete*: If for some there exists an such that all are (" is bounded"), then there exists a smallest with this property (" is the supremum of in ). - Define intervals in the canonical way by . Then for every set of pairwise disjoint intervals, there exists a surjective function ("every set of pairwise disjoint intervals is countable").

Then is isomorphic to .

__Theorem:__ Suslin's Hypothesis is undecidable over standard set theory (ZFC).

Of course, we will not prove this, as it involves a lot of model theory. Let us move on with our second prospect, the p-adic numbers. If we look at the proof that is an equivalence relation, we notice that what we really needed to prove this were a few inequalities on the absolute value, namely:

The third one is called the *triangle inequality*. It is crucial for the transitivity proof. On the other hand, we might think of some other function which satisfies the same axioms. And in fact, there is a mathematical concept which generalizes this:

__Definition:__ A function is called a *metric* (or *distance function*) on if it satisfies

Note that this definition is usually made with real numbers instead of rational numbers.

Now, as above, we can define for a given metric , and it will also be an equivalence relation.

Now, let be some prime number. For every rational number , we can find an unique integer , such that , where , and neither nor are divisible by . Define for every such , and . Define .

__Lemma:__

__Proof:__ Let and with fully cancelled fractions not containing as a factor, and let w.l.o.g. . Then . Now, is not divisible by , so the fraction can only contain positive powers of . Therefore, for some . We now want , that is, , which is trivial, since .∎

__Theorem:__ is a metric.

Proof: Trivially, and . It remains to show the triangle inequality, that is, . By the above lemma, we have .∎

The set we get when identifying sequences according to this metric is called , the *-adic numbers*. It has many similarities with , and you can even do analysis on it.