It can be shown that these axioms are sufficient to classify a unique structure "up to isomorphy", that is, two such structures are equivalent up to "renaming" of the elements. However, this just says that if such a structure exists, it is unique. It does not show that it actually exists, which is what we will approach.
For every we can define its equivalence class
.
Lemma: Two equivalence classes and
are either equal or disjoint.
Proof: Every equivalence class contains at least one element, so they cannot be both disjoint and equal. Assuming they are not disjoint, they must have a common element , but then
and
, so by transitivity,
, hence
.∎
Hence, the set of equivalence classes over regarding
is a partition of
. We call this set
, say "S modulo
".
Trivial examples are standard equality , in which case
can be identified with
in a canonical way, and the "squash"-relation that makes every two elements equal, in which case there is only one equivalence class.
Furthermore, important equivalence classes are the residue-equalities on
, where
means that
is divisible by
. Their equivalence classes are the residue classes, and they can be added and multiplied elementwise, forming the residue class rings.
Natural Numbers
Our bootstrapping of begins with a slightly simpler object,
. First, we give axioms for something we shall call a Peano Structure:
A triple , where
is a set,
and
is a function, is called a Peano Structure, if
That is called zero, and
is called successor function of that peano structure.
Mathematicians call a function between two structures that preserves relevant parts of that structure an isomorphism. In case of Peano Structures, a function between two Peano Structures
is an isomorphism if
If such an isomorphism exists between two Peano Structures, they are called isomorphic. They are then essentially equal structures, in the sense that there is no mathematically meaningful difference between them, and the one structure is a result of just renaming the elements of the other.
Lemma: Every two peano structures are isomorphic (in one set universe)
Proof: Let two peano structures be given. We define a function
recursively by
. Therefore, the second and a part of the third axiom are already satisfied by definition.
We show that is surjective: We know that the image of
is a subset of
, and we know that
is in this image. If
is in the image of
, say,
for some
, then we know that
by definition, hence,
is also in the image of
. By the axioms of peano structures, this means that the image of
is
.
Now consider the function with
. By the same argument as above,
is surjective. Furthermore, since we have
and from
follows
we have
for all
, and similarily
for all
, hence,
. Therefore,
is also injective, and the third axiom of isomorphism is proved.∎
What we now know is that if there is a peano structure, then there is - up to isomorphism - only one. There are several ways to show that (in a set universe) there actually is one. The common way is to classify a special set of finite sets, the finite cardinal numbers. In ZFC, this can be done using the well-ordering theorem, the axiom of infinity and the axiom of comprehension. But doing this is outside our scope, we just believe the following theorem:
Theorem: A peano structure exists (in every set universe).
And now we can take an arbitrary peano structure as our set of natural numbers, , by setting
and
[note that we have yet to define addition, this is just for intuition]. The first three axioms of peano structures become obvious arithmetical laws, the fourth axiom becomes the induction axiom. We can define addition recursively by
.
Exercise: Prove
We can define multiplication recursively by
The arithmetical laws follow from these definitions.
Integers and Rational Numbers
Now that we have defined , we want to reach out for
, and it turns out that this is much easier than getting
in the first place. However, for a non-mathematician, these definitions sound rather overcomplicated, but it turns out that they are much easier to handle than "less complicated" ones.
The need for negative numbers comes from the fact that subtraction is not total on , that is, for example,
. Subtraction itself is just the operation of solving an additive equation,
in our example. To be consistent with most of our arithmetical laws so far, it is desirable that the solution of
is equal to the solution of
and generally
, due to cancellation.
We observe that the equations and
must have the same solutions in
if
. Now, let us have a look at the set
of pairs of natural numbers. If we define
, then
is an equivalence relation on
: Reflexivity and symmetry are obvious. For transitivity, consider
and
. That is,
and
. Since we have equality of sums here, we may subtract without getting negative results, therefore
, so
and hence
, which means
. Now we define
.
We furthermore define . We have to show that this is well-defined, which is left as an exercise.
Now, at schools you are taught that , and even in higher mathematics this is often used. With our definition, this is of course not the case, but we can identify every natural number
with the equivalence class
. As isomorphic structures do not need to be distinguished, we can safely redefine
to be the set of these equivalence classes. The changes we would have to accomplish are below the usual abstraction level.
The structure with addition and multiplication is called a Ring.
Rational numbers are similar, this time we add solutions to , but we have to avoid the pathological case of
. The reason is that we want the rationals to be a Field. Similar to how we did before, we define an equivalence relation
on
, namely
. You are probably familiar with this representation of rational numbers, you just write
instead of
- we just chose an equivalence that tells when two fractions are equal, and we have
.
Cauchy Sequences
The step from rational numbers to real numbers is a bit more complicated. It is not sufficient to add solutions of special equations to our system. However, of course, in many cases, the real numbers are just solutions of equations. If we add zeroes of all polynomials, for example, we would get (more than) the algebraic real numbers over . But there are also transcendental numbers like
.
Let us first have a look at one of the first irrational numbers ever discovered: .
Theorem: There is no rational solution to the equation .
Proof: By monotony of we know that a positive solution must lie between
and
, and can especially not be integral. Now assume there was some solution
with
, where w.l.o.g.
. Then also
. But
, and as
cannot be factorized nontrivially,
must be divisible by
, therefore,
for some
. But then
yields
and by the same argument as before, we get that
is divisible by
, which contradicts our assumption
.∎
So we know that , but still, we can define the set
, and this set will be an open interval, with rational numbers approaching
up to an arbitrary precision. The set
is bounded, but has no rational bound. The set
is therefore not complete, we say.
However, for every given we can give a rational number
such that
. We now touch the topic of infinitesimal analysis, however, do not fear, we do not have to dive in it too deeply. The crucial concept we have to define now is the Cauchy sequence:
Definition: A sequence of rational numbers is called a Cauchy sequence, if for every
we can find an
such that for all
we have
.
That is, if we have such an , we know an
such that all elements of that sequence above
do not differ about more than
, and since we may choose this
arbitrarily small, the sequence "approximates" something.
For some of these sequences, we can give a rational number, which they "obviously" approximate. For example, somehow, the sequence "approximates"
: It gets arbitrarily close to
. On the other hand, if we set
, this is a cauchy sequence, but does not approximate any rational number. However, we want this to be a number - in general, we want things to which we have an intuitive arbitrarily precise "approximation" to be numbers - we want the numbers to be complete.
The easiest way to make sure this is the case is to just postulate that every Cauchy sequence has such an approximation (a "limit point"), but then the problem remains that we do not really know how the objects we add should look like. So we simply do the first thing that comes to your mind when you are a mathematician: Just use the Cauchy sequences themselves, and define a proper equivalence relation on them.
To do this, we first have to make a formal definition of what we mean by an "approximation", but we can give it for rational approximated numbers only, so far:
Definition: A sequence converges towards
, if for an arbitrarily small
we can find an
such that for all
we have
.
As an exercise, check whether our above statements on "approximations" hold for this definition. Now we define an equivalence relation on cauchy sequences:
Definition: Two cauchy sequences and
converge towards the same point, denoted by
, if the sequence
converges towards
.
Theorem: is an equivalence relation.
Proof: Reflexivity holds trivially, since . Symmetry is trivial since
. For transitivity, assume
and
. Now let an
be given. By definition, we find an
such that for all
we have
, and similarily, we find an
such that for all
we have
. But then we have
for all
. Therefore,
.∎
Let be the set of Cauchy sequences, then we define
. We can embed the rational numbers into the real numbers by
. Furthermore, we define addition componentwise by
same for multiplication. The simple arithmetic laws then also follow componentwise. For and
, we say that they hold when they hold for infinitely many corresponding elements of the compared sequences.
Of course, we still have to show a lot of things, like showing that this is really a field with the desired axioms, and showing that addition and multiplication defined in this way are compatible with the equivalence relation. However, this is just technical, and we omit it, since we think that for people not familiar with this topic, it is already sufficiently complicated.
Prospects
For everyone who does not yet feel challenged enough, I have a few prospects. The first one is from set theory over the real line:
Hypothesis (Suslin): Let be a set and
satisfy the following conditions:
Then is isomorphic to
.
Theorem: Suslin's Hypothesis is undecidable over standard set theory (ZFC).
Of course, we will not prove this, as it involves a lot of model theory. Let us move on with our second prospect, the p-adic numbers. If we look at the proof that is an equivalence relation, we notice that what we really needed to prove this were a few inequalities on the absolute value, namely:
The third one is called the triangle inequality. It is crucial for the transitivity proof. On the other hand, we might think of some other function which satisfies the same axioms. And in fact, there is a mathematical concept which generalizes this:
Definition: A function is called a metric (or distance function) on
if it satisfies
Note that this definition is usually made with real numbers instead of rational numbers.
Now, as above, we can define for a given metric
, and it will also be an equivalence relation.
Now, let be some prime number. For every rational number
, we can find an unique integer
, such that
, where
, and neither
nor
are divisible by
. Define
for every such
, and
. Define
.
Lemma:
Proof: Let and
with fully cancelled fractions not containing
as a factor, and let w.l.o.g.
. Then
. Now,
is not divisible by
, so the fraction can only contain positive powers of
. Therefore,
for some
. We now want
, that is,
, which is trivial, since
.∎
Theorem: is a metric.
Proof: Trivially, and
. It remains to show the triangle inequality, that is,
. By the above lemma, we have
.∎
The set we get when identifying sequences according to this metric is called , the
-adic numbers. It has many similarities with
, and you can even do analysis on it.