number-theorypedantic

Created 25 Jul 2023. Last updated 19 Dec 2024

Developing the Axioms

tl;dr: A journey from the ground up in which we use axioms to build a proof that every positive integer can be uniquely prime factored.

This is part 1, where we develop the basic axioms of the integers and some definitions. In general, the level of rigor will decrease as the parts go on, so that the reader doesn't get bored to death. But it should be obvious how to fill out everything with complete rigor. Here is a glossary of math symbols.

Axioms as Properties

Over the integers, we need a set of reduced axioms from which all the known theorems can be derived using the rules of logical inference. Reduced meaning that if an axiom can be proven using other axioms, then it should not be an axiom. And philosophically, our axioms should be as simple as possible. The modern approach to axiomatic proof is for our list of axioms to be the properties that we want our system to have; but, it's not clear what these fundamental properties of the integers should be.

For example, it is well known that if a prime divides a product, then it divides one of the constituents (Euclid's Lemma). Should this be one of our fundamental properties? It certainly feels "obvious", in the sense that proving it would not get you any extra points on an olympiad question. But does it follow from some other fundamental properties? How do you even define prime? In my experience, people often try to justify Euclid's lemma by using prime factorization. The problem with this is that it feels backwards - the fact that every integer can be uniquely prime factorized is an extremely powerful result (hence the name fundamental theorem of arithmetic (FTA)), and so using it feels like overkill and may even be circular reasoning.

We could have FTA as an axiom. But if we can prove it from simpler axioms, then why bother?

How deep do we go?

At some point, we need to stop our search for rigor - otherwise we will get too far out of the math world and into philosophy. For example, what does it mean for two things to be equal? And so, we will assume some basic notions:

  • Equality is reflexive (a=aa=a), symmetric (a=ba=b means b=ab=a) and transitive (if a=ba=b and b=cb=c then a=ca=c)
  • If a=ba=b then we may substitute aa for bb and vice versa, in any expression containing them
  • Properties of logic and basic set theory
  • Order of operations

Basic properties

We are working with the integers (whole numbers, Z\mathbb{Z}) and naturals (positive whole numbers, N\mathbb{N}), under two basic operations: addition and multiplication (+,+, \cdot). More technically, it is an underlying assumption that Z\mathbb{Z} is closed under two well-defined binary operations +,+, \cdot, i.e. that adding or multiplying two integers always gives an integer. (This is not the case for division!)

Let's add the first items to our "inventory" of fundamental properties2.

  • Commutativity: the order of multiplication and addition does not matter. In symbols:
a,bZ,  ab=ba,  a+b=b+a\forall a,b \in \mathbb{Z}, \; a \cdot b = b \cdot a, \; a+b=b+a
  • Associativity: in repeated addition or multiplication, the brackets do not matter. In symbols:
a,b,cZ,  (ab)c=a(bc),\forall a,b,c \in \mathbb{Z}, \; (a \cdot b) \cdot c = a \cdot (b \cdot c),(a+b)+c=a+(b+c)(a+b)+c = a+(b+c)
  • Distributivity: multiplication is distributive over addition. In symbols:
a,b,cZ,  a(b+c)=ab+ac\forall a,b,c \in \mathbb{Z}, \; a\cdot(b+c) = a\cdot b + a \cdot c

I hope you agree that these properties seem pretty fundamental. Let's add some more:

  • Additive Identity: there exists an integer we call 00, which when added to any integer, does nothing:
0Z  s.t.  aZ,a+0=a\exists \, 0 \in \mathbb{Z} \; s.t. \; \forall a \in \mathbb{Z}, a+0=a
  • Additive Inverse: for every integer aa, there is another integer that when added to aa, gives 0.
aZ,aZ  s.t.  a+a=0\forall a \in \mathbb{Z}, \exists \, a' \in \mathbb{Z} \; s.t.\; a + a' = 0
  • Multiplicative Identity: there exists an integer we call 11, which when multiplying by any integer, does nothing:
1Z  s.t.  aZ,a1=a\exists \, 1 \in \mathbb{Z} \; s.t. \; \forall a \in \mathbb{Z}, a\cdot 1=a

Note that we don't have multiplicative inverses, because then we would have to include reciprocals of integers.

Furthermore, we can use the commutative property to extend some of the above axioms:

  • Distributivity: As well as a(b+c)=ab+aca \cdot (b+c) = a\cdot b + a \cdot c, we also have (b+c)a=ba+ca(b+c)\cdot a = b \cdot a + c \cdot a
  • Additive identity: a+0=0+a=0a+0 = 0+a = 0 instead of just a+0=aa+0=a
  • Additive inverse: a+a=a+a=0a+a'=a'+a=0 instead of just a+a=0a+a'=0
  • Multiplicative identity: a1=1a=aa\cdot 1 = 1 \cdot a = a instead of just a1=aa \cdot 1 = a

If we don't do this then whenever we cite these axioms, we would have to remember the way round we wrote it, which is utter hell.

Uniqueness

We haven't explicitly stated that 0 (the additive identity), 1 (the multiplicative identity) and additive inverses are unique. Again, this feels intuitive - if a+b = a, then b=0, right? And -1 is the additive inverse of 1, right? Maybe we should add uniqueness as an axiom.

Actually, we can prove it from what we already have. I encourage you to try and do so.

Claim.

00 is unique, i.e. the only additive identity of Z\mathbb{Z}.

Proof.

Suppose that 00 and 00' are two additive identities of Z\mathbb{Z}. We will show that 0=00' = 0.

Note that 0+0=00 + 0' = 0, by +ive id.

But also 0+0=00 + 0' = 0', by +ive id.

Thus 0=0+0=00 = 0 + 0' = 0', so 0=00 = 0', as required.

The proof for the uniqueness of 11 is completely analogous, so it is left to the reader.

Claim.

Additive inverses are unique.

Proof.

Let aa be an arbitrary integer. Let b,cb,c be two additive inverses of a. We will show that b=cb = c.

Consider (a+b)+c(a+b)+c.

One one hand:

(a+b)+c=0+c(a+b)+c = 0+c by +ive inv.

=c= c by +ive id.

On the other hand:

(a+b)+c=a+(b+c)(a+b)+c = a+(b+c) by assoc.

=a+(c+b)=a+(c+b) by comm.

=(a+c)+b=(a+c)+b by assoc.

=0+b = 0+b by +ive inv.

=b= b by +ive id.

Thus, b=(a+b)+c=cb = (a+b)+c = c, so b=cb=c, as required.

Now we can introduce negative signs as the way to refer an integer's unique additive inverse: for each integer nn, we denote its unique additive inverse as n-n. Then, we can define aba-b to be shorthand for a+(b)a+(-b), which is a nice way to avoid having to define subtraction as another operation.

Ordering of Z

What we have so far is good, but we need more. For example we haven't axiomatized the naturals yet, and what about proof techniques?

  • Z is ordered: There exists a non-empty subset N\mathbb{N} of Z\mathbb{Z} that is closed under +,+,\cdot and satisfies Trichotomy: for all aZa \in \mathbb{Z}, exactly one of aN,a=0,aNa \in \mathbb{N}, a=0, -a \in \mathbb{N} is true1.
  • Well-ordering principle: Every non-empty subset of the integers has a least element, defined as an element ee of the subset such that for all elements xx of the subset, exe\leq x.

The importance of the well-ordering principle3 cannot be understated, because it will let us finish off proofs by assuming minimality and showing a contradiction (i.e. infinite descent).

Definitions

Let's make a list of things we'll probably need to explicitly define if we want to have hope of proving FTA:

  • divisibility and primality
  • inequalities, notion of positive/negative
  • common divisors, gcd and lcm (greatest common divisor, lowest common multiple)

So, let's try to define these rigorously.

Defn.

Let a,b,c,pZa,b,c,p \in \mathbb{Z}.

  • aba \mid b ("a divides b", "b is divisible by a") if and only if ("iff") kZ  s.t.  b=ak\exists k \in \mathbb{Z} \; s.t. \; b = a \cdot k. Then aa is a "factor" or "divisor" of bb and bb is a "multiple" of aa.

  • aba-b is shorthand for a+ba + -b.

  • a>ba \gt b iff abNa-b \in \mathbb{N}.

  • a<ba \lt b iff b>ab \gt a.

  • aba \geq b iff a>ba \gt b or a=ba=b.

  • aba \leq b iff bab \geq a.

  • a>b>ca \gt b \gt c iff a>ba \gt b and b>cb \gt c; in this case bb is "strictly between" aa and cc. Vice versa for a<b<ca \lt b \lt c.

  • aa is positive iff a>0a \gt 0, and "negative" iff a<0a \lt 0.

  • pp is prime iff for all ways of writing p=uvp = u \cdot v with u,vZu,v \in \mathbb{Z}, exactly one of u,vu,v is 1 or -1.

  • aa is a gcd of b,cb,c if for all common divisors aa' of b,cb,c, aaa \geq a'.

  • aa is a lcm of b,cb,c if for all common multiples aa' of b,cb,c, aaa \leq a'

Note: in the definition of prime, mathematicians like to define another thing called a unit, which is a factor of 1; in the case of the integers, the only factors of 1 are 1 and -1 (why?).

Also, for the sake of brevity, we will skip the proofs that gcds and lcms exist and are unique, and that all common divisors divide the greatest common divisor (left as an exercise). Thus we can refer to the unique gcd of a,ba,b as gcd(a,b)gcd(a,b) or (a,b)(a,b), and the unique lcm of a,ba,b as lcm(a,b)lcm(a,b) or [a,b][a,b].

Structuring logic, building lemmas

So, what's the point of all these axioms and definitions? It means we can start to inch towards our goal by building lemmas. For example:

Lemma.

aZ,  0a=a0=0\forall a \in \mathbb{Z}, \; 0 \cdot a = a \cdot 0 = 0.

Proof.

Let aa be an arbitrary integer.

0a=(0+0)a0 \cdot a = (0 + 0) \cdot a by +ive id.

=0a+0a = 0 \cdot a + 0 \cdot a by dist.

Thus, (0a)+(0a)=(0a+0a)+(0a)(0 \cdot a) + -(0 \cdot a) = (0 \cdot a + 0 \cdot a) + -(0 \cdot a)

The left hand side equals 00, by +ive inv.

The right hand side is:

(0a+0a)+(0a)(0 \cdot a + 0 \cdot a) + -(0 \cdot a)

=0a+(0a+(0a))= 0 \cdot a + (0 \cdot a + -(0 \cdot a)) by assoc.

=0a+0= 0 \cdot a + 0 by +ive inv.

=0a= 0 \cdot a by +ive id.

Therefore, equating the RHS and LHS, we obtain 0=0a0 = 0\cdot a. Thus by comm., 0a=a0=00 \cdot a = a \cdot 0 = 0, as required.

Wow, that seemed tedious! But the point is, even though the fact that 0 times anything is 0 seems fundamental, we don't actually need it as an axiom, because we can prove it from the axioms we already have.

I will give one more lemma with full proof, so that you get the idea (referring to axioms at each step, etc). Then, I'll give the list of lemmas that can be built up, and the main ideas for how to prove them, but not the complete proofs.

Lemma.

In ZZ, if da,dbd \mid a, d\mid b then d(ar+bs)d \mid (a \cdot r+b \cdot s) for any r,sZr,s \in \mathbb{Z}

Proof.

Suppose da,dbd \mid a, d \mid b. Let r,sZr,s \in \mathbb{Z}, we will show that d(ar+bs)d \mid (a\cdot r+b\cdot s).

da    a=dkd \mid a \implies a = d \cdot k for some kZk \in \mathbb{Z}

And, db    b=djd \mid b \implies b = d \cdot j for some jZj \in \mathbb{Z}

Thus, ar+bs=(dk)r+(dj)sa\cdot r + b \cdot s = (d \cdot k) \cdot r + (d \cdot j) \cdot s

=d(kr)+(dj)s = d \cdot (k \cdot r) + (d \cdot j) \cdot s by assoc.

=d(kr)+d(js) = d \cdot (k \cdot r) + d \cdot (j \cdot s) by assoc.

=d(kr+js) = d \cdot (k \cdot r + j \cdot s) by dist.

But kr+jsk\cdot r + j \cdot s is an integer because of closure, thus xZ  s.t.  (ar+bs)=dx\exists \, x \in \mathbb{Z} \;s.t.\; (a\cdot r + b \cdot s) = d \cdot x, namely x=kr+jsx = k\cdot r + j\cdot s. Thus by defn. of "divides", d(ar+bs)d \mid (a \cdot r + b \cdot s), as required.

Lemma List

Now for the list of lemmas that can be built up. To prevent circular reasoning, if lemma A is used to prove lemma B, then A will have a lower lemma number than B. Feel free to fill out the details of each proof (it's a good exercise!).

Lemma 1.

aZ,(a)=a\forall a \in \mathbb{Z}, -(-a) = a

Proof.

Follows from a+(a)=0a + (-a) = 0 and uniqueness of +ive inv.

Lemma 2.

a,b,cZ,a=b    a+c=b+c\forall a,b,c \in \mathbb{Z}, a=b \iff a+c=b+c

Proof.

Forward direction is immediate, backwards direction follows from adding c-c to both sides.

Lemma 3.

0=0-0 = 0

Proof.

Consider 0+00 + -0; it equals both 0-0 and 00.

Lemma 4.

If da,dbd \mid a, d \mid b then d(ar+bs)d \mid (a\cdot r+b \cdot s)

Proof.

See above.

Lemma 5.

0a=a0=00 \cdot a = a \cdot 0 = 0

Proof.

See above.

Lemma 6.

a>b,b>c    a>ca\gt b,b \gt c \implies a \gt c

Proof.

If a>b,b>ca \gt b,b \gt c then aba-b and bcb-c are naturals by defn. of ">\gt". Thus (ab)+(bc)(a-b)+(b-c) is natural by closure, which simplifies and implies the result.

Lemma 7.

a=(1)a-a = (-1)\cdot a

Proof.

By Lemma 2, 0=0a0 = 0\cdot a, which equals (1+(1))a=a+(1)a(1+(-1))\cdot a = a + (-1)\cdot a. So 0=(1)a+a0 = (-1)\cdot a + a. Add a-a to both sides for the result.

Lemma 8.

a>0    aNa \gt 0 \iff a \in \mathbb{N}

Proof.

If a>0a\gt 0 then a0Na-0 \in \mathbb{N} by defn of ">\gt". But a0=a+(0)=a+0=aa-0 = a+(-0) = a+0 = a, using Lemma 3 and +ive id.

Lemma 9.

1N1 \in \mathbb{N}

Proof.

Use Trichotomy and eliminate the other two cases by contradiction.

If 1=01=0, then since N\mathbb{N} is nonempty, pick a natural xx, then we have x=1x=0x=0x = 1\cdot x = 0 \cdot x = 0 (by *ive id. and Lemma 5) So 00 is natural, which contradicts Trichotomy.

If 1-1 is natural, then so is (1)(1)(-1)\cdot(-1) by closure, but (1)(1)=(1)=1(-1)\cdot (-1) = -(-1) = 1 by Lemma 7 and Lemma 1. So -1 and 1 are both natural, contradicting Trichotomy.

Lemma 10.

(ab)=ba-(a-b) = b-a

Proof.

Use Lemma 7 and Lemma 1:

(ab)=(1)(ab)=(1)a+(1)(b)=a+(b)=a+b=ba-(a-b) \\= (-1) \cdot (a-b) \\= (-1) \cdot a + (-1) \cdot (-b) \\= -a + -(-b) \\= -a + b \\= b-a

Lemma 11.

a>b    a+c>b+ca\gt b \implies a+c\gt b+c

Proof.

Follows from Lemma 7, and that ab=(a+c)(b+c)a-b = (a+c)-(b+c).

Lemma 12.

a,bN,cZ\forall a,b \in \mathbb{N}, c \in \mathbb{Z}, if a=bca=b\cdot c then cNc \in \mathbb{N}

Proof.

Use Trichotomy.

If c=0c=0, then a=0a=0 by Lemma 2, so 00 is natural, contradicting Trichotomy.

If cN-c \in \mathbb{N} then after applying Lemma 7 (and basic axioms) we get that aN-a \in \mathbb{N}, contradicting Trichotomy.

Lemma 13.

a<ba\lt b and c<0    ac<bcc \lt 0 \implies a\cdot c \lt b \cdot c

Proof.

If a<ba\lt b then baNb-a \in \mathbb{N} and so by Lemma 8 and closure, (ba)cN(b-a)\cdot c \in \mathbb{N}. This can be rearranged to bc(ac)b\cdot c - (a \cdot c), implying the result.

Lemma 14.

There are no integers strictly between 00 and 11.

Proof.

Suppose there is an integer aa between 00 and 11. By Lemma 8, aNa \in \mathbb{N}, and so by the well-ordering principle, let ee be the smallest such integer. More precisely, we are considering the set {nNn<1}\{n \in \mathbb{N} \mid n \lt 1\}.

Now, ee<1ee \cdot e \lt 1 \cdot e (Lemma 13) and so eee \cdot e is a smaller element of the set, contradiction.

Lemma 15.

Exactly one of a<b,a=b,a>ba\lt b, a=b, a\gt b is true.

Proof.

Trichotomy on aba-b.

Lemma 16.

Exactly one of ab,a<ba\geq b, a\lt b is true.

Proof.

Follows from Lemma 15.

Lemma 17.

aZ,  aN    a1\forall a \in \mathbb{Z},\; a \in \mathbb{N} \iff a \geq 1

Proof.

Forward direction: suppose aNa \in \mathbb{N} is true and a1a \geq 1 is false, we will show a contradiction. By Lemma 16, we have a<1a\lt 1. But also a>0a\gt 0 by Lemma 8 so a is an integer strictly between 00 and 11, contradicting Lemma 14.

Backward direction: suppose a1a \geq 1, then a>1a\gt 1 or a=1a=1. If a=1a=1 then Lemma 9 finishes. If a>1a \gt 1 then since 1N1 \in \mathbb{N} (Lemma 9), we have 1>01\gt 0 (Lemma 8), so a>1,1>0a \gt 1, 1 \gt 0, so a>0a\gt 0 (Lemma 3), then Lemma 8 to finish.

Lemma 18.

ab    a>b1a \geq b \iff a \gt b-1

Proof.
  • Forward direction:

    • First case: if a>ba\gt b then abNa-b \in \mathbb{N}, thus so is (ab)+1(a-b)+1 by Lemma 9 and Closure. This can be rewritten as a(b1)a-(b-1), which implies the result by defn. of ">\gt".
    • Second case: if a=ba=b, then by Lemma 9, 0+1N0+1 \in \mathbb{N}, which equals b(b1)b-(b-1), and so b>b1b\gt b-1. Substitue for the result.
  • Backward direction: suppose a>b1a\gt b-1.

    • Then a(b1)Na-(b-1) \in \mathbb{N}, which can be rearranged to 1(ba)1-(b-a), so ba<1b-a\lt 1.
    • We must have aba \geq b, because if not, then by Lemma 16 a<ba\lt b so 0<ba0 \lt b-a (Lemma 8) so bab-a is an integer strictly between 00 and 11, contradicting Lemma 14.
Lemma 19.

m,nN,mn    mn\forall m,n \in \mathbb{N}, m \mid n \implies m \leq n

Proof.

Write n=mkn = m \cdot k.

Note that k1k \geq 1; else we would have k<1k \lt 1 (Lemma 16) and so kNk \in \mathbb{N} with k<1k \lt 1 (Lemma 12), so 0<k<10\lt k \lt 1 (Lemma 8) contradicting Lemma 14.

Thus k=1k=1 or 1<k1\lt k and we can multiply both sides of the inequality by mm (Lemma 13) to deduce the result.

Lemma 20.

n,xN\forall n,x \in \mathbb{N}, if n=xnn = x \cdot n then x=1x = 1

Proof.

First note that we can't appeal to uniqueness of the multiplicative inverse, because nn is fixed. So suppose x1x \neq 1, we'll show a contradiction.

xN    x>1x \in \mathbb{N} \implies x \gt 1 (Lemma 17 and defn. of "\geq")

    1n<xn\implies 1 \cdot n \lt x \cdot n (defn. of "<\lt" and Lemma 13)

    n<n\implies n \lt n

So nnNn -n \in \mathbb{N}, so 0N0 \in \mathbb{N}, contradicting Trichotomy.

Phew, that was a lot of lemmas! But I hope you agree they're all very fundamental.

In the next part, we'll look at proving our first important theorem - the division algorithm.


  1. I'm gonna skip over the whole "is 0 a natural number" thing. 

  2. These six properties come from the fact that Z\mathbb{Z} is a ring (mathematical structure)

  3. The well-ordering principle is actually equivalent to induction