TANT 1 - An introduction to p-adic numbers (and much more)

Hello everyone! These are notes for the first class of the course "Topics in algebra and number theory" held in Block 4 of the academic year 2017/18 at the University of Copenhagen.

This will be a course on p-adic numbers, so in this first hour I tried to give a little motivation about why one should study these objects in relation to some fundamental questions in number theory.



What do we study in number theory?

We started by asking ourselves: what is number theory? This is of course a big question and there is no hope to answer in a short time. However, one may think that number theory is first and foremost the study of the integers Z. This set can be studied as a subset of the real numbers R by using analytical methods. In this fashion we can ask questions like:
  • how many prime numbers are there? In particular, the strong version of the prime number theorem π(x)=x2dtlog(t)+O(xlog(x)) is equivalent to the Riemann hypothesis, one of the most difficult open problems in mathematics. See this paper of Conrey and this paper of Connes for an overview on different approaches to the Riemann Hypothesis;
  • how close can prime numbers be? Mayor results in this area are for instance the theorem of Green and Tao, which says that there are arbitrary long arithmetic progressions of prime numbers (see this paper of Conlon, Fox and Zhao for an exposition of the proof) and the twin primes conjecture, which asserts that there are infinitely many couples of primes whose difference is two (see this page for the current records on this conjecture);
  • how dense are certain subsets of Z? A big problem along these lines is the conjecture of Goldbach which states that every even integer grater or equal to 4 is a sum of two primes. The best known result in this area is the proof of the fact that every odd integer grater than 5 can be expressed as a sum of three primes (see this book by Harald Helfgott concerning his proof of this result).
Almost all these theorems (and many more) have an heuristic explanation in terms of probabilistic models for the natural numbers (see for instance this blog post by Terence Tao).

We have also another approach to study the integers Z, which is to study them as a ring. Recall in particular that Yoneda's lemma tells us that whenever we have a category C and we want to study an object CC we may as well study the functor Hom(,C):CopSets. Applying this to the category CRings of commutative rings with unity we see that to understand Z as a ring we need to understand all the possible maps RZ, where R is any ring.

This question as it is looks tantalizing, but in fact we shall remember that Z is also the initial object of the category CRings. Thus for every RCRings we have that RZ[{xi}]/({fj}) where {xi} is a set of variables and {fj} is a set of polynomials with integer coefficients in a finite number of the xi's. Now applying the "substitution principle" in commutative algebra (see for example this video by MathProofsable) we see that every map RZ is essentially determined by the integer solutions of a (possibly huge) set of polynomial equations with integer coefficients.

How to (not) solve Diophantine equations

Thus in order to understand Z as a ring we can, as a first step, try to solve Diophantine equations, i.e. find integer solutions to (systems of) polynomial equations f(x1,,xn)Z[x1,,xn]. This might look simpler at first but is in fact a hugely complicated problem. First of all, the famous theorem of Matiyasevich, Robinson, Davis and Putnam says that it is not possible to find an algorithm which, given any Diophantine equation, would decide whether it has a solution or not (see this survey paper by Martin Davis).

Moreover, finding solutions to Diophantine equations is tremendously difficult also for humans as these two famous examples show:
  • the equations xn+yn+zn=0 with nN3 have only solutions x,y,zZ with xyz=0. This is the notorious theorem of Wiles (and many other people) who proved a longstanding conjecture of Fermat (see this survey paper written by Henry Darmon);
  • the equations xayb1=0 with a,bN2 don't have any solution except for the case a=2 and b=3, which has the only solution (x,y)=(3,2). This is the famous conjecture of Catalan, which was proved in 2002 by Preda Mihăilescu (see this survey paper written by Yuri Bilu).
Thus, there is no hope that we will learn in this lecture how to solve all Diophantine equations! Nevertheless we can try to find some methods to solve them in some particular cases. For simplicity we will concentrate on single equations (or, if you prefer, hypersurfaces) of the form f(x1,,xn)=0. Thus given a polynomial f(x1,,xn)Z[x1,,xn] and a ring R we define the set Xf(R):={xRnf(x)=0} and we are interested in particular in the set Xf(Z).

The catch here is that this set is too complicated to be determined with an algorithm (as we have seen) but it is also too simple, i.e. it is just a discrete set with not enough structure to use powerful mathematical tools to study it.

For one thing, Z is not even a field, so we cannot even take denominators in the solutions of our equation. To put this in a different way, we have that the inclusion Xf(Z)Xf(Q) is usually strict. In order to overcome this difficulty we need to take into consideration the homogenization of the polynomial f(x1,,xn). In particular we define hf(x0,x1,,xn):=xd0f(x1x0,,xnx0)Z[x0,,xn] where d is the degree of f. Observe now that for every ring R we have an action of R× on the set Xhf(R) since hf is homogeneous. Thus we define Pf(R):=Xhf(R){0}R× and we observe that in this case Pf(Z)=Pf(Q). 

Remark This is the wrong definition in general. The right definition would be to take the R-points of the scheme Pf:=Proj(Z[x0,,xn](f(x0,,xn))) and the difference between Pf(R) and Pf(R) lies in the Picard group Pic(R) of the scheme Spec(R).

Diving low: reduction modulo primes and local-global principles

So, what do we do now? Well, as a first step we can try to understand whether Pf(Z) or not. In order to do so we can observe that if Pf(Z) then Pf(Z/pZ) for all prime numbers pZ. This gives us a nice necessary condition for Pf(Z) to be non empty which we can easily test with a computer, even by brute force. The problem of course is that we cannot test this for an infinite number of primes, and even if we could do that this wouldn't prove in general that Pf(Z). For example the equation (x2+1)(x3+3x+1)=0 has a root in Z/pZ for every prime p but does not have a root in Z (see this question on Math StackExchange). Observe that the previous polynomial is reducible over Z, because for irreducible polynomials in one variable we can indeed check that Pf(Z/pZ) for all primes pZ to be sure that Pf(Z) (see this question on Math StackExchange).

Is it time to give up already? Maybe not! Indeed, if Pf(Z) then we also have that Pf(Z/prZ) for every prime number pN and every rN. Thus we could hope for the converse to this statement to hold!
Actually, this fails to be true (look for example at the equation x2+y2+z2=0) because we are not taking into account all the primes of Z but only the finite ones. One of the big changes of mentality introduced by the French school (in particular by Weil and Chevalley in the 1940s) is that we should consider also the primes at infinity. We will discover in the following lectures where this name comes from (see here) and how important they are to compactify the spectrum Spec(OK) of the ring of integers of a number field K. In particular for our problem of determining whether or not  Pf(Z)=Pf(Q) this amounts to check also that Pf(R), which is false for f(x,y,z)=x2+y2+z2.

Thus we can try to see if it is true that Pf(Q) whenever Pf(Z/prZ)Pf(R). This holds if f(x) is an irreducible polynomial in one variable or if f(x1,,xn)=ni=1nj=1aijxixjZ[x1,,xn] is a quadratic form. This is the famous theorem of Hasse and Minkowski (see here for a proof) which could make us hope that this holds in general.

Unfortunately... it is not the case. Already for cubic equations, Selmer showed that f(x,y,z)=3x3+4y3+5z3 violates this principle, which is now known as the Hasse principle. Nevertheless, it is extremely interesting to try to determine why the Hasse principle fails for certain equations and why it holds for others.

In order to do so, we need to put together all the information that we have on the various rings Z/prZ. First of all we fix the prime p and we let rN vary. If we do so we can define the ring of p-adic integers Zp:=limrZprZ which is an integral domain with fraction field Qp:=Frac(Zp). Then we take into account all the primes at once, including the prime at infinity, by defining the ring of adeles AQ:=p(Qp:Zp)×R which is a subspace of the "naive" product pQp×R which is too big (i.e. it is not locally compact).

With these new definitions (and in more modern terms) the Hasse principle is the study of the inclusion X(Q)X(AQ) where XSpec(Z) is a projective scheme of finite type (e.g. X=Pf for a polynomial fZ[x1,,xn]). This is related with the Brauer group of the scheme X and is a theme of extremely active research (see these notes of Colliot-Thélène).

Flying up: extensions of Q and the absolute Galois group

Let's go back to our original question, namely to study the integers Z as a ring by looking at all the various sets of solutions Xf(Z) for different polynomials f(x1,,xn)Z[x1,,xn]. We saw that one way of approaching this problem is to reduce modulo (powers of) p to get a smaller set to work with. Another possible way of approaching this question could be to get a bigger and richer set to study. One way to do this is to consider finite extensions of Z, e.g. rings of integers OK of number fields K/Q. In particular if K/Q is a finite, Galois extension then Pf(OK)=Pf(K) and we have an action of the Galois group Gal(K/Q) on the set Pf(K). We can thus hope that studying this action could be easier than just studying the smaller set Pf(Q).

If we want to study all the finite extensions K/Q at once we can consider the inclusion Pf(Q)Pf(¯Q), where ¯Q is an algebraic closure of Q. We have an action of the absolute Galois group GQ:=Gal(¯Q/Q) over Pf(¯Q) and this action is lossless, i.e. Pf(¯Q)GQ=Pf(Q). Thus in order to study Pf(Q) we need to study the group GQ and its representations, i.e. the many possible ways in which it acts on different objects.

This is completely straightforward for irreducible polynomials in one variable f(x)Z[x]. Namely if Kf/Q is the splitting field of f(x) we have that Pf(¯Q)=Pf(Kf) is a finite set of d=deg(f) elements which gives us a continuous group homomorphism
GQGal(Kf/Q)SdGLd(C) where Sd is the symmetric group on d letters and the last map is given by letting Sd act on the subscripts of the canonical basis e1,,ed of Cd. This gives us a tool to study the set Pf(Q) and the extension Kf/Q by studying the complex representations of the finite group Gal(Kf/Q).

We can do this also for polynomials with many variables. If we do so we end up with a continuous map of groups GQGL(CN) because the set Pf(¯Q) would be countably infinite.
This is not so good to study because we don't have a good theory of infinite dimensional complex representations of groups. To overcome this difficulty we can observe that in the one variable case the representation comes from the geometry of Pf(¯Q)=Pf(C) which is a finite and discrete topological space.

To see this recall that for every complex manifold M with a continuous action of a group G we get continuous group representations GGL(Hjsing(M,C)) for every j=0,,2dim(M).
In particular if f(x)Z[x] is irreducible the only cohomology group of the zero dimensional manifold Pf(C)  that it makes sense to consider is H0sing(Pf(C),C)Cd where d=deg(f), and this gives us precisely the representation GQGLn(C) that we defined above.

Thus we can try now to study the higher dimensional complex manifolds Pf(C) for polynomials f(x1,,xn)Z[x1,,xn]. We face two problems:
  • Pf(¯Q)Pf(C) and thus we don't have an action of GQ on this space but an action of the much bigger group AutQ(C);
  • the action of AutQ(C) on Pf(C) is really discontinuous if we consider Pf(C) with the complex topology, and thus doesn't induce an action on cohomology.
For these two reasons (and especially for the second one) we need new cohomology groups to associate to our polynomial f(x1,,xn) which would be better related to Pf(¯Q). It was again an incredible result of the French school (and in particular of Grothendieck and Artin) to define cohomology groups Hjét(X¯Q,¯Qp) using the fields Qp for any prime number p. Here X is a scheme, a new kind of object which is defined over Z and encodes both the geometric and the arithmetic properties of the polynomial f(x1,,xn). As an example we can take X=Pf which is the canonical projective scheme associated to a polynomial. In particular there is an operation of base change which allows us to view this object over extensions of Z, e.g. over ¯Q and over C. We will write respectively X¯Q and XC.

This new cohomology theory is called "étale cohomology" because it is more "calm and rigid" than singular cohomology. In particular these groups are finite dimensional and come with a continuous action of GQ, precisely because they are associated to the base change X¯Q. Moreover, we can recover singular cohomology using étale cohomology! Namely, we get a comparison isomorphism Hjét(X¯Q,¯Qp)Hjsing(X(C),C) where X(C) has the complex topology! In this course we will see only an easy ingredient of this proof, namely the fact that we have a (non canonical) isomorphism ¯QpC.

Appendix: absolute Galois groups and class field theory

We have seen in this short note how the absolute Galois group GQ helps us to solve Diophantine equations. In particular every polynomial f(x1,,xn)Z[x1,,xn] gives us a scheme Pf which in turn gives us some continuous representations GQGL(Hjét((Pf)¯Q,¯Qp))GLhj(C) for every prime number pN and every j=0,,2n, where hj=hj(Pf):=dim(Hjét((Pf)¯Q,¯Qp)).

The group GQ is the "Holy Grail" of modern number theory. More generally to any field κ we can associate an absolute Galois group Gκ which governs the behavior of the algebraic extensions of κ. We know really little about absolute Galois groups of fields in general. For example, the inverse Galois problem asks whether every finite group is a quotient of GQ but we only know this for solvable groups (thanks to the work of Igor Shafarevich), sporadic groups (except, possibly, the Mathieu group M23 ) and not so many more. Moreover, we know how to present the group Gκ only in some special cases:
  • by definition Gκ is trivial if and only if κ is algebraically closed;
  • when κ=R then GκZ/2Z. These two first cases are almost the only cases (up to isomorphism) when Gκ is finite (see this note by Keith Conrad);
  • when κ is finite then GκlimZ/nZ because for every nN any finite field κ has a finite Galois extension of degree nN, and this extension is unique up to isomorphism (see this note by Miller);
  • when κQp is a finite extension with p2 then we have an explicit presentation of Gκ with [κ:Qp]+3 generators (see Theorem 7.5.14 of the book "Cohomology of number fields" by Neukirch, Schmidt and Wingberg).
  • when κ=K(T) is a purely transcendental one dimensional extension of an algebraically closed field K then Gκ is isomorphic to the free group on the set of elements of K. See this paper of Haran and Jarden for a short proof using algebraic methods.
However, giving a presentation of a group is not the only way we have to determine its isomophism class. A deep theorem of Tannaka roughly says that if we have a compact topological group (like Gκ ) then its isomorphism class is determined by its finite dimensional linear representations over a field (for example, over C). Thus we are back to the problem of studying the representations of Gκ which were introduced in the previous section as a tool to understand Diophantine equations!

If we want to do this then the first representations that we should study are characters, i.e. (continuous) group homomorphisms GκC×. These maps factor through the abelian part Gabκ of the group Gκ, i.e. the quotient of Gκ by the subgroup generated by commutators. Describing Gabκ is the subject of class field theory, which gives us a complete answer when κ is a number field. In particular, as we will see, we can define a ring of adeles Aκ also for general number fields. We have an inclusion κAκ and if we define Cκ:=A×κ/κ× then Artin defined a map CκGabκ which is almost an isomorphism (see this note of Poonen for the correct statement).

If κ=Q this amounts to a theorem of Kronecker and Weber which tells us that every abelian extension of Q is contained in a cyclotomic extension Q(ζn) for some nN. Modern proofs of this theorem and all the proofs of class field theory use extensively the theory of finite extensions of the p-adic numbers Qp. Understanding the character theory of GQ is thus yet another reason to study p-adic fields.

Conclusions and an important attribution

I hope to have convinced you that the p-adic numbers Qp are a fundamental tool to study the integers Z as a ring, which is perhaps the most fundamental question in number theory. They help us to package all the information that we get reducing modulo pr for various rN, they help us to study the representations of the absolute Galois group GQ (in particular, they are a fundamental tool in class field theory) and they are a crucial ingredient in the definition of étale cohomology, which gives us a big new source of representations of GQ.

Observe finally that p-adic numbers are very similar to the real numbers, which one uses to study "statistical properties" of Z. Recall that R can be constructed by "filling in the holes" of Q. The key fact here is that this holes are relative to the canonical absolute value ||:QR0. As we will see, there are many more absolute values on Q, and there is one of them canonically associated to any prime number pN. The result of "filling in the holes" which are relative to this new absolute value is the field Qp of p-adic numbers! In particular, we have an inclusion ZQp which allows us to study new "analytic" properties of Z which are relative to the prime p.

Please keep in mind that this class was based on a blog post by Alexander Youcis which is similar to this one. Here you get a little more on analytic number theory and local-global principles, but I didn't have time to treat Legendre's theorem on quadratic forms in detail.

Comments

Popular posts from this blog

A (not quite) perfect(oid) talk

TANT 2 - Absolute values