Skip to content →

Category: number theory

Conway’s big picture

Conway and Norton showed that there are exactly 171 moonshine functions and associated two arithmetic subgroups to them. We want a tool to describe these and here’s where Conway’s big picture comes in very handy. All moonshine groups are arithmetic groups, that is, they are commensurable with the modular group. Conway’s idea is to view several of these groups as point- or set-wise stabilizer subgroups of finite sets of (projective) commensurable 2-dimensional lattices.

Expanding (and partially explaining) the original moonshine observation of McKay and Thompson, John Conway and Simon Norton formulated monstrous moonshine :

To every cyclic subgroup m of the Monster M is associated a function

fm(τ)=1q+a1q+a2q2+ with q=e2πiτ and all coefficients aiZ are characters at m of a representation of M. These representations are the homogeneous components of the so called Moonshine module.

Each fm is a principal modulus for a certain genus zero congruence group commensurable with the modular group Γ=PSL2(Z). These groups are called the moonshine groups.

Conway and Norton showed that there are exactly 171 different functions fm and associated two arithmetic subgroups F(m)E(m)PSL2(R) to them (in most cases, but not all, these two groups coincide).

Whereas there is an extensive literature on subgroups of the modular group (see for instance the series of posts starting here), most moonshine groups are not contained in the modular group. So, we need a tool to describe them and here’s where Conway’s big picture comes in very handy.

All moonshine groups are arithmetic groups, that is, they are subgroups G of PSL2(R) which are commensurable with the modular group Γ=PSL2(Z) meaning that the intersection GΓ is of finite index in both G and in Γ. Conway’s idea is to view several of these groups as point- or set-wise stabilizer subgroups of finite sets of (projective) commensurable 2-dimensional lattices.

Start with a fixed two dimensional lattice L1=Ze1+Ze2=e1,e2 and we want to name all lattices of the form L=v1=ae1+be2,v2=ce1+de2 that are commensurable to L1. Again this means that the intersection LL1 is of finite index in both lattices. From this it follows immediately that all coefficients a,b,c,d are rational numbers.

It simplifies matters enormously if we do not look at lattices individually but rather at projective equivalence classes, that is  L=v1,v2L=v1,v2 if there is a rational number λQ such that  λv1=v1,λv2=v2. Further, we are of course allowed to choose a different ‘basis’ for our lattices, that is,  L=v1,v2=w1,w2 whenever  (w1,w2)=(v1,v2).γ for some γPSL2(Z).
Using both operations we can get any lattice in a specific form. For example,

12e1+3e2,e113e2=(1)3e1+18e2,6e12e2=(2)3e1+18e2,38e2=(3)338e1+919e2,e2

Here, identities (1) and (3) follow from projective equivalence and identity (2) from a base-change. In general, any lattice L commensurable to the standard lattice L1 can be rewritten uniquely as L=Me1+ghe2,e2 where M a positive rational number and with 0gh<1.

Another major feature is that one can define a symmetric hyper-distance between (equivalence classes of) such lattices. Take L=Me1+ghe2,e2 and L=Ne1+ije2,e2 and consider the matrix

DLL=[Mgh01][Nij01]1 and let α be the smallest positive rational number such that all entries of the matrix α.DLL are integers, then

δ(L,L)=det(α.DLL)N defines a symmetric hyperdistance which depends only of the equivalence classes of lattices (hyperdistance because the log of it behaves like an ordinary distance).

Conway’s big picture is the graph obtained by taking as its vertices the equivalence classes of lattices commensurable with L1 and with edges connecting any two lattices separated by a prime number hyperdistance. Here’s part of the 2-picture, that is, only depicting the edges of hyperdistance 2.



The 2-picture is an infinite 3-valent tree as there are precisely 3 classes of lattices at hyperdistance 2 from any lattice L=v1,v2 namely (the equivalence classes of) 12v1,v2 , v1,12v2 and 12(v1+v2),v2.

Similarly, for any prime hyperdistance p, the p-picture is an infinite p+1-valent tree and the big picture is the product over all these prime trees. That is, two lattices at square-free hyperdistance N=p1p2pk are two corners of a k-cell in the big picture!
(Astute readers of this blog (if such people exist…) may observe that Conway’s big picture did already appear here prominently, though in disguise. More on this another time).

The big picture presents a simple way to look at arithmetic groups and makes many facts about them visually immediate. For example, the point-stabilizer subgroup of L1 clearly is the modular group PSL2(Z). The point-stabilizer of any other lattice is a certain conjugate of the modular group inside PSL2(R). For example, the stabilizer subgroup of the lattice LN=Ne1,e2 (at hyperdistance N from L1) is the subgroup

[abNNcd] | [abcd]PSL2(Z) 

Now the intersection of these two groups is the modular subgroup Γ0(N) (consisting of those modular group element whose lower left-hand entry is divisible by N). That is, the proper way to look at this arithmetic group is as the joint stabilizer of the two lattices L1,LN. The picture makes it trivial to compute the index of this subgroup.

Consider the ball B(L1,N) with center L1 and hyper-radius N (on the left, the ball with hyper-radius 4). Then, it is easy to show that the modular group acts transitively on the boundary lattices (including the lattice LN), whence the index [Γ:Γ0(N)] is just the number of these boundary lattices. For N=4 the picture shows that there are exactly 6 of them. In general, it follows from our knowledge of all the p-trees the number of all lattices at hyperdistance N from L1 is equal to Np|N(1+1p), in accordance with the well-known index formula for these modular subgroups!

But, there are many other applications of the big picture giving a simple interpretation for the Hecke operators, an elegant proof of the Atkin-Lehner theorem on the normalizer of Γ0(N) (the whimsical source of appearances of the number 24) and of Helling’s theorem characterizing maximal arithmetical groups inside PSL2(C) as conjugates of the normalizers of Γ0(N) for square-free N.
J.H. Conway’s paper “Understanding groups like Γ0(N)” containing all this material is a must-read! Unfortunately, I do not know of an online version.

Comments closed

On2 : extending Lenstra’s list

We have seen that John Conway defined a nim-addition and nim-multiplication on the ordinal numbers in such a way that the subfield [ωωω]F2 is the algebraic closure of the field on two elements. We’ve also seen how to do actual calculations in that field provided we can determine the mystery elements αp, which are the smallest ordinals not being a p-th power of ordinals lesser than [ωωk1] if p is the k+1-th prime number.

Hendrik Lenstra came up with an effective method to compute these elements αp requiring a few computations in certain finite fields. I’ll give a rundown of his method and refer to his 1977-paper “On the algebraic closure of two” for full details.

For any ordinal α<ωωω define its degree d(α) to be the degree of minimal polynomial for α over F2=[2] and for each prime number p let f(p) be the smallest number h such that p is a divisor of 2h1 (clearly f(p) is a divisor of p1).

In the previous post we have already defined ordinals κpk=[ωωk1.pn1] for prime-power indices, but we now need to extend this definition to allow for all indices. So. let h be a natural number, p the smallest prime number dividing h and q the highest power of p dividing h. Let g=[h/q], then Lenstra defines

κh={κq if q divides d(κq)  andκg+κq=[κg+κq] otherwise

With these notations, the main result asserts the existence of natural numbers m,m such that

αp=[κf(p)+m]=[κf(p)]+m

Now, assume by induction that we have already determined the mystery numbers αr for all odd primes r<p, then by teh argument of last time we can effectively compute in the field [κp]. In particular, we can compute for every element its multiplicative order ord(β) and therefore also its degree d(β) which has to be the smallest number h such that ord(β) divides [2h1].

Then, by the main result we only have to determine the smallest number m such that β=[κf(p)+m] is not a p-th power in κp which is equivalent to the condition that

β(2d(β)1)/p1 if p divides [2d(β)1]

All these conditions can be verified within suitable finite fields and hence are effective. In this manner, Lenstra could extend Conway’s calculations (probably using a home-made finite field program running on a slow 1977 machine) :

[tex]pf(p)αp32[2]54[4]73[ω]+11110[ωω]+11312[ω]+4178[16]1918[ω3]+42311[ωω3]+12928[ωω2]+4315[ωω]+13736[ω3]+44120[ωω]+14314[ωω2]+1[/tex]

Right, so let’s try the case p=47. To begin, f(47)=23 whence we have to determine the smallest field containg κ23. By induction (Lenstra’s tabel) we know already that

κ2323=κ11+1=[ωω3]+1 and κ1111=κ5+1=[ωω]+1 and κ55=[4]

Because the smallest field containg 4 is [16]=F24 we have that F2(4,κ5,κ11)F2220. We can construct this finite field, together with a generator a of its multiplicative group in Sage via


sage: f1.< a >=GF(2^220)

In this field we have to pinpoint the elements 4,κ5 and κ11. As 4 has order 15 in F24 we know that κ5 has order 75. Hence we can take κ5=a(22201)/75 and then 4=κ55.

If we denote κ5 by x5 we can obtain κ11 as x11 by the following sage-commands


sage: c=x5+1

sage: x11=c.nth_root(11)

It takes about 7 minutes to find x11 on a 2.4 GHz MacBook. Next, we have to set up the field extension determined by κ23 (which we will call x in sage). This is done as follows


sage: p1.=PolynomialRing(f1)

sage: f=x^23-x11-1

sage: F2=f1.extension(f,'u')

The MacBook needed 8 minutes to set up this field which is isomorphic to F25060. The relevant number is therefore n=25060147 which is the gruesome

34648162040462867047623719793206539850507437636617898959901744136581<br/>
259144476645069239839415478030722644334257066691823210120548345667203443<br/>
317743531975748823386990680394012962375061822291120459167399032726669613
<br/>
442804392429947890878007964213600720766879334103454250982141991553270171

938532417844211304203805934829097913753132491802446697429102630902307815

301045433019807776921086247690468136447620036910689177286910624860871748

150613285530830034500671245400628768674394130880959338197158054296625733

206509650361461537510912269982522844517989399782602216622257291361930850

885916974186835958466930689748400561295128553674118498999873244045842040

080195019701984054428846798610542372150816780493166669821114184374697446

637066566831036116390063418916814141753876530004881539570659100352197393

997895251223633176404672792711603439161147155163219282934597310848529360

118189507461132290706604796116111868096099527077437183219418195396666836

014856037176421475300935193266597196833361131333604528218621261753883518

667866835204501888103795022437662796445008236823338104580840186181111557

498232520943552183185687638366809541685702608288630073248626226874916669

186372183233071573318563658579214650042598011275864591248749957431967297

975078011358342282941831582626985121760847852546207377440873367589369439

085660784239080183415569559585998884824991911321095149718147110882474280

968166266224151511519773175933506503369761671964823112231808283557885030

984081329986188655169245595411930535264918359325712373064120338963742590

76555755141425

Remains ‘only’ to take x,x+1,etc. to the n-th power and verify which is the first to be unequal to 1. For this it is best to implement the usual powering trick (via digital expression of the exponent) in the field F2, something like


sage: def power(e,n):
...: le=n.bits()
...: v=n.digits()
...: mn=F2(e)
...: out=F2(1)
...: i=0
...: while i< le :
...: if v[i]==1 : out=F2(out_mn)
...: m=F2(mn_mn)
...: mn=F2(m)
...: i=i+1
...: return(out)
...:

then it takes about 20 seconds to verify that power(x,n)=1 but that power(x+1,n) is NOT! That is, we just checked that α47=κ11+1.

It turns out that 47 is the hardest nut to crack, the following primes are easier. Here’s the data (if I didn’t make mistakes…)

[tex]pf(p)αp4723[ωω7]+15352[ωω4]+15958[ωω8]+16160[ωω]+[ω]6766[ωω3]+[ω][/tex]

It seems that Magma is substantially better at finite field arithmetic, so if you are lucky enough to have it you’ll have no problem finding αp for all primes less than 100 by the end of the day. If you do, please drop a comment with the results…

Comments closed

On2 : Conway’s nim-arithmetics

Last time we did recall Cantor’s addition and multiplication on ordinal numbers. Note that we can identify an ordinal number α with (the order type of) the set of all strictly smaller ordinals, that is, α=α : α<α. Given two ordinals α and β we will denote their Cantor-sums and products as [α+β] and [α.β].

The reason for these square brackets is that John Conway constructed a well behaved nim-addition and nim-multiplication on all ordinals On2 by imposing the ‘simplest’ rules which make On2 into a field. By this we mean that, in order to define the addition α+β we must have constructed before all sums α+β and α+β with α<α and β<β. If + is going to be a well-defined addition on On2 clearly α+β cannot be equal to one of these previously constructed sums and the ‘simplicity rule’ asserts that we should take α+β the least ordinal different from all these sums α+β and α+β. In symbols, we define

α+β=mexα+β,α+β | α<α,β<β

where mex stands for ‘minimal excluded value’. If you’d ever played the game of Nim you will recognize this as the Nim-addition, at least when α and β are finite ordinals (that is, natural numbers) (to nim-add two numbers n and m write them out in binary digits and add without carrying). Alternatively, the nim-sum n+m can be found applying the following two rules :

  • the nim-sum of a number of distinct 2-powers is their ordinary sum (e.g. 8+4+1=13, and,
  • the nim-sum of two equal numbers is 0.

So, all we have to do is to write numbers n and m as sums of two powers, scratch equal terms and add normally. For example, 13+7=(8+4+1)+(4+2+1)=8+2=10 (of course this is just digital sum without carry in disguise).

Here’s the beginning of the nim-addition table on ordinals. For example, to define 13+7 we have to look at all values in the first 7 entries of the row of 13 (that is, 13,12,15,14,9,8,11) and the first 13 entries in the column of 7 (that is, 7,6,5,4,3,2,1,0,15,14,13,12,11) and find the first number not included in these two sets (which is indeed 10).

In fact, the above two rules allow us to compute the nim-sum of any two ordinals. Recall from last time that every ordinal can be written uniquely as as a finite sum of (ordinal) 2-powers :
α=[2α0+2α1++2αk], so to determine the nim-sum α+β we write both ordinals as sums of ordinal 2-powers, delete powers appearing twice and take the Cantor ordinal sum of the remaining sum.

Nim-multiplication of ordinals is a bit more complicated. Here’s the definition as a minimal excluded value

α.β=mexα.β+α.βα.β

for all α<α,β<β. The rationale behind this being that both αα and ββ are non-zero elements, so if On2 is going to be a field under nim-multiplication, their product should be non-zero (and hence strictly greater than 0), that is,  (αα).(ββ)>0. Rewriting this we get α.β>α.β+α.βα.β and again the ‘simplicity rule’ asserts that α.β should be the least ordinal satisfying all these inequalities, leading to the mex-definition above. The table gives the beginning of the nim-multiplication table for ordinals. For finite ordinals n and m there is a simple 2 line procedure to compute their nim-product, similar to the addition-rules mentioned before :

  • the nim-product of a number of distinct Fermat 2-powers (that is, numbers of the form 22n) is their ordinary product (for example, 16.4.2=128), and,
  • the square of a Fermat 2-power is its sesquimultiple (that is, the number obtained by multiplying with 112 in the ordinary sense). That is, 22=3,42=6,162=24,

Using these rules, associativity and distributivity and our addition rules it is now easy to work out the nim-multiplication n.m : write out n and m as sums of (multiplications by 2-powers) of Fermat 2-powers and apply the rules. Here’s an example

5.9=(4+1).(4.2+1)=42.2+4.2+4+1=6.2+8+4+1=(4+2).2+13=4.2+22+13=8+3+13=6

Clearly, we’d love to have a similar procedure to calculate the nim-product α.β of arbitrary ordinals, or at least those smaller than ωωω (recall that Conway proved that this ordinal is isomorphic to the algebraic closure F2 of the field of two elements). From now on we restrict to such ‘small’ ordinals and we introduce the following special elements :

κ2n=[22n1] (these are the Fermat 2-powers) and for all primes p>2 we define
κpn=[ωωk1.pn1] where k is the number of primes strictly smaller than p (that is, for p=3 we have k=1, for p=5, k=2 etc.).

Again by associativity and distributivity we will be able to multiply two ordinals <ωωω if we know how to multiply a product

[ωα.2n0].[ωβ.2m0] with α,β<[ωω] and n0,m0N.

Now, α can be written uniquely as [ωt.nt+ωt1.nt1++ω.n2+n1] with t and all ni natural numbers. Write each nk in base p where p is the k+1-th prime number, that is, we have for n0,n1,,nt an expression

nk=[jpj.m(j,k)] with 0m(j,k)<p

The point of all this is that any of the special elements we want to multiply can be written as a unique expression as a decreasing product

[ωα.2n0]=[qκqm(q)]

where q runs over all prime powers. The crucial fact now is that for this decreasing product we have a rule similar to addition of 2-powers, that is Conway-products coincide with the Cantor-products

[qκqm(q)]=qκqm(q)

But then, using associativity and commutativity of the Conway-product we can ‘nearly’ describe all products [ωα.2n0].[ωβ.2m0]. The remaining problem being that it may happen that for some q we will end up with an exponent m(q)+m(q)>p. But this can be solved if we know how to take p-powers. The rules for this are as follows

 (κ2n)2=κ2n+1i<nκ2i, for 2-powers, and,

 (κpn)p=κpn1 for a prime p>2 and for n2, and finally

 (κp)p=αp for a prime p>2, where αp is the smallest ordinal <κp which cannot be written as a p-power βp with β<κp. Summarizing : if we will be able to find these mysterious elements αp for all prime numbers p, we are able to multiply in [ωωω]=F2.

Let us determine the first one. We have that κ3=ω so we are looking for the smallest natural number n<ω which cannot be written in num-multiplication as n=m3 for m<ω (that is, also m a natural number). Clearly 1=13 but what about 2? Can 2 be a third root of a natural number wrt. nim-multiplication? From the tabel above we see that 2 has order 3 whence its cube root must be an element of order 9. Now, the only finite ordinals that are subfields of On2 are precisely the Fermat 2-powers, so if there is a finite cube root of 2, it must be contained in one of the finite fields [22n] (of which the mutiplicative group has order 22n1 and one easily shows that 9 cannot be a divisor of any of the numbers 22n1, that is, 2 doesn’t have a finte 3-th root in nim! Phrased differently, we found our first mystery number α3=2. That is, we have the marvelous identity in nim-arithmetic

ω3=2

Okay, so what is α5? Well, we have κ5=[ωω] and we have to look for the smallest ordinal which cannot be written as a 5-th root. By inspection of the finite nim-table we see that 1,2 and 3 have 5-th roots in ω but 4 does not! The reason being that 4 has order 15 (check in the finite field [16]) and 25 cannot divide any number of the form 22n1. That is, α5=4 giving another crazy nim-identity

 (ωω)5=4

And, surprises continue to pop up… Conway showed that α7=ω+1 giving the nim-identity  (ωω2)7=ω+1. The proof of this already uses some clever finite field arguments. Because 7 doesn’t divide any number 22n1, none of the finite subfields [22n] contains a 7-th root of unity, so the 7-power map is injective whence surjective, so all finite ordinal have finite 7-th roots! That is, α7ω. Because ω lies in a cubic extension of the finite field [4], the field generated by ω has 64 elements and so its multiplicative group is cyclic of order 63 and as ω has order 9, it must be a 7-th power in this field. But, as the only 7th powers in that field are precisely the powers of ω and by inspection ω+1 is not a 7-th power in that field (and hence also not in any field extension obtained by adjoining square, cube and fifth roots) so α7=ω+1.

Conway did stop at α7 but I’ve always been intrigued by that one line in ONAG p.61 : “Hendrik Lenstra has computed αp for p43”. Next time we will see how Lenstra managed to do this and we will use sage to extend his list a bit further, including the first open case : α47=ωω7+1.

For an enjoyable video on all of this, see Conway’s MSRI lecture on Infinite Games. The nim-arithmetic part is towards the end of the lecture but watching the whole video is a genuine treat!

Comments closed