Tag: Connes

On2 : transfinite number hacking

Published January 8, 2009 by lieven

In ONAG, John Conway proves that the symmetric version of his recursive definition of addition and multiplcation on the surreal numbers make the class On of all Cantor’s ordinal numbers into an algebraically closed Field of characteristic two : On2 (pronounced ‘Onto’), and, in particular, he identifies a subfield
with the algebraic closure of the field of two elements. What makes all of this somewhat confusing is that Cantor had already defined a (badly behaving) addition, multiplication and exponentiation on ordinal numbers.

Over the last week I’ve been playing a bit with sage to prove a few exotic identities involving ordinal numbers. Here’s one of them ( $ω$ is the first infinite ordinal number, that is, $ω = 0, 1, 2, \dots$ ),

$(ω^{ω^{13}})^{47} = ω^{ω^{7}} + 1$

answering a question in Hendrik Lenstra’s paper Nim multiplication.

However, it will take us a couple of posts before we get there. Let’s begin by trying to explain what brought this on. On september 24th 2008 there was a meeting, intended for a general public, called a la rencontre des dechiffeurs, celebrating the 50th birthday of the IHES.

One of the speakers was Alain Connes and the official title of his talk was “L’ange de la géométrie, le diable de l’algèbre et le corps à un élément” (the angel of geometry, the devil of algebra and the field with one element). Instead, he talked about a seemingly trivial problem : what is the algebraic closure of $F_{2}$ , the field with two elements? My only information about the actual content of the talk comes from the following YouTube-blurb

Alain argues that we do not have a satisfactory description of ${\overset{―}{F}}_{2}$ , the algebraic closure of $F_{2}$ . Naturally, it is the union (or rather, limit) of all finite fields $F_{2^{n}}$ , but, there are too many non-canonical choices to make here.

Recall that $F_{2^{k}}$ is a subfield of $F_{2^{l}}$ if and only if $k$ is a divisor of $l$ and so we would have to take the direct limit over the integers with respect to the divisibility relation… Of course, we can replace this by an increasing sequence of a selection of cofinal fields such as

$F_{2^{1!}} \subset F_{2^{2!}} \subset F_{2^{3!}} \subset \dots$

But then, there are several such suitable sequences! Another ambiguity comes from the description of $F_{2^{n}}$ . Clearly it is of the form $F_{2} [x] / (f (x))$ where $f (x)$ is a monic irreducible polynomial of degree $n$ , but again, there are several such polynomials. An attempt to make a canonical choice of polynomial is to take the ‘first’ suitable one with respect to some natural ordering on the polynomials. This leads to the so called Conway polynomials.

Conway polynomials for the prime $2$ have only been determined up to degree 400-something, so in the increasing sequence above we would already be stuck at the sixth term $F_{2^{6!}}$ …

So, what Alain Connes sets as a problem is to find another, more canonical, description of ${\overset{―}{F}}_{2}$ . The problem is not without real-life interest as most finite fields appearing in cryptography or coding theory are subfields of ${\overset{―}{F}}_{2}$ .

(My guess is that Alain originally wanted to talk about the action of the Galois group on the roots of unity, which would be the corresponding problem over the field with one element and would explain the title of the talk, but decided against it. If anyone knows what ‘coupling-problem’ he is referring to, please drop a comment.)

Surely, Connes is aware of the fact that there exists a nice canonical recursive construction of ${\overset{―}{F}}_{2}$ due to John Conway, using Georg Cantor’s ordinal numbers.

In fact, in chapter 6 of his book On Numbers And Games, John Conway proves that the symmetric version of his recursive definition of addition and multiplcation on the surreal numbers make the class $On$ of all Cantor’s ordinal numbers into an algebraically closed Field of characteristic two : ${On}_{2}$ (pronounced ‘Onto’), and, in particular, he identifies a subfield

${\overset{―}{F}}_{2} ≃ [ω^{ω^{ω}}]$

with the algebraic closure of $F_{2}$ . What makes all of this somewhat confusing is that Cantor had already defined a (badly behaving) addition, multiplication and exponentiation on ordinal numbers. To distinguish between the Cantor/Conway arithmetics, Conway (and later Lenstra) adopt the convention that any expression between square brackets refers to Cantor-arithmetic and un-squared ones to Conway’s. So, in the description of the algebraic closure just given $[ω^{ω^{ω}}]$ is the ordinal defined by Cantor-exponentiation, whereas the exotic identity we started out with refers to Conway’s arithmetic on ordinal numbers.

Let’s recall briefly Cantor’s ordinal arithmetic. An ordinal number $α$ is the order-type of a totally ordered set, that is, if there is an order preserving bijection between two totally ordered sets then they have the same ordinal number (or you might view $α$ itself as a totally ordered set, namely the set of all strictly smaller ordinal numbers, so e.g. $0 = \emptyset, 1 = 0, 2 = 0, 1, \dots$ ).

For two ordinals $α$ and $β$ , the addition $[α + β]$ is the order-type of the totally ordered set $α ⊔ β$ (the disjoint union) ordered compatible with the total orders in $α$ and $β$ and such that every element of $β$ is strictly greater than any element from $α$ . Observe that this definition depends on the order of the two factors. For example, $[1 + ω] = ω$ as there is an order preserving bijection $\tilde{0}, 0, 1, 2, \dots \to 0, 1, 2, 3, \dots$ by $\tilde{0} \mapsto 0, n \mapsto n + 1$ . However, $ω \neq [ω + 1]$ as there can be no order preserving bijection $0, 1, 2, \dots \to 0, 1, 2, \dots, 0_{m a x}$ as the first set has no maximal element whereas the second one does. So, Cantor’s addition has the bad property that it may be that $[α + β] \neq [β + α]$ .

The Cantor-multiplication $α . β$ is the order-type of the product-set $α \times β$ ordered via the last differing coordinate. Again, this product has the bad property that it may happen that $[α . β] \neq [β . α]$ (for example $[2 . ω] \neq [ω . 2]$ ). Finally, the exponential $β^{α}$ is the order type of the set of all maps $f : α \to β$ such that $f (a) \neq 0$ for only finitely many $a \in α$ , and ordered via the last differing function-value.

Cantor’s arithmetic allows normal-forms for ordinal numbers. More precisely, with respect to any ordinal number $γ \geq 2$ , every ordinal number $α \geq 1$ has a unique expression as

$α = [γ^{α_{0}} . η_{0} + γ^{α_{1}} . η_{1} + \dots + γ^{α_{m}} . η_{m}]$

for some natural number $m$ and such that $α \geq α_{0} > α_{1} > \dots > α_{m} \geq 0$ and all $1 \leq η_{i} < γ$ . In particular, taking the special cases $γ = 2$ and $γ = ω$ , we have the following two canonical forms for any ordinal number $α$

$[2^{α_{0}} + 2^{α_{1}} + \dots + 2^{α_{m}}] = α = [ω^{β_{0}} . n_{0} + ω^{β_{1}} . n_{1} + \dots + ω^{β_{k}} . n_{k}]$

with $m, k, n_{i}$ natural numbers and $α \geq α_{0} > α_{1} > \dots > α_{m} \geq 0$ and $α \geq β_{0} > β_{1} > \dots > β_{k} \geq 0$ . Both canonical forms will be important when we consider the (better behaved) Conway-arithmetic on ${On}_{2}$ , next time.

One Comment

Mumford’s treasure map

Published December 13, 2008 by lieven

David Mumford did receive earlier this year the 2007 AMS Leroy P. Steele Prize for Mathematical Exposition. The jury honors Mumford for “his beautiful expository accounts of a host of aspects of algebraic geometry”. Not surprisingly, the first work they mention are his mimeographed notes of the first 3 chapters of a course in algebraic geometry, usually called “Mumford’s red book” because the notes were wrapped in a red cover. In 1988, the notes were reprinted by Springer-Verlag. Unfortnately, the only red they preserved was in the title.

The AMS describes the importance of the red book as follows. “This is one of the few books that attempt to convey in pictures some of the highly abstract notions that arise in the field of algebraic geometry. In his response upon receiving the prize, Mumford recalled that some of his drawings from The Red Book were included in a collection called Five Centuries of French Mathematics. This seemed fitting, he noted: “After all, it was the French who started impressionist painting and isn’t this just an impressionist scheme for rendering geometry?””

These days it is perfectly possible to get a good grasp on difficult concepts from algebraic geometry by reading blogs, watching YouTube or plugging in equations to sophisticated math-programs. In the early seventies though, if you wanted to know what Grothendieck’s scheme-revolution was all about you had no choice but to wade through the EGA’s and SGA’s and they were notorious for being extremely user-unfriendly regarding illustrations…

So the few depictions of schemes available, drawn by people sufficiently fluent in Grothendieck’s new geometric language had no less than treasure-map-cult-status and were studied in minute detail. Mumford’s red book was a gold mine for such treasure maps. Here’s my favorite one, scanned from the original mimeographed notes (it looks somewhat tidier in the Springer-version)

It is the first depiction of $spec (Z [x])$ , the affine scheme of the ring $Z [x]$ of all integral polynomials. Mumford calls it the”arithmetic surface” as the picture resembles the one he made before of the affine scheme $spec (C [x, y])$ corresponding to the two-dimensional complex affine space $A_{C}^{2}$ . Mumford adds that the arithmetic surface is ‘the first example which has a real mixing of arithmetic and geometric properties’.

Let’s have a closer look at the treasure map. It introduces some new signs which must have looked exotic at the time, but have since become standard tools to depict algebraic schemes.

For starters, recall that the underlying topological space of $spec (Z [x])$ is the set of all prime ideals of the integral polynomial ring $Z [x]$ , so the map tries to list them all as well as their inclusions/intersections.

The doodle in the right upper corner depicts the ‘generic point’ of the scheme. That is, the geometric object corresponding to the prime ideal $(0)$ (note that $Z [x]$ is an integral domain). Because the zero ideal is contained in any other prime ideal, the algebraic/geometric mantra (“inclusions reverse when shifting between algebra and geometry”) asserts that the gemetric object corresponding to $(0)$ should contain all other geometric objects of the arithmetic plane, so it is just the whole plane! Clearly, it is rather senseless to depict this fact by coloring the whole plane black as then we wouldn’t be able to see the finer objects. Mumford’s solution to this is to draw a hairy ball, which in this case, is sufficiently thick to include fragments going in every possible direction. In general, one should read these doodles as saying that the geometric object represented by this doodle contains all other objects seen elsewhere in the picture if the hairy-ball-doodle includes stuff pointing in the direction of the smaller object. So, in the case of the object corresponding to $(0)$ , the doodle has pointers going everywhere, saying that the geometric object contains all other objects depicted.

Let’s move over to the doodles in the lower right-hand corner. They represent the geometric object corresponding to principal prime ideals of the form $(p (x))$ , where $p (x)$ in an irreducible polynomial over the integers, that is, a polynomial which we cannot write as the product of two smaller integral polynomials. The objects corresponding to such prime ideals should be thought of as ‘horizontal’ curves in the plane.

The doodles depicted correspond to the prime ideal $(x)$ , containing all polynomials divisible by $x$ so when we divide it out we get, as expected, a domain $Z [x] / (x) ≃ Z$ , and the one corresponding to the ideal $(x^{2} + 1)$ , containing all polynomials divisible by $x^{2} + 1$ , which can be proved to be a prime ideals of $Z [x]$ by observing that after factoring out we get $Z [x] / (x^{2} + 1) ≃ Z [i]$ , the domain of all Gaussian integers $Z [i]$ . The corresponding doodles (the ‘generic points’ of the curvy-objects) have a predominant horizontal component as they have the express the fact that they depict horizontal curves in the plane. It is no coincidence that the doodle of $(x^{2} + 1)$ is somewhat bulkier than the one of $(x)$ as the later one must only depict the fact that all points lying on the straight line to its left belong to it, whereas the former one must claim inclusion of all points lying on the ‘quadric’ it determines.

Apart from these ‘horizontal’ curves, there are also ‘vertical’ lines corresponding to the principal prime ideals $(p)$ , containing the polynomials, all of which coefficients are divisible by the prime number $p$ . These are indeed prime ideals of $Z [x]$ , because their quotients are
$Z [x] / (p) ≃ (Z / p Z) [x]$ are domains, being the ring of polynomials over the finite field $Z / p Z = F_{p}$ . The doodles corresponding to these prime ideals have a predominant vertical component (depicting the ‘vertical’ lines) and have a uniform thickness for all prime numbers $p$ as each of them only has to claim ownership of the points lying on the vertical line under them.

Right! So far we managed to depict the zero prime ideal (the whole plane) and the principal prime ideals of $Z [x]$ (the horizontal curves and the vertical lines). Remains to depict the maximal ideals. These are all known to be of the form
$m = (p, f (x))$
where $p$ is a prime number and $f (x)$ is an irreducible integral polynomial, which remains irreducible when reduced modulo $p$ (that is, if we reduce all coefficients of the integral polynomial $f (x)$ modulo $p$ we obtain an irreducible polynomial in $F_{p} [x]$ ). By the algebra/geometry mantra mentioned before, the geometric object corresponding to such a maximal ideal can be seen as the ‘intersection’ of an horizontal curve (the object corresponding to the principal prime ideal $(f (x))$ ) and a vertical line (corresponding to the prime ideal $(p)$ ). Because maximal ideals do not contain any other prime ideals, there is no reason to have a doodle associated to $m$ and we can just depict it by a “point” in the plane, more precisely the intersection-point of the horizontal curve with the vertical line determined by $m = (p, f (x))$ . Still, Mumford’s treasure map doesn’t treat all “points” equally. For example, the point corresponding to the maximal ideal $m_{1} = (3, x + 2)$ is depicted by a solid dot $.$ , whereas the point corresponding to the maximal ideal $m_{2} = (3, x^{2} + 1)$ is represented by a fatter point $\circ$ . The distinction between the two ‘points’ becomes evident when we look at the corresponding quotients (which we know have to be fields). We have

$Z [x] / m_{1} = Z [x] / (3, x + 2) = (Z / 3 Z) [x] / (x + 2) = Z / 3 Z = F_{3}$ whereas $Z [x] / m_{2} = Z [x] / (3, x^{2} + 1) = Z / 3 Z [x] / (x^{2} + 1) = F_{3} [x] / (x^{2} + 1) = F_{3^{2}}$

because the polynomial $x^{2} + 1$ remains irreducible over $F_{3}$ , the quotient $F_{3} [x] / (x^{2} + 1)$ is no longer the prime-field $F_{3}$ but a quadratic field extension of it, that is, the finite field consisting of 9 elements $F_{3^{2}}$ . That is, we represent the ‘points’ lying on the vertical line corresponding to the principal prime ideal $(p)$ by a solid dot . when their quotient (aka residue field is the prime field $F_{p}$ , by a bigger point $\circ$ when its residue field is the finite field $F_{p^{2}}$ , by an even fatter point $◯$ when its residue field is $F_{p^{3}}$ and so on, and on. The larger the residue field, the ‘fatter’ the corresponding point.

In fact, the ‘fat-point’ signs in Mumford’s treasure map are an attempt to depict the fact that an affine scheme contains a lot more information than just the set of all prime ideals. In fact, an affine scheme determines (and is determined by) a “functor of points”. That is, to every field (or even every commutative ring) the affine scheme assigns the set of its ‘points’ defined over that field (or ring). For example, the $F_{p}$ -points of $spec (Z [x])$ are the solid . points on the vertical line $(p)$ , the $F_{p^{2}}$ -points of $spec (Z [x])$ are the solid . points and the slightly bigger $\circ$ points on that vertical line, and so on.

This concludes our first attempt to decypher Mumford’s drawing, but if we delve a bit deeper, we are bound to find even more treasures… (to be continued).

Comments closed

noncommutative F_un geometry (2)

Published October 17, 2008 by lieven

Last time we tried to generalize the Connes-Consani approach to commutative algebraic geometry over the field with one element $F_{1}$ to the noncommutative world by considering covariant functors

$N : groups \to sets$

which over $C$ resp. $Z$ become visible by a complex (resp. integral) algebra having suitable universal properties.

However, we didn’t specify what we meant by a complex noncommutative variety (resp. an integral noncommutative scheme). In particular, we claimed that the $F_{1}$ -‘points’ associated to the functor

$D : groups \to sets G \mapsto G_{2} \times G_{3}$ (here $G_{n}$ denotes all elements of order $n$ of $G$ )

were precisely the modular dessins d’enfants of Grothendieck, but didn’t give details. We’ll try to do this now.

For algebras over a field we follow the definition, due to Kontsevich and Soibelman, of so called “noncommutative thin schemes”. Actually, the thinness-condition is implicit in both Soule’s-approach as that of Connes and Consani : we do not consider R-points in general, but only those of rings R which are finite and flat over our basering (or field).

So, what is a noncommutative thin scheme anyway? Well, its a covariant functor (commuting with finite projective limits)

$X : {Alg}_{k}^{f d} \to sets$

from finite-dimensional (possibly noncommutative) $k$ -algebras to sets. Now, the usual dual-space operator gives an anti-equivalence of categories

${Alg}_{k}^{f d} \leftrightarrow {Coalg}_{k}^{f d} A = C^{*} \leftrightarrow C = A^{*}$

so a thin scheme can also be viewed as a contra-variant functor (commuting with finite direct limits)

$X : {Coalg}_{k}^{f d} \to Sets$

In particular, we are interested to associated to any {tex]k $- a l g e b r a$ A $ its representation functor :

$rep (A) : {Coalg}_{k}^{f d} \to Sets C \mapsto A l g_{k} (A, C^{*})$

This may look strange at first sight, but $C^{*}$ is a finite dimensional algebra and any $n$ -dimensional representation of $A$ is an algebra map $A \to M_{n} (k)$ and we take $C$ to be the dual coalgebra of this image.

Kontsevich and Soibelman proved that every noncommutative thin scheme $X$ is representable by a $k$ -coalgebra. That is, there exists a unique coalgebra $C_{X}$ (which they call the coalgebra of ‘distributions’ of $X$ ) such that for every finite dimensional $k$ -algebra $B$ we have

$X (B) = C o a l g_{k} (B^{*}, C_{X})$

In the case of interest to us, that is for the functor $rep (A)$ the coalgebra of distributions is Kostant’s dual coalgebra $A^{o}$ . This is the not the full linear dual of $A$ but contains only those linear functionals on $A$ which factor through a finite dimensional quotient.

So? You’ve exchanged an algebra $A$ for some coalgebra $A^{o}$ , but where’s the geometry in all this? Well, let’s look at the commutative case. Suppose $A = C [X]$ is the coordinate ring of a smooth affine variety $X$ , then its dual coalgebra looks like

$C [X]^{o} = \oplus_{x \in X} U (T_{x} (X))$

the direct sum of all universal (co)algebras of tangent spaces at points $x \in X$ . But how do we get the variety out of this? Well, any coalgebra has a coradical (being the sun of all simple subcoalgebras) and in the case just mentioned we have

$c o r a d (C [X]^{o}) = \oplus_{x \in X} C e_{x}$

so every point corresponds to a unique simple component of the coradical. In the general case, the coradical of the dual coalgebra $A^{o}$ is the direct sum of all simple finite dimensional representations of $A$ . That is, the direct summands of the coalgebra give us a noncommutative variety whose points are the simple representations, and the remainder of the coalgebra of distributions accounts for infinitesimal information on these points (as do the tangent spaces in the commutative case).

In fact, it was a surprise to me that one can describe the dual coalgebra quite explicitly, and that $A_{\infty}$ -structures make their appearance quite naturally. See this paper if you’re in for the details on this.

That settles the problem of what we mean by the noncommutative variety associated to a complex algebra. But what about the integral case? In the above, we used extensively the theory of Kostant-duality which works only for algebras over fields…

Well, not quite. In the case of $Z$ (or more general, of Dedekind domains) one can repeat Kostant’s proof word for word provided one takes as the definition of the dual $Z$ -coalgebra
of an algebra (which is $Z$ -torsion free)

$A^{o} = f : A \to Z : A / K e r (f) is finitely generated and torsion free$

(over general rings there may be also variants of this duality, as in Street’s book an Quantum groups). Probably lots of people have come up with this, but the only explicit reference I have is to the first paper I’ve ever written. So, also for algebras over $Z$ we can define a suitable noncommutative integral scheme (the coradical approach accounts only for the maximal ideals rather than all primes, but somehow this is implicit in all approaches as we consider only thin schemes).

Fine! So, we can make sense of the noncommutative geometrical objects corresponding to the group-algebras $C Γ$ and $Z Γ$ where $Γ = P S L_{2} (Z)$ is the modular group (the algebras corresponding to the $G \mapsto G_{2} \times G_{3}$ -functor). But, what might be the points of the noncommutative scheme corresponding to $F_{1} Γ$ ???

Well, let’s continue the path cut out before. “Points” should correspond to finite dimensional “simple representations”. Hence, what are the finite dimensional simple $F_{1}$ -representations of $Γ$ ? (Or, for that matter, of any group $G$ )

Here we come back to Javier’s post on this : a finite dimensional $F_{1}$ -vectorspace is a finite set. A $Γ$ -representation on this set (of n-elements) is a group-morphism

$Γ \to G L_{n} (F_{1}) = S_{n}$

hence it gives a permutation representation of $Γ$ on this set. But then, if finite dimensional $F_{1}$ -representations of $Γ$ are the finite permutation representations, then the simple ones are the transitive permutation representations. That is, the points of the noncommutative scheme corresponding to $F_{1} Γ$ are the conjugacy classes of subgroups $H \subset Γ$ such that $Γ / H$ is finite. But these are exactly the modular dessins d’enfants introduced by Grothendieck as I explained a while back elsewhere (see for example this post and others in the same series).

Comments closed