If machine learning, AI, and large language models are here to stay, there’s this inevitable conclusion:
Millennials are the last generation to grow up without tropical geometry
— Dave Jensen (@DaveJensenMath) April 6, 2023
At the start of this series, the hope was to find the topos of the unconscious. Pretty soon, attention turned to the shape of languages and LLMs.
In large language models all syntactic and semantic information is encoded is huge arrays of numbers and weights. It seems unlikely that
But, as with ordinary presheaves, there are just too many
For inspiration, let’s turn to evolutionary biology and their theory of phylogenetic trees. They want to trace back common (extinguished) ancestors of existing species by studying overlaps in the DNA.

(A tree of life, based on completely sequenced genomes, from Wikipedia)
The connection between phylogenetic trees and tropical geometry is nicely explained in the paper Tropical mathematics by David Speyer and Bernd Sturmfels.
The tropical semi-ring is the set
Because tropical multiplication is ordinary addition, a tropical monomial in
corresponds to the linear polynomial
gives the piece-wise linear function on
The tropical hypersurface
Now, for the relation to phylogenetic trees: let’s sequence the genomes of human, mouse, rat and chicken and compute the values of a suitable (necessarily symmetric) distance function between them:


From these distances we want to trace back common ancestors and their difference in DNA-profile in a consistent manner, that is, such that the distance between two nodes in the tree is the sum of the distances of the edges connecting them.
In this example, such a tree is easily found (only the weights of the two edges leaving the root can be different, with sum

In general, let’s sequence the genomes of
the maximum is attained at least twice.
What has this to do with tropical geometry? Well,
Here’s why: let
is attained at least twice, or that
and we recognise this as one of the defining quadratic Plucker relations of the Grassmannian
More on this can be found in another paper by Speyer and Sturmfels The tropical Grassmannian, and the paper Geometry of the space of phylogenetic trees by Louis Billera, Susan Holmes and Karen Vogtmann.
What’s the connection with
The set of all species
For an ancestor node
We also defined the distance between such
and this distance coincides with the tree distance between the nodes.
So, all ancestors nodes in a phylogenetic tree are very special
We would like to garden out such exceptional
Still, there might be regions in the space where we can do the above. So, in general we might expect not one tree, but a forest of trees formed by the
If we think of the underlying
That’s why I like to call this mental picture the tropical brain-forest.

(Image credit)
Where’s the tropical coming from?
Well, I think that in order to pinpoint these ‘optimal’
For any two
and check that this is again a
(tbc)
Previously in this series:
- The topology of dreams
- The shape of languages
- Loading a second brain
- The enriched vault
- The super-vault of missing notes