Dissimilarity Vectors of Trees and Their Tropical Linear Spaces

advertisement
Dissimilarity Vectors of Trees and Their Tropical Linear Spaces
A thesis presented to the faculty of
San Francisco State University
In partial fulfillment of
The Requirements for
The Degree
Master of Arts
In
Mathematics
by
Benjamin Iriarte
San Francisco, California
December 2010
Copyright by
Benjamin Iriarte
2010
CERTIFICATION OF APPROVAL
I certify that I have read Dissimilarity Vectors of Trees and Their Tropical Linear Spaces by Benjamin Iriarte and that in my opinion this work
meets the criteria for approving a thesis submitted in partial fulfillment
of the requirements for the degree: Master of Arts in Mathematics at
San Francisco State University.
Federico Ardila
Professor of Mathematics
Serkan Hoşten
Professor of Mathematics
Matthias Beck
Professor of Mathematics
Dissimilarity Vectors of Trees and Their Tropical Linear Spaces
Benjamin Iriarte
San Francisco State University
2010
We present an introduction to the study of weighted trees, from a combinatorial
perspective. In particular, we explore the connections of the subject with tropical
algebraic geometry and with tropical linear spaces.
Finally, we prove that the set of dissimilarity vectors of trees is contained in
the tropical Grassmannian, and then describe the complex of bounded faces of the
tropical linear space of a dissimilarity vector.
I certify that the Abstract is a correct representation of the content of this thesis.
Chair, Thesis Committee
Date
ACKNOWLEDGMENTS
This work has been supported by an NSF Grant through my advisor,
Federico Ardila. I need to give special thanks to Federico for the meetings
we have held during this year. Thanks to them, a lot of things took right
course, including this project, and I felt supported at all moments. The
capacity and willingness to listen to my sometimes unprecise ideas has
been invaluable, and has been the best form of advising I could ever get.
Thanks Federico, you have also been a teacher for life.
I also need to specially thank my family, for being there in the good
times, and the bad times as well. Most of this gratefulness goes to my
mother, an example of strength, drive, love and capacity to work.
I am very grateful to both of them for the chance to spend this year at
SFSU: I have learned a great deal of lessons.
Special thanks go also to Bernd Sturmfels. A series of conversations in
the middle of December were the thrust to get the final chapters of this
project started. He has been a great support this year. I am also grateful
with Serkan Hosten for the things that I’ve learned from him this year.
Thanks to Nathalia for her constant support. She was always there.
Thanks to Sarah Billey and Isabella Novik for allowing me to present
some of these results in the Combinatorics Seminar at UW, and also
v
to Luis Serrano and Sergey Fomin. I must also thank Filip Cools for
fruitful discussions.
vi
TABLE OF CONTENTS
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1
1.1
Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1
1.2
Basic Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2
2 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5
2.1
The Tropical Grassmannians . . . . . . . . . . . . . . . . . . . . . . .
5
2.2
Tropical Linear Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.3
Results About Trees . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
3 Dissimilarity Vectors are Contained in the Tropical Grassmannian . . . . . 28
3.1
Column Reductions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
3.2
Main Result . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
4 The Tropical Linear Space of a Dissimilarity Vector . . . . . . . . . . . . . 39
4.1
Vertices of Td and Their Matroids . . . . . . . . . . . . . . . . . . . . 40
4.2
Higher Dimensional Faces of Td and Their Matroids . . . . . . . . . . 56
4.3
A Combinatorial Description of the Geometric Object Td . . . . . . . 72
4.4
4.3.1
Faces from (m, i)-good pairs. . . . . . . . . . . . . . . . . . . . 73
4.3.2
Faces from full subtrees of T . . . . . . . . . . . . . . . . . . . 79
Face Enumeration in the Simplest Cases . . . . . . . . . . . . . . . . 89
5 Further Directions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
vii
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
viii
1
Chapter 1
Introduction
1.1 Motivation
This writing is, more than anything else, an excursion into the world of weighted
trees. Recently, this has become a combinatorial branch of its own, know as T theory. We find it very motivating that weighted trees are amazingly simple objects to understand, yet, a large number of natural questions about them remain
unanswered.
The T -theory comes motivated by its applications in biology and computer science. However, it is not hard to notice the intrinsic combinatorial wealth and beauty
of the subject. In particular, it turns out that weighted trees provide a handful of
examples to tropical geometry, illustrating several central concepts and providing
further intuition, conjectures and problems.
Among the most common applications, we find the reconstruction of phylogenetic
trees in evolutionary biology, from a set of known data on the taxa. Usually, these
data are the results of analysis of DNA sequences alignments of several species.
Along the lines of the writing, we provide a personal development of the subject,
2
sometimes providing proofs of the main results. The proofs themselves will sometimes be original, sometimes not. Clearly, we will only cover the smallest part of a
more developed subject, narrowed to our interests.
The final chapters will be devoted to present our own results and contributions
to the subject. In particular, we will provide a proof of an amazing relation between
weighted trees and a central object of tropical algebraic geometry, the tropical Grassmannian. Then, we will describe the tropical linear spaces that arise from weighted
trees.
We expect that our results provide further intuition and knowledge about weighted
trees, and to contribute to future endeavors in the area.
1.2 Basic Definitions
To begin, we recall the significant definitions associated with this work.
E
For every finite set E, define m
as the collection of all subsets of E of size m.
We adopt the somewhat standard convention under which [n] = {1, 2, . . . , n}.
Under this convention, notice that [n]
is the collection of subsets of [n] with size
m
m.
Recall that a tree can be defined as a connected graph that has no cycles. Every
vertex of degree 1 in a tree is called a leaf. Every vertex of a tree that is not a leaf
is called an internal vertex.
Now, assume that we have a tree containing an internal vertex v satisfying the
3
following condition: for every pair of different leaves u and w of the tree, the minimal
path from u to v intersects the minimal path from w to v at v and only at v, that
is, there is no common edge shared by both paths. In that case, we say that our
tree is a v-path-star tree.
Suppose that we have a tree T with n ≥ 1 leaves, labeled by the set [n]. Suppose
also that T does not have internal vertices of degree 2. Moreover, assume that for
each edge e of T , there exists a unique real number ω(e) associated to e such that:
• ω(e) > 0 whenever e is an internal edge of T , that is, whenever e is adjacent
to two internal vertices.
• ω(e) ∈ R whenever e is not an internal edge of T , that is, whenever e is
adjacent to a leaf. No restrictions apply to this case.
We call ω(e) the weight of e and we say that our tree T is a weighted tree. We
will let E(T ) be the set of (weighted) edges of T , V(T ) be the set of vertices of T ,
and L(T ) be the set of (labeled) leaves of T .
To briefly convey all these conditions, we sometimes say that T is an n-tree, or
simply that T is a tree. In fact, every time we refer to a tree T in this writing, we
will always refer to a weighted tree in the sense described above. In particular, note
that this convention involves the use of the letter T .
4
For every tree T , let the total weight ω(T ) of T be defined by:
X
ω(T ) :=
ω(e).
e∈E(T )
If E(T ) = ∅, we adopt the convention ω(T ) = 0.
[n]
Now, for m ∈ [n], we define a vector d ∈ R( m ) associated with the tree T , which
we call the m-dissimilarity vector of T . Notice that we have not indexed our vector
d with the integer m. Most of the time, the value of m will be clear from the context
and whenever it’s not, we will explicitly refer to d as the m-dissimilarity vector of
our tree.
For each m-set {i1 , i2 , . . . , im } ∈
[n]
m
, let di1 i2 ...im be the total weight of the
minimal subtree (w.r.t number of edges) of T that contains the leaves i1 , i2 , . . . , im .
Equivalently, let di1 i2 ...im be the total weight of the subtree of T spanned by these
leaves.
[n]
In particular, when d is the 1-dissimilarity vector, notice that d = 0 ∈ R( 1 ) .
This case is not very interesting. In case d is the n-dissimilarity vector, we have
d = ω(T ) ∈ R( n ) . Neither is this very interesting.
[n]
5
Chapter 2
Background
2.1 The Tropical Grassmannians
We first mention the idea of classical Grassmannians. Let F be a field of characteristic 0. The space of all m-dimensional subspaces of F n is called the Grassmannian
Gm,n,F . Given m linearly independent vectors v1 , v2 , . . . , vm of F n spanning an mdimensional subspace V , form the m × n matrix M whose rows are the vectors
v1 , v2 . . . , vm .
[n]
Let P ∈ F ( m ) be the vector of all maximal minors of M where, for all {i1 , i2 , . . . , im } ∈
[n]
, Pi1 i2 ...im is the minor coming from columns i1 , i2 , . . . , im of M . Call P the vecm
tor of Plücker coordinates of V . The following theorem and proof can be found
in [15].
Theorem 2.1. The m-dimensional linear subspaces of F n are uniquely determined
by their vector of Plücker coordinates, modulo multiplication by a constant non-zero
scalar.
Proof. Consider the m×n matrix M just defined. Any other m×n matrix with equal
6
row space is obtained as ∆M for some invertible matrix ∆. Each maximal minor of
∆M is then det(∆) times the corresponding maximal minor of M . Now, suppose
that the maximal minors of M are a constant multiple λ of the corresponding
maximal minors of N . We show that M and N have equal rowspace. For some
ρ = {ρ1 < ρ2 < · · · < ρm } ∈ [n]
, the maximal submatrix Mρ of M obtained from
m
columns ρ satisfies det(Mρ ) 6= 0. Thus, det(Mρ ) = λ det(Nρ ) 6= 0, where Nρ is the
corresponding maximal submatrix of N . We can find ρ because the rank of both
M and N is precisely m. Define M 0 = Mρ−1 M and N 0 = Nρ−1 N . Clearly, the row
span of N 0 coincides with the row span of N and the same is true for M 0 and M .
The columns ρ of M 0 and N 0 form the identity matrix. Consider some % ∈ [n]
. We
m
have
det(M%0 ) = det(Mρ−1 M% ) = det(Mρ−1 )det(M% )
=
1
det(Nρ−1 )λdet(N% ) = det(Nρ−1 )det(N% )
λ
= det(Nρ−1 N% ) = det(N%0 ).
Therefore, the maximal minors of M 0 and N 0 are equal. Take a pair 1 ≤ i ≤ m
and 1 ≤ j ≤ n. Consider the columns ρ1 , . . . , ρbi , . . . , ρm and j. The minor given by
these columns is (−1)l (M 0 )ij in M 0 and (−1)l (N 0 )ij in N 0 , for l ∈ {0, 1}. But these
two are equal and so (M 0 )ij = (N 0 )ij , that is, M 0 = N 0 .
Now, suppose that we have a vector x ∈ F n and we want to check if x ∈ V . To
7
the matrix M , append the row x after the m-th row to obtain an (m + 1) × n matrix
M x . If x is a point in V , then all the maximal minors of M x have to be 0, clearly.
On the other hand, if x 6∈ V , then M x is a rank-(m + 1) matrix, so it must
have m + 1 linearly independent columns as well, which then give rise to a non-zero
maximal minor.
Thus, x ∈ V if and only if all the maximal minors of M x are 0.
Now, notice that the maximal minors of M x can be expressed using the Plücker
coordinates of V , doing cofactor expansion of the determinants along the last row.
To do this, for every {i1 , i2 , . . . , im } ∈ [n]
, recall that Pi1 i2 ...im is the maximal minor
m
of M coming from columns i1 , i2 , . . . , im : Pi1 i2 ...im is a Plücker coordinate of V . Then,
we can see that the maximal minors of M x are:
m+1
X
(−1)r Pi1 i2 ...ibr ...im im+1 xir for all 1 ≤ i1 < i2 < · · · < im < im+1 ≤ n.
r=0
Therefore, x is a point in V if and only if all these sums are 0. We will keep
this characterization in mind when defining the completely analogous notion of
tropical linear space. The tropical linear spaces will arise after tropicalizing the
sums presented above.
We are now ready to provide some background on tropical Grassmannians.
To begin, consider the field K = C{{t}} of dual Puiseux series. Recall that this
8
is the algebraically closed field of formal expressions
ω=
k=p
X
ck tk/q
−∞
where p ∈ Z, cp 6= 0, q ∈ Z+ and ck ∈ C for all k ≤ p. It is the algebraic
closure of the field of dual Laurent series over C. The field comes equipped with a
standard valuation val : K 7→ Q ∪ {−∞} by which val(ω) = p/q. As a convention,
val(0) = −∞.
Let X = (Xij ) be an m × n matrix of indeterminates and let K[X] denote the
polynomial ring over K generated by these indeterminates. Fix a second polynomial
n
ring in m
indeterminates over the same field: K[Y ] = K[Yi1 i2 ...im : {i1 , i2 , . . . , im } ∈
[n]
].
m
Let φm,n : K[Y ] 7→ K[X] be the homomorphism of rings taking Yi1 ...im to the
maximal minor of X obtained from columns i1 , . . . , im .
The Plücker ideal or ideal of Plücker relations will be the homogeneous
prime ideal Im,n =ker(φm,n ), which contains all the algebraic relations or syzygies
among the m × m minors of any m × n matrix with entries in K.
In particular, letting F = K during the first lines of this section, we can see
that the Plücker coordinates Pi1 i2 ...im with {i1 i2 . . . im } ∈ [n]
of the m-dimensional
m
vector space V considered, lie in the variety V (Im,n ) of Im,n .
For m ≥ 2, the Plücker ideal has a Gröbner basis consisting of quadrics; a
comprehensive study of these ideals can be found in Chapter 14 of the book by
9
Miller and Sturmfels [15] and in Sturmfels [19]. In particular, the tropicalization of
some of these quadrics, known as the three-term Plücker relations, will give rise
to the notion of tropical Plücker relations that we present in the following section.
We note that for the purposes of studies in evolutionary biology and phylogenetics, the max convention should be adopted for tropical geometry. We follow that
convention here.
Now, the Plücker ideal is a polynomial ideal in K[Y ]. We can therefore define the
notion of its tropical variety in the usual way, as we now recall. Let R = R ∪ {−∞}.
([n])
Consider a finite set A ⊆ Z≥0m and let:
f=
X
cα
α∈A
Y
Yραρ ∈ K[Y ].
ρ∈([n]
m)
The tropicalization of f is given by:
trop(f ) = max{val(cα ) +
α∈A
X
ρ∈(
[n]
m
αρ yρ }.
)
The tropical hypersurface T ( trop(f )) of f is the set of points in R
([n]
m)
where
trop(f ) attains its maximum twice or, equivalently, where trop(f ) is not differentiable. It is the analogous tropical notion of the classical hypersurface of 0’s in K( m )
[n]
of f . By the fundamental theorem of tropical algebraic geometry [20, Theorem 2.1],
the classical and tropical hypersurfaces are related in the expected way: the later
10
is the (closure of) the pointwise valuation of the former. The actual content of the
theorem says that the same will still hold for polynomial ideals and their varieties.
In Theorem 2.2, we present a narrower version of the fundamental theorem, suitable
for our purposes.
With these tools, we now define the tropical analog of the Grassmannian Gm,n,K .
The tropical variety T (Im,n ) of the Plücker ideal Im,n is denoted by Gm,n and is
called the tropical Grassmannian. It is defined as the intersection:
T (Im,n ) =
\
T ( trop(f )) .
f ∈Im,n
We have the following characterization of Gm,n , which is a direct application of
the fundamental theorem of tropical algebraic geometry:
Theorem 2.2. The following subsets of R
([n]
m)
coincide:
• The tropical Grassmannian Gm,n .
[n]
• The closure of the set {(val(cρ ))ρ∈([n]) : (cρ )ρ∈([n]) ∈ V (Im,n ) ⊆ K( m ) }.
m
m
An important consequence of this theorem will be the following. Suppose that
a vector P ∈ K( m ) arises as the set of maximal minors of a full rank m × n matrix
[n]
with entries from K. That is, suppose that P is the vector of Plücker coordinates of
some m-dimensional vector space in Kn , as V . Then, its componentwise valuation
p = val(P ) = (val(Pρ ))ρ∈([n]) will lie in the tropical Grassmanian Gm,n . Naturally, p
m
11
will also satisfy the tropical Plücker relations that we will describe soon, and it will
do so as much as P satisfies the classical Plücker relations, and the former are the
tropicalization of a subset of the latter.
2.2 Tropical Linear Spaces
In this section we introduce one of the main objects of this writing.
Let p ∈ R( m ) . We say that p satisfies the tropical Plücker relations with the
[n]
max convention if the maximum of the set
{pSij + pSkl , pSik + pSjl , pSil + pjk }
[n]
is attained at least twice for all S ∈ m−2
and {i, j, k, l} ⊆ [n]\S
.
4
For all {i1 , i2 , . . . , im } ∈ [n]
, define ei1 i2 ...im = ei1 + ei2 + · · · + eim , where the
m
sum takes place in Rn with its canonical basis {e1 , e2 , . . . , en }.
Let Hm be the m-hypersimplex of Rn . Recall that Hm is the polytope obtained
as the convex hull of all vectors ei1 i2 ...im with {i1 , i2 , . . . , im } ∈ [n]
. Each of these
m
vectors is a vertex of Hm .
[n]
Suppose that p ∈ R( m ) . We define a regular subdivision Dp of Hm from p. Let
R ⊆ Rn+1 be the polytope obtained as the convex hull of all points (ei1 i2 ...im , pi1 i2 ...im )
with {i1 , i2 , . . . , im } ∈ [n]
. Consider the face of R consisting of all points r ∈ R that
m
maximize a linear functional (−x, 1) · r with x ∈ Rn . This face is the convex hull of
12
the vertices (ei1 i2 ...im , pi1 i2 ...im ) that it contains. Now, let Bx be the collection of sets
{i1 , i2 , . . . , im } ∈ [n]
for which (ei1 i2 ...im , pi1 i2 ...im ) is a point in our face. This is the
m
collection of sets {i1 , i2 , . . . , im } ∈ [n]
for which pi1 i2 ...im − (xi1 + xi2 + · · · + xim ) is
m
maximal. Finally, construct a face Px of Dp obtained from x as the convex hull of
all ei1 i2 ...im with {i1 , i2 , . . . , im } ∈ Bx .
Now, let P be a subpolytope of Hm . To recall, this means that the vertices of P
are of the form ei1 i2 ...im with {i1 , i2 , . . . , im } ∈ [n]
. Let BP be the collection of sets
m
{i1 , i2 , . . . , im } ∈ [n]
for which ei1 i2 ...im ∈ P. We say that P is matroidal or that P
m
is a matroid polytope if the set BP is the collection of bases of a rank-m matroid
over [n], see Oxley [16, Chapter 1] for a general introduction to the language of
matroids.
The following is a theorem of David Speyer [21], which motivates the introduction
of tropical Plücker relations.
Theorem 2.3. The following assertions are equivalent:
[n]
• the vector p ∈ R( m ) satisfies the tropical Plücker relations;
• the one skeleta of Dp and Hm are the same;
• every face of Dp is matroidal.
[n]
Now, suppose that p ∈ R( m ) satisfies the tropical Plücker relations. Define
13
L(p) ⊆ Rn as the set of all x for which the maximum of the set
{pi1 i2 ...ibr ...im im+1 + xir |r ∈ [m + 1]}
is attained at least twice, for all {i1 , i2 , . . . , im , im+1 } ∈
[n]
m+1
. The set L(p) will be
called the m-dimensional tropical linear space associated to p.
For any x ∈ Rn , consider the subpolytope Px of Hm introduced earlier. Recall
that Px is a face of Dp obtained as the convex hull of all points ei1 i2 ...im for which
pi1 i2 ...im − (xi1 + xi2 + · · · + xim ) is maximal. Also, notice that BPx = Bx . As p
satisfies the tropical Plücker relations, we conclude that Bx is the set of bases of
some rank-m matroid over [n], which we call Mx .
We now present a remarkable theorem of Speyer [21].
Theorem 2.4. x ∈ L(p) if and only if Mx is a loopless matroid.
Recall that a matroid over a finite set E is said to be loopless if every element
of E is contained in at least one basis.
Proof. To begin, notice that L(p) can be described as the set of all x for which the
maximum of the set
{pi1 i2 ...ibr ...im im+1 − xi1 + xi2 + · · · + xc
ir + · · · + xim + xim+1 |r ∈ [m + 1]}
is attained at least twice, for all {i1 , i2 , . . . , im , im+1 } ∈ [n]m+1 .
14
So let x ∈ L(p) and suppose, without loss of generality, that [m] ∈ Bx . Take
any element of the set [n]\[m]. We can assume without loss of generality that this
element is m + 1. By the definition of L(p), the maximum of the set
{p12...br...m(m+1) − x1 + x2 + · · · + xbr + · · · + xm + x(m+1) |r ∈ [m + 1]}
is attained at least twice. This maximum is attained by p12...m −(x1 + x2 + · · · + xm ).
But then, any other choice of subindices for which the maximum is attained
must contain m + 1, and this set of subindices will be an element of Bx . Therefore,
Mx is loopless.
Now, suppose that Mx is loopless and we want to prove x ∈ L(p). Without loss
of generality, it is enough to show that the maximum of the set
{p12...br...m(m+1) − x1 + x2 + · · · + xbr + · · · + xm + x(m+1) |r ∈ [m + 1]}
is attained at least twice.
Let e[m+1] = e1 + e2 + · · · + em + em+1 . For all reals t ≥ 0, let Mt = Mx−te[m+1]
and let Bt = Bx−te[m+1] .
Notice that for large enough values t > 0, we have Bt ⊆ [m + 1]. Let t∗
be the infimum of these values. Then Mt = Mt∗ for all t ≥ t∗ . As t ranges
from 0 to ∞, the matroid Mt changes at a finite number of times t, which we call
0 = t0 < t1 < t2 < · · · < tq = t∗ , and remains constant throughout (t` , t`+1 ) for all
15
0 ≤ ` < q.
Let t ∈ (t` , t`+1 ). Then, notice that the set of bases of Mt coincides with the
subset of bases of Mt` that meet [m + 1] at a maximal number of points.
Now, suppose that every element of [m + 1] is contained in a basis of Mt` .
Let i be the maximal number of elements of [m + 1] contained in a basis of Mt` .
Assume that [i] ⊆ I for some I ∈ Bt` . If j ∈ ([m + 1]\[i]), find J ∈ Bt` for which
j ∈ J. Then, by the maximality of i and considering the bases I and J, we can see
that ((I − k) ∪ j) ∈ Bt` for some k ∈ [i]. Therefore, for all j ∈ [m + 1], there exists
J ∈ Bt` such that j ∈ J and |J ∩ [m + 1]| = i. That is, every element of [m + 1] lies
in basis of Mt` which meets [m + 1] at a maximal number of points.
Therefore, every element of [m + 1] is contained in a basis of Mt .
On the other hand, the set of bases of Mt coincides with the subset of bases of
Mt`+1 that meet [m + 1] at a minimal number of points. If every element of [m + 1]
is contained in a basis of Mt , then every element of [m + 1] is contained in a basis
of Mt`+1 .
As Mx = Mt0 is loopless, so in particular every element of [m + 1] is contained in
a basis, inductively we obtain that every element of [m + 1] is contained in a basis
of Mtq = Mt∗ .
But notice that x ∈ L(p) if and only if |Bt∗ | ≥ 2, which is what we just proved.
A different approach to tropical linear spaces is fully presented now. Let p ∈
16
[n]
R( m ) satisfy the tropical Plücker relations with the max convention. Consider the
nonempty unbounded n-dimensional polyhedron Pp := {x ∈ Rn |xi1 +xi2 +· · ·+xim ≥
pi1 i2 ...im for all {i1 , i2 , . . . , im } ∈ [n]
}.
m
The polyhedron Pd coincides with the notion of the envelope of Hm with
respect to p, as presented in [11].
Let the reduced tropical linear space of p be the set Pp0 := {x ∈ Pp | If y ≤
x componentwise for some y ∈ Pp , then we have y = x}.
Take x ∈ ∂(Pp ) and define Bx to be the collection of all sets {i1 , i2 , . . . , im } ∈
[n]
m
for which
xi1 + xi2 + · · · + xim = pi1 i2 ...im .
The set Bx is the set of bases of a matroid Mx with rank m, over the set [n].
To understand why this is true, consider the polytope R of Rn+1 previously
defined.
For each r ∈ R, notice that (−x, 1) · r ≤ 0. The set of all such points r for
which (−x, 1) · r = 0 is a face of R. Of course, this face is then the convex hull
of all vertices (ei1 i2 ...im , pi1 i2 ...im ) of R with {i1 , i2 , . . . , im } ⊆ Bx , and its projection
Px back in the m-hypersimplex Hm is the convex hull of the vertices ei1 i2 ...im with
{i1 , i2 , . . . , im } ⊆ Bx . Given that p obeys the tropical Plücker relations with the
max convention, we know that Px is matroidal, so indeed Bx is the set of bases of
a matroid Mx .
Now, notice that x ∈ Pp0 if and only if Mx is a loopless matroid.
17
Consider the set Lm,n of all loopless matroids of rank m on the set [n]. Following
the spirit of our previous notation, denote the set of bases of a matroid M ∈ Lm,n
by B(M ).
For each M ∈ Lm,n , define the set Pp (M ) := {x ∈ Pp |xi1 + xi2 + · · · + xim =
pi1 i2 ...im whenever {i1 , i2 , . . . , im } ∈ B(M )}.
Clearly Pp (M ) is a polyhedron, which can be empty for some choices of M .
Moreover, notice that Pp (M ) is a (closed) proper face of Pp or a face of the boundary
complex ∂(Pp ) of Pp . Notice also that we can write
Pp0 =
[
Pp (M ).
M ∈Lm,n
Therefore, the set Pp0 is a polyhedral subcomplex of ∂(Pp ). It has been proved
in [21] that Pp0 is pure (m − 1)-dimensional, or equivalently (as we shall see), that
L(p) is pure m-dimensional.
We have:
• For x ∈ Rn , x ∈ L(p) if and only if Mx is loopless.
• For x ∈ ∂(Pp ), x ∈ Pp0 if and only if Mx is loopless.
Now, notice that for any x ∈ Rn we may find a real t for which x + te[n] ∈ Pp .
In particular, for every x ∈ Rn , there exists a unique t for which x + te[n] ∈ ∂(Pp ).
On the other hand, notice that L(p) and the matroid Mx are invariant under
translation by e[n] .
18
Therefore, we conlude that L(p) and Pd0 are the same object if considered in the
tropical projective space TPn−1 = Rn /(1, . . . , 1)R, and Pp0 arises from L(p) after a
natural choice of representative for each class. Under this choice, the maximum in
the definition of L(p) is precisely 0. This allows us to write Pp0 = L(p)/(1, . . . , 1)R.
Let S ⊆ Rn . The subset S is said to be tropically convex if
(max{a + x1 , b + y1 }, max{a + x2 , b + y2 }, . . . , max{a + xn , b + yn }) ∈ S
for all x, y ∈ S and a, b ∈ R.
Now, notice that the set
{p12...br...m(m+1) + xr |r ∈ [m + 1]}
is tropically convex. By the definition, the intersection of two tropically convex sets
is tropically convex. We then see that the tropical linear space L(p) is tropically
convex. It is proved in [7] that tropically convex sets are contractible. Therefore,
we see that L(p) is a contractible subspace of Rn . It deformation retracts onto Pp0 ,
which is then also contractible in Rn . On the other hand, for x ∈ ∂(Pp ), notice
that if Mx contains a loop, then x belongs to the relative interior of an unbounded
face of ∂(Pp ). To see this, consider the loop in the defining equations of the face.
Therefore, the complex Pp0 can be obtained from the polyhedron Pp by removing the
relative interior of a number of unbounded faces, in particular, the interior of Pp .
19
Finally, we want to emphasize the fact that Pp0 may contain unbounded faces.
In any case, it is common practice to consider only the complex of bounded faces of
Pp0 . As noted, this coincides with the complex of bounded faces of the polyhedron
Pp , and it will be the main object of our study.
Let the tight span Td of p be the complex of bounded faces of Pd .
It has been proved by Herrmann and Joswig in [11, Proposition 2.3] that Tp
coincides with the bounded faces of the dual complex of the subdivision of Hm
induced by p.
The tight span Tp is also known to be a contractible space, see for example
Dress [8] or Hirai [12].
Several studies about the combinatorial structure of this object have been done,
in particular as it relates to an important problem in evolutionary biology: to determine trees from their metric structure. There have been important results and
conjectures on the face enumeration of Tp [10], among which we find most relevant
to us the ones from Speyer [21].
[n]
Conjecture 2.1 (The f -vector Conjecture). Let p ∈ R( m ) satisfy the tropical
Plücker relations. Then, the f -vector of Tp satisfies the following bounds:
n − 2 − 2i n − i − 2
fi ≤
for all i with 0 ≤ i ≤ m − 1.
d−1−i
i
In particular, series-parallel tropical linear spaces realize the f -vector of the
f -vector conjecture. An example of these are the tree-tropical linear spaces of
20
Speyer [21]. These spaces can be regarded as approximations to the tropical linear
spaces of dissimilarity vectors, which we study in Chapter 4. On the other hand,
tropical linear spaces which are realizable have f -vectors which are bounded by the
f -vector conjecture [22].
We present the main idea of this Section in a separate Proposition.
[n]
Proposition 2.5 (Speyer, Herrmann, Joswig). Let p ∈ R( m ) satisfy the tropical
Plücker relations with the max convention. Then Pp0 = L(p)/(1, . . . , 1)R and Tp
coincides with the subcomplex of bounded faces of the dual complex to the subdivision
of Hm induced by p.
2.3 Results About Trees
Consider a tree T with m-dissimilarity vector d.
In this writing, we prove that d satisfies the tropical Plücker relations. We will
later provide a proof of this fact by obtaining d as the componentwise valuation of a
vector of classical Plücker coordinates of an m-dimensional vector space of Kn . By
the fundamental theorem of tropical algebraic geometry, this will moreover imply
that d ∈ Gm,n . The present writing is then in reality an investigation of the objects
that we have associated to the tropical Plücker relations, as obtained from d.
To begin, we point out that a connection between phylogenetic trees and tropical
geometry had been noted some time ago. That these two subjects are mathemat-
21
ically related can be traced back to Pachter and Speyer [17], Speyer and Sturmfels [20], and Ardila and Klivans [2]. The precise nature of this connection has been
the matter of some recent papers by Bocci and Cools [3] and Cools [6]. Later, we
will describe a concise relation between m-dissimilarity vectors and tropical Grassmannians Gm,n .
In order to present the classical motivating example, we develop some concepts.
Consider the polynomial ring K[Yij : {i, j} ∈ [n]
]. In this ring, consider the
2
ideal of Plücker relations, or the Plücker ideal I2,n :
I2,n = hYij Ykl − Yik Yjl + Yil Yjk : 1 ≤ i < j < k < l ≤ ni.
The generators of I2,n presented are called the quadric relations, or simply
quadrics. As remarked earlier, they form a Gröbner basis of I2,n . Even more,
it is known that the quadrics form a tropical basis of I2,n , by which it is meant
that
G2,n =
\
T ( trop(Yij Ykl − Yik Yjl + Yil Yjk )) .
1≤i<j<k<l≤n
[n]
We can restate this by saying that G2,n is the set of points y ∈ R( 2 ) for which
the maximum of the set {yij + ykl , yik + yjl , yil + yjk } is attained at least twice, for
all {i, j, k, l} ∈ [n]
.
4
[n]
Notice that, as promised, a vector of R( 2 ) lies in the tropical Grassmannian G2,n
if and only if it satisfies the tropical Plücker relations.
22
The main motivating example for studying these tropical Grassmannians in connection with trees comes from the following theorem, also known as the four point
condition theorem. It is a theorem of Buneman [5], widely applicable in evolutionary biology.
Theorem 2.6 (Pachter and Sturmfels [18]). The set of 2-dissimilarity vectors of
trees is equal to the tropical Grassmannian G2,n .
The theorem of Buneman is used as a tool for reconstructing shapes of trees from
their metrics. The extent of its applicability relies on the fact that we can project
generic points in R( 2 ) on the tropical Grassmannian G2,n . See Ardila [1].
[n]
One of the main open problems in computational biology is to device a way to
use the higher m-dissimilarity vectors to carry on the same purpose. It is commonly
known that such approach would allow more reliable statistical solutions of the tree
reconstruction problem.
From here sprouts the natural problem of characterizing m-dissimilarity vectors
of trees.
In the case of m = 2, there are other characterizations of dissimilarity vectors.
Among these, we present a result of Dress [9] which relates tropical linear spaces
and 2-dissimilarity vectors.
[n]
Theorem 2.7. A vector d ∈ R( 2 ) is a 2-dissimilarity vector if and only if Td is a
tree.
23
Describing the object Td for general m-dissimilarity vectors will be the main
goal of Chapter 4. See also Sturmfels and Yu for pictures of tight spans of generic
metrics [23].
Now, with the theorem of Buneman as motivation, the question had been posed
of whether or not the set of m-dissimilarity vectors of trees is contained in the
tropical Grassmannian Gm,n , for all values of m. This was initially asked by Lior
Pachter and David Speyer [17].
The major obstacle in trying to prove such result comes mainly from the fact
that no tropical basis is known for the Plücker ideals Im,n for m ≥ 3. Notice that the
knowledge of such bases would reduce the problem to a finite number of verifications
for each value of m. We have not found tropical bases for these ideals.
A different approach to understanding m-dissimilarity vectors for m ≥ 3 is to
find a relation between them and the 2-dissimilarity vectors, which are all well
understood.
Along this line of thought, a key result of Cools is the existence of a map that,
from the 2-dissimilarity vector of a tree T , computes its m-dissimilarity vector. We
provide a combinatorial proof of this fact.
Proposition 2.8. Consider a tree T and let m ≥ 3. Let Cm ⊆ Sm be the set of
cyclic permutations. For any σ ∈ Cm , define
dσ = d1σ(1) + dσ(1)σ2 (1) + dσ2 (1)σ3 (1) + dσ3 (1)σ4 (1) + · · · + dσm−1 (1)σm (1) .
24
Then
d12...m =
1
· min dσ .
2 σ∈Cm
Proof. Without loss of generality, assume that m = n. Consider an internal vertex
v of T . This vertex induces a partition on the set of leaves of T under which two
leaves i and j belong to the same class if and only if the minimal path from i to v
and the minimal path from j to v share a common edge. But then, each of the sums
dσ above can be seen to consider at least twice every edge adjacent to v. Thus, dσ
considers every edge of T at least twice.
Now, let σ ∈ Cn be such that dσ is minimal. Suppose that some edge e adjacent
to v is considered more than twice in dσ . Notice that e must then be internal, so
ω(e) > 0.
Let e be the class of leaves of T containing e in their minimal path to v. Then,
there exist a∗ , a1 , . . . , ai ∈ e and b∗ , b1 , . . . , bj ∈ ([n]\e) such that
σ = (· · · , a∗ , b1 , · · · , bj , a1 , · · · , ai , b∗ , · · · ).
Now, define
σ 0 = σ(a∗ , ai )(a∗ , bj )
= (· · · , a∗ , a1 , · · · , ai , b1 , · · · , bj , b∗ , · · · ) ∈ Cn .
25
But then dσ0 ≤ dσ − 2ω(e). This contradicts the minimality of dσ .
Therefore, every edge is considered exactly twice in dσ , which proves the result.
Remark. We expand on the argument presented. Abusing the notation of the proof,
for every leaf i of T , let div be the total weight of the minimal path from i to v.
Then:
da∗ b1 = da∗ v + db1 v
da1 bj = da1 v + dbj v
dai b∗ = dai v + db∗ v
dai b1 = dai v + db1 v .
From here we obtain:
da∗ b1 + da1 bj + dai b∗ − da∗ a1 + dai b1 + dbj b∗
= ((da∗ v + da1 v ) − da∗ a1 )
+ dbj v + db∗ v − dbj b∗
≥ 2ω(e) + 0 = 2ω(e).
From this result we could try to study m-dissimilarity vectors and the way in
which they induce a metric on T . An initial attempt would go along the lines of
26
trying to invert the presented map. One immediate application of the result is
presented now.
Corollary 2.9. Suppose that we have a tree T . Then, the following identity holds
for all {i, j, k, l, p} ∈ [n]
:
5
1
dij = (2dijk + 2dijl + 2dijp
3
− dipl − dikl − dikp
− djpl − djkl − djkp
+ 2dplk ).
Corollary 2.9 is a direct consequence of the identity
dijk =
1
(dij + dik + djk ) for all {i, j, k} ∈
2
[n]
3
.
In order to visualize the computation involved, it could be useful to think of the
following construction.
Let A = {i, j, k, l, p}. Consider the subdivision of a 3-dimensional tetrahedron
into four tetrahedra, induced by selecting a point i in the interior of the tetrahedron.
Then, label the remaining vertices j, k, l, p. Note that A3 is the set of 2-faces of
the subdivision and A2 is the set of edges. Let C` be the free Q-module with
A
basis `+1
, and define a Q-module homomorphism ∆ : C2 7→ C1 under which
27
∆[ijk] = 21 ([ij] + [ik] + [jk]), for all [ijk] ∈
A
3
.
Then, carry on the computation
∆ (2[ijk] + 2[ijl] + 2[ijp] − [ipl] − [ikl] − [ikp] − [jpl] − [jkl] − [jkp] + 2[plk])
to obtain 3[ij].
To finish, we note that for m ≥ 4 the situation is rather unclear.
28
Chapter 3
Dissimilarity Vectors are Contained in the
Tropical Grassmannian
The result presented in this chapter is based on two papers of Cools [6] and Bocci
and Cools [3], where the cases m = 3, m = 4 and m = 5 are handled.
We answer the question of Pachter and Speyer affirmatively for all m:
Theorem 3.1. Let T be a tree with m-dissimilarity vector d. Then, d ∈ Gm,n .
This result has more recently been proved by Manon [14], using very interesting
ideas from representation theory.
In order to prove Theorem 3.1, we need to introduce a new family of graphtheoretical-trees with special combinatorial properties.
We will refer to this new kind of tree using the letter U . In particular, we do
this to stress the difference with the conventions adopted in Chapter 1 for the letter
T.
To start, let U be a tree with n leaves labeled by the set [n], with n ≥ 1. Suppose
ω
also that U has a weight function E(U ) −
→ R≥0 . The function ω will extend to a
29
function from the subtrees of U to the nonnegative reals in the natural way. In the
latter case, we will speak of total weights of subtrees, following our convention. In
particular, notice that this time we are allowing internal edges of weight 0, and not
allowing external edges with negative weight. The tree U will be refered to as
being ultrametric if it further satisfies that:
• U is trivalent;
• U is rooted;
• U is `-equidistant. That is, the total weight of the minimal path from every
leaf to the root is a constant `;
• ω induces a metric on [n]. In particular, it separates points.
Note that the fourth condition directly implies ` > 0.
On the other hand, there is also an independent notion of an ultrametric.
A metric space S with distance function d : S×S 7→ R≥0 is called an ultrametric
space if the following inequality holds for all x, y, z ∈ S:
dxz ≤ max{dxy , dyz }.
It is a well known fact that finite ultrametric spaces are realized by ultrametric
trees, see for example [4, Lemma 11.1].
30
Note that the condition on ultrametric spaces S is equivalent to saying that for
all x, y, z ∈ S, the maximum of the set {dxy , dxz , dyz } is attained at least twice.
Now, suppose that we have a tree T with 2-dissimilarity vector d. From here,
[n]
define a vector d0 ∈ R( 2 ) by:
d0ij = 2t∗ + dij − din − djn for all different i, j ∈ [n].
Let t∗ > 0 be sufficiently large, so that d0ij > 0 for all different i, j. Observe that
d0in = 2t∗ for all i ∈ [n − 1]. Now, pick {i, j, k} ∈ [n−1]
and consider the set
3
{dij − din − djn , dik − din − dkn , djk − djn − dkn }.
We want to compare the members of this set. Comparing them pairwise, we
obtain three new sets to study:
{dij + dkn , dik + djn }
{dij + dkn , din + djk }
{dik + djn , din + djk }.
Note that these sets involve only three numbers. By the theorem of Buneman, one
of these three sets contains the same number twice, and that number is greater than
or equal to the third one. This translates back to concluding that the maximum of
the set
{dij − din − djn , dik − din − dkn , djk − djn − dkn }
31
is attained at least twice. Therefore, d0 |([n−1]) induces an ultrametric on the set
2
[n − 1]. Hence, d0 |([n−1]) is realized by an ultrametric tree.
2
Following this ingenious idea, Cools shows somewhat technically that in order
to prove Theorem 3.1, it suffices to prove the following variation:
Theorem 3.2. Let U be an ultrametric tree with m-dissimilarity vector d. Then,
d ∈ Gm,n .
This is the theorem that we will prove. First, we will need to introduce some
terminology.
3.1 Column Reductions
Let n ≥ 3. Suppose that we are given integers 1 ≤ a, b ≤ n with a 6= b and let
ca,b be the operator acting on Puiseux matrices for which, for any n × n matrix M ,
ca,b (M ) is the matrix obtained from M by subtracting column b to column a. We
know that ca,b preserves the determinant, i.e. det (ca,b (M )) = det(M ). For l ≥ 1, let
(cal ,bl ◦ · · · ◦ ca2 ,b2 ◦ ca1 ,b1 ) (M ) be the matrix obtained from M by first subtracting
column b1 to column a1 , then subtracting column b2 to column a2 , and so on up to
subtracting column bl to column al . Call this matrix a column reduction of M if
the following conditions are met:
• 1 ≤ a1 , . . . , al , b1 , . . . , bl ≤ n;
• the numbers a1 , a2 , . . . , al are pairwise different;
32
• whenever 1 ≤ k ≤ l, the number bk is different from a1 , . . . , ak .
For simplicity, we will accept M as a column reduction of itself.
3.2 Main Result
We are now ready to prove Theorem 3.1. We do this by proving Theorem 3.2 for
m ≥ 3.
Proposition 3.3. Suppose that 3 ≤ m ≤ n. Let U be an ultrametric tree with
m-dissimilarity vector d, all of whose edges have rational weight.
For each edge e of U , denote by h(e) the total weight of the minimal path from
the top node of e to any leaf below e.
Also, let a1 (e), . . . , an−2 (e) be n − 2 generic complex numbers associated to e.
For all i ∈ [n − 2] and j ∈ [n], let tij be the sum of the monomials ai (e)th(e) ,
where e runs over all edges in the minimal path from leaf j to r.
Notice that tij ∈ K.
33
Then, define a matrix:

1
 1

 t
t12
 11


 (t11 )2 (t12 )2
M =

 t21
t22

 .
..
 ..
.


t(n−2)1 t(n−2)2
Finally, for all ρ ∈
[n]
m

...
...
...
...
..
.
...
1


t1n 



(t1n )2 
.

t2n 

.. 
. 


t(n−2)n
, let mρ be the m × m upper minor of M coming from
the columns ρ.
Then, val(mρ ) = dρ .
Proof. First, consider the following matrix for some p1 , p2 , . . . , pn−2 ∈ K:

1
1


 t +p
t12 + p1
11
1



 (t11 + p1 )2
(t12 + p1 )2

N =
 t21 + p2
t22 + p2


..
..

.
.


t(n−2)1 + pn−2 t(n−2)2 + pn−2

...
1
...
t1n + p1
...
(t1n + p1 )2
...
..
.
t2n + p2
..
.
. . . t(n−2)n + pn−2







.







Then, notice that the situation is completely analogous to proving that val (det(N )) =
ω(U ). However, by first expanding the quadric terms of the second row, then sub-
34
tracting multiples of the first row to all rows to cancel the p-terms, and then using
the second row to cancel the appropriate remaining terms in the third row, we see
that det(N ) = det(M ).
Therefore, there is no loss of generality if we prove that val (det(M )) = ω(U ).
This is the plan.
As U is trivalent with n leaves, we know that U has n − 2 internal nodes of
degree three, one node (the root) of degree two and 2(n − 1) edges.
Let ≤U be the tree order of U with respect to r, that is, the order on the set
of nodes of U by which v ≤U w if and only if v lies in the path from r to w in U .
Let v1 , v2 , . . . , vn−1 be the n − 1 internal nodes of U , numbered in such a way that
if vi ≤U vj , then j ≤ i. We must have vn−1 = r.
Consider an injective function α : vi 7→ ai from the set of internal nodes to the
leaves of U , so that vi ≤U ai for all i with 1 ≤ i ≤ n − 1. Now, for each of these
values of i, let bi be the unique leaf such that bi 6= aj for all j with 1 ≤ j ≤ i, and
such that vi ≤U bi .
To show the existence of α, we construct it succesively starting with α(v1 ), then
α(v2 ) and then continuing up until we define α(vn−1 ). Suppose that we have already
defined α(v1 ), . . . , α(vi−1 ) for some i < n − 1. Consider the maximal subtree Ui of
35
U whose root is vi , i.e. Ui is the subtree below vi . If this tree has m leaves, then
it has m − 1 internal nodes, including vi itself. So far, we have not defined α for
nodes between r and vi , but we have defined it for all internal nodes of Ui different
from vi . Therefore, there are exactly m − 2 leaves of the tree Ui which have been
assigned to some of v1 , . . . , vi−1 under α, so there are 2 leaves which we can assign
to vi : α(vi ) can be either one of them. Incidentally, this also gives us the existence
and uniqueness of the respective bi .
Now, we want to establish the equality
Pn−1
i=1
h(vi ) = ω(U ) − `. This equality is
clearly true when U has 2 or 3 leaves, so that n = 2 or n = 3. Let now n ≥ 4 and
suppose that we have proved the result for all trees with i leaves, with i < n. Recall
n is being taken as the number of leaves in U , which is rooted `-equidistant with
root r = vn−1 . We know that the equality holds for each of the subtrees U1 , . . . ,
Un−2 below v1 , . . . , vn−2 , respectively. Let Un−2 be `n−2 -equidistant and let Un−3 be
`n−3 -equidistant. There are two cases to distinguish. If vn−2 <U vi for all i < n − 2,
P
then n−2
i=1 h(vi ) = (ω(U ) − ` − (` − `n−2 )) − `n−2 = ω(U ) − 2` by induction, so
Pn−1
i=1 h(vi ) = ω(U ) − `. Otherwise, suppose that vn−2 and vn−3 are incomparable
in <U . Then Un−2 and Un−3 are disjoint graphs and we have

X
vi ∈V(Un−2 )

h(vi ) = ω(U ) − 

X
vj ∈V(Un−3 )

h(vj ) + `n−3  − (` − `n−3 ) − (` − `n−2 )−`n−2
36
by induction. Reordering we get:
X
X
h(vi ) +
vi ∈V(Un−2 )
h(vj ) = ω(U ) − 2`,
vj ∈V(Un−3 )
so if we add h(vn−1 ) = ` to both sides, we get our result.
Now, consider the column reduction M ∗ = can−1 ,bn−1 ◦ · · · ◦ ca2 ,b2 ◦ ca1 ,b1 (M )
Q
∗
with
of M . We claim that the valuation of all the nonzero monomials ni=1 Mi,σ(i)
σ ∈ Sn in the sum
det(M ∗ ) =
X
σ∈Sn
is precisely
Pn−1
i=1
sgn(σ)
n
Y
!
∗
Mi,σ(i)
i=1
h(vi ) + ` = ω(U ). To see this, notice that for all i, 1 ≤ i ≤ n − 1,
we have:
∗
= 0;
• M1a
i
∗
is ` + h(vi );
• the valuation of M3a
i
∗
• the valuation of Mja
is h(vi ) if j 6= 1 and j 6= 3;
i
• the only nonzero term in the first row of M ∗ is the 1 in column bn−1 .
Because of our generic choice of coefficients, we can find some monomial term in
the sum det(M ∗ ) with valuation ω(U ) that does not get cancelled.
37
r = v9
5
(p)
5
(q)
9
v7
v8
2
(g)
3
(f )
v5
v1
1
1
(a) (b)
1
2
(c)
2
1
v6 (z)
3
2
(x)
1
(h)
v2
1
1
(d) (e)
4
2
(y)
v3
1
1
(r) (s)
5
6
4
(w)
v4
1
1
(u) (v)
7
8
9
10
Figure 3.1:
A rooted 10-tree.
The injective function α
:=
{(v1 , 1), (v2 , 4), (v3 , 6), (v
,
8),
(v
,
3),
(v
,
7),
(v
,
2),
(v
,
9),
(v
,
5)}
is
depicted,
4
5
6
7
8
9
P
as well as the equality 9i=1 h(vi ) = 35 − 9.
Example 3.1. Consider the 9-equidistant 10-tree of Figure 3.1 with total weight
35. The second row of the matrix M associated to this tree is the following vector
with generic complex coefficients:
[at1 + f t4 + pt9
,bt1 + f t4 + pt9
,ct2 + gt4 + pt9
,
dt1 + ht2 + gt4 + pt9
,et1 + ht2 + gt4 + pt9
,rt1 + xt3 + zt4 + qt9
,
st1 + xt3 + zt4 + qt9
,ut1 + yt3 + zt4 + qt9
,vt1 + yt3 + zt4 + qt9
,
wt4 + qt9 ]
38
Using the operator (c5,10 ◦ c9,10 ◦ c2,5 ◦ c7,9 ◦ c3,5 ◦ c8,9 ◦ c6,7 ◦ c4,5 ◦ c1,2 ) suggested by
the figure we obtain the column reduction M ∗ whose second row is the vector:
[(a − b)t1
,
(b − e)t1 − ht2 + (f − g)t4
,
− et + (c − h)t2
,
(d − e)t1
,
et1 + ht2 + (g − w)t4 + (p − q)t9
,
(r − s)t1
,
(s − v)t1 + (x − y)t3
,
(u − v)t1
,
vt1 + yt3 + (z − w)t4
,
wt4 + qt9 ]
It has valuation vector:
(1, 4, 2, 1, 9, 1, 3, 1, 4, 9) =
(h(v1 ), h(v7 ), h(v5 ), h(v2 ), h(v9 ), h(v3 ), h(v6 ), h(v4 ), h(v8 ))
,
where v1 , v7 , v5 , v2 , v9 , v3 , v6 , v4 , v8 are the preimages of 1, 2, 3, 4, 5, 6, 7, 8, 9 under α,
P
respectively in that order. Also notice that 9i=1 h(vi ) = 35 − 9.
We have shown that the m-dissimilarity vector of a tree T gives a point in the
tropical Grassmannian Gm,n , and therefore satisfies the tropical Plücker relations.
Thus, it gives rise to a tropical linear space. We now study this tropical linear space,
in its tight span version.
39
Chapter 4
The Tropical Linear Space of a
Dissimilarity Vector
The main object of this chapter is a tree T with m-dissimilarity vector d. To keep
our discussion interesting, from now on and for the remaining part of this writing,
d will never be a 1-dissimilarity vector or an n-dissimilarity vector. The purpose of
the chapter is to describe, from both a geometrical and combinatorial point of view,
the space Td .
During the whole chapter, we will sometimes use the symbol ⊆st to denote the
relation of being a subtree of. We will also use (st to denote the relation of being
a proper subtree of. We consciously use these symbols sporadically, combining
them with the use of the words “is a subtree of”. The motivation of the use of this
symbol will be emphasizing the fact that we are comparing trees.
Consider the tree in(T ) ⊆st T that consists of the internal edges and vertices of
T . If T does not contain internal edges or vertices, as a technical convenience we let
in(T ) = Λ, where Λ is the abstract tree consisting of an empty set of vertices and
edges, and which we consider to be a proper subtree of every other tree. We call
40
in(T ) the internal tree of T .
If S ⊆st T , let T /v S be the tree obtained from T by collapsing the subtree S to
a single vertex v.
4.1 Vertices of Td and Their Matroids
Our first and main result on the combinatorial structure of the tight span Td of d
is a description of its vertices.
We prove that the set of vertices of Td is in bijection with the set of subtrees of
in(T ) that satisfy certain conditions.
To state these conditions, let S ⊆st in(T ). Let S be the maximal subtree of T
such that in(S) = S. Notice that S is the subtree of T consisting of all edges and
vertices of T that are in S or adjacent to S. By definition S (st S. We call S the
extended tree of S. It will also be convenient to refer to the set L(S) of leaves of
S as the set of extended leaves of S.
Computing degrees in T , the number of extended leaves |L(S)| of S is
X
v∈V(S)
deg(v) − 2(|V(S)| − 1).
41
The number of vertices of S is
X
deg(v) − 2(|V(S)| − 1) + |V(S)|.
v∈V(S)
We will say that S is an m-good subtree of T if S ⊆st in(T ), |L(S)| ≤ m − 1
and |L(S)| ≥ m + 1.
Using these definitions, we are now ready to state the main result of this section.
Theorem 4.1. Let T be a tree with m-dissimilarity vector d. The set of vertices of
the tight span Td is in bijection with the set of m-good subtrees of T .
In the case where T is a trivalent tree, that is, when each internal edge of T
is of constant degree 3, we can enunciate our result in a more compact way. In this
case deg(v) = 3 for all v ∈ V(S), so the number of extended leaves of S becomes
X
deg(v) − 2(|V(S)| − 1) = 3|V(S)| − 2(|V(S)| − 1) = |V(S)| + 2.
v∈V(S)
Also, the number of vertices of S becomes 2|V(S)| + 2 = (|V(S)| + 2) + |V(S)|.
Therefore, the main result for this case can be stated as follows.
Theorem 4.2. Suppose that T is a trivalent tree with m-dissimilarity vector d. The
set of vertices of the tight span Td is in bijection with the set of trees S ⊆st in(T )
such that |L(S)| ≤ m − 1 ≤ |V(S)|.
To facilitate the exposition of the proof, we introduce some additional notation.
42
Let S ⊆st in(T ). For each leaf i ∈ [n] of T , there is a unique minimal path from i
to S made up solely of edges from E(T )\E(S). Let ω(i, S) be the sum of the weights
of all edges in this minimal path. Notice that ω(i, S) coincides with the intuitive
notion of distance from the leaf i to the subtree S.
From now on, assume that S is an m-good subtree of T as in Theorem 4.1.
For all i ∈ [n], define the entries of x ∈ Rn in the following way:
xi = ω(i, S) +
ω(S)
.
m
Proposition 4.3. The point x ∈ Rn just defined is a vertex of Td .
Proof. It has already been noted that our statement is equivalent to proving that
x is a vertex of the polyhedron Pd . This is what we do. The proof is divided into
several steps.
Notice that for each m-set {i1 , i2 , . . . , im } ⊆ [n] we have:
xi1 + xi2 + · · · + xim = ω(S) + ω(i1 , S) + ω(i2 , S) + · · · + ω(im , S).
Let Ti1 i2 ...im be the subtree of T spanned by the leaves i1 , i2 , . . . , im .
Step I: We have xi1 + xi2 + · · · + xim ≥ di1 i2 ...im with equality if and only if S ⊆st
Ti1 i2 ...im and Ti1 i2 ...im /v S is a v-path-star tree.
To avoid the excessive use of subscripts, let us assume without loss of generality
that i1 = 1, i2 = 2, . . . , im = m. Let Tm = T12...m .
43
First, suppose that Tm does not intersect S at an edge or a vertex. Notice
that ω(Tm ) = d12...m by definition. There exists a unique path P = (vS , vm )
with vS ∈ V(S) and vm ∈ V(Tm ) which consists purely of edges from the set
E(T )\ (E(S) ∪ E(Tm )). This is the minimal path connecting S and Tm . Notice
that this minimal path is made up of internal edges of T , which therefore have
positive weight. As a consequence, ω(P ) > 0. Also, notice that
ω(1, P ) + ω(2, P ) + · · · + ω(m, P ) = ω(1, vm ) + ω(2, vm ) + · · · + ω(m, vm )
≥ d12...m .
However, ω(i, S) = ω(i, P ) + ω(P ) for all i ∈ [m], so
ω(1, S) + ω(2, S) + · · · + ω(m, S) > d12...m .
This implies x1 + x2 + · · · + xm > d12...m .
Now, suppose that S and Tm do intersect at some tree S ∩ Tm . For ease of
notation, let:
Te = T /v (S ∩ Tm ),
Se = S/v (S ∩ Tm ) ⊆st Te and
Tem = Tm /v (S ∩ Tm ) ⊆st Te.
44
Notice that Se is precisely v if S ⊆st Tm .
We can compute some of the new weights:
e = ω(S) − ω(S ∩ Tm ),
ω(S)
ω(Tem ) = ω(Tm ) − ω(S ∩ Tm ) = d12...m − ω(S ∩ Tm ).
e and ω(i, v) as computed in Te.
Now, for all i ∈ [m], define the values of ω(i, S)
We can see that
ω(S) + ω(1, S) + ω(2, S) + · · · + ω(m, S)
e + ω(S ∩ Tm ) + ω(1, S)
e + ω(2, S)
e + · · · + ω(m, S)
e
= ω(S)
e + ω(S ∩ Tm ) + ω(1, v) + ω(2, v) + · · · + ω(m, v).
= ω(S)
But looking at our tree Te, notice that
ω(1, v) + ω(2, v) + · · · + ω(m, v) ≥ ω(Tem ) = d12...m − ω(S ∩ Tm ).
Substituting this in our previous equality, we get
e + ω(S ∩ Tm ) + ω(1, v) + ω(2, v) + · · · + ω(m, v) ≥ ω(S)
e + d12...m .
ω(S)
45
This gives us finally the following inequality:
e = ω(S ∩ Tm ) + ω(1, v) + ω(2, v) + · · · + ω(m, v)
x1 + x2 + · · · + xm − ω(S)
≥ d12...m .
e = 0, that is,
In particular, notice that we can only have equality when ω(S)
whenever Se is a vertex. In that case, we know that Se can only be v, which means
that S = S ∩ Tm . Therefore, a necessary condition for equality is that S ⊆st Tm .
At this point, we have already proved that x ∈ Pd . It remains to study the cases
where we have equality in the defining inequalities of Pd .
We know that S ⊆st Tm . But now, in order to attain the equality in x1 + x2 +
· · · + xm ≥ d12...m , we require that
ω(1, v) + ω(2, v) + · · · + ω(m, v) = ω(Tem ).
This is also a sufficient condition. In order for it to hold, we need that in Te,
the minimal path from i to v and the minimal path from j to v to intersect at v
and only at v, for all different i, j ∈ [m]. That is tantamount to saying that Tem is a
v-path-star tree.
Therefore, x1 + x2 + · · · + xm = d12...m if and only if S ⊆st Tm and Tem is a
46
v-path-star tree. This settles the first step of the proof.
Let us now describe these conditions completely in terms of S. We will first need
to introduce some additional concepts. This is when all the previously developed
notation will begin to come in handy.
Consider the subtree S. It has a number of leaves |L(S)| ≤ m − 1. Pick a leaf
l ∈ L(S). There are some leaves of T whose minimal path to S meets S precisely
at l. These are all the leaves i ∈ [n] such that ω(i, S) = ω(i, l). Recall the extended
tree S of S, which has at least m + 1 leaves, that is |L(S)| ≥ m + 1. This motivates
introducing the following sets.
For all l ∈ L(S), let Hl be the set of leaves of T whose minimal path to S meets
l. Analogously, for all l ∈ L(S) let Rl be the set of leaves of T whose minimal
path to S meets l. Notice that the collection of sets Rl define a partition of [n] as
l ranges over all leaves of S. Let R be the set of classes of this partition, so that
R = {Rl }l∈L(S) . Define H to be the analogous object for S, H = {Hl }l∈L(S) . It is
worth noting that H is also a collection of pairwise disjoint subsets of [n], but not
necessarily a partition of [n].
Also, note that each set in the collection H can be written as a disjoint union of at
least two sets from the collection R, R is a refinement of H. Furthermore, we can say
P
something about the size of these collections: |R| = v∈V(S) deg(v)−2(|V(S)|−1) ≥
47
m + 1 and |H| = |L(S)| ≤ m − 1.
Step II: We have xi1 +xi2 +· · ·+xim = di1 i2 ...im if and only if |{i1 , i2 , . . . , im }∩Hl | ≥ 1
for all l ∈ L(S) and |{i1 , i2 , . . . , im } ∩ Rl | ≤ 1 for all l ∈ L(S).
In order for S to be a subtree of Ti1 i2 ...im , a necessary and sufficient condition
is that |{i1 , i2 , . . . , im } ∩ Hl | ≥ 1 for all l ∈ L(S). On the other hand, in order for
Ti1 i2 ...im /v (S ∩ Ti1 i2 ...im ) to be a v-path-star tree, a necessary and sufficient condition
is that |{i1 , i2 , . . . , im } ∩ Rl | ≤ 1 for all l ∈ L(S). This settles the second step of our
proof.
For each leaf i ∈ [n] of T , let i be the leaf of S such that i ∈ Ri . Notice that the
leaf i also represents the class of i in the partition of [n] induced by S, so we follow
the usual notation for partitions.
Step III: For every pair of different leaves i, j ∈ [n] = L(T ), there exists an (m − 1)set {k1 , k2 , . . . , km−1 } ⊆ L(T )\{i, j} such that xk1 +xk2 +· · ·+xkm−1 +xi = dk1 k2 ...km−1 i
and xk1 + xk2 + · · · + xkm−1 + xj = dk1 k2 ...km−1 j .
Using a relabeling of the leaves of T when necessary, let us prove without loss of
generality that for a pair of different leaves i, j ∈ L(T )\[m − 1] we can construct a
set [m − 1] ⊆ L(T ) such that x1 + x2 + · · · + xm−1 + xi = d12...(m−1)i and x1 + x2 +
· · · + xm−1 + xj = d12...(m−1)j .
We know that i ∈ Ri and j ∈ Rj and it might be the case that i = j. We do
this in a case by case fashion.
48
If i, j ∈ Hl for some l ∈ L(S), choose one leaf of T from each of the sets in
the collection H\Hl to form a set {1, 2, . . . , |L(S)| − 1} of leaves of T . Clearly
i, j 6∈ (R1 ∪ R2 ∪ · · · ∪ R|L(S)|−1 ). As |R\{Ri , Rj }| ≥ m − 1 and |L(S)| − 1 < m − 1,
we complete our set of leaves by picking a total of (m − 1) − (|L(S)| − 1) new
leaves from the sets in the collection R\{R1 , R2 , . . . , R|L(S)|−1 , Ri , Rj }, no two of
which come from the same member of the collection. We thus form a set of leaves
{1, 2, . . . , m − 1} = [m − 1] of T which satisfies our conditions:
x1 + x2 + · · · + xm−1 + xi = d12...(m−1)i and
x1 + x2 + · · · + xm−1 + xj = d12...(m−1)j .
If there does not exist some Hl with l ∈ L(S) such that i, j ∈ Hl , then pick one
leaf from each member of the collection of sets H to form a set {1, 2, . . . , |L(S)|} of
leaves of T , but do this in such way that [|L(S)|]∩Ri = ∅ and [|L(S)|]∩Rj = ∅. This
is possible because we can write each member of H as a disjoint union of at least
two members of R. Notice also that we are now using the fact that |L(S)| ≤ m − 1.
Otherwise, the output of our last construction would be a set of more than m − 1
leaves of T . As |R\{Ri , Rj }| ≥ m − 1, if |L(S)| < m − 1, we can pick one leaf from
each of any m − 1 − |L(S)| different sets of the collection R\{R1 , R2 , . . . , R|L(S)| } to
49
complete our set [m − 1] of leaves of T . Again
x1 + x2 + · · · + xm−1 + xi = d12...(m−1)i and
x1 + x2 + · · · + xm−1 + xj = d12...(m−1)j .
This settles the third step of the proof.
Step IV: The defining equations of Pd at which equality is attained by x are a rank-n
system of equations which define a unique point in Rn . Therefore, x is a vertex of
Pd .
By the previous step, we know that the defining equations of Pd at which equality
is attained let us determine the value of xi − xj for all different i, j ∈ [n]. They also
let us determine xi1 + xi2 + · · · + xim for some set of leaves {i1 , i2 , . . . , im } ⊆ L(T ).
Thus, they are a rank-n system of equations.
The natural question to ask now is whether two different m-good subtrees S and
S 0 of T give rise to different vertices of Td . This is indeed the case.
Proposition 4.4. Let S and S 0 be m-good subtrees of T . Consider the vertices x
50
and x0 of Td given by the equalities, for all i ∈ [n]:
ω(S)
,
m
ω(S 0 )
x0i = ω(i, S 0 ) +
.
m
xi = ω(i, S) +
Then, if S and S 0 are different trees then x and x0 are different vertices of Td .
Proof. To begin, assume that S and S 0 are different trees and suppose that ω(S 0 ) ≤
ω(S). Let x, x0 ∈ Rn be given, for all i ∈ [n], by
ω(S)
,
m
ω(S 0 )
x0i = ω(i, S 0 ) +
.
m
xi = ω(i, S) +
Following the proof of Proposition 4.3, let H and R be the collections of sets of
leaves of T associated to S, and let H0 and R0 be the collections of sets of leaves of
T associated to S 0 . Take a leaf l of S that is not a vertex of S 0 and such that the
minimal path from every element of Hl ∈ H to l does not meet S 0 . Using the ideas
of the proof, construct a set of leaves [m] of T (relabeling when necessary) such that
|[m] ∩ Hl | ≥ 2 and such that
x1 + x2 + · · · + xm = d12...m .
This construction is possible because S has at most m − 1 leaves and at least
51
m + 1 extended leaves.
Now, take two different leaves i, j ∈ ([m] ∩ Hl ) and consider the set Ri0 ∈ R0 .
Notice that i and j are both in Ri0 , so |[m] ∩ Ri0 | > 1. Therefore,
x01 + x02 + · · · + x0m > d12...m .
Combining these results we see that x 6= x0 .
The final step of our study needs to be concerned with making sure that all these
vertices of Td that we have exposed are actually all the vertices of Td .
In order to prove this, we need to change our perspective on tight spans. In
particular, Td can be described as the set of bounded faces of the dual complex of
the subdivision of the m-hypersimplex Hm induced by d. Under this perspective,
vertices of Td correspond to (n − 1)-dimensional faces of this subdivision. These
are the maximal faces of the subdivision because Hm lies in the affine space of Rn
consisting of points x such that x1 + x2 + x3 + · · · + xn = m.
Let us begin by understanding the (n − 1)-dimensional face associated to the
known vertices of Td that we have described. Formally, let S be an m-good subtree
of T . Let H and R be the collections of sets of leaves of T associated to S, as in
the proof of Proposition 4.3. Define x ∈ Rn as usual by the following equalities, for
all i ∈ [n]:
xi = ω(i, S) +
ω(S)
.
m
52
Recall that Bx is the collection of m-sets {i1 , i2 , . . . , im } ⊆ [n] for which xi1 +
xi2 + · · · + xim = di1 i2 ...im . The face Px of the subdivision Dd that corresponds to x
is the convex hull of all points ei1 i2 ...im with {i1 , i2 , . . . , im } ∈ Bx . Now, to describe
Bx we can use our knowledge of H and R. Indeed, notice that {i1 , i2 , . . . , im } ∈ Bx
if and only if
|{i1 , i2 , . . . , im } ∩ Hl | ≥ 1 for all l ∈ L(S),
|{i1 , i2 , . . . , im } ∩ Rl | ≤ 1 for all l ∈ L(S).
We would like to describe Px with inequalities.
Proposition 4.5. Let
Px0 ={y ∈ Hm |
X
yj ≥ 1 for all l ∈ L(S) and
X
yj ≤ 1 for all l ∈ L(S)}.
j∈Rl
j∈Hl
Then, Px = Px0 .
Proof. We study the vertices of Px0 . Let y ∈ Px0 and suppose that 0 < y1 < 1.
If there exists an element of R1 \1, say 2, such that 0 < y2 < 1, then y +
(δ, −δ, 0, 0, . . . , 0) is also a point of Px0 for any δ with |δ| sufficiently small. Thus, y
cannot be a vertex of Px0 .
Otherwise, we have 0 <
P
j∈R1
= y1 < 1. Because y ∈ Hm , there must exist
one element of [n]\1, say 2, such that 0 < y2 < 1. It better be the case that
53
0<
P
j∈R2
= y2 < 1 because otherwise, per the case already handled we would be
done. If there exists some l ∈ L(S) such that 1, 2 ∈ Hl , or if neither one among 1 and
2 is contained in some Hl with l ∈ L(S), then y + (δ, −δ, 0, 0, . . . , 0) is also a point
of Px0 for any δ with |δ| sufficiently small. If only one of them, say 1, is contained in
P
some Hl for l ∈ L(S) and if j∈Hl yj > 1, then again y + (δ, −δ, 0, 0, . . . , 0) is also a
P
point of Px0 for any δ with |δ| sufficiently small. If however j∈Hl yj = 1 then notice
that we must have |Hl | ≥ 2, so we can assume that 2 ∈ Hl , which takes us back to
a case already studied.
Therefore, all vertices of Px0 are 0-1 vectors. But notice that a 0-1 vector belongs
to Px0 if and only if it is equal to ei1 i2 ...im for some {i1 , i2 , . . . , im } ∈ Bx . As Px ⊆ Px0
clearly, we conclude Px = Px0 .
We use Proposition 4.5 to prove the following result:
Proposition 4.6. Let y be a generic point in the interior of Hm . Then, there exists
an m-good subtree S of T such that the point x ∈ Rn defined by
xi = ω(i, S) +
ω(S)
m
for all i ∈ [n], satisfies y ∈ Px .
Remark. Our generic choice of y means precisely that y ∈ Hm does not satisfy the
equalities that we now describe. Let v be a vertex of T , or v ∈ V(T ) in our usual
notation. The vertex v defines a partition Πv on the set of leaves [n] of T . Under
54
this partition, two different leaves i, j ∈ [n] belong to the same class if and only if
the minimal paths from i to v and from j to v contain a common edge adjacent to
v. The number of classes in this partition is equal to the degree of v if v ∈ V (in(T ))
and equal to 2 if v ∈ L(T ).
For each a ∈ Πv , let σa =
P
i∈a
yi . By the statement that y is generic, we require
specifically that σa 6= m − 1 for all v ∈ V(T ) and for all classes a ∈ Πv .
Proof. We first introduce a subset Vy of the set of internal vertices of T :
Vy = {v ∈ V (in(T )) |m − σa > 1 for all a ∈ Πv }.
We prove that Vy 6= ∅. To begin, notice that if we take a vertex v ∈ V (in(T )) \Vy
and there exist two different classes a, b ∈ Πv such that σa > m − 1 and σb > m − 1,
then m ≥ σa +σb > 2m−2 or m < 2. This contradicts one of the earliest assumptions
made in this chapter. Thus, for each v ∈ V (in(T )) \Vy there exists a unique a ∈ Πv
such that σa > m − 1.
If Vy = ∅, among all pairs (v, σa ) with v ∈ V (in(T )) and a ∈ Πv such that
σa > m − 1, take the one with σa minimal. The class a corresponds to a unique
edge e of T adjacent to v. Let u be the other vertex adjacent to the edge e. If
u ∈ V (in(T )) so that deg(u) ≥ 3, then by the minimality of σa and because y is an
interior point of Hm (in particular, y > 0 componentwise), it can only be the case
that σb > m − 1 where b ∈ Πu is the class corresponding to the edge e. However, in
55
this later case we obtain m = σa + σb > 2m − 2 or m < 2, a contradiction. Finally,
if u 6∈ V (in(T )), then u has to be a leaf of T . But then 1 > yu = σa > m − 1 or
m < 2, the first inequality coming from the fact that y ∈ Hm .
Therefore, Vy 6= ∅.
It is not difficult to check that Vy defines a tree S ⊆st in(T ). More precisely, if
we have three different vertices u, v ∈ Vy and w ∈ V (in(T )) such that w lies in the
minimal path from u to v, then w ∈ Vy . For example, if σc > m − 1 for some c ∈ Πw ,
then that implies that either σa > m − 1 or σb > m − 1 holds for some a ∈ Πv or
some b ∈ Πu .
Now, S must have at most m − 1 leaves. Otherwise, for every leaf l of S, let
al ∈ Πl correspond to the edge of S adjacent to l. Then:
m≥
X
l∈L(S)
(m − σal ) >
X
1 = |L(S)| ≥ m.
l∈L(S)
Also, S must have at least m + 1 extended leaves. To see this, notice that the
set of leaves L(S) of S (the set of extended leaves of S) and the set of vertices
V(S) = Vy of S are disjoint. For each leaf l ∈ L(S), let al ∈ Πl correspond to
the edge e adjacent to S. It has to be the case that σal > m − 1 or 1 > m − σal .
Otherwise, if we let v be the vertex of Vy adjacent to e and a ∈ Πv be the class
corresponding to the edge e, then we would have σa > m − 1, contradicting the fact
that v ∈ Vy . But then, as the set of leaves L(S) of S induces a partition of [n], we
56
see that:
X
m=
l∈L(S)
(m − σal ) <
X
1 = |L(S)| ≤ m.
l∈L(S)
To sum up these arguments, if l ∈ L(S) and a ∈ Πl is the class corresponding
to the edge of S adjacent to l, then m − σa > 1. On the other hand, if l ∈ L(S)
and a ∈ Πl is the class corresponding to the edge adjacent to both S and l, then
m − σa < 1. Therefore, if we let x be defined as in the statement of the theorem,
then y ∈ Px per the description of Px in terms of inequalities.
4.2 Higher Dimensional Faces of Td and Their Matroids
Let us now understand the remaining faces of the tight span Td associated to T .
The idea is to study the internal faces of Px for a vertex x of Td .
Suppose that we have a subtree S ⊆st in(T ), where |L(S)| ≤ m − 1. Suppose
also that there exists some A ⊆ L(S) with 0 ≤ |A| = i ≤ m − 1, and such that the
number of extended leaves of S that are not adjacent to an element of A is at least
m + 1 − i. Computing degrees in T , this final assumption is equivalent to having:
X
deg(v) − 2(|V(S)| − 1) + 2i ≥ m + 1.
v∈V(S)\A
We call the pair (S, A) an (m, i)-good pair whenever each of these conditions
is satisfied.
57
Notice that taking i = 0, the left hand side of our inequality becomes precisely
|L(S)|, the number of extended leaves of S. Thus, we regard the conditions as a
generalization of the conditions imposed on S when we studied the vertices of Td .
In fact, if we let (S, A) be a (m, i)-good pair, then, as deg(v) ≥ 3 for all v ∈ A, we
get:
X
v∈V(S)
X
deg(v) − 2(|V(S)| − 1) ≥
deg(v) + 3i − 2(|V(S)| − 1) ≥ m + 1 + i.
v∈V(S)\A
Therefore, if (S, A) is an (m, i)-good pair, then (S, ∅) is also a (m, 0)-good pair,
which is equivalent to saying that S is an m-good subtree of T . Furthermore, the
former is a stronger condition than the later whenever i > 0.
Actually, this last phenomenon is part of a more general situation: If (S, A) is
an (m, i)-good pair and B ⊆ A, then (S, B) is an (m, j)-good pair where |B| = j.
Lemma 4.7. Suppose that (S, A) is an (m, i)-good pair and let x be the vertex of
Td associated to S. Recall the subpolytope Px of Hm arising from x. Consider the
matroid polytope consisting of all y ∈ Px that satisfy the following i equalities:
X
yj = 1 for all l ∈ A.
j∈Hl
Then, the face of Td dual to this matroid polytope is i-dimensional.
Proof. Computing the degrees in T , note that for each l ∈ L(S), the number of
58
extended leaves of S adjacent to l is deg(l) − 1.
Also, note that the total number of extended leaves of S which are not adjacent
to a leaf of S is
X
X
deg(v)−(2(|V(S)| − 1) − |L(S)|) =
v∈V(S)\L(S)
deg(v)−2(|V(S)|−1)+|L(S)|.
v∈V(S)\L(S)
With these motivations, let
ϕ
RL(S)\A × R −
→ R,

ϕ(r, c) =
i+
X
rl (deg(l) − 1) + c 
l∈L(S)\A
Define k = (kl )l∈L(S)\A by letting kl =

X
deg(v) − 2(|V(S)| − 1) + |L(S)| .
v∈V(S)\L(S)
1
deg(l)−1
for all l ∈ L(S)\A and 1 =
(1, 1, 1, . . . , 1) ∈ RL(S)\A . Notice that 0 < k < 1 componentwise. We have:
ϕ(k, 0) = i +
X
1 + 0 = i + |L(S)\A| = i + (|L(S)| − i) = |L(S)|
l∈L(S)\A
≤ m − 1 < m.
59
Moreover:
X
ϕ(1, 1) = i +
l∈L(S)\A
X
=i+
X
(deg(l) − 1) +
v∈V(S)\L(S)
deg(l) − (|L(S)| − i) +
l∈L(S)\A
=
X
l∈L(S)\A
=
X
deg(v) − 2(|V(S)| − 1) + |L(S)|
deg(l) +
X
deg(v) − 2(|V(S)| − 1) + |L(S)|
v∈V(S)\L(S)
X
deg(v) + 2i − 2(|V(S)| − 1)
v∈V(S)\L(S)
deg(v) + 2i − 2(|V(S)| − 1) ≥ m + 1 > m.
v∈V(S)\A
That is, ϕ(k, 0) < m < ϕ(1, 1).
Therefore, there exists a vector r∗ ∈ RL(S)\A and a real number c∗ such that
(k, 0) < (r∗ , c∗ ) < (1, 1) componentwise, and such that ϕ(r∗ , c∗ ) = m.
Using this pair (r∗ , c∗ ), define a vector y∗ ∈ Rn by:



1/|Hl | if j ∈ Hl and l ∈ A,



yj∗ =
r∗l /|Rj | if j ∈ Hl and l ∈ L(S)\A,




 c∗ /|R | if j ∈
6 Hl for all l ∈ L(S).
j
Then y∗ ∈ Px satisfies the following and only the following i equalities among
all defining inequalities of Px , including those inherited from the description of Hm :
X
j∈Hl
yj∗ = 1 for all l ∈ A.
60
The consequence worth stressing here is that y∗ is then an interior point of Hm .
But furthermore, we conclude that the set of all y ∈ Px satisfying the aforementioned
i equalities is an (m − 1 − i)-dimensional internal face of Px , of which y∗ is an
interior point. That is, y∗ lies in the relative interior of the defined face. In turn,
this face corresponds to its dual, an i-dimensional face of Td .
Remark. We will refer to the situation of the previous proof by saying that the
(m, i)-good pair (S, A) gives rise to an i-dimensional face of Td .
Let’s now describe the matroid associated to the i-dimensional face F of Td
arising from an (m, i)-good pair (S, A). Let x be the vertex of Td associated to S
and Px the subpolytope of Hm associated to x. The set of bases of the matroid of
F comes from the 0-1 vectors y ∈ Px satisfying the equalities
X
yj = 1 for all l ∈ A.
j∈Hl
These 0-1 vectors, in turn, correspond to all m-sets {i1 , i2 , . . . , im } ∈ Bx for
which
|{i1 , i2 , . . . , im } ∩ Hl | = 1 for all l ∈ A.
We then see that the set of bases of our rank-m matroid over [n] associated to
61
F is precisely the collection BF of all m-sets {i1 , i2 , . . . , im } ⊆ [n] satisfying:
|{i1 , i2 , . . . , im } ∩ Hl | = 1 for all l ∈ A,
|{i1 , i2 , . . . , im } ∩ Hl | ≥ 1 for all l ∈ L(S)\A,
|{i1 , i2 , . . . , im } ∩ Rl | ≤ 1 for all l ∈ L(S).
A point w ∈ ∂(Pd ) belongs to F if and only if BF ⊆ Bw . In particular, notice
that x ∈ F .
Lemma 4.8. Let F be the face of Td arising from an (m, i)-good pair (S, A) with
i > 0. Let x0 be the vertex of Td arising from an m-good subtree S 0 of T . Then, a
necessary condition for having x0 ∈ F is that S 0 ⊆st S.
Proof. First, note that S is not a vertex of T and m > 2.
To S, associate the collections H = {Hl }l∈L(S) and R = {Rl }l∈L(S) of subsets of
[n]. To S 0 , associate the collections H0 = {Hl0 }l∈L(S 0 ) and R0 = {Rl0 }l∈L(S 0 ) of subsets
of [n].
Let BF be the set of bases of the matroid of the face F . That is, let BF be the
collection of all m-sets {i1 , i2 , . . . , im } ⊆ [n] satisfying
|{i1 , i2 , . . . , im } ∩ Hl | = 1 for all l ∈ A,
|{i1 , i2 , . . . , im } ∩ Hl | ≥ 1 for all l ∈ L(S)\A,
|{i1 , i2 , . . . , im } ∩ Rl | ≤ 1 for all l ∈ L(S).
62
Also, recall that the set of bases Bx0 of the matroid of x0 is the collection of all
m-sets {i1 , i2 , . . . , im } ⊆ [n] satisfying
|{i1 , i2 , . . . , im } ∩ Hl0 | ≥ 1 for all l ∈ L(S 0 ),
|{i1 , i2 , . . . , im } ∩ Rl0 | ≤ 1 for all l ∈ L(S 0 ).
Suppose that S 0 is not a subtree of S and let u ∈ L(S 0 )\V(S) be such that the
minimal path from any element of Hu0 to u does not meet S. We want to prove that
x0 is not a vertex of F , or that BF 6⊆ Bx0 . Carefully consider our set Hu0 . For each
l ∈ L(S), take one element from Hl \Hu0 to form a subset [|L(S)|] = {1, 2, . . . , |L(S)|}
of leaves of T disjoint from Hu0 -after a relabeling of the leaves if necessary. This
is possible because the leaves of S are at least of degree 3 in T . As (S, A) is an
(m, i)-good pair, the number of elements in L(S) not adjacent to an element of A
is at least m + 1 − i. If we let u ∈ L(S) be the vertex at which the minimal path
from u to S meets S, we see that the number of elements in L(S)\u not adjacent to
an element of A is then at least m − i. Moreover, as the elements of [|L(S)|] come
from |L(S)| different members of R, this shows that the number of extended leaves
l ∈ L(S)\u not adjacent to an element of A and such that [|L(S)|] ∩ Rl = ∅, is at
least m − |L(S)| = m − i − (|L(S)| − i). Therefore, we can complete our set [|L(S)|]
of leaves of T up to a set [m] such that [m] ∩ Hu0 = ∅ and [m] ∈ BF . But then,
[m] 6∈ Bx0 .
63
Proposition 4.9. There is a one-to-one function from the set of (m, i)-good pairs
(S, A), to the set of i-dimensional faces of Td .
Proof. Let x be the vertex of Td coming from S and let Px be the associated
subpolytope of Hm . Map the (m, i)-good pair (S, A) to the face F of Td dual to
the matroid polytope of all y ∈ Px satisfying
X
yj = 1 for all l ∈ A.
j∈Hl
Per Lemma 4.7, F is i-dimensional. We need only be concerned with proving
that our map is injective.
Let (S, A) and (S 0 , A0 ) be two good pairs with corresponding i-dimensional faces
F and F 0 . Suppose that S and S 0 have, respectively, associated vertices x and x0 of
Td . Using Lemma 4.8, we see that a necessary condition for having x0 ∈ F is that
S 0 ⊆st S. Also, a necessary condition for having x ∈ F 0 is that S ⊆st S 0 . Therefore,
as x ∈ F and x0 ∈ F 0 , if F = F 0 then we must have S = S 0 .
However, Lemma 4.7 shows that if (S, A) and (S, A0 ) are (m, i)-good pairs with
A=
6 A0 , then the corresponding faces of Td are different. In that case, the relative
interiors of the internal faces of Px given by A and A0 are nonempty and disjoint.
If T is a trivalent tree, we can restate these results without further mention of
64
extended leaves. In that case, we have
X
deg(v)−2(|V(S)|−1)+2i = 3(|V(S)|−i)−2(|V(S)|−1)+2i = (|V(S)| − i)+2.
v∈V(S)\A
Therefore, whether (S, A) is a good pair or not does not depend on the specific
choice of i-set A, but only on the value i and the tree S. We can check the following
result:
Proposition 4.10. Let T be a trivalent tree, 0 ≤ i ≤ m − 1 and S ⊆st in(T ).
Then, a pair (S, A) with A ⊆ L(S) and |A| = i is an (m, i)-good pair if and only
if |L(S)| ≤ m − 1 and |V(S)| ≥ m − 1 + i. All such (m, i)-good pairs give rise to
i-dimensional faces of Td . Furthermore, each i-dimensional face of Td corresponds
to at most one such pair.
The first central theorem of this section establishes that good pairs give rise to
all i-dimensional faces of Td whenever 0 ≤ i < m − 1.
Theorem 4.11. Let T be a tree with m-dissimilarity vector d. Suppose that Td
is the associated tight span and let 0 ≤ i < m − 1. Then, there is a one-to-one
correspondence between (m, i)-good pairs (S, A) and i-dimensional faces of Td .
Proof. We need only prove that for every i-dimensional face F of Td with 0 ≤ i <
m − 1, there is a good pair giving rise to it.
Let x be a vertex of F and let S be the m-good subtree of T giving rise to x.
65
Let F ◦ be the face of the subdivision of Hm dual to F . We know F ◦ is an
(n − 1 − i)-dimensional internal face of Px , with 0 ≤ i < m − 1. There exist sets
A ⊆ L(S) with |A| = i1 and B ⊆ L(S) with |B| = i2 such that i = i1 + i2 , and such
that F ◦ can be described as the set of all y ∈ Px satisfying the equalities
X
yj = 1 for all l ∈ A,
j∈Hl
X
yj = 1 for all l ∈ B.
j∈Rl
It must be the case that the number of extended leaves of S that are not adjacent
to an element of A is at least m + 1 − i1 . Otherwise, as F ◦ is a subset of Hm , this
number would have to be precisely m − i1 , and then all y ∈ F ◦ would be forced to
satisfy the following equalities from the description of Px :
X
yj = 1 for all extended leaves l ∈ L(S) not adjacent to an element of A,
j∈Rl
X
yj = 1 for all l ∈ A.
j∈Hl
However, this is a rank-m system of equations, so F ◦ would have to live in an
(n − m)-dimensional space of Rn . This is false because F ◦ is (n − 1 − i)-dimensional
with 0 ≤ i < m − 1.
Therefore, the number of extended leaves of S that are not adjacent to an element
of A is at least m + 1 − i1 and furthermore, (S, A) is an (m, i1 )-good pair.
66
Let S B be the subtree of in(T ) obtained as the span of the set of vertices V(S)∪B.
Notice that S B arises after appending to S the i2 edges adjacent to both S and
B. Elements of B now become leaves of S B . Our tree S B does have to be a subtree
of in(T ) because of the assumed equalities for B in the description of F ◦ and the fact
that F ◦ is internal. Also, notice that no element of B is adjacent to an element of
P
A in T . Otherwise, we would have j∈Rl yj = 0 for some l ∈ L(S)\B, which again
contradicts that F ◦ is internal. As there are at least m + 1 − i1 extended leaves of S
which are not adjacent to an element of A and these leaves include the i2 elements
of B, we conclude that there must be at least m + 1 − i = m + 1 − i1 − i2 extended
leaves of S B which are not adjacent to elements of A ∪ B.
The set A is also a set of leaves of S B , so A ∪ B ⊆ L(S B ). These sets satisfy
A ∩ B = ∅, showing that S B has at least i leaves. Let HB = {HlB }l∈L(S B ) and
RB = {RlB }l∈L(S B ) be the collections of subsets of [n] = L(T ) associated to S B . By
construction L(S B )\ (A ∪ B) ⊆ L(S), so all elements y ∈ F ◦ must satisfy the
following conditions:
X
yj = 1 for all l ∈ A ∪ B,
j∈HlB
X
yj ≥ 1 for all l ∈ L(S B )\ (A ∪ B) .
j∈HlB
As F ◦ lies in Hm , we see from here that |L(S B )| ≤ m. If |L(S B )| = m,
then among these expressions, equality must hold in each and everyone of the
67
last |L(S B )| − i inequalities, giving us a rank-m system of equations which all elements of F ◦ must satisfy. As we have seen, this yields a contradiction because
dim(F ◦ ) = n − 1 − i > n − m.
Thus |L(S B )| ≤ m − 1. Moreover, we have now all the necessary ingredients to
conclude that (S B , A ∪ B) is an (m, i)-good pair.
To finish, let xB be the vertex of Td associated to S B and consider:
PxB ={y ∈ Hm |
X
yj ≥ 1 for all l ∈ L(S B ) and
j∈HlB
X
yj ≤ 1 for all l ∈ L(S B )}.
j∈RlB
Then, F ◦ is precisely the set of all y ∈ PxB that satisfy the following i equalities,
each of which comes from the description of PxB :
X
yj = 1 for all l ∈ A ∪ B.
j∈HlB
Therefore, (S B , A ∪ B) is an (m, i)-good pair giving rise to F .
It remains to study the pathological case of the (m − 1)-dimensional faces of Td .
We have had glimpses of it previously. For example, we already know that some
(m − 1)-dimensional faces of Td do arise from (m − 1)-good pairs (S, A). In that
case, notice that this means that S has exactly m − 1 leaves and that the number of
extended leaves of S that are not adjacent to leaves of S is at least 2. In the case of
trivalent trees, this translates to saying that S has exactly m − 1 leaves and a least
68
2(m − 1) vertices.
To see why these are not necessarily all (m − 1)-dimensional faces of Td , we first
need to introduce some additional terminology.
Suppose that S ⊆st in(T ) is such that the collection H of sets of leaves of T
associated to S is a partition of [n]. We then say that S is a full subtree of T .
This is a natural tag for S, as can be checked by doing some small examples. One
example of full subtrees that we have found all the way up to this point are the
subtrees of in(T ) that arise as extended trees.
One favorable aspect of a full m-subtree S of T is that any proper subtree of
S that properly contains in(S) is an m-good subtree of T . However, neither S nor
in(S) are m-good subtrees of T . In particular, notice that full m-subtrees of T do
not arise as extended trees of m-good subtrees of T .
We want to prove the following result:
Theorem 4.12. Let T be a tree with m-dissimilarity vector d and let Td be the
associated tight span. Let F be the union of the following sets:
• the set of (m, m − 1)-good pairs (S, A);
• the set of full m-subtrees of T .
Then, there is a one-to-one correspondence between the elements of F and the (m −
1)-dimensional faces of Td .
In the case of trivalent trees, this result can be stated in a more convenient way.
69
Theorem 4.13. Let T be a trivalent tree with m-dissimilarity vector d and let Td
be the associated tight span. Let F be the union of the following sets:
• the set of subtrees S of in(T ) with m − 1 leaves and at least 2(m − 1) vertices,
• the set of subtrees S of in(T ) with m leaves and 2(m − 1) vertices.
Then, there is a one-to-one correspondence between the elements of F and the (m −
1)-dimensional faces of Td .
To understand why this result holds and how full m-subtrees of T come into
play, let’s study an (m − 1)-dimensional face F of Td and the way in which it arises
from T . Let F ◦ be the dual of F in the subdivision of Hm . We know that F ◦ is
(n − m)-dimensional. Also, F ◦ is an internal face of some matroid polytope Px ,
where x is a vertex of F . Let S be the m-good subtree of T giving rise to x.
There exist pairs (A, B) of sets of vertices of T such that
• A ⊆ L(S) with |A| = i1 ;
• B ⊆ L(S) with |B| = i2 ;
• m − 1 ≤ i1 + i2 ;
70
• F ◦ can be described as the set of all y ∈ Px satisfying the equalities:
X
yj = 1 for all l ∈ A,
j∈Hl
X
yj = 1 for all l ∈ B.
j∈Rl
Among all such pairs, choose one with i1 + i2 maximal.
No element of B is adjacent in T to an element of A because F ◦ is an internal
face of Px . Therefore, we must have i1 + i2 ≤ m. Extend S to a tree S B as in
the proof of the theorem, and notice that A ∪ B is a set of i1 + i2 leaves of S B .
Notice also that S B ⊆st in(T ) because F ◦ is internal. If we consider the collections
HB = {HlB }l∈L(S B ) and RB = {RlB }l∈L(S B ) of subsets of [n] associated to S B , then
all points y ∈ F ◦ satisfy
(F1)
X
yj = 1 for all l ∈ A ∪ B,
j∈HlB
(F2)
X
yj ≥ 1 for all l ∈ L(S B )\ (A ∪ B) ,
j∈HlB
(F3)
X
yj ≤ 1 for all extended leaves l ∈ L(S B ) not adjacent to a leaf of S B .
j∈RlB
The family (F1) of expressions contains i1 + i2 equalities and the family (F2)
contains |L(S B )| − (i1 + i2 ) inequalities. Both (F1) and (F2) consist of a total
of |L(S B )| expressions and we check that |L(S B )| ≤ m. If i1 + i2 = m − 1 <
71
m = |L(S B )|, we can see that the family (F2) consists of precisely one inequality,
which is then forced to be an equality. As L(S B )\ (A ∪ B) ⊆ L(S), the pair
L(S B )\ (A ∪ B) ∪ A, B contradicts the maximality of i1 + i2 .
Therefore, i1 + i2 = |L(S B )| and the family (F2) of expressions must be void.
That is, L(S B ) = A ∪ B.
Assume that |L(S B )| = i1 + i2 = m − 1. The family (F3) must contain at
least one inequality. If it contains at least 2 inequalities, then (S B , A ∪ B) is an
(m, m − 1)-good pair giving rise to F . On the other hand, if it contains one and
only one inequality, then this inequality is forced to be an equality. That is, if the
family (F3) contains one expression, then there exists one and only one extended
P
leaf l of S B not adjacent to a leaf of S B , and for this l we have j∈RB yj = 1.
l
Moreover, l ∈ L(S)\B. Therefore, we see that the pair (A, B ∪ l) contradicts the
maximality of i1 + i2 .
Finally, assume that |L(S B )| = i1 + i2 = m. In that case, the family (F3) must
also be void because F ◦ is internal. We identify here the pathology we were looking
for. In this final case, notice that S B is a subtree of in(T ) which has m leaves and
such that all elements of L(S B ) are adjacent to elements of L(S B ). Notice that
this final condition is equivalent to saying that the collection HB associated to S B
defines a partition of [n] into m classes. The (n − m)-dimensional internal face of
72
the subdivision of Hm arising from S B is the set of all y ∈ Hm satisfying
X
yj = 1 for all l ∈ L(S B ).
j∈HlB
The matroid associated to this maximal face of Td is then the rank-m transversal
matroid whose bases are the transversals of the collection HB = {HlB }l∈L(S B ) .
Using our now fully developed methods, it can be checked that the following
holds. Let S be a full m-subtree of T and let H = {Hl }l∈L(S) be the collection of
subsets of [n] associated to S. Also, let S 0 be a tree such that in(S) (st S 0 (st S and
let x0 be the vertex of Td coming from S 0 . Then, the intersection of Px0 and the space
P
of all y ∈ Rn such that j∈Hl yj = 1 for all l ∈ L(S), is an (n − m)-dimensional
face of Px0 . As m > 1, for every such S we can find S 0 .
Therefore, every full m-subtree of T gives rise to an (m − 1)-dimensional face of
Td . Moreover, considering the matroids of the faces, no two different full m-subtrees
give rise to the same (m − 1)-dimensional face because the bases of the matroids
would be different.
4.3 A Combinatorial Description of the Geometric Object Td
We now use our results to study an i-dimensional face F of Td . Notice that F is a
face of Pd , so F maximizes some linear functional on Pd . Therefore, F is the convex
hull of its vertices and F is a polytope. Also, every vertex of Pd that lies in F must
73
be a vertex of F : restrict the domain of the linear functional on Pd maximized by
this vertex to F . On the other hand, every vertex of F is a vertex of Pd , as can be
seen from the description of both F and Pd in terms of inequalities. Thus, in order
to understand F , we can find all vertices of Td that are contained in F . This is our
initial plan.
We will omit any consideration of the empty face in the face lattice of a polytope,
so we speak of and consider vertices as “minimal” faces. The same rule will apply
to the construction of f -vectors, where we will omit the entry f−1 .
For a tree S ⊆st T and a set A ⊆ L(S), let SA be the subtree of S spanned by
the vertices in V(S)\A.
4.3.1 Faces from (m, i)-good pairs.
We begin studying a face F of Td coming from an (m, i)-good pair (S, A) with i > 0.
Lemma 4.14. Let F be a face of Td coming from an (m, i)-good pair (S, A) with
i > 0. If x0 is a vertex of F coming from an m-good subtree S 0 of T , then SA ⊆st
S 0 ⊆st S. Also, if a tree S 0 satisfies SA ⊆st S 0 ⊆st S, then S 0 is an m-good subtree
of T giving rise to a vertex x0 of F .
Proof. To S, associate the collections H = {Hl }l∈L(S) and R = {Rl }l∈L(S) of subsets
of [n]. To S 0 , associate the collections H0 = {Hl0 }l∈L(S 0 ) and R0 = {Rl0 }l∈L(S 0 ) of
subsets of [n].
Per Lemma 4.8, we know that if x0 is a vertex of F , then S 0 ⊆st S.
74
Let us now assume that S 0 ⊆st S and that x0 is a vertex of F .
We now see that S 0 must contain all leaves of S different from A. If there
exists some l ∈ L(S)\ (A ∪ V(S 0 )), then we must have |{i1 , i2 , . . . , im } ∩ Hl | = 1
for all m-sets {i1 , i2 , . . . , im } ∈ BF ∩ Bx0 . However, it is easy to construct a set
{i1 , i2 , . . . , im } ∈ BF for which |{i1 , i2 , . . . , im } ∩ Hl | = 2 because |L(S)| ≤ m − 1.
Therefore, BF 6⊆ Bx0 and x0 is not a vertex of F .
Let’s now see that S 0 must contain all the vertices of S adjacent to elements of
A. For any l ∈ A, let v be the vertex of S adjacent to l.
First, if S 0 = l, then |Rv0 ∩ {i1 , i2 , . . . , im }| ≥ 2 for all {i1 , i2 , . . . , im } ∈ BF
because l ∈ A and m > 2: S 0 cannot be equal to l. But then, S 0 must contain v.
Suppose on the contrary that v 6∈ V(S 0 ) and let u be the vertex at which the minimal
path from v to S 0 meets S 0 . The vertex v is of degree at least 3 in T . Choose an
edge (v, w) 6= (v, l) such that w does not lie in the minimal path from v to S 0 . Now:
• If w ∈ V(S), there exists a leaf l0 ∈ L(S)\l such that the minimal path from
l0 to (v, w) meets (v, w) precisely at w.
• If w 6∈ V(S), then w ∈ L(S).
In the first case, no element of BF lies in Bx0 because if {i1 , i2 , . . . , im } ∈ BF ,
then |{i1 , i2 , . . . , im } ∩ Ru0 | ≥ 2. In the second case, as |L(S)| ≤ m − 1, we can
construct an m-set {i1 , i2 , . . . , im } ∈ BF for which |{i1 , i2 , . . . , im } ∩ (Hl ∪ Rw ) | = 2,
so |{i1 , i2 , . . . , im } ∩ Ru0 | ≥ 2. Thus, in both cases, we conclude BF 6⊆ Bx0 .
75
This proves that if x0 is a vertex of F coming from an m-good subtree S 0 of T ,
then SA ⊆st S 0 ⊆st S.
For the other direction, let S 0 satisfy SA ⊆st S 0 ⊆st S. Note that the set of
extended leaves of S which are not adjacent to an element of A is contained in
L(SA ). Moreover, L(SA ) must be the union of this set and A. As the number of
extended leaves of S which are not adjacent to an element of A is at least m + 1 − i,
then |L(SA )| ≥ i + (m + 1 − i) = m + 1. If S 0 is a subtree of S containing SA , then
m + 1 ≤ |L(SA )| ≤ |L(S 0 )|,
|L(S 0 )| ≤ |L(S)| ≤ m − 1.
Therefore, S 0 is an m-good subtree of T that gives rise to a vertex x0 of Td .
Now, notice that L(S 0 ) ⊆ A ∪ L(S) . As a consequence, for {i1 , i2 , . . . , im } ∈
BF we have
|{i1 , i2 , . . . , im } ∩ Rl0 | ≤ 1 for all l ∈ L(S 0 ).
Furthermore, every leaf of S 0 must be a leaf of S or adjacent to an element of A.
In both cases, we check that if {i1 , i2 , . . . , im } ∈ BF , then
|{i1 , i2 , . . . , im } ∩ Hl0 | ≥ 1 for all l ∈ L(S 0 ).
Thus, for each such S 0 we have BF ⊆ Bx0 , or x0 ∈ F .
76
Remark. Notice that a tree S 0 satisfying SA ⊆st S 0 ⊆st S can be described in our
usual notation as (SA )B for some B ⊆ A, the subtree of S spanned by the set of
vertices V(SA ) ∪ B. Recall that (SA )B can also be seen as the tree arising from SA
after appending the edges adjacent to both SA and B. For example, (SA )A = S. But
then, note that every such tree (SA )B with B ⊆ A is a subtree of S which contains
SA , and there are a total of 2i = 2|A| of these trees. We conclude that the vertices
of F are given by subtrees (SA )B with B ⊆ A, and the number of such vertices is
2i = 2|A| .
Let us now study the faces of a face F coming from an (m, i)-good pair (S, A)
with i > 0.
The argument given at the beginning of this section generalizes to all faces F 0 of
F . If F 0 is a face of Pd contained in F , restricting the domain of the linear functonal
on Pd maximized by F 0 to F shows that F 0 is also a face of F . If F 0 is a face of
F , the description of both Pd and F in terms of inequalities shows that F 0 is also
a face of Pd . Therefore, to study the faces of F we need only study the faces of Pd
contained in F .
Lemma 4.15. Let F be a face of Td coming from an (m, i)-good pair (S, A) with
i > 0. For a triple C ⊆ B ⊆ A, the pair (SA )B , C is an (m, |C|)-good pair. The
set of triples C ⊆ B ⊆ A is in bijection with the set of faces of our i-dimensional
face F via the map taking the triple C ⊆ B ⊆ A to the face of Td arising from the
B
(m, |C|)-good pair (SA ) , C . Moreover, the set of k-dimensional subfaces of F is
77
in one-to-one correspondence with the triples C ⊆ B ⊆ A such that |C| = k via this
map.
Proof. Notice that if F 0 is a face of F , then F 0 must also come from an (m, j)-good
pair (S 0 , A0 ) with j ≤ i. For example, consider the dimension of F 0 to check this.
But the vertices of F 0 must be vertices of F as well, so:
• S 0 must be equal to (SA )B for some B ⊆ A;
B\C
0
• SA
for some C ⊆ B ⊆ A, and then A0 = B\ (B\C) =
0 must be equal to (SA )
C.
Therefore, the pair (S 0 , A0 ) must actually be equal to (SA )B , C .
To check that every pair (SA )B , C with C ⊆ B ⊆ A is indeed a good pair, it
is enough to check that (SA )B , B is a good pair. In order to do this, recall that
we want to verify that the number of extended leaves of (SA )B not adjacent to B
is at least m + 1 − |B|. This is equivalent to verifying |L(SA )\{B}| ≥ m + 1 − |B|.
As (S, A) = (SA )A , A is a good pair, by analogy we know that |L(SA )\{S}| ≥
m + 1 − |A| = m + 1 − i. But the containments B ⊆ A ⊆ L(SA ) show that
|L(SA )\{B}| = |L(SA )\{S}| + |A\B|
≥ (m + 1 − |A|) + (|A| − |B|)
= m + 1 − |B|.
78
Therefore, the pair (SA )B , C with C ⊆ B ⊆ A is a good pair. It gives rise to
a |C|-dimensional face of F .
Now, suppose that we have triples C ⊆ B ⊆ A and C 0 ⊆ B 0 ⊆ A. From them, we
0
obtain faces with associated pairs ((SA )B , C) and ((SA )B , C 0 ). These faces are equal
0
if and only if the pairs coincide, so that (SA )B = (SA )B and C = C 0 . Of course, then
B = B0 .
Using Lemma 4.15, let us now count the number of k-dimensional subfaces of
F . They are in one-to-one correspondence with the triples C ⊆ B ⊆ A such that
|C| = k. For the last triples, there are ki choices for C. Then, for a fixed C, there
are 2i−k choices for B. We conclude that the f -vector of F is given by
i−k
fk (F ) = 2
i
for all k with 0 ≤ k ≤ i.
k
The total number of faces of F is then 3i .
Now, for a triple C ⊆ B ⊆ A with associated k-dimensional face F 0 of F , we
describe all triples C 0 ⊆ B 0 ⊆ A giving rise to a face of F containing F 0 . Notice
that a necessary condition is that C ⊆ C 0 . Also, we require that B\C 0 = B 0 \C 0 . But
then, B 0 = (B 0 \C 0 ) ∪ C 0 = (B\C 0 ) ∪ C 0 = B ∪ C 0 . These conditions can be seen to be
sufficient conditions. Therefore, all faces of F containing F 0 come from triples
C 0 ⊆ (B ∪ C 0 ) ⊆ A with C ⊆ C 0 .
79
The number of such triples is then given by the number of sets C 0 ⊆ A containing
i−k
C, which is 2i−k . The number of such triples for which |C 0 | = p is then given by p−k
.
That is, the number of p-dimensional faces of F containing any given k-dimensional
i−k
face of F is p−k
. Letting k = 0 and p = i − 1, we see that the polytope F is simple.
To finish, let us expose the reason why we are obtaining these familiar results.
Theorem 4.16. Let T be a tree with m-dissimilarity vector d and let Td be the
associated tight span. If F is a face of Td coming from an (m, i)-good pair (S, A),
then F is an i-dimensional polytope and its face lattice is isomorphic to the face
lattice of the i-dimensional cube.
Proof. Consider the map
φ
{Triples of F } →
− {Faces of the 0-1 i-dimensional cube in RA },
φ
C⊆B⊆A→
− Convex hull of all 0-1 vectors with 0’s in A\B and 1’s in B\C.
The map φ induces an isomorphism between the lattice of faces of F and the
lattice of faces of the i-cube.
4.3.2 Faces from full subtrees of T .
We now want to develop a similar study for an (m − 1)-dimensional face F of Td
coming from a full m-subtree S of T . However, as any proper face F 0 of F is a
face of Pd with dimension less than m − 1, this will be considerably simplified if we
80
remember that then F 0 must come from a good pair, which is a case we have already
taken care of.
Let us first find the vertices of F . Notice that we no longer have restrictions on
the value of m, so m ≥ 2.
Lemma 4.17. Let F be an (m − 1)-dimensional face of Td coming from a full msubtree S of T . A tree S 0 satisfying in(S) (st S 0 (st S is an m-good subtree of T
and gives rise to vertex of F . All vertices of F are obtained in this way.
Proof. Let x0 be a vertex of F . Let S 0 be the m-good subtree of T giving rise to x0 .
If S 0 is not a subtree of S, let u ∈ L(S 0 )\V(S) be such that the minimal path from
any element of Hu0 to u does not meet S. Construct a transversal of the collection
H that does not contain any member of Hu0 . The construction is possible because
the element l ∈ L(S) at which the minimal path from u to S meets S is of degree
at least 3. This shows that BF 6⊆ Bx0 , so x0 6∈ F , yielding a contradiction.
If S 0 ⊆st S, then we must have also in(S) ⊆st S 0 . In case m = 2, we have
in(S) = Λ and the contaiment is trivial. If m ≥ 3 and in(S) is not a subtree of S 0 ,
there exists an element u ∈ L(S 0 ) such that for every transversal [m] of H we have
|Ru0 ∩ [m]| ≥ 2. This argument is completely analogous to the one given during the
related discussion for good pairs, only easier. Then, BF 6⊆ Bx0 and again we obtain
a contradiction.
Now, we know that S 0 cannot be S or in(S) because then (S 0 , ∅) would not be a
good pair.
81
Thus, if x0 is a vertex of F , the tree S 0 must satisfy that in(S) (st S 0 (st S.
In that case, as we noted earlier, (S 0 , ∅) is an (m, 0)-good pair and gives rise to
a vertex x0 of Td as required. Also, every transversal of H is now clearly a member
of the collection Bx0 , so we obtain BF ⊆ Bx0 or x0 ∈ F .
Now, every S 0 satisfying in(S) (st S 0 (st S can be described as (in(S))A with
∅ ( A ( L(S), and there are 2n − 2 of the latter. Thus, the number of vertices of
F is 2n − 2.
It is easy to check the following result:
Lemma 4.18. Let F be an (m − 1)-dimensional face of Td coming from a full
m-subtree S of T . The proper faces of F arise from the pairs (in(S))A , B with
∅ ( A ( L(S) and B ( A. All such pairs are good. In particular, the i-dimensional
proper faces of F correspond to the (m, i)-good pairs (in(S))A , B with ∅ ( A (
L(S), B ( A and |B| = i < m − 1.
The number of pairs in Lemma 4.18 can be computed easily. Pick first a set B,
for which there are mi choices. Then, for a fixed B, the number of choices for A is
2m−i − 2. Therefore, the f -vector of F for this case is given by
m−i
fi (F ) = 2
m
−2
for all i with 0 ≤ i ≤ m − 2,
i
fm−1 (F ) = 1.
The total number of faces of F is 3m − 2m+1 + 2.
82
Now, consider two good pairs:
A
• (in(S)) , B with ∅ ⊆ B ( A ( L(S) and 0 ≤ |B| = k < m − 1,
•
A0
(in(S)) , B
0
with ∅ ⊆ B0 ( A0 ( L(S) and k ≤ |B 0 | = i < m − 1.
Necessary and sufficient conditions for the second face to contain the first are
B ⊆ B 0 and A0 \B 0 = A\B 0 . These conditions are equivalent to B ⊆ B 0 and A0 =
A ∪ B 0 . In particular, when k = 0 and i = m − 2, the conditions break down
to finding, for a fixed A satisfying ∅ ( A ( L(S), all B 0 with |B 0 | = m − 2 and
|A\B 0 | = 1. For our polytope F , this time the nature of the answer does depend on
A.
If |A| = 1, then B 0 comes from picking any m − 2 elements of the set L(S)\A, so
there are m − 1 = m−1
possibilities. If |A| = m − 1, then B 0 comes from selecting
m−2
any |A| − 1 members of A and we again find m − 1 = m−1
possibilities. If however
m−2
1 < |A| < m − 1, then B 0 comes from picking |A| − 1 members of A and m − 1 − |A|
members of L(S)\A, and there are |A| (m − |A|) different ways of doing this. The
number of vertices on which the latter situation springs up is given by the number
of subsets of [m] with at least 2 elements and at most m − 2 elements, 2m − 2m − 2
if m ≥ 3 and 0 if m = 2.
Remark. In particular, if m = 2 or m = 3 then our polytope F is simple, but not
anymore if m ≥ 4.
Let us now describe the cover relations of the lattice of faces of F . We restrict
the lattice to faces of dimension at most m − 2. To simplify the notation, we will
83
consider L(S) = [m] and we will always let A be a nonempty proper subset of [m].
Vertices of F are now pairs (A, ∅). Faces of F of dimension at most m − 2 are now
pairs (A, B), where B will always be assumed to be a proper subset of A. The face
lattice is ranked. The rank of a pair (A, B) is |B|.
If B = ∅, (A, B) is minimal.
If |B| = m − 2, (A, B) is maximal.
If |B| < m − 2, we describe the cover of (A, B).
If |A| − |B| = 1, then |A| < m − 1. The cover of (A, B) are exactly all pairs
(A ∪ i, B ∪ i) with i ∈ ([m]\A). There are m − |A| such coverings. The new pair
satisfies |A ∪ i| − |B ∪ i| = 1.
If |A| − |B| ≥ 2, we distinguish two cases:
• If |A| < m − 1, the cover of (A, B) are exactly all pairs (A ∪ i, B ∪ i) with
i ∈ ([m]\B). There are m − |B| such coverings. The new pair satisfies |A ∪
i| − |B ∪ i| = |A| − |B| if i 6∈ A, and |A ∪ i| − |B ∪ i| = |A| − |B| − 1 if i ∈ A.
• If |A| = m − 1, the cover of (A, B) consists of all pairs (A ∪ i, B ∪ i) with
i ∈ (A\B). There are |A| − |B| such coverings. The new pair satisfies |A ∪
i| − |B ∪ i| = |A| − |B| − 1.
Remark. In the case m = 2, F is a line segment. In case F = 3, F is isomorphic
to an hexagon. In case m = 4, F can be seen to be isomorphic to the rhombic
dodecahedron.
84
Finally, for a pair (A, B) of rank k with k < m − 1, we count the number of
pairs (A0 , B 0 ) of rank i with k < i < m − 1 greater than or equal to (A, B) in our
lattice. The pair (A0 , B 0 ) must satisfy B ⊆ B 0 and A0 = A ∪ B 0 . Letting C = B 0 \B,
the problem becomes that of counting all sets C ⊆ ([m]\B) with |C| = i − k such
that [m] ) A ∪ C ) B ∪ C. Now, A ∪ C ) B ∪ C if and only if (A\B) \C 6= ∅.
If k = m − 2, (A, B) is maximal.
Assume that k < m − 2.
For this exercise, we distinguish four cases:
• If i < |A| and i − k < m − |A|, then C is any (i − k)-subset of [m]\B. The
number of choices is m−k
.
i−k
• If i ≥ |A| and i − k < m − |A|, then C is any (i − k)-subset of [m]\B which
does not contain A\B. There are m−k
− m−|A|
choices.
i−k
i−|A|
• If i < |A| and i − k ≥ m − |A|, then C is any (i − k)-subset of [m]\B which
|A|−k
does not contain [m]\A. There are m−k
−
choices.
i−k
(i−k)−(m−|A|)
• If i ≥ |A| and i − k ≥ m − |A|, then C must be any (i − k)-subset of [m]\B
|A|−k
not containing [m]\A or A\B. There are m−k
− m−|A|
− (i−k)−(m−|A|)
i−k
i−|A|
choices.
These results give a hint of the precise nature of the polytope F . Recall that the
(m − 1)-dimensional pyrope Pm−1 , or (m − 1)-pyrope, can be realized as the
85
polytope conv ([−1, 0]m−1 ∪ [1, 0]m−1 ), the convex hull of the 0-1 and 0-(−1) cubes
in Rm−1 . See Joswig and Kulas [13] for a detailed description.
Theorem 4.19. An (m − 1)-dimensional face F of Td coming from a full m-subtree
S of T has a face lattice isomorphic to the face lattice of the (m − 1)-pyrope Pm−1 .
Proof. Fix a choice of l ∈ L(S). For every pair of sets (A, B) satisfying ∅ ( A (
L(S) and B ( A, consider the finite set VA,B ⊆ RL(S)\l that we now construct. For
each C ⊆ B:
• If l ∈ L(S)\ (A\C), define v ∈ RL(S)\l as the vector of 1’s in A\C and 0’s
everywhere else. Then, let v ∈ VA,B .
• If l ∈ A\C, define v ∈ RL(S)\l as the vector of 0’s in A\C and −1’s everywhere
else. Then, let v ∈ VA,B .
Consider the map
φ
{ Faces of ∂(F ) } →
− { Faces of ∂(Pm−1 ) },
under which the face of ∂(F ) arising from the pair in(S)A , B with ∅ ( A ( L(S)
and B ( A gets mapped to the convex hull of the set VA,B .
Using the ideas discussed up to this point, it is routine to check that φ is a lattice
isomorphism between the face lattices of ∂(F ) and ∂(Pm−1 ), where of course we
are considering the (m − 1)-pyrope as living in RL(S)\l . The map φ induces a lattice
isomorphism between the face lattices of F and Pm−1 .
86
To close this section, we prove a short result about the behaviour of the intersection of two faces F and F 0 of Td , each of which comes from a good pair. As we
will see, these intersections behave nicely.
Theorem 4.20. Let F and F 0 be faces of Td . Suppose that F comes from an
(m, |A|)-good pair (S, A) and that F 0 comes from an (m, |A0 |)-good pair (S 0 , A0 ).
Then, F ∩ F 0 6= ∅ if and only if
0
SA ⊆st S 0 and SA
0 ⊆st S.
Moreover, in the case when our faces meet, F ∩F 0 comes from the (m, |A∩A0 |)-good
pair (S ∩ S 0 , A ∩ A0 ). In particular, F ∩ F 0 is of dimension |A ∩ A0 |.
Proof. Suppose that F ∩ F 0 6= ∅. Then, there exist C ⊆ A and C 0 ⊆ A0 such that
0
0 C
0
0
0
(SA )C = (SA
0 ) . Letting B = A\C and B = A \C , we can restate this as: there
exist B ⊆ A and B 0 ⊆ A0 such that SB = SB0 0 . From here we see that indeed
0
SA ⊆st S 0 and SA
0 ⊆st S.
0
0
Now, suppose that SA ⊆st S 0 and SA
0 ⊆st S and we want to prove F ∩ F 6= ∅.
Observe that
SA\V(S 0 ) ⊆st S 0 ,
0
SA
0 \V(S) ⊆st S.
87
Also, V(SA\V(S 0 ) ) ∩ (A0 \V(S)) = ∅, so from the first containment we obtain
SA\V(S 0 ) = SA\V(S 0 )
A0 \V(S)
0
⊆st SA
0 \V(S) .
0
Analogously, from the second containment we obtain SA
0 \V(S) ⊆st SA\V(S 0 ) . Thus,
0
SA\V(S 0 ) = SA
0 \V(S) , which proves the first assertion.
For the second one, observe that
(A ∩ A0 ) ∩ (A\V(S 0 )) = ∅,
(A ∩ A0 ) ∩ (A0 \V(S)) = ∅.
0
0
Therefore, from the equality SA\V(S 0 ) = SA
=
0 \V(S) , we see that the pairs SA\V(S 0 ) , A ∩ A
0
0
SA
give rise to a common face of F and F 0 .
0 \V(S) , A ∩ A
Now
0
SA\V(S 0 ) = SA\V(S 0 ) ∩ SA
⊆st (S ∩ S 0 ) .
0 \V(S)
On the other hand V(S∩S 0 ) ⊆ V(S)\ (A\V(S 0 )), which implies (S ∩ S 0 ) ⊆st SA\V(S 0 ) .
The consequence is
0
SA\V(S 0 ) = (S ∩ S 0 ) = SA
0 \V(S) .
This shows that the pair (S ∩ S 0 , A ∩ A0 ) does give rise to a common face of F
and F 0 of dimension |A ∩ A0 |.
To see that this is the maximal common face, let a ∈ (A\A0 ) and then let v be
88
the vertex of S adjacent to a. Then, as SA ⊆ S 0 , v ∈ V(S 0 ).
By the choice of a and considering the edge (v, a), either (v, a) ∈ E(S 0 ) or
0
(v, a) 6∈ E(SA
0 ), but not both. Exactly one of them must hold. Thus, no subface of
F of the form (SA )B , C with a ∈ C ⊆ B ⊆ A can also be a subface of F 0 . This
can be restated by saying that if (SA )B , C gives a common face of F ∩ F 0 for some
C ⊆ B ⊆ A, then C ⊆ (A ∩ A0 ). Thus, the dimension of F ∩ F 0 can be at most
|A ∩ A0 |.
Remark. Notice that we actually have the containment
0
0
(SA ∪ SA
0 ) ⊆st (S ∩ S ) ,
and from there we can check that
0
0
(SA ∪ SA
0 ) = (S ∩ S )A∩A0 .
One application of this result is to decide whether or not F is a face of F 0 : we
0
have that F ⊆ F 0 if and only if A ⊆ A0 , S ⊆st S 0 , and SA
0 ⊆st SA . These results
generalize our study of the i-dimensional faces of Td .
Notice that if F is contained in F 0 , S 0 is obtained from S by appending some or
none of the edges adjacent to both S and L(S). In particular, we can only pick edges
that are not adjacent to an element of A. Now, recall that [n] is the set of leaves of
T , and then notice that if |L(S)| is the number of leaves of S and |L(S)| < m − 1,
89
for all l ∈ L(SA )\[n] we have that S l , A ∪ l is an (m, |A ∪ l|)-good pair. More
generally, for all B ⊆ L(SA )\[n] with |B| ≤ m − 1 − |L(S)|, the pair S B , A ∪ B is
an (m, |A ∪ B|)-good pair. Using these observations, some enumeration of the faces
of Td containing F could be possible for the case of trivalent trees, but we leave this
for a subsequent writing.
4.4 Face Enumeration in the Simplest Cases
In this brief section, we enumerate the faces of Td for some restricted values of m.
Let m = 2. The 0-dimensional faces are in bijection with the set of subtrees of
in(T ) with at most 1 leaf and at least 1 vertex. Therefore, the 0-dimensional faces
of Td correspond to the internal vertices of T . The 1-dimensional faces correspond
to either subtrees of in(T ) with exactly 1 leaf and at least 2 vertices, or to subtrees
of in(T ) with 2 leaves and 2 vertices. The former do not exist, and the latter are
the internal edges of T . We can see that Td is a tree isomorphic to in(T ).
For m = 2, Td has n − 2 vertices and n − 3 edges.
Let’s assume that T is trivalent.
Let m = 3. The 0-dimensional faces of Td correspond to subtrees of in(T ) with
at most 2 leaves and at least 2 vertices. These are all nontrivial paths in in(T ) and
are in bijection with 2-subsets of internal vertices of T . Therefore, there are n−2
2
vertices of Td . The 1-dimensional faces of Td correspond to the pairs (S, A) where
S is an internal subtree of T with at most 2 leaves and at least 3 vertices, and A
90
is a leaf of S. All such trees S must be internal paths of T with at least 3 vertices,
and they have exactly 2 leaves. The set of these trees is in bijection with 2-subsets
of internal edges of T , of which there are n−3
. Thus, the number of edges of Td
2
is (n − 3)(n − 4) = 2 n−3
, the factor 2 coming from the choices for A. Now, the
2
2-dimensional faces of Td correspond to either internal subtrees of T with 2 leaves
and at least 4 vertices, or to internal subtrees of T with 3 leaves and 4 vertices. To
count the first, consider 2-sets of internal edges of T , the number of which there are
n−3
. From here, discount the number of internal paths of T with 3 vertices, the
2
3-paths of in(T ).
Consider exclusively the tree in(T ) and orient all its edges towards one leaf l of
in(T ). We count many 3-paths by picking any vertex v 6= l not adjacent to l, and
then considering the 3-path containing v and contained in the oriented path from v
to l. There are n − 4 such paths. Now, computing the degrees in in(T ), if M is the
number of vertices of degree 3 in in(T ), then the total number of 3-paths in in(T )
is n − 4 + M . Therefore, the number of internal subtrees of T with 2 leaves and
at least 4 vertices is n−3
− (n − 4) − M . Now, the number of internal subtrees of
2
T with 3 leaves and 4 vertices corresponds to the number of vertices of degree 3 in
in(T ), computing the degrees in in(T ). There are M of them. We conlude that the
number of 2-dimensional faces of Td is n−4
= n−3
− (n − 4) − M + M .
2
2
We see that Td attains the f -vector of the f -vector conjecture for the case of 2
and 3-dissimilarity vectors of trivalent trees. This will no longer hold for sufficiently
91
big trees T for m ≥ 4.
92
Chapter 5
Further Directions
There is a result of Pachter and Speyer [17] which claims that the metric of T can
only be reconstructed from its m-dissimilarity vector d if and only if n ≥ 2m − 1.
We would like to understand how this result can be seen in our work. In the case
of trivalent trees, we believe there is a simple explanation for this fact.
Now, we have found that the faces of the tight span Td carry with them a
wealth of information about the tree T . Specifically, we refer to the collections H
and R. These collections give rise to a family of matroids that can be described using
trees. Very nicely, for the pathological faces of maximal dimension, these matroids
degenerate to transversal matroids. It is of our interest to fully understand these
matroids.
For example, notice that the 2-circuits of the matroids carry information about
the collections R. We do not know how to identify the collections H.
This has some interesting applications. It could be possible to devise an efficient
way to compute the vertices of the polytope Pd . In that case, we would know a lot
of information about the collections R and H. Is this information enough to recover
completely the collections? If so, can these collections be used to recover the shape
93
of T ?
From a different perspective, it might also be possible to obtain a theoretical
characterization of m-dissimilarity vectors of trees, for example, by requiring that
the family of matroids associated to Dd satisfy certain properties.
In general, we have shown a combinatorial richness associated to Td . The question is, how can we use it?
94
Bibliography
[1] Federico Ardila, Subdominant matroid ultrametrics, Annals of Combinatorics 8
(2005), no. 4, 379–389.
[2] Federico Ardila and Caroline J Klivans, The bergman complex of a matroid and
phylogenetic trees, Journal of Combinatorial Theory, Series B 96 (2006), no. 1,
38–49.
[3] Cristiano Bocci and Filip Cools, A tropical interpretation of¡ i¿ m¡/i¿dissimilarity maps, Applied Mathematics and Computation 212 (2009), no. 2,
349–356.
[4] Hans-Joachim Böckenhauer and Dirk Bongartz, Algorithmic aspects of bioinformatics, no. Part I, Springer, 2007.
[5] Peter Buneman, A note on the metric properties of trees, Journal of Combinatorial Theory, Series B 17 (1974), no. 1, 48–50.
[6] Filip Cools, On the relation between weighted trees and tropical grassmannians,
Journal of Symbolic Computation 44 (2009), no. 8, 1079–1086.
[7] Mike Develin and Bernd Sturmfels, Tropical convexity, Doc. Math 9 (2004),
1–27.
[8] Andreas Dress and Werner Terhalle, The tree of life and other affine buildings,
Proceedings of the International Congress of Mathematicians, vol. 3, 1998,
pp. 565–574.
[9] Andreas WM Dress, Trees, tight extensions of metric spaces, and the cohomological dimension of certain groups: a note on combinatorial properties of
metric spaces, Advances in Mathematics 53 (1984), no. 3, 321–402.
[10] Sven Herrmann and Michael Joswig, Bounds on the f -vectors of tight spans,
arXiv preprint math/0605401 (2006).
[11]
, Splitting polytopes, Münster J. Math 1 (2008), no. 1, 109–141.
95
[12] Hiroshi Hirai, Characterization of the distance between subtrees of a tree by the
associated tight span, Annals of Combinatorics 10 (2006), no. 1, 111–128.
[13] Michael Joswig and Katja Kulas, Tropical and ordinary convexity combined,
2008.
[14] Christopher Manon, The m− dissimilarity map and representation theory of
sl m, arXiv preprint arXiv:1003.3970 (2010).
[15] Ezra Miller and Bernd Sturmfels, Combinatorial commutative algebra, vol. 227,
Springer, 2005.
[16] James G Oxley and JG Oxley, Matroid theory, vol. 1997, Oxford university
press Oxford, 1992.
[17] Lior Pachter and David Speyer, Reconstructing trees from subtree weights, Applied Mathematics Letters 17 (2004), no. 6, 615–621.
[18] Lior Pachter and Bernd Sturmfels, Algebraic statistics for computational biology, vol. 13, Cambridge University Press, 2005.
[19] Peter Paule and Bernd Sturmfels, Algorithms in invariant theory, Springer,
2008.
[20] David Speyer and Bernd Sturmfels, The tropical grassmannian, Advances in
Geometry 4 (2004), no. 3, 389–411.
[21] David E Speyer, Tropical linear spaces, SIAM Journal on Discrete Mathematics
22 (2008), no. 4, 1527–1558.
[22]
, A matroid invariant via the¡ i¿ k¡/i¿-theory of the grassmannian, Advances in Mathematics 221 (2009), no. 3, 882–913.
[23] Bernd Sturmfels and Josephine Yu, Classification of six-point metrics, Electronic Journal of Combinatorics 11 (2004), R44.
Download