SMITH NORMAL FORM OF A MULTIVARIATE MATRIX ASSOCIATED WITH PARTITIONS CHRISTINE BESSENRODT AND RICHARD P. STANLEY∗ Abstract. Considering a question of E. R. Berlekamp, Carlitz, Roselle, and Scoville gave a combinatorial interpretation of the entries of certain matrices of determinant 1 in terms of lattice paths. Here we generalize this result by reο¬ning the matrix entries to be multivariate polynomials, and by determining not only the determinant but also the Smith normal form of these matrices. A priori the Smith form need not exist but its existence follows from the explicit computation. It will be more convenient for us to state our results in terms of partitions rather than lattice paths. E. R. Berlekamp [1, 2] raised a question concerning the entries of certain matrices of determinant 1. (Originally Berlekamp was interested only in the entries modulo 2.) Carlitz, Roselle, and Scoville [3] gave a combinatorial interpretation of the entries (over the integers, not just modulo 2) in terms of lattice paths. Here we will generalize the result of Carlitz, Roselle, and Scoville in two ways: (a) we will reο¬ne the matrix entries so that they are multivariate polynomials, and (b) we compute not just the determinant of these matrices, but more strongly their Smith normal form (SNF). A priori our matrices need not have a Smith normal form since they are not deο¬ned over a principal ideal domain, but the existence of SNF will follow from its explicit computation. A special case is a determinant of π-Catalan numbers. It will AMS Classiο¬cation: 05A15, 05A17, 05A30 Keywords: Lattice paths, partitions, determinants, Smith normal form, π-Catalan numbers. ∗ This author’s contribution is based upon work supported by the National Science Foundation under Grant No. DMS-1068625. 1 2 CHRISTINE BESSENRODT AND RICHARD P. STANLEY abcde+bcde +bce+cde+ ce+de+c+e +1 bce+ce+c +e+1 c+1 1 de+e+1 e+1 1 1 1 1 1 Figure 1. The polynomials πππ for π = (3, 2) be more convenient for us to state our results in terms of partitions rather than lattice paths. Let π be a partition, identiο¬ed with its Young diagram regarded as a set of squares; we ο¬x π for all that follows. Adjoin to π a border strip extending from the end of the ο¬rst row to the end of the ο¬rst column of π, yielding an extended partition π∗ . Let (π, π ) denote the square in the πth row and π th column of π∗ . If (π, π ) ∈ π∗ , then let π(π, π ) be the partition whose diagram consists of all squares (π’, π£) of π satisfying π’ ≥ π and π£ ≥ π . Thus π(1, 1) = π, while π(π, π ) = ∅ (the empty partition) if (π, π ) ∈ π∗ /π. Associate with the square (π, π) of π an indeterminate π₯ππ . Now for each square (π, π ) of π∗ , associate a polynomial πππ in the variables π₯ππ , deο¬ned as follows: ∑ ∏ π₯ππ , (1) πππ = π⊆π(π,π ) (π,π)∈π(π,π )/π where π runs over all partitions contained in π(π, π ). In particular, if (π, π ) ∈ π∗ /π then πππ = 1. Thus for (π, π ) ∈ π, πππ may be regarded as a generating function for the squares of all skew diagrams π(π, π )/π. For instance, if π = (3, 2) and we set π₯11 = π, π₯12 = π, π₯13 = π, π₯21 = π, and π₯22 = π, then Figure 1 shows the extended diagram π∗ with the polynomial πππ placed in the square (π, π ). SMITH NORMAL FORM OF A MULTIVARIATE MATRIX 3 Write π΄ππ = ∏ π₯ππ . (π,π)∈π(π,π ) Note that π΄ππ is simply the leading term of πππ . Thus for π = (3, 2) as in Figure 1 we have π΄11 = πππππ, π΄12 = πππ, π΄13 = π, π΄21 = ππ, and π΄22 = π. For each square (π, π) ∈ π∗ there will be a unique subset of the squares of π∗ forming an π × π square π(π, π) for some π ≥ 1, such that the upper left-hand corner of π(π, π) is (π, π), and the lower righthand corner of π(π, π) lies in π∗ /π. In fact, if πππ denotes the rank of π(π, π) (the number of squares on the main diagonal, or equivalently, the largest π for which π(π, π)π ≥ π), then π = πππ + 1. Let π(π, π) denote the matrix obtained by inserting in each square (π, π ) of π(π, π) the polynomial πππ . For instance, for the partition π = (3, 2) of Figure 1, the matrix π(1, 1) is given by β‘ β€ π11 πππ + ππ + π + π + 1 π + 1 β’ β₯ π(1, 1) = β£ ππ + π + 1 π+1 1 β¦, 1 1 1 where π11 = πππππ + ππππ + πππ + πππ + ππ + ππ + π + π + 1. Note that for this example we have det π(1, 1) = π΄11 π΄22 π΄33 = πππππ ⋅ π ⋅ 1 = πππππ2 . If π is a commutative ring (with identity) and π an π × π matrix over π , then we say that π has a Smith normal form (SNF) over π if there exist matrices π, π ∈ GL(π, π ) (the set of π × π matrices over π which have an inverse whose entries also lie in π , so that det π and det π are units in π ) such that π ππ has the form π ππ = diag(π1 π2 ⋅ ⋅ ⋅ ππ , π1 π2 ⋅ ⋅ ⋅ ππ−1 , . . . , π1 ), where each ππ ∈ π . If π is a unique factorization domain and π has an SNF, then it is unique up to multiplication of the diagonal entries by units. If π is a principal ideal domain then the SNF always exists, but not over more general rings. We will be working over the polynomial 4 CHRISTINE BESSENRODT AND RICHARD P. STANLEY ring (2) π = β€[π₯ππ : (π, π) ∈ π]. Our main result asserts that π(π, π) has a Smith normal form over π , which we describe explicitly. In particular, the entries on the main diagonal are monomials. We will give two proofs for this result. The main tool used for the ο¬rst proof is a recurrence for the polynomials πππ . To state this recurrence we need some deο¬nitions. Let π = rank(π). For 1 ≤ π ≤ π, deο¬ne the rectangular array β‘ β€ (1, 2) (1, 3) ⋅ ⋅ ⋅ (1, ππ − π + 1) β’ β₯ β’ (2, 3) (2, 4) ⋅ ⋅ ⋅ (2, ππ − π + 2) β₯ β’ β₯ π π = β’ .. β₯ . β£ β¦ (π, π + 1) (π, π + 2) ⋅ ⋅ ⋅ (π, ππ ). Note that if ππ = π, then π π has no columns, i.e., π π = ∅. Let ππ denote the set of all subarrays of π π that form a vertical reο¬ection of a Young diagram, justiο¬ed into the upper right-hand corner of π π . Deο¬ne ∑ ∏ Ωπ = π₯ππ . πΌ∈ππ (π,π)∈πΌ For instance, if π2 = 4, then π 2 = [ (1, 2) (1, 3) (2, 3) (2, 4) ] , so Ω2 = 1 + π₯13 + π₯12 π₯13 + π₯13 π₯24 + π₯12 π₯13 π₯24 + π₯12 π₯13 π₯23 π₯24 . ( ) In general, π π will have π rows and ππ − π columns, so Ωπ will have πππ terms. We also set Ω0 = 1, which is consistent with regarding π 0 to be an empty array. Next deο¬ne for 1 ≤ π ≤ π, ππ = {(π, π) ∈ π : 1 ≤ π ≤ π, ππ − π + π < π ≤ ππ }. In other words, ππ consists of those squares of π that are in the same row and to the right of a square appearing as an entry of π π . SMITH NORMAL FORM OF A MULTIVARIATE MATRIX 5 In particular π1 = ∅. (When ππ = π, then ππ consists of all squares strictly to the right of the main diagonal of π.) Set ∏ ππ = π₯ππ , (π,π)∈ππ where as usual an empty product is equal to 1. In particular, π0 = 1. We can now state the recurrence relation for πππ . Theorem 1. Let 2 ≤ π ≤ π + 1 = rank(π) + 1. Then π0 Ω0 π1π − π1 Ω1 π2π + ⋅ ⋅ ⋅ + (−1)π ππ Ωπ ππ+1,π = 0. Moreover, for π = 1 we have π0 Ω0 π11 − π1 Ω1 π21 + ⋅ ⋅ ⋅ + (−1)π ππ Ωπ ππ+1,1 = π΄11 . Before presenting the proof, we ο¬rst give a couple of examples. Let π = (3, 2), with π₯11 = π, π₯12 = π, π₯13 = π, π₯21 = π, and π₯22 = π as in Figure 1. For π = 1 we obtain the identity (1+π+π+ππ+ππ+πππ+πππ+ππππ+πππππ)−(1+π+ππ)(1+π+ππ)+ππ = πππππ, For π = 2 we have (1 + π + π + ππ + πππ) − (1 + π + ππ)(1 + π) + ππ = 0, while for π = 3 we get (1 + π) − (1 + π + ππ) + ππ = 0. For a further example, let π = (5, 4, 1), with the variables π₯ππ replaced by the letters π, π, . . . , π as shown in Figure 2. For π = 1 we get π11 −(1+π+ππ+πππ+ππππ)(1+π+π +ππ +βπ+βππ +πβπ+πβππ +π πβππ) +ππ(1 + π + ππ + ππ + πππ + ππβπ)(1 + π) = ππππππ πβππ, where π11 = 1 + π + π + π + ⋅ ⋅ ⋅ + ππππππ πβππ, a polynomial with 34 terms. For π = 2 we get (1+π+π+ππ+βπ+πππ+πβπ+πβπ+ππβπ+ππβπ+πππβπ+πππβπ+ππππβπ+πππππβπ) −(1+π+ππ+πππ+ππππ)(1+π+βπ+πβπ)+ππ(1+π+ππ+ππ+πππ+ππβπ) = 0. 6 CHRISTINE BESSENRODT AND RICHARD P. STANLEY a b c d f g h i e j Figure 2. The variables for π = (5, 4, 1) For π = 3 we have (1+π+π+ππ+βπ+πππ+πβπ+ππβπ+πππβπ)−(1+π+ππ+πππ+ππππ)(1+π+βπ) +ππ(1 + π + ππ + ππ + πππ + ππβπ) = 0. Proof of Theorem 1. First suppose that π ≥ 2. We will prove the result in the form π0 Ω0 π1π = π1 Ω1 π2π − ⋅ ⋅ ⋅ + (−1)π−1 ππ Ωπ ππ+1,π by an Inclusion-Exclusion argument. Since π0 = Ω0 = 1, the lefthand side is the generating function for all skew diagrams π(1, π)/π, as deο¬ned by equation (1). If we take a skew diagram π(2, π)/π and append to it some element of π1 (that is, some squares on the ο¬rst row forming a contiguous strip up to the last square (1, π1 )), then we will include every skew diagram π(1, π)/π. However, some additional diagrams πΏ will also be included. These will have the property that the ο¬rst row begins strictly to the left of the second. We obtain the ο¬rst two rows of such a diagram πΏ by choosing an element of π2 and adjoining to it the set π2 . The remainder of the diagram πΏ is a skew shape π(3, π)/π. Thus we cancel out the unwanted terms of π1 Ω1 π2π by subtracting π2 Ω2 π3π . However, the product π2 Ω2 π3π has some additional terms that need to be added back in. These terms will correspond to diagrams π with the property that the ο¬rst row begins strictly to the left of the second, and the second begins strictly to the left of the third. We obtain the ο¬rst three rows of such a diagram π by choosing an element of π3 and adjoining to it the set π3 . The remainder of the diagram π is a skew shape π(4, π)/π. Thus we cancel out the unwanted terms SMITH NORMAL FORM OF A MULTIVARIATE MATRIX 7 of π2 Ω2 π3π by adding π3 Ω3 π4π . This Inclusion-Exclusion process will come to end when we reach the term ππ Ωπ ππ+1,π , since we cannot have π + 1 rows, each strictly to the left of the one below. This proves the theorem for π ≥ 2. When π = 1, the Inclusion-Exclusion process works exactly as before, except that the term π΄11 is never cancelled from π0 Ω0 π11 = π11 . Hence the theorem is also true for π = 1. β‘ The result below is stated for π(1, 1), but it applies to any π(π, π) by replacing π with π(π, π). Theorem 2. There are an upper unitriangular matrix π and a lower unitriangular matrix π in SL(π , π + 1) such that π ⋅ π(1, 1) ⋅ π = diag(π΄11 , π΄22 , . . . , π΄π+1,π+1 ). In particular, det π(1, 1) = π΄11 π΄22 ⋅ ⋅ ⋅ π΄ππ (since π΄π+1,π+1 = 1). For instance, in the example of Figure 1 we have π ⋅ π(1, 1) ⋅ π = diag(πππππ, π, 1) . Proof of Theorem 2. Induction on π, the result being trivial for π = 0 (so π = ∅). Assume the assertion holds for partitions of rank less than π, and let rank(π) = π. For each 1 ≤ π ≤ π, multiply row π + 1 of π(1, 1) by (−1)π ππ Ωπ and add it to the ο¬rst row. By Theorem 1 we get a matrix π ′ whose ο¬rst row is [π΄11 , 0, 0, . . . , 0]. Now by symmetry ′ we can perform the analogous operations on the [ columns of π ] . We π΄11 0 then get a matrix in the block diagonal form . The 0 π(2, 2) row and column operations that we have performed are equivalent to computing π ′ ππ′ for upper and lower unitriangular matrices π ′ , π′ ∈ SL(π , π + 1), respectively. The proof now follows by induction. β‘ Note. The determinant above can also easily be evaluated by the LindstroΜm-Wilf-Gessel-Viennot method of nonintersecting lattice paths, but it seems impossible to extend this method to a computation of SNF. 8 CHRISTINE BESSENRODT AND RICHARD P. STANLEY We now come to the second approach towards the SNF which does not use Theorem 1. Indeed, we will prove the more general version below where the weight matrix is not necessarily square; while this is not expounded here, the previous proof may also be extended easily to any rectangular subarray (regarded as a matrix) of π∗ whose lower-right hand corner lies in π∗ /π. The inductive proof below will not involve inclusion-exclusion arguments; again, suitable transformation matrices are computed explicitly stepwise along the way. Given our partition π, let πΉ be a rectangle of size π × π in π∗ , with top left corner at (1, 1), such that its corner (π, π) is a square in the added border strip π∗ β π; thus π΄ππ = 1 and πππ = 1. We denote the corresponding matrix of weights by ππΉ = (πππ )(π,π)∈πΉ . Theorem 3. Let πΉ be a rectangle in π∗ as above, of size π × π; assume π ≤ π (the other case is dual). Then there are an upper unitriangular matrix π ∈ SL(π , π) and a lower unitriangular matrix π ∈ SL(π , π) such that β β 0 π΄1,1+π−π β β β0 β π΄2,2+π−π 0 β, π ⋅ ππΉ ⋅ π = β . . β. β . . . 0 β β 0 π΄π,π In particular, when π is of rank π and πΉ is the Durfee square in π∗ , we have the result in Theorem 2 for ππΉ = π(1, 1). Proof. We use induction on the size of π. For π = ∅ we have π = (1), πΉ can only be a 1 × 1 rectangle, π = 1 = π, π΄11 = 1 and ππΉ = (1), and the assertion is obviously true. We now assume that the result holds for all rectangles as above in partitions of π. We take a partition π′ of π+1 and a rectangle πΉ ′ with its corner on the rim π′∗ βπ′ ; we will produce the required transformation matrices inductively along the way. ∗ First we assume that we can remove a square π = (π, π) from π′ and obtain a partition π = π′ β π such that πΉ ′ ⊆ π∗ . By induction, we thus SMITH NORMAL FORM OF A MULTIVARIATE MATRIX 9 have the result for π and πΉ = πΉ ′ ⊂ π∗ ; say this is a rectangle with corner (π, π), π ≤ π. Then π < π and π ≥ π, or π ≥ π and π < π. We discuss the ο¬rst case, the other case is analogous. Set π§ = π₯ππ . Let π‘ ∈ π ⊂ π′ ; denote the weights of π‘ with respect to π by π΄π‘ , ππ‘ , and with respect to π′ by π΄′π‘ , ππ‘′ . Let π = ππΉ = (πππ )(π,π)∈πΉ and π ′ = (πππ′ )(π,π)∈πΉ be the corresponding weight matrices. We clearly have π΄′π,π+π−π = { π§π΄π,π+π−π π΄π,π+π−π for 1 ≤ π ≤ π . for π < π ≤ π When we compute the weight of π‘ = (π, π) ∈ πΉ with π ≤ π with respect to π′ , we get two types of contributions to πππ′ . For a partition π ⊆ π′ (π, π) with π β∈ π, i.e., π ⊆ π(π, π), the corresponding weight summand in πππ is multiplied by π§, and hence the total contribution from all such π is exactly π§πππ . On the other hand, if the partition π ⊆ π′ (π, π) contains π , then it contains the full rectangle β spanned by the corners π‘ and π , that is, running over rows π to π and columns π to π; π arises from β by gluing suitable (possibly empty) partitions πΌ, π½ to its right and bottom side, respectively, and π′ (π, π) β π = (π(π, π + 1) β πΌ) ∪ (π(π + 1, π) β π½) . Summing the terms for all such π, we get the contribution ππ,π+1 ⋅ππ+1,π . Clearly, when π > π, then the square π has no eο¬ect on the weight of π‘. Hence πππ′ = { π§πππ + ππ,π+1 ⋅ ππ+1,π πππ for 1 ≤ π ≤ π . for π < π ≤ π We now transform π ′ . Multiplying the (π + 1)-th row of π ′ by ππ,π+1 and subtracting this from row π, for all π ≤ π, corresponds to multiplication from the left with an upper unitriangular matrix in SL(π, π ) 10 CHRISTINE BESSENRODT AND RICHARD P. STANLEY and gives β π§π11 β . β .. β β π§π β π1 π1′ = β βππ+1,1 β β .. β . ππ1 ⋅⋅⋅ ⋅⋅⋅ ⋅⋅⋅ ⋅⋅⋅ ⋅⋅⋅ ⋅⋅⋅ β π§π1π .. β . β β π§πππ β β β ππ+1,π β β .. β . β πππ By induction, we know that there are upper and lower unitriangular matrices π = (π’ππ )1≤π,π≤π, π = (π£ππ )1≤π,π≤π , respectively, deο¬ned over π , such that ππ π = (0, diag(π΄1,1+π−π , . . . , π΄π,π )). We then deο¬ne an upper unitriangular matrix π ′ = (π’′ππ )1≤π,π≤π ∈ SL(π, π ) by setting { π§π’ππ for π ≤ π < π π’′ππ = . π’ππ otherwise With ππ = (π˜ππ ), we then have β π§ π˜11 β . β .. β β π§ π˜ β π1 π ′ π1′ = β ˜ βππ+1,1 β . β . β . π˜π1 ⋅⋅⋅ ⋅⋅⋅ ⋅⋅⋅ ⋅⋅⋅ ⋅⋅⋅ ⋅⋅⋅ and hence we obtain the desired form via β π§ π˜1π .. β . β β π§ π˜ππ β β β, π˜π+1,π β .. β β . β π˜ππ π ′ π1′ π = (0, diag(π§π΄1,1+π−π , . . . , π§π΄π,π+π−π , π΄π+1,π+1+π−π , . . . , π΄ππ )) . Next we deal with the case where we cannot remove a square from π such that the rectangle πΉ ′ is still contained in the extension of the smaller partition π of π. This is exactly the case when π′ is a rectangle, with corner square π = (π, π) (say), and πΉ ′ = π′∗ is the rectangle with its corner at (π + 1, π + 1). Then π is the only square that can be removed from π′ ; for π = π′ β π we have π∗ = π′∗ β (π + 1, π + 1). We now use the induction hypothesis for the partition π of π and the rectangle πΉ = π′ ⊂ π′∗ = πΉ ′ . ′ SMITH NORMAL FORM OF A MULTIVARIATE MATRIX 11 We keep the notation for the weights of a square π‘ = (π, π) with respect to π, π′ , and set π§ = π₯π = π₯ππ . Clearly, we have for the monomial weights to π′ : { π§π΄π,π+π−π for π ≤ π π΄′π,π+π−π = . 1 for π = π + 1 Now we consider how to compute the matrix π ′ = (πππ′ )(π,π)∈πΉ ′ from the values πππ , (π, π) ∈ πΉ . Arguing analogously as before, we obtain ⧠ β¨ π§πππ + 1 for 1 ≤ π ≤ π, 1 ≤ π ≤ π ′ πππ = 1 for π ≤ π + 1 and π = π + 1 .  β© 1 for π = π + 1 and π ≤ π + 1 As a ο¬rst simpliο¬cation on π ′, we subtract the (π + 1)-th row of π ′ from row π, for all π ≤ π (corresponding to a multiplication from the left with a suitable upper unitriangular matrix), and we obtain the matrix β β π§π11 . . . π§π1π 0 β . .. β .. β .. .β . . . . ′ β. π1 = β β β π§π . . . π§π 0 β β π1 ππ 1 ... 1 1 Subtracting the last column from column π, for all π ≤ π, transforms this (via postmultiplication ( )with a lower unitriangular matrix as reπ§π 0 quired) into π2′ = . By induction we have an upper and a 0 1 lower unitriangular matrix π ∈ SL(π, π ), π ∈ SL(π, π ), respectively, with ππ π = (0, diag(π΄1,1+π−π , . . . , π΄ππ )). Then ( )( )( ) ( ) π 0 π§π 0 π 0 0 diag(π§π΄1,1+π−π , . . . , π§π΄ππ ) 0 = , 0 1 0 1 0 1 0 0 1 and we have the assertion as claimed. β‘ We conclude with an interesting special case, namely, when π is the “staircase” (π − 1, π − 2, . . . , 1) and each π₯ππ = π, we get that πππ is the π-Catalan number that is denoted πΆ˜π+2−π−π (π) by FuΜrlinger and Hofbauer [6][8, Exer. 6.34(a)]. For instance, πΆ˜3 (π) = 1 + 2π + π 2 + π 3 . 12 CHRISTINE BESSENRODT AND RICHARD P. STANLEY Theorem 2 gives that the matrix π2π−1 = [πΆ˜2π+1−π−π (π)]π π,π=1 has SNF diag(π ( 2π−1 2 2π−5 ) , π (2π−3 2 ) , π ( 2 ) , . . . , π 3 , 1), and the matrix π2π = [πΆ˜2π+2−π−π (π)]π+1 π,π=1 2π 2π−2 2π−4 has SNF diag(π ( 2 ) , π ( 2 ) , π ( 2 ) , . . . , π, 1). The determinants of the matrices ππ were already known (e.g., [4],[5, p. 7]), but their Smith normal form is new. References [1] E. R. Berlekamp, A class of convolutional codes, Information and Control 6 (1963), 1–13. [2] E. R. Berlekamp, Unimodular arrays, Computers and Mathematics with Applications 20 (2000), 77–83. [3] L. Carlitz, D. P. Roselle, and R. A. Scoville, Some remarks on ballot-type sequences, J. Combinatorial Theory 11 (1971), 258–271. [4] J. Cigler, π-Catalan und π-Motzkinzahlen, Sitzungsber. OΜAW 208 (1999), 3–20. [5] J. Cigler, π-Catalan numbers and π-Narayana polynomials, preprint; arXiv:math.CO/0507225. [6] J. FuΜrlinger and J. Hofbauer, π-Catalan numbers, J. Combinatorial Theory, Series A 40 (1985), 248–265. [7] R. Stanley, Enumerative Combinatorics, vol. 1, second ed., Cambridge University Press, Cambridge, 2012. [8] R. Stanley, Enumerative Combinatorics, vol. 2, Cambridge University Press, New York/Cambridge, 1999. Institut fuΜr Algebra, Zahlentheorie und Diskrete Mathematik, Leibniz UniversitaΜt Hannover, D-30167 Hannover, Germany. E-mail address: bessen@math.uni-hannover.de Department of Mathematics, Massachusetts Institute of Mathematics, Cambridge, MA 02139, USA. E-mail address: rstan@math.mit.edu