CANONICAL DECOMPOSITIONS AND INVARIANTS FOR DATA ANALYSIS MARLOS VIANA UNIVERSITY OF ILLINOIS AT CHICAGO EYE CENTER Contents 1. Introduction 1 2. A general principle of interpretability 2 3. Symmetry studies 3 4. Data indexed by S3 7 The canonical projections 9 Interpreting the regular invariants 9 A set of bases for the regular invariants 10 The invariant basis 11 A numerical example 12 5. Data indexed by S4 13 The partition λ = 22 16 6. Comments and summary 16 Appendix A. 18 References 20 Abstract. Data analytic applications of the canonical decomposition theorem are illustrated in the study of data indexed by groups of permutations. The concept of canonical invariants is discussed and interpreted in studies of data indexed by the permutation of three and four symbols. It is shown that the invariants can be obtained directly from the (finite group) Fourier transforms for the data. 1. Introduction Riley et al. (2002) observe that If a physical system is such that after application of a particular symmetry transformation the final system is indistinguishable from the original system then its behavior, and hence the functions that describe its behavior, must have the corresponding property of invariance when subject to the same transformations. From Ch.8 in The Handbook of Algebra, Vol.6, (2009) M. Hazenwinkel, ed., Elsevier ( The Netherlands) pp 565-584. 1 2 MARLOS VIANA UNIVERSITY OF ILLINOIS AT CHICAGO EYE CENTER The study of these transformations is a study of the symmetries of the system. More generally, as Bacry (1967) shows, the study of the symmetries of a physical system often suggests the study of the symmetries of certain physical laws and theories, and not infrequently, leads to symmetryrelated principles in the form of conservation laws. In this Chapter we propose a data analytic interpretation of these principles. To illustrate, consider the group G = {1, v, h, o} of vertical (v) and horizontal (h) line reflections, point reflection (o) and the identity (1) planar transformations under composition of functions. Its multiplication table ∗ 1 v h o 1 1 v h o (1.1) v v 1 o h h h o 1 v o o h v 1 shows that G is Abelian. Of course, these are the automorphisms of a rectangle. Examples of data indexed by these transformations appear, for example, in Viana (2006) and Viana (2008, Ch.1) . Indicating by {x1 , xv , xh , xo } the data indexed by these transformations, it then follows that the four scalar quantities I1 = x1 +xo +xv +xh , Iv = x1 +xv −xo −xh , Ih = x1 +xh −xo −xv , Io = x1 +xo −xv −xh , (1.2) are the canonical invariants of the data space under the given group action. These data summaries vary, as a result of the underlying group action, only up to companion points in the same invariant subspace. For example, the invariant x1 + xo − xv − xh is such that xt∗1 + xt∗o − xt∗v − xt∗h = ±(x1 + xo − xv − xh ) for all t ∈ G. Ideally, then, it is of interest to interpret and describe both the underlying group actions and the resulting invariants. 2. A general principle of interpretability As a general principle, group actions can be interpreted as degrees of arbitrariness, or irrelevance, that are chosen upon the labels and need to be resolved. For example, it can be shown that the sum and n − 1 sets of pairwise comparisons in a sample of size n are the invariants that result when the assignment of observations to the labels {1, 2, . . . , n} is (determined to be) arbitrary in such way that its degree of arbitrariness amounts to shuffling the labels in all possible n! ways. These invariants, in dimensions of 1 and n − 1 respectively, appear in the practice of data analysis as the sample mean its standard deviation. If, however, the data are labeled by n equally spaced points along a circle, and the degree of arbitrariness is now restricted to shuffling the labels according to CANONICAL DECOMPOSITIONS AND INVARIANTS FOR DATA ANALYSIS 3 the n cyclic permutations, then the resulting invariants are the n linear combinations of the original data, with coefficients the n sets of Fourier coefficients cos(2πjk/n)+i sin(2πjk/n), j, k = 0, . . ., n−1. Appying this principle to the present example, the invariants in (1.2) can be characterized as those data that should be retained when the arbitrariness of where is left (right) and where is up (down) is resolved. This degree of arbitrariness can be identified by studing the group action, defined by the multiplication table in (1.1) , each line of which relabels what is up-down, leftright. We may refer to it as planar orientation and then say that the pairwise comparisons in (1.2) are the invariants of planar orientation. For example, x1 + xo − xv − xh gives a one-dimensional subspace regardless of the planar orientation then. Equivalently, it compares the responses to point symmetry against the combined responses of vertical and horizontal responses, regardless of the planar orientation. In this example, all invariants were in dimension of one, adding to the dimension (4) of the original vector space for the data indexed by the elements of G. The simplest example is given by the invariants x` ± xr associated with bilateral, say, left (`) and right (r) measurements, determined by the identity transformation and the transposition, indicated by (r`), of left and right. When an invariant is derived from a regular group action (the group acting on itself by multiplication on the right or on the left) we call it a regular invariant. In the next sections the data analytic methods of symmetry studies will be applied to determine and interpret the regular invariants in higher dimensions. 3. Symmetry studies Methods of representation theory can be systematically applied in the analysis of data indexed by a set of labels where a group action can be defined and often justified by an experimental context. While group representations provide the connection between the labels and the data space, the canonical decomposition theorem, reducing the identity matrix in the data space as a sum of algebraically orthogonal projections, provides the conditions with which the statistical theory of quadratic forms can be formulated to study the resulting decomposition of the sum of squares, or analysis of variance. The study of these projections also lead to useful summaries of the data in the form of canonical invariants, which can be used as descriptive tools or interpreted for further inference in the analysis of variance context. These basic elements constitute the core of all symmetry studies as introduced by the author in Viana (2008), where a detailed introduction to a number of data analytical applications is described. The algebraic foundations for this chapter are also summarized in Viana (2008). Specifically, every symmetry study includes the identification of: (1) A set V of labels with v elements; 4 MARLOS VIANA UNIVERSITY OF ILLINOIS AT CHICAGO EYE CENTER (2) The observed data x defined in a linear subspace V ⊂ Rv and indexed by those labels (the structured data); (3) A rule or group action, with which the symmetry transformations in G are applied to V; (4) The classes and multiplicities of the resulting elementary orbits, subsets of V, where G acts transitively; (5) The resulting linear representations of these actions in the corresponding data vector subspaces; (6) The canonical projection matrices Pχ indexed by the irreducible characters χ of G; (7) The canonical invariants Pχ x on the data x, and their interpretations; (8) A statistical analysis of the canonical invariants and, if applicable, their analysis of variance P P x0 x = χ x0 Pχx based on the canonical decomposition I = χ Pχ of the identity operator in the data subspaces. When necessary, these canonical projections may be grouped together to describe the V. SpecifiP cally, if there are qλ orbits of type λ, each with oλ elements, classifying the v = λ qλ oλ points in V then, for each class of orbits with vλ = qλ oλ points, the identity matrix in V reduces according to Ivλ = Pλ1 + Pλ2 + . . ., where Pλj is the canonical projection constructed with qλ copies of the canonical projection Pjλ in the vector space associated with the elementary orbits of type λ, and indexed by the irreducible representations j = 1, 2, . . . of the underlying group G. The reduction in the original space V is then Iv = Diag(. . . , Ivλ , . . .), with as many components as the different types of orbits identified by the action of G on V. To illustrate, consider the set V = {uu, yy, uy, yu} of binary sequences in length of two, or, equivalently, the set of all mappings s from {1, 2} into {u, y}, so that ϕ1 (τ, s) = sτ −1 , τ −1 s ϕ1 : {1, 2} −−→ {1, 2} − → {u, y}, s ∈ V, τ ∈ S2 = {1, (12)}, (3.1) and ϕ2 (σ, s) = σs, s σ ϕ2 : {1, 2} − → {u, y} − → {u, y}, s ∈ V, σ ∈ S2 = {1, (uy)}, (3.2) are actions of S2 on V. Action ϕ1 classifies the sequences up to permutations of the positions {1, 2} of the letters, whereas ϕ2 classifies the sequences up to permutations of the symbols {u, y}. Action ϕ1 generates 3 orbits {uu}, {yy} and {uy, yu} and ϕ2 generates 2 orbits, {uu, yy} and {uy, yu}. In the position symmetry case, the representation is isomorphic to # # " " 1 0 0 1 ), t ≡ (12), ), ξt = Diag(1, 1, ξ1 = Diag(1, 1, 1 0 0 1 taking values in GL(R4 ). Equivalently, writing V = R4 , we observe that the ξ determines the decomposition V = W1 ⊕ W2 ⊕ W3 , CANONICAL DECOMPOSITIONS AND INVARIANTS FOR DATA ANALYSIS 5 where the stable subspaces W1 and W2 reduce isomorphically as a subspace U1 associated with the unit representation and W3 reduces as the sum of an isomorphic copy of U1 and a subspace USgn associated with the sign representation. Therefore, V = U1 ⊕ U1 ⊕ U1 ⊕ USgn = V1 ⊕ VSgn, | {z } (3.3) V1 showing a decomposition of V into a direct sum of the irreducible representations of S2 in which we collected together the isomorphic copies. This is the canonical decomposition of V. Next, to construct projections P1 and PSgn of V on the irreducible subspaces V1 and VSgn define X χβ (τ )ξτ /g, Pβ = nβ τ ∈G where χβ is the irreducible character of the irreducible representation β of G with |G| = g elements, and nβ its dimension. In the present case (G = S2 ), β ∈ {1, Sgn} with corresponding characters χ1 ≡ 1 and χSgn = Sgnτ , so that P1 = (ξ1 + ξt)/2 and PSgn = (ξ1 − ξt )/2. When the projections are evaluated relative to a basis of V = V1 ⊕ VSgn on which ξτ = Diag(I3 ⊗ 1τ , Sgnτ ), where In indicates the n × n identity matrix, we obtain 1 P1 = [Diag(I3 ⊗ 1, 1) + Diag(I3 ⊗ 1, −1)] = Diag(I3 ⊗ 1, 0) = Diag(1, 1, 1, 0), 2 and 1 [Diag(I3 ⊗ 1, 1) − Diag(I3 ⊗ 1, −1)] = Diag(I3 ⊗ 0, 1) = Diag(0, 0, 0, 1). 2 2 It then follows that (1) P12 = P1 , PSgn = PSgn; (2) P1 PSgn = PSgn P1 = 0 and (3) I4 = P1 + PSgn, PSgn = so that Pβ is a projection on a subspace isomorphic to Vβ . These properties remain valid if Pβ is evaluated relative to any representation Mξτ M−1 equivalent to ξ. In this case, Pβ transforms as MPβ M−1 and (1),(2) and (3) remain unchanged. For example, relative to the basis for V indexed by {uu, yy, uy, yu}, we have P1 = Diag(1, 1, A2); PSgn = Diag(0, 0, Q2), where An indicates the n × n matrix with all entries equal to 1/n and Qn = In − An . Similarly, in the letter symmetry case, ξ is given by " # " # 0 1 0 1 ξ1 = Diag(I2 , I2 ), ξt = Diag( ), )), 1 0 1 0 so that now V = W1 ⊕ W2 and in each one of these two stable subspaces ξ reduces isomorphically as the sum of the unit and the sign representations. Collecting the isomorphic copies of the 6 MARLOS VIANA UNIVERSITY OF ILLINOIS AT CHICAGO EYE CENTER corresponding irreducible subspaces U1 and USgn we have then V = U1 ⊕ U1 ⊕ USgn ⊕ USgn , | {z } | {z } V1 VSgn which is the canonical decomposition of V, with corresponding projections P1 = (ξ1 + ξt )/2 and PSgn = (ξ1 − ξt )/2. When these matrices are evaluated relative to a basis of V = V1 ⊕ VSgn on which ξτ = Diag(I2 ⊗ 1τ , I2 ⊗ Sgnτ ) the result is P1 = 1 [Diag(I2 ⊗ 1, I2 ⊗ 1) + Diag(I2 ⊗ 1, I2 ⊗ −1)] = Diag(I2 ⊗ 1, I2 ⊗ 0) = Diag(1, 1, 0, 0), 2 and PSgn = 1 [Diag(I2 ⊗ 1, I2 ⊗ 1) − Diag(I2 ⊗ 1, I2 ⊗ −1)] = Diag(I2 ⊗ 0, I2 ⊗ 1) = Diag(0, 0, 1, 1). 2 It then follows that Pβ is a projection on a subspace isomorphic to Vβ and, regardless of the chosen 2 basis for V, P12 = P1 , PSgn = PSgn ; P1 PSgn = PSgn P1 = 0 and I4 = P1 + PSgn. In particular, P1 = Diag(A2 , A2 ), PSgn = Diag(Q2 , Q2 ). The arguments illustrated in the above example leads to the proof of following result (Viana, 2008; Serre, 1977): Theorem 3.1 (Canonical Decomposition). Let ρ be a linear representation of G into GL(V), ρ1 , . . . , ρh the distinct non-isomorphic irreducible representations of G with g elements, with corresponding characters χ1 , . . . , χh and dimensions n1 , . . . , nh . Then, X χi (τ )ρτ /g, Pi = ni τ ∈G is a projection of V onto a subspace Vi , sum of mi isomorphic copies of the irreducible subspaces P associated with ρi , i = 1, . . ., h. Moreover, PiPj = 0, for i 6= j, Pi2 = Pi and i Pi = Iv , where P v = dim V = hi=1 mi ni . The Fourier inverse formula. In several experiments, however, the data are naturally indexed by G in itself, so that V = G. In this case, there is a one-to-one correspondence X X b x(β) = xτ βτ ←→ xτ = nβ tr [βτ −1 b x(β)], τ ∈G b β∈G between the experimental data {xτ , τ ∈ G} and the Fourier transforms x b(β) over the irreducible b indicates the set of all representations β of G, in dimension of nβ , where in the above expression G irreducible representations of G. Observe, in particular, that the (regular) invariants in (1.2) are the Fourier transforms of the data indexed by G. More generally, we have: Proposition 3.1. The rows of the Fourier transform b x(β) matrix reduce as β under the regular action. CANONICAL DECOMPOSITIONS AND INVARIANTS FOR DATA ANALYSIS 7 Proof. Assume, without loss of generality that β is in dimension of 2, select the first row of b x(β) and write βijτ to indicate the ij entry of βτ . We want to show that G acting on the components u= X τ τ β11 , v= τ ∈G X τ τ β12 τ ∈G of the first row of b x(β) as points in the group algebra of G according to the regular action (uσ −1 , vσ −1) gives βσ , so that the representation coincides with (one of the two copies of) β, that is σ σ σ σ uσ −1 = uβ11 + vβ21 , vσ −1 = uβ12 + vβ22 Indeed, σ σ uβ11 + vβ21 = X X τ σ τ σ τ [β11 β11 + β12 β21 ] = τ τσ τ β11 = τ X γ (γβ11 )σ −1 = uσ −1 , γ σ + vβ σ = vσ −1 , proving the proposed equality and concluding the proof. and similarly uβ12 22 We now replace τ by xτ (the scalar data indexed by G) so that the rows of the Fourier transform b is in dimension of m b x(β) should give a basis for an invariant subspace tagged by β. If β ∈ G then there are m such bases, corresponding to the m copies with which β appear in the regular representation of G. Repeating the same argument for the remaining rows of xhat(β) we see that, in matrix form, Proposition 3.1 can be expressed as b x(β)βσ = X xτ σ−1 βτ . (3.4) τ In particular, the rows (or columns if the action if the regular action is from the right) form a set of bases for the isomorphic copies of β. In the next two sections these results will be applied to the study of the regular invariants of S3 and S4 . 4. Data indexed by S3 To illustrate, consider the data shown in Table (4.1), the observed frequencies with which the DNA words in the permutation orbit Oact = {act, tac, cta, cat, tca, atc} = {sτ −1 ; τ ∈ S3 }, of the mapping s = act appear in 9 subsequent 900−bp long regions along the BRU isolate of the Human Immunodeficiency Virus Type I. The entire 9229 bp-long single-strain DNA sequence is available from the National Center for Biotechnology Information 1http://www.ncbi.nlm.nih.gov/ 1 data base using the accession 8 MARLOS VIANA UNIVERSITY OF ILLINOIS AT CHICAGO EYE CENTER number K02013. 1 2 3 4 act 8 16 16 tac 7 17 13 15 6 7 7 17 11 12 9 11 18 8 9 6 14 5 17 15 cat 14 15 16 14 21 17 15 10 tca 11 18 10 17 11 16 14 7 15 9 14 15 8 cta atc 8 14 5 5 16 8 (4.1) 9 13 9 13 11 11 11 12 10 total 62 89 78 75 83 81 78 47 78 It is assumed in this section that we write the elements of S3 in the order of {1, (123), (132), (12), (13), (23)}, so that Oact is obtained by evaluating sτ −1 in that order, where s(1) = a, s(2) = c, s(3) = t. In what follows, moreover, the components of vectors such as x0 = (act, tac, cta, cat, tca, atc), indicate in short notation the data and at the same time the corresponding permutations in S3 . Other common examples occur in ranking preferences studies, or scalar data in psychometric studies where scalar responses to the different sequences of presentation of a stimuli are recorded. Having identified the labels and the structured data, we remark again that in the present the study the labels and the symmetries of potential interest coincide. In particular, we will study the case in which all permutations in S3 are allowed to act on the labels (which are in one to one correspondence with S3 ) according its multiplication table 1 2 3 4 5 6 1 1 2 3 4 5 6 2 2 3 1 6 4 5 3 3 1 2 5 6 4 , (4.2) 4 4 5 6 1 2 3 5 5 6 4 3 1 2 6 6 4 5 2 3 1 where we identify 1, (123), (132), (12), (13), (23) with 1, 2, 3, 4, 5, 6, respectively. This is called the regular action, and the resulting permutation representation is the regular representation of S3 and indicated here by φ. For example, when τ = (12), φτ is the permutation matrix defined by rows 1 and 4 of Table (4.2). Because this action is transitive, Step (4) of Section 2 leads to a simple single-orbit symmetry study. CANONICAL DECOMPOSITIONS AND INVARIANTS FOR DATA ANALYSIS 9 b = {1, Sgn, β} the irreducible representations of S3 , The canonical projections. Indicate by G where 1 is the symmetric or trivial representation, Sgn the alternating (or parity) representation, both in dimension of 1, and β is in dimension of two. It appears with two isomorphic copies in the regular representation of S3 . Their irreducible characters are given by τ χ1 χSgn χβ 1 1 1 2 (123) 1 1 −1 (132) 1 1 −1 , (12) 1 −1 0 (13) 1 −1 0 (23) 1 −1 0 from which the evaluation of the canonical projections X Pχ = nχ χ(τ )φτ /6, τ ∈G 0 0 gives P1 = ee /6 where ee is the 6 × 6 matrix of ones, 1 1 1 −1 −1 −1 1 1 1 −1 −1 −1 1 1 1 −1 −1 −1 PSgn = 1/6 , Pβ = 1/3 −1 −1 −1 1 1 1 1 1 1 −1 −1 −1 −1 −1 −1 1 1 1 2 −1 −1 0 0 0 −1 2 −1 0 0 0 −1 −1 2 0 0 0 . 0 0 0 2 −1 −1 0 0 −1 2 −1 0 0 0 0 −1 −1 2 Interpreting the regular invariants. Borrowing language from physics (Bacry, 1967), we find it useful to think of (the resulting actions of the) permutations {1, (123), (132)} as the rotations and {(12), (13), (23)} as the reversals. This is also justified when S3 is interpreted as the dihedral group D3 , in which case its action on the vertices of the regular triangle describes its 3-fold rotations as seen from “both sides”. Applying the interpretability principles introduced in Section 2, we observe that (1) P1 x gives the symmetric invariant act + tac + cta + cat + tca + atc; (2) PSgn x gives the alternating invariant act + tac + cta − cat − tca − atc, comparing the total effect of rotations with the total effect of reversals, and (3) Pβ x evaluates as an invariant of within-rotation variability and within-reversal variability, giving up an invariant subspace in dimension of four. The degree of arbitrariness induced by the regular action can be interpreted as the arbitrariness intrinsic to the notion of rotations and reversals, which in a sense is equivalent to a notion of 10 MARLOS VIANA UNIVERSITY OF ILLINOIS AT CHICAGO EYE CENTER planar orientation. To see this, observe that although there are 3! = 6 distinct assignments of the symbols {a, c, t} to the labeled (as 1, 2, 3, say) vertices of a regular triangle, there are only two distinct assignments to those vertices when their labels are removed (by shuffling): namely to the two distinct classes {(a, c), (c, t), (t, a)}, {(c, a), (a, t), (t, c)} of oriented edges. These two classes are the rotations and the reversals. It is simple to verify that the action of the permutations act, tac, cta, cat, tca, atc on these edges replicates precisely the regular action of S3 . With this language, we can then say that (1)-(3) above are the canonical invariants of rotations and reversals. We remark, however, that more generally, (1)-(3) are the regular invariants of S3 . A set of bases for the regular invariants. The projection Pβ suggests the basis {2e1 − e2 − e3 , 2e2 − e1 − e3 } ≡ {u, v}. Observing that −u − v = 2e3 − e1 − e2 , it is simple to verify that the action of S3 on the indices of {u, v} gives a representation " # " # " # " # " # " # 1 0 0 1 −1 −1 0 1 −1 −1 1 0 β' , , , , , , 0 1 −1 −1 1 0 1 0 0 1 −1 −1 where the matrices follow the order of S3 described above. The resulting Fourier transform X b x(β) = xτ βτ τ of the structured data x at β is then " # act − cta − tca + atc tac − cta + cat − tca b x(β) = ' β ⊕ β. −tac + cta + cat − atc act − tac + tca − atc From Proposition 3.1, we identify the two bases Iβ = {act − cta − tca + atc, tac − cta + cat − tca}, and Iβ0 = {−tac + cta + cat − atc, act − tac + tca − atc}, from rows 1 and 2 of b x(β), respectively. These are the invariants we are searching for (in dimension of two). The two (isomorphic) subspaces spanned by these two bases reduce as β under the regular action of S3 , that is, (3.4) obtains. The alternating invariant. Clearly, a trivial application of Proposition p1 with the alternating character gives the invariant ISgn = act + tac + cta − cat − tca − atc comparing rotations and reversals. CANONICAL DECOMPOSITIONS AND INVARIANTS FOR DATA ANALYSIS 11 The invariant basis. The vectors {I1 , ISgn, Iβ , Iβ0 } define the change of basis matrix B= relative to which 1 1 1 1 1 −1 0 1 0 −1 1 −1 1 1 1 1 1 −1 −1 −1 0 0 −1 1 , −1 1 −1 0 1 1 0 −1 0 0 1 −1 BP1 B−1 = Diag(1, 0, 0, 0, 0, 0), BPSgnB−1 = Diag(0, 1, 0, 0, 0, 0), BPβ B−1 = Diag(0, 0, 1, 1, 1, 1). Similarly, the regular action of S3 extended to the components of {I1 , ISgn, Iβ , Iβ0 } gives the representation φ̃τ = Diag(1, Sgnτ , βτ , βτ ), relative to which the canonical projections are, respectively, BP1 B−1 , BPSgn B−1 and BPβ B−1 . Interpreting voting preferences. Adopting the language of (complete) voting preferences, in which the data are often the frequencies with which voters select one of the n! rankings of n candidates or objects, we observe that Iβ = {act − cta − tca + atc, tac − cta + cat − tca}, can be characterized (as highlighted by a) as a basis for the space describing “more radical” (comparing positions 1 and 3) vs “less radical” (comparing positions 2 and 3) position substitutions for the symbol a. The same description applies to Iβ0 , relative to position substitutions of c. These invariants resolve the arbitrariness present when the labeling (1, (123), (132), (12), (13), (23)) ≡ (act, tac, cta, cat, tca, atc) (4.3) is implicitly made. That is equivalent to resolving the arbitrary choice of which labeled as the identity transformation. Also equivalently, resolving the arbitrary choice of the generator for the orbit Os = {sτ −1 ; τ ∈ S3 }. From Proposition 3.1, a different choice of generator would be represented in a space spanned by a basis that is a linear superposition of Iβ . It is also opportune to remark that these data analytic considerations do not depend on any parametric assumption on the underlying probability distributions as they refer to the labels for the data. 12 MARLOS VIANA UNIVERSITY OF ILLINOIS AT CHICAGO EYE CENTER The generic interpretation for the regular invariants of S3 . Following with the interpretations on page 9, it is then possible to summarize the the regular invariants ISgn and Iβ (or Iβ0 ) as (1) The comparison of rotations and reversals, in dimension of 1; (2) The joint within-rotation, within-reversal variability, in dimension of 2. A numerical example. Figure 4.1 displays the two components regular invariant Iβ = {act − cta − tca + atc, tac − cta + cat − tca} ≡ {v1 , v2 }, for the DNA (log transformed) frequency data from Table (4.1). Indicating by ,→ and ←- the two opposing (chiral) orientations of planar rotation, we observe that v1 = |act {z − cta} + |atc {z − tca}, ←- two-fold ,→ one-fold v2 = |tac {z − cta} + |cat {z − tca} ←- one-fold ,→ two-fold Figure 4.1. The regular invariant Iβ along the BRU isolate CANONICAL DECOMPOSITIONS AND INVARIANTS FOR DATA ANALYSIS 13 5. Data indexed by S4 Table 5.1 shows the frequencies xτ , abstracted and adapted from Diaconis (1989), with which voters chose each one of the 24 different rankings (τ ) of four candidates. τ x(τ ) τ x(τ ) τ x(τ ) τ x(τ ) agct 29 tgca 37 cgta 22 gcta 26 agtc 11 actg 25 tacg 43 gtac 44 acgt 19 atgc 46 tgac 24 ctga 57 atcg 50 gcat 24 gatc 35 catg 34 gact 50 gtca 54 ctag 49 tagc 26 cgat 22 cagt 26 tcga 29 tcag 67 . (5.1) For example, xagct = 29 voters ranked candidate a first followed by candidates G,C and T ranked 2, 3 and 4 respectively, whereas 37 voters ranked candidate T followed by candidates G,C and A. The choice of letters is intensional, as the same type of frequency counts appears in molecular biology studies. The complete symmetry study of data indexed by S4 is the study of 5 vector (invariant) subspaces, V = V1 ⊕ VSgn ⊕ 3Vβ ⊕ 3Vγ ⊕ 2Vη reducing the original data space. Each of the three copies of the subspaces Vβ and Vγ are in dimension of three, whereas the two copies of Vη are in dimension of two each. There are, in addition, the alternating (VSgn ) and the trivial (V1 ) subspaces. In what follows we will describe the bases {Iβ , Iβ0 , Iβ00}, {Iγ , Iγ0 , Iγ00} and {Iη , Iη0 } for 3Vβ , 3Vγ and 2Vη , respectively. β= 1 0 0 0 1 0 0 0 1 −1 1 1 0 1 0 0 0 1 0 −1 0 −1 0 0 0 0 1 0 0 −1 0 1 0 −1 0 0 1 0 0 1 −1 −1 0 0 1 1 0 0 0 1 0 1 −1 −1 1 0 0 0 0 1 0 1 0 0 −1 0 1 −1 −1 0 0 1 −1 1 1 −1 0 0 0 0 1 1 0 0 1 −1 −1 0 1 0 1 0 0 0 0 1 1 −1 −1 0 0 −1 −1 0 0 0 1 0 0 −1 0 0 0 1 −1 0 0 0 0 −1 0 1 0 1 −1 −1 −1 1 1 0 1 0 −1 0 0 0 0 −1 1 −1 −1 0 1 0 0 −1 0 −1 0 0 1 −1 −1 −1 1 1 0 0 1 −1 0 0 0 0 −1 0 1 −1 −1 1 0 0 0 1 0 −1 1 −1 −1 0 0 −1 0 0 1 −1 0 0 1 −1 −1 0 −1 1 −1 0 0 1 −1 0 −1 1 1 0 0 −1 0 0 1 −1 −1 −1 0 1 0 0 1 (12) (13) (14) (23) (24) (34) (123) (132) (234) (243) (134) (143) (124) (142) (1234) (13)(24) (1432) (1324) (12)(34) (1423) (1243) (14)(23) (1342) . . (5.2) (5.3) Table (5.2) shows an irreducible representation of S4 in dimension of three (Viana, 2008), indexed in correspondence with the elements of S4 in Table (5.3). When the labels for the data are generated from agct by the permutations in (5.3), in that order, acting on the positions {1, 2, 3, 4} of the 14 MARLOS VIANA UNIVERSITY OF ILLINOIS AT CHICAGO EYE CENTER symbols (via sτ −1 ), then the resulting complete set of labels is given by matrix (5.4). Applying Proposition 3.1 to β with the labels in (1.1), the resulting bases (rows of the Fourier transform of x at β) are shown in (5.5), (5.6), and (5.7). agct gact cgat tgca acgt atcg agtc cagt gcat atgc actg tgac cgta tacg gtca tagc ctag gcta (5.4) tcag gatc ctga catg tcga gtac agct − gact + acgt + atcg + agtc − gcat + actg + atgc − gtca − gcta − gatc − gtac Iβ = −cgat + acgt − gcat + cagt + atgc − tgac − ctag + tagc + ctga − tcag − gtac + tcga , −tgca + atcg + actg − cgta − gtca + tacg − gcta + ctag − ctga + tcag − tcga + catg Iβ0 (5.5) gact − cgat + gcat − cagt − cgta + gtca + gcta − ctag − ctga + gatc + gtac − catg = agct + gact + tgca − acgt + atcg − cagt − atgc + gtca + tacg − tagc − ctga − tcga , −atcg + agtc − actg + atgc + tgac − tacg − ctag + tagc + gatc − tcag + gtac − catg Iβ00 (5.6) gact − tgca + gcat − tgac + gtca − tacg + gcta − tagc + gatc − tcag + gtac − tcga = −acgt + agtc − cagt + actg − atgc + cgta + gcta − tagc − ctga + gatc − tcga + catg . agct + gact + cgat + acgt − atcg + gcat + cagt − actg − tacg − ctag − tcag − catg (5.7) each one spanning one copy of Vβ . Inspection of these bases shows that (1) Iβ describes “g vs a” at positions 1, 3, 4; (2) Iβ0 describes “g vs c” at position 1, 3, 4; (3) Iβ00 describes “g vs t” at positions 1, 3, 4. The position 2, not present above, is in fact g’s position in the generating sequence “agct”. In each subspace, the three coordinates correspond to position 1, 3, 4. Evaluation of the voting data, from Table (5.1), identifies xga = (−1.99, −0.63, 1.16) ∈ Iβ , xgc = (0.77, 1.93, −2.51) ∈ Iβ0 , xgt = (0.30, −1.64, −2.79) ∈ Iβ00 , the rows 1, 2 and 3 of b x(β). The inner product unitary for β in these subspaces is given by 3 1 1 (x, y) = 1/2 x0 1 3 −1 y, 1 −1 3 CANONICAL DECOMPOSITIONS AND INVARIANTS FOR DATA ANALYSIS 15 which gives ||xga|| = 2.87, ||xgc|| = 4.5 and ||xgt|| = 3.14, thus suggesting that the comparison “g vs c” is relatively more evident than “g vs t” or “g vs a”. The reduction in the second invariant space in dimension of three. The companion (distinct) irreducible representation of S4 is simply γτ = Sgnτ βτ . That is, multiply the matrices βτ in (5.2) by the parity Sgnτ of the corresponding permutation. Consequently, the bases for the invariant subspaces of interest are the rows of Fγ = X xτ Sgnτ βτ , τ specifically, agct + gact − acgt − atcg − agtc − gcat + atgc + actg − gtca + gcta − gatc + gtac Iγ = cgat − acgt + cagt − gcat + atgc − tgac − tagc − ctag + tcag − ctga + tcga + gtac , tgca − atcg + actg − cgta + tacg − gtca + ctag + gcta − tcag + ctga − catg − tcga Iγ0 (5.8) −gact + cgat − cagt + gcat − cgta + gtca − ctag − gcta + gatc + ctga + catg − gtac = agct − gact − tgca + acgt − atcg − cagt − atgc + tacg + gtca + tagc + ctga − tcga , atcg − agtc + atgc − actg + tgac − tacg − tagc − ctag + tcag + gatc + catg − gtac Iγ00 (5.9) −gact + tgca + gcat − tgac − tacg + gtca + tagc − gcta + tcag + gatc − tcga − gtac = acgt − agtc − cagt − atgc + actg + cgta + tagc − gcta + gatc + ctga − catg − tcga . agct − gact − cgat − acgt + atcg + cagt + gcat − actg − tacg − ctag + tcag + catg (5.10) To outline the interpretation of these subspaces we need some notation. Indicate by ±Nk the sum of (the data indexed by) the permutations with the symbol N in the k-th position that differ only by cyclically moving the remaining symbols in one direction (+) and in the opposite direction. For example, then, the first axis I1γ = agct + gact − acgt − atcg − agtc − gcat + atgc + actg − gtca + gcta − gatc + gtac Iγ can represented as ±a1 ± g1 . It then follows that I1γ = ±a1 ± g1 , I2γ = ±a3 ± g3 , I3γ = ±a4 ± g4 . Similarly, 0 0 0 00 00 00 I1γ = ±c1 ± g1 , I2γ = ±c3 ± g3 , I3γ = ±c4 ± g4 , and I1γ = ±t1 ± g1 , I2γ = ±t3 ± g3 , I3γ = ±t4 ± g4 . 16 MARLOS VIANA UNIVERSITY OF ILLINOIS AT CHICAGO EYE CENTER The partition λ = 22. ({1, 2}, {3, 4}) ({1, 3}, {2, 4}) ({1, 4}, {2, 3}) ({2, 3}, {1, 4}) ({2, 4}, {1, 3}) ({3, 4}, {1, 2}) u v 1 (12) (13) (14) (23) (24) 1 1 4 5 2 3 2 4 2 6 1 2 3 5 6 3 3 1 4 2 1 4 4 6 5 3 5 1 6 5 6 6 3 2 5 4 u −u u−v v v u−v v −u + v −v u u −v (34) (123) (234) (134) (124) (1234) 1 4 2 4 5 4 3 5 3 6 4 5 2 1 1 2 6 1 5 6 6 5 1 6 4 2 4 1 3 2 6 3 5 3 2 3 −u −u + v −v −u + v −v u−v −u + v −u u−v −u u−v −v (1324) (1243) 6 5 4 1 2 4 5 3 3 6 1 2 −u v −u + v u τ 1 0 −1 0 1 −1 0 1 0 1 1 −1 0 1 −1 1 0 −1 1 0 1 0 0 −1 −1 0 −1 1 0 −1 0 −1 −1 1 −1 1 −1 1 −1 0 1 −1 1 −1 −1 0 −1 0 0 −1 0 −1 −1 1 1 −1 1 0 1 −1 1 −1 1 −1 −1 0 0 −1 0 1 0 −1 −1 0 1 0 −1 0 0 1 1 0 0 1 −1 1 0 1 −1 1 1 0 0 1 1 0 0 0 0 2 0 0 0 −1 −1 −1 −1 −1 −1 −1 −1 0 2 0 0 2 0 0 2 0 Interpretation. Keeping in mind that in the generating permutation the symbol g occupies position number 3, we observe that each subspace compares g against the cyclic (eventually chiral) changes of the remaining symbols in the remaining positions, one subspace for each one of the three positions 1, 2 or 4 to which g is moved to. 6. Comments and summary These Chapter is an invitation to developing new applications of symmetry arguments to the study of experimental data indexed by permutations. It emphasized the methodological aspects of reducing the data space in subspaces where the resulting data summaries can be studied by descriptive and inferential statistical methods. We observe that: CANONICAL DECOMPOSITIONS AND INVARIANTS FOR DATA ANALYSIS 17 (1) It should be well understood that the isomorphic copies of the subspaces determined by b x are not invariant subspaces, as clearly demonstrated by Figures 4.1 and ??. The invariant subspace is the direct sum (indexed by the corresponding irreducible representation) of the two copies; (2) Proposition 3.1 was obtained by letting S` act on the left (position symmetry, as in (3.1). A similar result can be obtained for right actions (3.2), resolving symbol arbitrariness; (3) Symmetry studies of regular actions (the group acting on itself) are simpler to interpret compared with other group actions in which the action gives intransitivities. That is, in the regular case, the step (4) described in Section 3 always gives a single orbit. Otherwise, when orbits appear, as illustrated by (3.3), the canonical projections collect together pieces from different orbits and label the resulting classes by the irreducible representations, in one-toone correspondence with the canonical projections. This “popping out into non-isomorphic subspaces” clearly complicates any eventual interpretations. This is why the representations might be restricted to the underlying orbits, for data analytic purposes. This chapter illustrated the evaluation and interpretation of canonical invariants in symmetry studies of data indexed by permutations of three and four symbols. 18 MARLOS VIANA UNIVERSITY OF ILLINOIS AT CHICAGO EYE CENTER Appendix A 2 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 B=6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 4 1 1 1 0 1 −1 −1 0 1 −1 1 −1 1 −1 0 1 1 −1 0 1 1 −1 1 −1 1 −1 −1 0 1 1 −1 1 1 1 0 −1 1 1 0 −1 1 1 −1 1 1 1 −1 1 1 1 0 −1 1 1 0 −1 1 1 −1 1 1 −1 1 −1 1 1 1 0 1 −1 1 −1 1 −1 −1 0 1 1 1 0 1 −1 −1 0 1 −1 0 1 1 1 1 0 1 −1 0 1 0 1 1 0 0 0 −1 1 −1 0 0 1 0 −1 0 −1 0 −1 1 0 0 0 −1 0 1 0 1 1 0 0 0 −1 1 0 1 0 −1 1 1 0 0 0 −1 0 0 1 0 −1 1 −1 −1 −1 0 1 1 −1 1 1 0 0 −1 0 1 0 1 0 −1 0 0 −1 0 0 1 −1 0 0 −1 −1 1 −1 0 0 1 0 −1 0 −1 0 −1 1 0 −1 0 1 0 0 0 1 0 −1 1 −1 0 −1 −1 0 −1 1 −1 1 0 −1 1 0 0 1 −1 0 0 1 −1 1 0 1 −1 −1 1 0 0 0 1 −1 0 1 0 1 −1 0 1 0 −1 −1 0 1 1 0 0 0 1 1 1 0 1 0 1 1 0 0 0 0 1 0 1 0 −1 0 0 0 −1 0 0 −1 1 −1 1 −1 0 0 −1 −1 0 1 0 1 0 −1 −1 0 0 −1 1 0 0 0 1 0 1 −1 −1 1 0 −1 0 1 0 −1 0 1 −1 1 0 1 −1 0 0 0 0 0 0 1 0 0 1 −1 −1 0 −1 0 1 0 1 0 0 −1 −1 1 −1 −1 0 0 0 −1 0 0 −1 0 0 0 1 1 0 1 0 −1 −1 0 −1 0 0 1 1 1 0 −1 −1 0 0 −1 0 0 0 −1 0 1 −1 0 −1 0 −1 −1 0 0 0 1 1 0 0 1 0 0 0 1 0 0 0 0 −1 −1 0 −1 1 0 1 0 0 0 0 1 0 −1 0 1 −1 0 0 1 0 0 0 −1 0 −1 1 0 0 0 0 0 −1 0 1 0 −1 −1 0 0 −1 0 1 0 0 1 1 0 0 −1 1 0 0 1 0 0 −1 0 −1 0 0 0 1 −1 0 −1 −1 0 0 0 0 1 0 1 −1 −1 0 −1 1 1 0 1 −1 0 0 1 −1 1 −1 1 −1 0 −1 0 0 1 −1 0 0 −1 1 −1 0 0 1 1 0 0 1 0 1 1 −1 1 1 1 0 0 0 −1 1 0 1 0 1 −1 0 −1 0 −1 1 0 −1 0 −1 −1 0 1 0 −1 0 −1 0 0 1 −1 0 1 −1 0 −1 1 0 1 −1 0 1 −1 0 0 1 0 0 −1 0 0 1 0 0 −1 −1 0 0 1 1 0 1 0 −1 1 −1 0 0 0 BB0 = where 1 Diag(1, 1, β, β, γ, γ) 24 4 −2 2 −1 −2 4 −1 2 β = 96 , γ = 48 2 −1 4 −2 −1 2 −2 4 6 2 2 6 −2 −1 −3 2 −2 2 −3 −1 −1 −3 −1 −1 6 −1 1 −1 −3 1 −3 −1 1 −3 −1 −1 6 2 2 3 1 −1 −3 2 6 −2 1 3 −1 1 2 −2 6 1 −1 −3 −1 −1 3 1 1 6 2 −1 −3 1 3 −1 2 6 −1 1 −3 1 1 −3 1 −1 3 2 −2 1 −3 1 −1 . 3 2 −2 6 3 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 5 CANONICAL DECOMPOSITIONS AND INVARIANTS FOR DATA ANALYSIS w1 compares the dihedral images of the words acgt and agtc. a ‚ ‚ +‚ c ‚ ‚ ‚ a ‚ ‚ +‚ t ‚ ‚ ‚ a ‚ ‚ −‚ g ‚ ‚ ‚ a ‚ ‚ −‚ c ‚ ‚ ‚ t g c g c t g t g ‚ ‚ +‚ c ‚ ‚ ‚ g ‚ ‚ +‚ t ‚ ‚ ‚ g ‚ ‚ −‚ a ‚ ‚ ‚ g ‚ ‚ −‚ t ‚ ‚ ‚ t a c a t c a c c ‚ ‚ +‚ g ‚ ‚ ‚ c ‚ ‚ +‚ a ‚ ‚ ‚ c ‚ ‚ −‚ a ‚ ‚ ‚ c ‚ ‚ −‚ t ‚ ‚ ‚ a t g t t g a g t ‚ ‚ +‚ g ‚ ‚ ‚ t ‚ ‚ +‚ a ‚ ‚ ‚ t ‚ ‚ −‚ g ‚ ‚ ‚ t ‚ ‚ −‚ c ‚ ‚ ‚ a c g c c a g a 19 20 MARLOS VIANA UNIVERSITY OF ILLINOIS AT CHICAGO EYE CENTER a + t g c t + a c g a − g t c g − c a , (A.1) t References Bacry, H. 1967. Leçons sur la theorie des groupes et les symétries des particules elémentaires, Dunot, Paris, France. Diaconis, P. 1989. A generalization of spectral analysis with application to ranked data, Annals of Statistics 17, 949–979. Riley, K. F., M. P. Hobson, and S. J. Bence. 2002. Mathematical methods for physics and engineering, 2nd ed., Cambridge University Press, New York, NY. Serre, Jean-Pierre. 1977. Linear representations of finite groups, Springer-Verlag, New York. Viana, M. 2006. Symmetry studies and decompositions of entropy, Entropy 8, no. 2, 88–109. . 2008. Symmetry studies- an introduction to the analysis of structured data in applications, Cambridge University Press, New York, NY.