VECTORS and PSEUDO(?)VECTORS INTRODUCTION A vector is defined to be a quantity that has a length (magnitude) and a direction. Vectors are defined independently of coordinate systems, but when we actually work problems, it is usually convenient to choose a coordinate system. After choosing a coordinate system, we can represent a vector by its components along the coordinate axes. These components determine the length and direction of the vector and the definition of a vector is often stated in terms of the properties of the vector components under rotations of the coordinate axes. We will always use Cartesian coordinate systems. We distinguish between two different types of rotations. In a “passive” rotation the physical system is unchanged, but we rotate the coordinate axes. In an “active” rotation we rotate the physical system and leave the coordinate axes unchanged. We will first treat passive coordinate rotations and discuss active rotations later. Under passive rotations of coordinate axes, vectors themselves are unchanged. After choosing a coordinate system, we define unit vectors, ei (i=1,2,3), along the coordinate axes. We can represent a vector A by A=A1e1+A2e2+A3e3 where we indicate vectors by bold faced print. We will use a short hand notation by writing A=Aiei where we introduce the convention that we sum over repeated indices. The unit vectors are chosen to be perpendicular to each other. They satisfy where , the Kronecker delta, is 1 if i=j and zero otherwise. We still have to specify the orientation of the coordinate axes. We usually do this by using a right handed coordinate system, requiring that where we use a right hand rule to determine the direction of the cross product. In making a general passive “rotation” to another Cartesian coordinate system we assume that the unit vectors in the new system satisfy and , but we do not require that . The orientation is not necessarily preserved. If , the new coordinate system is left handed and we say that the rotation is “improper.” An improper rotation from one coordinate system to another is one that cannot be made continuously starting from the identity rotation. An example of an improper rotation is a coordinate inversion . Since a vector is independent of coordinate system, it must be true that For an inversion, and . VECTOR CROSS PRODUCTS Vector cross products are very useful in formulating the laws of classical physics. Many textbooks1,2,3,4 state that the transformation properties of cross products under passive improper rotations are different from “true” vectors. Feynman5, for example, says that “The original quantities, like the angular momenta and the angular velocities, and things of this kind were not really vectors at all!” He goes on to say that “there are ‘honest’ vectors, for example a r in space…. But the other kind of vector, which has to do with rotations, is of a different nature.” The “honest” vector is called a “polar” vector while the other kind of vector is called an “axial” vector or a “pseudovector.” We will show that cross products have precisely the same transformation properties as polar vectors under any passive orthogonal transformation. This property of the cross product has been pointed out previously6,7, but has been largely ignored. The distinction between polar and axial vectors arises only when an improper, active rotation of the physical system occurs and the coordinate system remains fixed. Before proving that cross products transform in the same way as polar vectors under any passive rotation, we will first argue that the claim that they transform differently cannot be true. Let A and B be two polar vectors and let Under a change of coordinates, A and B are invariant but the above texts1-5 claim that if the coordinate change is an improper rotation then Commonsense tells us that this is unreasonable since A and B are invariant. Commonsense is not always a reliable guide, but in this case it turns out to be right. can only be true if we must switch from a right hand rule to a left hand rule if we change from a right hand coordinate system to a left hand coordinate system. Many authors5,8 say it is necessary to link the handedness of the cross product to the handedness of the coordinate system. These authors use the same symbol for the cross product in any coordinate system. This creates the illusion that the equations are independent of coordinates, when in fact their rule for the cross product depends on the coordinate system. The principle of relativity states that the laws of physics are the same in all inertial frames of reference. That means that the same definition of cross product must be used in all inertial coordinate systems. Switching from one definition to another depending on the orientation of the coordinate system violates this principle. It also makes it impossible to find the direction of angular momentum or a magnetic dipole moment without first choosing a coordinate system. We will always interpret “ ” to mean that a right hand rule is used. If the cross product of two polar vectors is a “pseudovector,” this will be true for the cross product of any two polar vectors. However, it is obvious that if C is a polar vector, it always possible to find two other polar vectors A and B perpendicular to C so that . This is just a geometric accident. Does imply that e3 is a pseudovector? If it did, then it would be impossible to express a vector in component form because the unit vectors would behave differently under improper rotations. TRANSFORMATION PROPERTIES OF CROSS PRODUCTS Arfken and Weber5 define cross products first by using a right hand rule for determining the direction of and defining where θ is the smallest angle between the two vectors. They call this the “geometric” definition of a vector cross product. Then they express the cross product using the familiar determinant, This leads to the equation C3= A1B2-A2B1 with similar expressions for the other components. Arfken and Weber call this the “algebraic” definition of a cross product. They conclude that under an inversion If C were a polar vector, under an inversion we would have . This leads them to state that C does not transform like a polar vector and hence under an inversion. If we make the replacements the expression above, then in This gives the correct result that Arfken and Weber’s algebraic definition is consistent with the geometric definition in a right hand coordinate system, but it is inconsistent with the geometric definition in a left hand coordinate system. Other texts1,2,3 do the same thing, and this seems to be the source of the confusion. Using the “algebraic” definition gives for any orientation of the coordinate axes. This is inconsistent with using a right hand rule in a left hand coordinate system. We will show below that the cross product of two polar vectors transforms exactly the same way as polar vectors under any passive rotation, proper or improper. To do this it is useful to introduce the Levi-Civita symbol . It is defined so that and changes sign if any two indices are interchanged. This means that if any two indices are equal. There are only six non-zero elements: This is useful because the determinant of a 3X3 matrix R is defined by and satisfies (this can be generalized to an NXN matrix) . (1) This relation can be used to prove that the determinant of the product of two matrices is the product of the determinants of the matrices and that the determinant of the transpose of a matrix is equal to the determinant of the matrix. It can also be used to find the inverse of a matrix. Since we can represent the cross product of two vectors as a determinant, we can use the Levi-Civita symbol as a short hand way to represent a cross product in component form. The determinant expansion for the cross product can be conveniently written as , remembering to sum over repeated indices. You should remember that this is true only in a right hand coordinate system. We will derive the appropriate form for a left hand coordinate system. We will consider transformations of vector components between two Cartesian coordinate systems with a common origin. We will assume that the initial coordinate system is a right-handed system with unit basis vectors that satisfy (using a right hand rule for the cross product) , and (2) (3) We can transform to another set of orthonormal basis vectors by using the “rotation” matrix (4) where (5) It is also true that (6) We can expand a vector using either set of basis vectors. (7) The components of the vector in the different coordinate systems are related by (8) The rotation matrix itself is an orthogonal matrix and If If is the transpose of R, then the rotation is improper and the orientation of the coordinate system changes. It is easy to determine cross products of the new basis vectors. We have We use Eq.(1) to obtain . For an improper rotation (9) and in a left hand coordinate system We assume that A and B are two polar vectors and . We can find the components of C in the two different coordinate systems, starting from the right-hand system. (10) Thus (11) and . (12) We see that (13) Thus the components of C transform exactly like the components of a polar vector and e3 has the same transformation properties as e1 and e2 under any passive coordinate transformation. Many authors8 claim that under an inversion . It is clear from Eq. (12) that this is not correct and under an inversion. PSEUDOSCALAR PRODUCT A related claim is often made that the triple scalar product, , where A, B, and C are all polar vectors, is a “pseudoscalar” under inversions of a coordinate system and it is called the pseudoscalar product. If this were true, it would mean that changes signs if the coordinate system is inverted. Again, this defies commonsense and is wrong. This is proved incorrect by using the equations derived above. Start from a right hand coordinate system and make a transition to another orthogonal coordinate system. We note that and . The same result is obtained for the triple scalar product regardless of the orientation of the coordinate system. Inverting the coordinate system gives . The triple scalar product does not change sign if the rotation is improper and it is not a pseudoscalar. ACTIVE ROTATIONS We will now consider active rotations which means that the physical system is rotated, keeping the coordinate system fixed. Vectors are invariant under coordinate rotations, but they change under rotations of the system. Here we have after the rotation. In order to determine , it is convenient to also rotate the coordinate system along with the active rotation as we did for passive rotations, but in the end we calculate everything in the original system. Then the components of in the rotated coordinate system are the same as the components of A in the original system. We want to compute the . What we get is, using the same rotation matrix as before (Eq.(5)), . (14) Thus the rotation matrix that gives us the components of the rotated vector in the original coordinate system, is the transpose (or inverse since the matrix is orthogonal) of the matrix that previously gave us a passive rotation of the coordinate system. Now suppose we ask how angular momentum, , transforms under a general rotation. Angular momentum in the rotated physical system is still defined by . If we use a right hand coordinate system, We have to evaluate We start with Eq.(1) and multiply by . Thus (15) and (16) Cross products of polar vectors do not transform in the same way as polar vectors under active improper rotations. There is an extra factor of det(R). There is an important difference for physical quantities resulting from cross products under active rotations that distinguish them from polar vectors. Under a pure active inversion L does not change signs, but r and p do. Under a passive rotation, proper or improper, none of them changes sign. LORENTZ TRANSFORMATIONS We have been assuming that one observer is simply comparing the components of vectors in different Cartesian coordinate systems. A Lorentz transformation is different because it explicitly enables two observers to compare the results of measurements made of the same physical system in different reference frames. When one gives a coordinate system a boost, the observer is also boosted. When the coordinate system is inverted, the observer is also inverted. Although Lorentz transformations must be used to transform between two coordinate systems that are moving relative to each other, it is still relevant to ask how vectors, quantities with length and direction, transform under rotations between two coordinate systems at rest with respect to each other and compare the results with the Lorentz transformation. Consider two observers at rest, but inverted with respect to each other. Assume there is a current loop at the origin in each system with its plane in the plane (or plane). B transforms as components of a second rank tensor with two spatial indices and it is easy to see that in one frame is related to in the other frame by . Since the z-axes are inverted with respect to each other, the two observers claim the magnetic fields are in opposite directions. This is exactly what many people claim happens when one makes a passive coordinate inversion. The observations in the two reference frames appear to be inconsistent, but they are actually consistent. Each observer uses a right-hand rule in his reference frame to determine the direction of the magnetic field, but each observer sees the other observer using a left-hand rule. Taking this into account, they agree. One possible reason for the confusion surrounding the transformation properties of cross products might be because of the natural expectation that cross products, represented by second rank tensors in relativity, should transform the same way under Lorentz transformations as they do under simple rotations as long as the rotations are limited to coordinate systems at rest with respect to each other. We will see below that this is not true. In relativity, electric and magnetic fields transform as components of a second rank tensor. For Lorentz transformations involving only a rotation, proper or improper, the space and time components do not mix and the space components transform as (assuming an orthogonal transformation) . (17) It is unnecessary to distinguish between covariant and contravariant indices since there is no mixing of space and time components. The space components are related to the magnetic field by (up to a possible sign, depending on which convention is used) (18) For example and, using Eq.(15), (19) Now use so that (20) For a Lorentz transformation, there is an extra factor of det(R) in the transformation law compared to Eq.(8). The Lorentz transformation does not give the same result as an improper rotation of the coordinate system for coordinate systems at rest. REFERENCES 1. H. Goldstein, Classical Mechanics, (Addison-Wesley Publ. Co., Reading, Mass., 1980) Second Edition, p 171-172. 2. L. Landau and E. Lifschitz, Classical Theory of Fields (C, pp. 19-20. 3. P. Morse and H. Feshbach, Methods of Mathematical Physics (McGraw-Hill Book Company Inc., New York, 1953), Part I.p.10. 4. G. B. Arfken and H.J. Weber, Mathematical Methods for Physicists (Elsevier Press Academic Press, Burlington, MA 2005) 5. Richard Feynman, Feynman Lectures on Physics, (Addison-Wesley Publ. Co., Reading, Mass., 2005) Vol. 1 Chap. 52 6. A. V. Masket, Am. J. Phys. 34 164 (1966) 7. H. Gelman, Am. J. Phys. 38 599 (1970) 8. W. Hauser, Am. J. Phys. 54 168 (1986) 9. See, for example, D. J. Griffiths, Introduction to Electrodynamics, (Prentice-Hall Inc., New Jersey, 1999) Third Edition, p 12.