Honors Topics in Electrodynamics

Physics 212 Honors topics in electrodynamics Introduction and syllabus Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Introduction and syllabus .1 2 Honors topics in electrodynamics Physics 212 (...though it's officially PHYCS 199&HO, and it will be renumbered "PHYCS 222" next fall) Instructor: George Gollin; Loomis 437D. 333-4451; g-gollin@uiuc.edu Teaching assistant: To be announced Course secretary: Donna Guzy; Loomis 441. 333-4452; d-guzy@uiuc.edu Prerequisites: Physics 111 and associated math courses. Corequisites: Concurrent registration in Physics 112. Description: In Physics 212 (called PHYCS 199&HO by the registrar for the time being), students will investigate some of the most interesting topics in electricity and magnetism at a more sophisticated level than would be appropriate for Physics 112. We welcome all interested students who have done well in Physics 111; it is likely that about half of Physics 212 students will be physics majors. Enrollment is limited (by fire safety considerations) to 30 students. Electrodynamics is a beautiful subject which has, at its heart, the fever-dream alienness of Special Relativity. The subject offers students their first opportunity to see how profoundly different physical reality is from our familiar (and grossly inaccurate) classical world. We will cover a number of subjects related to those presented in Physics 112, but with a different slant: our intention is to delve more deeply into a limited number of topics in order to supplement the material of Physics 112. We will call upon the superior mathematical skills of our Physics 212 students to allow the course to deal with electrodynamics in a more sophisticated manner which complements (but does not replace) Physics 112. The format of the course comprises weekly meetings in which students will work together in small groups (closely supervised by the instructor and teaching assistant) to solve a number of simple problems, then to discuss the conclusions one can draw from the results. In this way, students will derive for themselves some of the surprising features of our post-classical physical world. For example, the simple fact that the speed of light is constant gives rise to the existence of magnetic fields, the nature of special relativity, and the origins of electromagnetic radiation. We will review mathematical techniques as required. Students will be expected to become comfortable using calculus to solve problems as necessary. Completion of each of the weekly homework sets will require an hour or two. Grades will be based most strongly on participation in the weekly meetings and performance on problem sets. Physics 212, Spring 2003. Introduction and syllabus .2 3 We want the course to be interesting, engaging, challenging, and lots of fun for both the students and the staff. Since Physics 212 is a "topics" course, we'll get to focus on the cool stuff, while relying on Physics 112 to provide broad coverage of the entire subject. It is likely that most students who are capable of working at the required level for this honors course will receive high course grades. In more detail, we will cover: 1. Special relativity: time dilation and Lorentz contraction 2. Special relativity: non-simultaneity; the Lorentz transformations 3. A microscopic version of Gauss’ law 4. Why superposition works 5. Pinwheels (curl and the vector potential) 6. The origin of the magnetic field as a consequence of special relativity 7. Operational amplifier circuits (meet in Loomis 336) 8. Electronic models of mechanical systems (meet in Loomis 336) 9. Analog computers (meet in Loomis 336) 10. Calculating magnetic fields using the vector potential 11. Maxwell's equations 12. Visualizing fields and potentials with Mathematica (meet in Loomis 257) 13. Studying electromagnetic radiation with Mathematica (meet in Loomis 257) 14. Oops! It doesn't really work like that! (Quantum mechanics enters the picture) 15. The electrodynamics of barbecuing We will review techniques involving differential equations and vector calculus as necessary. Do not be alarmed by the amount of calculus in some of the units! It’ll sink in as you work with it. Office hours: Professor Gollin: just call, or walk in. Email is also usually a reasonably quick way to reach me. Teaching assistant: TBA Physics 212, Spring 2003. Introduction and syllabus .3 4 Physics 212, Spring 2003. Introduction and syllabus .4 5 Physics 212 Honors topics in electrodynamics Unit 1 Special relativity: time dilation and Lorentz contraction Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.1 6 Unit 1: Special relativity-- time dilation and Lorentz contraction Introduction: the speed of light is finite, and constant. You're going to love this: its consequences are the weirdest things you'll see in physics before you learn about quantum mechanics. All the strange features of special relativity come from the fact that the speed of light is exactly 2.9972458 × 108 meters per second. It doesn't matter whether the source of light moves with respect to the measuring apparatus: any device measuring c (the speed of light) will obtain, within experimental error, this value. Nothing else works this way: sound travels at fixed speed with respect to the air, for example. The value 2.9972458 × 108 meters per second really is exact: it serves as a definition, along with some time standard, of the length of one meter. The speed of light is very close to 1 foot per nanosecond, so we'll use that as a convenient approximation this week and next week. Here’s what you’ll discover today: • Events which are simultaneous in one frame of reference are not necessarily simultaneous in another frame. • The rate at which time passes in a moving frame of reference is slowed down. • Moving objects become shorter along their direction of motion. Exercise 1.1: constant c; simultaneity. (a) Mr. Urkin, sociopathic co-proprietor of Urkin's Deli and Hardware Emporium, is capable of throwing a snowball at 50 miles per hour. While riding as a passenger in a delivery truck traveling 40 miles per hour, he hurls snowballs at stationary pedestrians ahead of, and behind, Urkin's Deli and Hardware Emporium 40 mph rearward victim foward victim the moving truck. Calculate the speeds of the snowballs from the perspectives of the two pedestrians shown in the diagram. Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.2 7 (b) The starship Nostromo, drifting in deep space, passes a large interstellar recreational facility which is also adrift, as shown in the figure. Observers on the recreational facility see the Nostromo as moving to the right, with speed 0.8c. While the ship is between the ends of the rec facility, it fires its communications lasers to send signal pulses in the forward and aft directions to receivers built into the ends of the rec facility. receiver burst of laser light N Nostromo v=0.8c burst of laser light receiver Six Flags Over τ Ceti iv According to technicians on the recreation facility, how fast are the two light pulses traveling as they pass through the receiver hardware built into each end of the rec facility? Note that (slowly moving) snowballs do not behave the same way as bursts of light! Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.3 8 (c) The Sulaco, another fast ship, glides past the same drifting interstellar recreational facility. As before, observers on the recreational facility see the ship moving to the right with speed 0.8c. Unfortunately, Sulaco's X-band antenna strikes the rec facility's TV parabola, interrupting the final episode of Pride and Prejudice. Sulaco's wrecked antenna is midway along the ship's length; the rec facility's ruined parabola is also midway along the facility's length. Observers on the recreation facility see the Sulaco's length as 600 feet. The rec facility's length, according to the manufacturer, is 2000 feet. S receiver Sulaco boom! v=0.8c receiver Six Flags Over τ Ceti iv 600 feet 2000 feet According to technicians on the recreation facility, when does the flash of light from the collision reach the rec facility's right- and left-receivers (assume the flash occurs at t = 0)? According to technicians on the recreation facility, when does light from the collision reach Sulaco's aft periscope (located at the rear of the ship)? When does light reach the forward periscope? Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.4 9 (d) From the perspective of the Sulaco, the collision happens because the incompetently-piloted recreational facility smashes into Sulaco’s antenna as it lumbers past at 0.8c, moving to the left, as shown in the figure. S Sulaco boom! v=0.8c Six Flags Over τ Ceti iv According to cosmonauts on the Sulaco, does the flash of light from the collision reach the Sulaco's forward and aft periscopes simultaneously? According to cosmonauts on the Sulaco, does the flash of light from the collision reach the rec facility's right- and left-receivers simultaneously? Discussion Imagine we use the receipt of the light flash as a way to synchronize clocks on the rec facility: we set a pair of stopped clocks, one at each end of the rec facility, to t = +1000 nsec. When the light flash arrives at the left end, we start the left clock ticking. When the light flash arrives at the right end, we start the right clock ticking. How do the clocks look from the two frames of reference (recreation facility's and Sulaco's)? Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.5 10 Exercise 1.2: a light clock; time dilation (a) Mrs. Urkin, the irritable, but surprisingly fit co-proprietress of Urkin's D&HE, drifts in deep space aboard the gigantic starship Vulcan Flashrocket. Since she realizes that the speed of light is approximately one foot per nanosecond, Mrs. Urkin constructs a clock using two mirrors, separated by a distance of 500 million feet, and a bouncing light pulse. The pulse is of brief duration-- the laser that produced it fired for only a nanosecond-- and it bounces between the mirrors without attenuation. 500 million feet Urkin's Deli and Hardware Emporium p ath of bouncing light p ulse Vulcan Flashrocket Bombs R Us Free delivery As shown in the figure, she holds the lower mirror in one hand, moving it briefly into place to reflect the descending light beam. After each successful reflection she removes the mirror. Because Mrs. Urkin knows her resting pulse rate, she has constructed the device so that she will need to move the mirror into position immediately after each heart beat. It is fortunate that she does not suffer from cardiac arrhythmia since the bomb stored below the light clock will detonate if its optical sensor is struck by the light beam. What is Mrs. Urkin's pulse rate? Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.6 11 (b) The Flashrocket is passed by that hazardous interstellar recreational facility while Mrs. Urkin is occupied with her light clock. Fortunately, no collision takes place. From the perspective (the "frame of reference") of technicians on the rec facility, the Flashrocket is seen to glide past the facility at speed v, as shown in the figure below. Urkin 's Deli a nd Hard ware Emporiu m Vulca n Flashro cke v (=0.8c) Free delivery Six Flags Over τ Ceti iv As seen from the rec facility's perspective, does Mrs. Urkin's bomb detonate? 500 million feet (c) As seen from the recreational facility, Mrs. Urkin's light pulse travels along a diagonal path, as shown below. Keep in mind that the Flashrocket is seen to move with speed v, and that the light pulse is seen to travel one foot per nanosecond! mirror assembly moves with speed v c path of light pulse How long is the path followed by the light pulse during one lower-mirror-to-upper-mirror-tolower-mirror trip? (Express your answer in terms of v, c, and so forth.) If v = 0.8c, how long does it take for the light pulse to make the bottom-to-top-to-bottom trip? Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.7 12 (d) If we define ∆t to be the time required for one bottom-to-top-to-bottom light pulse cycle in Mrs. Urkin's frame of reference, and ∆t' to be the time required for one cycle according to observers on the recreational facility, what is the ratio ∆t' / ∆t? Express your result in terms of v and c. (e) What is Mrs. Urkin's pulse rate, according to observers on the recreational facility? Discussion • To Mrs. Urkin, everything seems fine. To observers in the other frame, Mrs. Urkin is doing everything slowly. • This MUST be true if something (light) exists which has the same velocity in all frames regardless of any source-observer relative motion. • All systems used to measure time are affected the same way (heart beat, light clocks, mechanical clocks,...) Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.8 13 Exercise 1.3: measuring the length of a moving object using a single clock. (a) Floating comfortably aboard the Vulcan Flashrocket, Mrs. Urkin measures the length of the Nostromo by timing how long it takes for the Nostromo to coast past her position, moving to the right at speed v. Here's how Mrs. Urkin's length measurement looks to the crew of the Nostromo: they see the Flashrocket gliding past with speed v and, naturally, see that Mrs. Urkin's clock is ticking slowly. The Nostromo crew can measure the length of their own ship (let's call it L) by seeing how long it takes the Flashrocket to fly the full length of the Nostromo. (Let's call the time required ∆t). L N ∆t v = 0.8c Free delivery Flashrocket Nostromo Nostromo ∆t′ Flashrocket clock stops when its clock lever strikes Nostromo's aft probe Urkin's Deli and Hardware Emporium Flashrocket ∆t Flashrocket clock starts when its clock lever strikes Nostromo's forward probe (As seen from the Nostromo's frame of reference) What is L in terms of ∆t and so on? (L is the Nostromo's rest length. This is a ridiculously easy problem!) Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.9 14 (b) In Mrs. Urkin’s rest frame, the measurement looks as shown in the figure. Her clock is started and stopped by the probes mounted near Nostromo’s nose and tail. Naturally, she sees that Nostromo’s clocks (not shown) are ticking slowly, and are not synchronized. (Note that Nostromo’s aft clock must still read ∆t, however, as it passes Mrs. Urkin.) L´ N ∆t Nostromo Flashrocket clock stops when Nostromo's aft probe strikes clock lever Free delivery Flashrocket v = 0.8c ∆t´ Urkin's Deli and Hardware Emporium ? Flashrocket clock starts when Nostromo's forward probe strikes clock lever (As seen from the Flashrocket's frame of reference) Define L′ to be the Nostromo length measured by Mrs. Urkin; the time interval which passed on the Flashrocket clock is ∆t′. In terms of v and ∆t′, what is the value she obtains for L′? (Clearly, we have another difficult problem here!) (c) What is the ratio L' / L, expressed only in terms of v and c? (Lorentz contraction!) Keep in mind that ∆t′ < ∆t. Discussion All methods of performing a length measurement (tape measure, timing how long it takes a moving object to coast past a fixed point,...) will yield consistent results in one frame of reference. Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.10 15 Exercise 1.4: is there any “transverse” contraction? Lengths parallel to the direction of motion become shorter as v → c. What happens to lengths perpendicular to the direction of motion? Consider a long stick which carries a pair of paint balls, spaced along the edge of the stick which is perpendicular to its velocity. The stick crashes into a stationary ruler, as shown in the figure. 1 2 How far apart are the paint marks which are left on the ruler? 3 paint balls on moving stick Imagine that a second observer views the same process (the very same ruler-stick collision shown in the above diagram), but from a frame in which the stick is at rest and the ruler is in motion. 1 How far apart will this observer think the paint marks are? 2 What conclusion can you draw concerning contraction perpendicular to the direction of motion from this system? 3 paint balls on stationary stick Summary and wrap-up discussion • Time is messed up: Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.11 16 1. moving clocks take longer per tick by a factor of 1 • • • • ( ) 1 − v2 c2 . 2. simultaneity doesn't work: clocks "at the front" read earlier times than clocks "at the back" All time references are affected identically: light clocks, heartbeat,... Conclusions are valid if anything (light, neutrinos, gravitons,...) is found to move with constant velocity, independent of source-observer motion. Some things don't change when one switches frames of reference (e.g. bomb did/didn't explode). ( ) Length is messed up: moving objects are shorter by a factor of 1 − v 2 c 2 , but only along their direction of motion. Comments about homework assigned for next week Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.12 17 Reading and homework should be completed by this Friday, 4pm. Please leave your finished problem set in the TA’s mailbox by that time. We are available for help and questions as they arise; please call (let the phone ring until a secretary picks it up if I am not in my office), drop in, or send us email. Required reading: 1. Introduction and syllabus handout. 2. Notes on Special Relativity, pp. 1 – 23 as necessary to clarify points that remain obscure after class. Optional reading, just for fun: Einstein’s Dreams, Alan Lightman. Warner Books (paperback), 179 pages, 1994. Comments from Amazon.com: "The book takes flight when Einstein takes to his bed and we share his dreams, 30 little fables about places where time behaves quite differently. In one world, time is circular; in another a man is occasionally plucked from the present and deposited in the past: "He is agonized. For if he makes the slightest alteration in anything, he may destroy the future ... he is forced to witness events without being part of them ... an inert gas, a ghost ... an exile of time." Einstein's Bridge, John Cramer. Avon Books (paperback), 1998. A high energy physics experiment at the SSC lab in Texas opens a channel into another universe. There are good things and bad things lurking out there, and both find their way through the portal. I suppose this really is about a quantum version of General Relativity, rather than Special Relativity, but it was really fun to read. Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to this week’s in-class exercises and write a (true!) statement on your problem set asserting that you have successfully printed your personal copy. Problem 1 (10 points): KS0 meson decay Neutral K mesons (“kaons”) are unstable particles composed of a quark and an antiquark. They can be produced copiously in energetic collisions between stable particles. The neutral kaon mass is roughly 1/2 the proton mass; short-lived neutral kaons usually decay into pairs of (lighter) pi-mesons through the decays KS0 → π+π- and KS0 → π0π0. The average KS0 lifetime for a kaon at rest is 0.8935 × 10-10 seconds. A beam of fast KS0 is produced at a national laboratory. The average distance traveled by a kaon before decaying in flight is found to be approximately 9.2 cm. How fast are the kaons moving? (They are not traveling faster than c!!) You should assume all kaons are moving at the same speed v and use the value c = 3 × 1010 cm/sec in your calculations. Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.13 18 Problem 2 (10 points): The space telescope sees clearly Two photons from a pair of faint, distant objects simultaneously enter the open end of the Hubble Space Telescope’s barrel. One of the objects is moving at 0.5 c directly towards the telescope, while the other is moving at 0.5 c directly away from the telescope. Assuming that the distance from the telescope’s open end to its primary mirror is 20 feet, calculate how long it takes for each of the two photons to arrive at the primary mirror after entering the open end of the telescope barrel (as determined by observers in the rest frame of the telescope). Problem 3 (10 points): The Minoans synchronize their watches The identical sister ships Linear A and Linear B pass each other in deep space, as shown from the perspective of the Linear A’s crew in the figure below. In its rest frame, each ship is 2000 feet long; the crew of Linear A sees the other ship glide by with speed 0.8 c. As the midpoint of Linear B passes the midpoint of Linear A, parabolic antennas on the two ships collide. The brief flash of light from the collision travels outwards from the point of contact. v=0.8c ? ? Linear B L-B boom! Linear A L-A 2000 feet (a) According to the crew of Linear A, how long is the ship Linear B? (b) According to the crew of Linear B, how long is the ship Linear A? (c) Both ships carry (initially stopped) clocks at their forward and aft ends. Crew members stationed next to each of the clocks start them ticking as soon as they see the light from the explosion. After all clocks have finally begun ticking, how do the clocks on Linear B look to the crew on Linear A? If you conclude that the Linear A crew believes them to be out-of-synch, which Linear B clock reads the later time according to Linear A observers? Physics 212, Spring 2003. Unit 1 Special relativity: time dilation and Lorentz contraction 1.14 19 Physics 212 Honors topics in electrodynamics Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.1 20 Unit 2: Special relativity-- Non-simultaneity; the Lorentz transformations Recap of last week Introduction: reference frames and measurements. The only quantitative features of special relativity we've seen so far are the rates at which moving clocks slow down and moving objects shrink. If we're clever, we can use this to bootstrap our way into drawing other quantitative conclusions concerning measurements such as the non-simultaneity of events which are simultaneous in a different reference frame. What do we mean by "reference frame?" The idea is simple: it's just a coordinate system in which all our clocks and rulers are at rest, and all our clocks are synchronized. A measurement of a spatial interval between two events amounts to determining the distance between the two clocks placed next to the events. The time interval is just the difference in the readings on the faces of the two clocks. For example, the space-time interval between waking up and entering my office involves a distance of about 2 miles and a time interval that depends on how sleepy I am. Synchronizing clocks in a reference frame is easy: we preset all of them so that a clock which is R feet from the origin reads +R nanoseconds. Each clock (initially paused) starts running when it senses the light from a strobe, placed at the origin, which fires at time zero. Keep in mind that most measurements involve both position and time: measuring the length of a moving object as it glides past our (stationary) ruler requires us to see where the two ends of the object are, in comparison to the scale printed on our ruler, at the same time. Timing how long a moving object needs to coast past our stopwatch requires that the stopwatch remain in the same place during the timed interval. We'll continue to use the approximation c = 1 foot per nanosecond this week. Here’s what you’ll find/work on today: • A quantitative expression for the non-simultaneity of events viewed from one frame when the events occur simultaneously in another frame • Putting it all together: deriving the Lorentz transformations Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.2 21 Exercise 2.1: quantitative description of non-simultaneity. We know from last week that clocks at the front of a moving object read earlier times than clocks at the back of the object. Let's work this up, quantitatively, now. Here's the setup: a flash is produced at the tail of the Nostromo as its rear clock reads zero. The light travels the length of the ship, arriving at a sensor at its nose. We can describe the production and reception of the light in terms of a pair of events (an event is something which happens at one point in space and lasts only an instant). The first "event" is the production of the spark near the Nostromo's aft clock. The second event is the arrival at the forward sensor of light from the spark. event 1 N t=0 Nostromo t=0 Nostromo t = L/c flash of light rest length L event 2 N t = L/c flash of light Since Nostromo's rest length is L, and all its clocks are synchronized in its rest frame, we know that the light will arrive at the forward clock when that clock reads L/c. Perhaps the light from the arriving flash is used to expose some photographic film to record permanently the reading on the forward clock when the light arrives. There is a well-defined spatial and temporal interval between the two events in this reference frame (∆x = L and ∆t = L/c). From the perspective of observers who see Nostromo moving to the right, the ship's length is Lorentz contracted to L 1 − ( v c ) . Nostromo's forward clock is seen to show an earlier time 2 than its aft clock. (Let's call this difference δ.) The two events look like this: Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.3 22 L 1 − ( v c) event 1 2 N Nostromo t=0 t = -δ v flash of light 0 Six Flags Over τ Ceti iv L 1 − ( v c) event 2 0 2 N t = L/c + δ vt' Nostromo t = L/c flash of light ct' t' Six Flags Over τ Ceti iv t' Event 1: spark is produced at the aft end of Nostromo. The Nostromo's tail clock reads 0; the nose clock -δ. (We already know that δ is a positive number, since the forward clock reads an earlier time than the aft clock.) Since we are viewing events from the rest frame of the observers on the rec facility, all clocks on the recreation facility read zero. Event 2: light from spark reaches forward end of Nostromo. From the photograph taken in the Nostromo rest frame, we know that Nostromo's forward clock will read L/c when the light arrives. The tail clock always reads δ later than the nose clock, according to rec facility observers. The rec facility clocks all read t'. In time t', the light from the spark has traveled a distance ct' to catch up with the nose of Nostromo, while the Nostromo itself has moved a distance vt'. There is a well-defined spatial and temporal interval between the two events in this reference frame. (∆x' = ct' and ∆t' = t' -- note that I'm using coordinates with primes to describe variables in the rec facility frame.). Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.4 23 (a) Note that (in the rec facility rest frame) the light beam travels a distance ∆x' = c∆t' which is the same as the sum of the Lorentz-contracted Nostromo length and the distance traveled by the Nostromo. Write an equation which says the same thing as the previous sentence, and then use this to solve for ∆t' in terms of L, v, and c. (b) When event 2 occurs in the rec facility rest frame, Nostromo’s forward clock must read L/c and its aft clock must still be ahead of the forward clock by an amount δ. As a result, the reading on Nostromo’s slowly-ticking aft clock has advanced from 0 to L/c + δ during the time it took for a rec facility clock to advance (in the rec facilty rest frame) from 0 to ∆t'. Since the moving Nostromo clocks are ticking slowly in comparison to the rec facility’s clocks, we must have L c + δ = ∆t ′ 1 − v 2 c 2 . Use this fact to derive an expression for δ in terms of L, v, and c. Please simplify your expression as much as possible. Discussion How do the rec facility clocks look to the Nostromo crew? Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.5 24 Exercise 2.2: Deriving the Lorentz transformations Observers in different frames of reference which are in relative motion will tend to disagree about lengths, distances, time intervals, and the synchronization of clocks. Here’s a summary of what we know so far: 1. A single, moving clock ticks slowly so that a time interval ∆t between two events measured by this one clock will be shorter than the time interval ∆t' measured by observers who see that this clock is in motion: ∆t = ∆t ′ 1 − v 2 c 2 . 2. The length L' of a moving object is shorter than the length L of the object as measured in the object’s rest frame: L 1 − v 2 c 2 = L′ . 3. Clocks which are separated by a distance ∆x in their rest frame and are synchronized in their rest frame are seen to be out-of-synch in a frame in which the clocks are moving with speed v by an amount −v∆x c 2 . (The negative sign means that the forward clock reads an earlier time than the aft clock.) Imagine that a pair of events is seen by observers in the two different coordinate systems O and O'. Observers in O (the "unprimed" frame) measure the space-ime intervals between the events to be ∆x, ∆y, ∆z, ∆t while observers in O' measure the interval to be ∆x′, ∆y′, ∆z′, ∆t′. (I’m defining ∆x = x2 – x1, ∆t = t2 – t1, and so forth.) Let’s assume that the origins of O and O' coincide when clocks at the origins of both frames read zero. Frame O' (according to observers in O) is moving with velocity v along the x axis of O as shown in the figure. (Note that the sign of v matters.) O O' y y' v x z x' event 2 (x2, y2, z2, t2) or (x2', y2', z2', t2') event 1 (x1, y1, z1, t1) or (x1', y1', z1', t1') z' The Lorentz transformations allow us to calculate the space-time interval between the two events measured by observers in one frame if we know the interval between the events measured by observers in the other frame and also know the relative velocities of the two frames. They allow us to write ∆x′ as a function of ∆x, ∆t, v, and c and also ∆t′ as a (different) function of ∆x, ∆t, v, and c. (a) Let’s redraw the figure used in the last exercise in order to make the choice of primed and unprimed coordinates agree with the discussion in this exercise. Here’s the redrawn figure: Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.6 25 event 1 N Nostromo t′ = 0 v flash of light t=0 Six Flags Over τ Ceti iv t=0 ∆x′ = rest length event 2 N Nostromo v∆t t′ = ∆t′ flash of light ∆x = c∆t t = ∆t Six Flags Over τ Ceti iv t = ∆t In the rec facility frame the light beam travels a distance which is the same as the sum of the Lorentz-contracted Nostromo length and the distance traveled by the Nostromo. The equation which says the same thing as this is ∆x = ∆x′ 1 − v 2 c 2 + v∆t . Solve this equation for ∆x' as a function of ∆x and ∆t. Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.7 26 (b) Since the two events are separated by ∆x′ and ∆t′ in the Nostromo rest frame, we know that ∆t ′= ∆x′ c . We also know that ∆t = ∆x c and that ∆t c = ∆x c 2 . Use these facts to rewrite your answer to the previous exercise as an equation for ∆t', and simplify it as much as possible to express ∆t' as a function of ∆t and ∆x. Discussion You have derived the Lorentz transformations! The frames O and O' correspond to the rest frames of the rec facility and the Nostromo respectively; the space-time separation of any pair of events measured by observers in the unprimed frame can be used to derived the space-time interval between the events that would be measured by observer in the primed frame. Just to make sure we have them written down compactly: define: γ = 1 1 − v 2 c 2 so that ∆x′ = γ ( ∆x − v∆t ) ∆x = γ ( ∆x′ + v∆t ′ ) ∆y′ = ∆y ∆z′ = ∆z ( ∆t ′ = γ ∆t − v∆x c 2 ) ( ) ∆t = γ ∆t ′ + v∆x′ c 2 . If we know the intervals in one frame, we can calculate what they'll be in the other frame. Keep in mind that positive v means that O' is moving to the right, as viewed from O. Note that we can extract all of the quantities we’ve calculated so far using the Lorentz transformations as long as we choose the right pair of events. Keep in mind that almost all measurements require you to know something about space and time. For example, a time interval measured using a single clock is done so that the spatial separation between the “starting” event and “stopping” event used to define the time interval is zero. Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.8 27 Summary and wrap-up discussion • Time AND space are messed up. 1. rate of passage of time is slowed in a moving frame: ( ∆t = ∆t ′ 1 − v 2 c 2 • • ) 2. simultaneity doesn't work: clocks "at the front" read earlier by v∆x / c than clocks "at the back," where ∆x is the separation between the clocks as measured in their own rest frame. 3. moving objects are shorter: L ′ = L 1 − v 2 c 2 where L is 2 ( ) the rest length of the object. The Lorentz transformations can be used to calculate the intervals ∆x, ∆y, ∆z, ∆t between events observed from one reference frame using the intervals ∆x', ∆y', ∆z', ∆t' between the same events observed from another reference frame. v → c. Comments about homework assigned for next week Please do not ever ever ever search for solutions to assigned problems in texts, lecture notes or problem set solutions from other courses, or solutions distributed in previous offerings of this course! Scout’s honor! Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.9 28 Reading and homework should be completed by 4pm Friday. Required reading: Notes on Special Relativity, pages as appropriate to our work. Optional reading, just for fun: The Feynman Lectures on Physics (volume I), chapter 15. The Forever War, Joe Haldeman. Avon books (paperback), 1974. This science fiction novel won both the Hugo and Nebula awards. Time dilation plays a role in the messes the characters make of their lives. Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to this week’s in-class exercises as well as last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Problem 1 (10 points): Can anything go faster than c? In this problem you'll derive the relativistic velocity addition formula. To replenish the Nostromo's perilously low supply of jell-o, Mrs. Urkin's fast shuttle is launched on a resupply mission. As seen from Nostromo, the shuttle streaks away with constant velocity u in the positive x' direction. At position x1' and time t1' the shuttle strikes a dust mote, vaporizing it. Somewhat later, the shuttle hits a micrometeoroid. N Nostromo U event 1: Mrs. Urkin clobbers a dust mote u ∆x', ∆t' (Nostromo rest frame) N Nostromo event 2: Mrs. Urkin strikes a micrometeoroid U u The light flashes from both events are seen by Nostromo's cook, who calculates (after correcting for the light's propagation time to the ship) that the space-time interval between the events was (∆x', ∆t') = (x2' - x1', t2' - t1'). Not surprisingly, the cook notices that u = ∆x' / ∆t'. The same events are viewed by Mrs. Urkin's personal trainer on board the interstellar recreation facility. He sees the Nostromo travelling with speed v in the positive x direction, and determines that the space-time interval between the two events was (∆x, ∆t) = (x2 - x1, t2 - t1). He concludes that the shuttle craft was moving in the positive x direction with speed ∆x / ∆t. (a) By making use of the Lorentz transformations, calculate ∆x / ∆t in terms of u (the shuttle's velocity as seen from Nostromo), v (Nostromo's velocity, as seen from the rec facility), and c. Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.10 29 (b) How does your expression for ∆x / ∆t behave in the limit that both u and v are close to c? Problem 2 (10 points): The train-tunnel "paradox" Here's how this one works. We have a tunnel, 800 feet long in its rest frame, with doors on each end which can be used to seal the tunnel. The train is 1,000 feet long in its own rest frame, as shown in the illustration: Tunnel rest frame: The train travels at speed 0.8 c so that its Lorentz-contracted length allows it to fit entirely inside the tunnel. When (synchronized in their rest frame) tunnel clocks by both doors read zero, just as the train is neatly centered inside the tunnel, the doors slam shut, trapping the entire train inside the tunnel. The front of the train crashes through the right-side tunnel door 125 nanoseconds later, but the (closed) left-side door was able to close without interfering with the train. Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.11 30 Train rest frame: From the train's rest frame, things look rather different: the Lorentz-contracted tunnel rushes towards it as shown in the following figure. The tunnel is much too short to be able to trap the train entirely inside itself, yet both doors will drop, and neither will touch the train (although the front of the train will crash through the already-closed right-side door). How can this be possible? Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.12 31 Problem 3 (10 points): An introduction to partial derivatives We will begin using calculus to address problems next unit. Here’s a reminder about the definition of an ordinary derivative, and an introduction to partial derivatives. (a) The Urkins are hiking in eastern France’s Jura mountains. They have been told by residents of a nearby village that the altitude above sea level is well-described by the function h(x,y) = 10-7 x3, where x and y are the distances due east and due north (in meters) of the center of town. A plot of the elevation near the town would therefore look something like this: altitude is a function only of x village 100 50 h(x,y) = 10-7 x3 1000 0 500 -50 -100 0 y -500 -500 0 x 500 -1000 1000. (i) The Urkins hike due east from the point (x, y) one morning, stopping for lunch at the point (x + a, y). Write an algebraic expression for their change in altitude, ∆h = h ( x + a, y ) − h ( x, y ) and for the average slope of their path, m = ∆h a . Please simplify your result as much as possible, expressing it as a function of x and a. (ii) Evaluate the average slope in the limit that a → 0 , expressing your answer as a function of x. (Since the definition of the derivative of a function of one variable is df ( x ) dx = lim  f ( x + ∆x ) − f ( x )  ∆x , your answer shouldn’t come as a surprise to you!) ∆x → 0 Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.13 32 (b) The next day the Urkins are hiking in a region where the altitude is well-described by the function h(x,y) = 10-10 x3y, where x and y are the distances due east and due north of the center of a different town. An elevation plot would therefore look something like the following: altitude as a function of x,y is 10-10 x3y 100 50 altitude 0 500 -50 -100 0 y -500 -500 0 x 500 (i) The Urkins hike due east from the point (x, y) to the point (x + a, y). Write an algebraic expression for their change in altitude, ∆h = h ( x + a, y ) − h ( x, y ) and for the average slope of their path, m = ∆h a . Simplify your result, expressing it as a function of x, y, and a. (ii) Evaluate the average slope in the limit that a → 0 as a function of x and y. You have just evaluated a partial derivative! The definition of a partial derivative of a function of several variables is quite similar to the definition of the derivative of a function of one variable: ∂f ( x, y ) f ( x + ∆x , y ) − f ( x , y ) ∂f ( x, y ) f ( x , y + ∆y ) − f ( x , y ) = lim and = lim . ∆x → 0 ∆y →0 ∂x ∆x ∂y ∆y y constant x constant The machinery is easy to use: just treat all “the other” variables as if they were constants and take the derivative of the function in the usual way. Physics 212, Spring 2003. Unit 2 Special relativity: Non-simultaneity; the Lorentz transformations 2.14 33 Physics 212 Honors topics in electrodynamics Unit 3 A microscopic version of Gauss’ law Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 3 A microscopic version of Gauss’ law 3.1 34 Unit 3: A microscopic version of Gauss’ law Recap of last week Introduction: the connection between Coulomb’s and Gauss’ laws The Physics 112 discussion of the electric field and electrostatic potential describes them as quantities which depend on the global distribution of electric charge: V (r ) = E (r ) = Qi ⇒ all charges 4πε 0 r − ri ∑ 1 ∑ 1 Qi all charges 4πε 0 r − ri 2 ρ ( r ′) 1 ∫ 4πε 0 r − r ′ all space r − ri r − ri ⇒ ∫ 1 d3r ′ ρ ( r ′) r − r ′ 4πε 0 r − r ′ 2 r − r ′ all space d3r ′ . By “global” I mean that you’ll need to know where all the charge is in order to calculate the sums or integrals. Don’t be alarmed by the volume integrals—an integral is just a gigantic sum, which is probably why an integral sign is meant to resemble a large letter “S.” It is possible to craft equations which relate characteristics of the fields (or potentials) to the charge distribution using only on local properties of the charge density. (You’ll do that today.) In particular, knowing the charge density at a single point in space lets us know about the spatial derivatives of the electric field at that same point in space. The electric field falls off like 1/r2 while the area of a surface surrounding a finite charge distribution grows like r2. One consequence of this is Gauss’ law: the net flux of electric field through any closed surface depends only on the amount of charge inside the surface. This is another global statement since knowledge of the field over an extended surface is necessary to calculate the flux in order to establish the connection between the electric field and the (extended) charge distribution. Again, it is possible to concoct a local-properties-only formulation of Gauss’ law. Naturally, the electric field is the force per Coulomb felt by a point test charge placed somewhere, while the electric potential is the test charge’s potential energy per Coulomb. Physics 212, Spring 2003. Unit 3 A microscopic version of Gauss’ law 3.2 35 Here’s what you’ll find/work on today: • Verifying that Gauss’ law is true even when the electric field is not perpendicular to the Gaussian surface enclosing a point charge • A microscopic (differential) version of Gauss’ law which depends only on local properties of the electric field and charge distribution. • Another differential equation (Poisson’s equation) which relates the potential to the charge distribution. Exercise 3.1: Coulomb ⇒ Gauss in cylindrical geometry Gauss’ law describes the connection between the flux of electric field lines passing through a closed surface and the net charge contained inside the surface. The idea is relatively simple: electric field lines can only begin on positive charges and can only end on negative charges. If there are no charges inside a closed surface, any field line entering the surface must also exit. If the surface does enclose a net positive charge, more field lines will leave it than will enter. Naturally, the situation is reversed if the surface encloses a net negative charge. The flux of electric field lines passing through a small area patch ∆A depends on how strong the field is, how large the area of the patch is, and how close the electric field lines are to being perpendicular to the surface of the patch. In symbols, ∆Φ = E i ∆A = E ∆A cosθ , where the direction associated with the area patch is along the (outward) normal to the surface of the patch. Summing the flux of electric field over a closed surface involves a sum (an integral) over all area patches which comprise the surface: Φ= ∑ surface ∆Φ = ∑ surface E i ∆A  → ∆A→dA ∫ surface dΦ = ∫ E idA . surface The formal statement of Gauss’ law is ∫ surface E idA = Qenclosed ε0 . It’s easy to evaluate this integral when the surface is a sphere of radius R which is centered on a point charge Q: the integral is just the product of the strength of the electric field at a distance R and the surface area of a sphere, since all the cosines in the dot product are unity: Q Q E idA = E × 4π R 2 = × 4π R 2 = . 2 ∫ 4πε 0 R ε0 surface The dot product inside the integral is important: it is what makes Gauss’ law true regardless of the shape of the bounding surface used in the integral. Physics 212, Spring 2003. Unit 3 A microscopic version of Gauss’ law 3.3 36 To convince yourself (or at least to make it seem more plausible) that the shape doesn’t matter, consider an infinitely long cylindrical surface of radius a surrounding a point charge Q as in the following figure. φ dA x a E θ z Q y The charge is at the origin, so the distance to an area patch on the surface of the cylinder at (r, z, φ) = (a, z, φ) is (a2 + z2)1/2. In addition, the cosine of the angle between the electric field and the normal to the area patch is cos(θ) = a / (a2 + z2)1/2. An area patch of extent dφ, dz has area dA = a dφ dz. For this surface, evaluate the electric field flux through the cylinder, namely ∫ E idA . surface Don’t get spooked by this: a surface integral involves taking a double integral, which can be written this way:   a E idA = ∫ E  dA ∫ 2 2   a +z  surface surface =    Q a   ∫ 4πε a 2 + z 2  a 2 + z 2  dA  surface  0 ( =∫ +∞ −∞ . )  2π     Q a ∫   dz. φ a d   0  4πε 0 a 2 + z 2   a 2 + z 2     ( ) E cos θ dA (A correct calculation will yield the answer Q/ε0. You will probably find the trig substitution z = a⋅tan(α), in combination with the identity 1 + tan2(α) = sec2(α) = 1/cos2(α), to be helpful.) Discussion Physics 212, Spring 2003. Unit 3 A microscopic version of Gauss’ law 3.4 37 Exercise 3.2: A microscopic version of Gauss’ law If a surface encloses a distribution of charge with density ρ ( r ) C/m3, Gauss’ law can be written Q 1 ∫ E idA = enclosed = ∫ ρ ( r ) dV ε0 surface ε 0 volume since the amount of charge contained inside a small volume element dV is ρdV and the integral just sums up the charge contained by all volume elements inside the bounding surface. Let’s develop a microscopic version of this. Consider a small volume which is a parallelepiped centered at (x,y,z) with side lengths dx, dy, dz as shown in the figure. Assume that ρ and E vary slowly over the size of the volume so that the charge inside the volume is . y dz dy x dx z If the volume is small enough, the surface integral can be rewritten as a sum over the faces of the box so that Gauss’ law becomes i =6 ρ ( x, y, z ) dxdydz . ∑ Ei idAi = i =1 ε0 (a) Write an explicit expression for each of the six area patch vectors dAi in terms of the unit vectors x̂ , ŷ , ẑ and the volume’s side lengths dx, dy, dz. Physics 212, Spring 2003. Unit 3 A microscopic version of Gauss’ law 3.5 38 (b) The electric field at the center of one particular face of the box can be written as dx dx dx dx         E  x − , y, z  = Ex  x − , y, z  xˆ + E y  x − , y, z  yˆ + Ez  x − , y, z  zˆ         2 2 2 2 while the field at the center of the opposite face is dx dx dx dx         E  x + , y, z  = Ex  x + , y, z  xˆ + E y  x + , y, z  yˆ + Ez  x + , y, z  zˆ .         2 2 2 2 There are similar expressions for the electric field at the other four faces of the box. Use this fact, and the definition of a partial derivative, to rewrite Gauss’ law for a vanishingly small volume entirely in terms of ρ and partial derivatives of the components of the electric field. (You should obtain ∂E x ( x, y , z ) ∂E y ( x , y , z ) ∂E z ( x, y , z ) ρ ( x , y , z ) + + = .) ε0 ∂x ∂y ∂z In Cartesian coordinates, the definition of the operator ∇ (“del”) is ∇ ≡ xˆ ∂ ∂ ∂ + yˆ + zˆ so ∂x ∂y ∂z that the divergence of any vector field G can be written ∇iG = ∂G x ( x , y , z ) ∂G y ( x, y , z ) ∂G z ( x , y , z ) + + . ∂x ∂y ∂z As a result, the microscopic (differential) version of Gauss’ law can be written compactly as ∇i E = Physics 212, Spring 2003. Unit 3 A microscopic version of Gauss’ law ρ . ε0 3.6 39 Discussion: divergence (and the divergence theorem) • A net “outflow” of electric field lines automatically means that the electric field has positive divergence. A net “inflow” means the field has negative divergence. • The divergence theorem (true for all vector fields): use ∇i E = ∫ E idA = surface ρ to rewrite Gauss’ law as ε0 ρ (r ) ∫ ε 0 dV = volume ∫ ∇i E dV : volume ( ) The flux of field through a closed surface is the same as the integral of the field’s divergence over the volume enclosed by the surface. In general, for any vector field G it is true that ∫ G idA = surface ∫ volume ( ∇iG) dV . Exercise 3.3: Poisson’s equation The equation ∇i E = ρ depends only on the local properties of the charge density and the ε0 derivatives of the electric field. We can work up a similar equation connecting the potential with the charge density with the help of the work-energy theorem. (This only applies to static systems, where all free charge stays put, and all currents are constant!) Recall that the (small) change in an object’s potential energy is ∆U = F i ∆r when it is pushed through a displacement ∆r by a force F . If you are applying the force to overcome the effects of an electric field opposing your efforts, your force will satisfy F = −QE . Imagine that you move the test charge through a displacement ∆r = xˆ ∆x + yˆ ∆y + zˆ ∆z in three steps, as illustrated below. y 3. dz 2. dy Q starts here Q ends here 1. dx x z Physics 212, Spring 2003. Unit 3 A microscopic version of Gauss’ law 3.7 40 The work you do will be ∆U = ∆U1 + ∆U 2 + ∆U 3 with ∆U1 = F i xˆ ∆x = Fx ∆x, ∆U 2 = F i yˆ ∆y = Fy ∆y, ∆U 3 = F i zˆ∆z = Fz ∆z . ∆U1 ∂U ∂U ∂U = Fx , = Fy , = Fz so = Fx , with some thought it should be apparent that ∂x ∂y ∂z ∆x ∂U ∂U ∂U + yˆ + zˆ = ∇U . that F = xˆ ∂x ∂y ∂z Since Because F = −QE and U = QV , we can conclude that E = −∇V (static systems only) . Use this to construct Poisson’s equation, which relates various second derivatives of the potential to the charge distribution, from the differential form of Gauss’ law. ρ ( x, y , z ) ∂ 2V ∂ 2V ∂ 2V . The operator ∇ 2 is called the (You should obtain ∇ V ≡ 2 + 2 + 2 = − ε0 ∂x ∂y ∂z “Laplacian.”) 2 Discussion Summary and wrap-up discussion • • The 1/r2 nature of the electric field leads to Gauss’ law which relates the flux of electric field through a closed surface to the net charge enclosed by the surface. In combination with the connection between the electric field and the electrostatic potential, the microscopic (differential) version of Gauss’ law can be used to produce Poisson’s equation, which relates various second derivatives of the potential with the local charge density. Comments about homework assigned for next week Physics 212, Spring 2003. Unit 3 A microscopic version of Gauss’ law 3.8 41 Physics 212, Spring 2003. Unit 3 A microscopic version of Gauss’ law 3.9 42 Reading and homework should be completed by 4pm Friday. Required reading: Purcell, chapter 1, sections 7 through 10. Chapter 2, sections 1 through 5 and 7 through 12. Note that Purcell uses CGS, instead of MKS (SI), units so there won’t be any 4πε0’s anywhere. Read over (but don't worry about understanding) the Unit 4 material we'll tackle in class next week. Optional reading: The Feynman Lectures on Physics (volume II), chapter 2, sections 1-5 and 7-8; chapter 3, sections 1-3. Comment on notation: In general, I’ll use xˆ , yˆ , zˆ to represent unit vectors parallel to the x, y, and z axes. Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to this week’s in-class exercises as well as last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Problem 1 (10 points): This is not why the universe is still expanding Theoretical food chemists in the Urkin's D&HE Research Division are completing an analysis of a hypothetical universe filled with electrically charged mayonnaise. In this alternative reality the electrostatic potential created by the mayo is found to be V ( x, y, z ) = V0 ( sin kx + cos ky ) where k is a constant. (Note that the potential depends only on x and y.) (a) Calculate the electric field E ( x, y, z ) produced by mayonnaise. (b) Calculate the net charge density in Coulombs per cubic meter ρ ( x, y, z ) responsible for creating this potential. Physics 212, Spring 2003. Unit 3 A microscopic version of Gauss’ law 3.10 43 Problem 2 (10 points): At the surface of the FlashOven The electric field inside, and at the external surface of Urkin’s Vulcan FlashOven is accurately described as E ( x, y, z ) = zˆ E0 x 2 yz. The oven is a rectangular box whose sides are bounded by the planes x = 0, x = a, y = 0, y = b, z = 0, z = c . (a) Evaluate the integral ∫ E ⋅ dA over the surface of the oven. surface ( dA points outwards; the six faces have dA = ± xˆ dydz, ± yˆ dzdx, and ± zˆ dxdy. ) (b) Evaluate the integral ∫ ∇ ⋅ E dV over the interior volume of the oven. volume (c) How might the divergence theorem helped you in part (b)? Problem 3 (10 points): The point is… Anthrax, the Urkin’s pet cat, is resting comfortably at the origin after eating an electrically charged canary. He exudes the (radial) electric field E ( x, y , z ) = x xˆ + y yˆ + z zˆ (x 2 + y2 + z2 ) 3 2 where xˆ , yˆ , zˆ are unit vectors parallel to the x, y, and z axes. (Assume this expression is accurate for all points outside the cat.) Prove that the density of electric charge ρ ( x, y, z ) is zero for all points (x, y, z) external to the cat. Physics 212, Spring 2003. Unit 3 A microscopic version of Gauss’ law 3.11 44 Physics 212 Honors topics in electrodynamics Unit 4 Why superposition works; Green’s functions Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 4 Why superposition works; Green’s functions 4.1 45 Unit 4: Why superposition works; Green’s functions Recap of last week Introduction: superposition One of the most important/useful/convenient/fundamental properties of the equations which characterize electric and magnetic fields is that all the fields and potentials obey the superposition principle. Imagine that we are able to solve for the electric field (call it E1 ) in a universe which contains nothing but charge Q1 at position r1 and also that we can solve for E2 in a (different) universe which contains nothing but charge Q2 at r2 . Because of superposition we can be sure that the electric field in a universe which contains both charges will be the sum of the fields we found doing our pair of lone-charge-universe field calculations: Eboth charges present = E1 + E2 . Superposition allows us to break a complicated problem up into a number of simple problems which we can solve, then to sum the simple problems’ solutions to generate the solution to the complicated problem. In the two-charge example mentioned above Q1 and Q2 are sources of the electric field. Naturally, they don’t contribute equally to the electric field everywhere: the closer an observer is to one of the charges, the more important that charge’s contribution to the measured electric field will be. If we placed unit charges at r1 and r2 we would need to weight their contributions to the ( field at r by 1 4πε 0 r − ri 2 ) . (For sources which are different from one Coulomb we will also need to scale their contributions according to how much charge each particle carries.) ( The unit-charge weighting function 1 4πε 0 r − ri 2 ) is a Green’s function (named for the British miller George Green (1793-1841), whose hobby was mathematics). A Green’s function is a “helper” function which tells us how important each source’s contribution is when we superpose the effects (fields) of all the individual sources. Physics 212, Spring 2003. Unit 4 Why superposition works; Green’s functions 4.2 46 Discussion: superposition and Poisson’s equation Last week we cooked up a pair of powerful differential equations which can be used (at least in principle) to generate the electrostatic potential (and from it the electric field) if we know the charge distribution: ∇ 2V = − ρ ( x, y , z ) ε0 (with solution V ( r ) = ∫ all space 1 ρ ( r ′) 4πε 0 r − r ′ d3r ′ ) and E = −∇V . The ∇2V expression is Poisson’s equation, named for the French mathematician Simeon Denis Poisson (1781 - 1840). It is, in a sense, the master equation for the electrostatic potential: given ρ, there will always exist a solution to Poisson’s equation which we can construct (perhaps numerically, as opposed to analytically) by calculating an integral. An important property of Poisson’s equation is that the solutions superpose. Imagine that we have already calculated the potential V1 which corresponds to a charge distribution ρ1, and also the potential V2 which corresponds to a different charge distribution ρ2. If we want to know the potential Vsum which corresponds to a charge distribution ρsum = ρ1 + ρ2 all we have to do is add the potentials we previously found: Vsum = V1 + V2. It’s not necessary to crank through an integral over the combined charge distribution ρsum. From Physics 112 you already know about the simplest version of this: a pair of charges q1 and q2 at positions r1 and r2 will yield a potential 1  q1 q2  + V (r ) =  . 4πε 0  r − r1 r − r2  It is obvious that two identical charges (q1 = q2) would not, in general, contribute equally to the potential except at points which are equidistant from both charges. The weighting function which determines the size of the ith charge’s contribution to the overall potential is the (Coulomb Green’s) function 1 ( 4πε 0 r − ri ) . Here’s what you’ll find/work on today: • Proving that potentials which satisfy Poisson’s equation really do obey the principle of superposition. • Investigating superposition for some other systems. Exercise 4.1: Proving that Coulomb potentials really do obey the principle of superposition Physics 212, Spring 2003. Unit 4 Why superposition works; Green’s functions 4.3 47 Poisson’s equation says that ∇ 2V = − ρ ( x, y , z ) 1 ρ ( r ′) 3 d r′ . with solution V ( r ) = ∫ 4πε 0 r − r ′ ε0 all space Imagine that you’ve solved for the potential V1 corresponding to a particular charge distribution ρ1, and also for the potential V2 which corresponds to a different charge distribution ρ2. Prove ρ ( x, y , z ) that ∇ 2Vsum = − sum where Vsum = V1 + V2 and ρsum = ρ1 + ρ2 . ε0 discussion Exercise 4.2: Another superposition proof The net force on a forced, damped harmonic oscillator is given by ma = − kx − β v + f ( t ) . The damping, described by the –βv term to the right of the “=” sign, is velocity-dependent and tends to slow the oscillator down. The external driving force is f(t). We can rewrite the equation this way: d 2x dx + kx = f ( t ) . m 2 +β dt dt Imagine that our oscillator is being driven by a particular driving force f1(t) and that we have already solved for its position as a function of time x1(t). A different oscillator with identical mass and damping is driven by a driving force f2(t), and we have already solved for this oscillator’s position as a function of time x2(t). Prove that when one of the oscillators is subjected to a force fsum(t) = f1(t)+ f2(t), it will move so that its position is xsum(t) = x1(t) + x2(t) . discussion Physics 212, Spring 2003. Unit 4 Why superposition works; Green’s functions 4.4 48 Exercise 4.3: Yet again another proof The “Karma” K ( x) of central Illinois is described by the differential equation A d 3 K ( x) d 2 K ( x) dK ( x) + B +C + DK ( x) = U ( x) . 3 2 dx dx dx The function U(x) (incorrectly described in the Bhagavad-Gita as “the source of all visible kinetic energy in the Universe”) depends on the local weather. Imagine that you have already solved for the Karma on rainy days (Krain(x) corresponding to Urain(x)) and on days with hail (Khail(x) corresponding to Uhail(x)). During an especially unpleasant day in which it rains and hails simultaneously, the central Illinois U(x) function is Urain+hail(x) = Urain(x) + Uhail(x). Does the mid-state Karma function obey the superposition principle? (Is it true that Krain+hail(x) = Krain(x) + Khail(x))? Prove your answer. discussion Exercise 4.4: Yet again another (and not, perhaps, the last) proof The “Karma” K ( x) of Indiana behaves differently from that in Illinois, obeying the differential equation A d 2 K ( x) 2 + B ( K ( x) ) = U ( x) . 2 dx As in Illinois, the function U(x) depends on the weather. Imagine that you have already solved for the Indiana Karma on rainy days (Krain(x) corresponding to Urain(x)) and on days with hail (Khail(x) corresponding to Uhail(x)). Once again, when it rains and hails simultaneously, the function U(x) is given by Urain+hail(x) = Urain(x) + Uhail(x). Does the Indiana Karma function superpose? (Is it true that Krain+hail(x) = Krain(x) + Khail(x))? Prove your answer. discussion Exercise 4.5: Probably the last superposition proof Physics 212, Spring 2003. Unit 4 Why superposition works; Green’s functions 4.5 49 The Iowa Karma function K ( x) obeys the differential equation AK ( x) dK ( x) + BK ( x) = U ( x) . dx As before, U(x) depends on the weather. Imagine that you know Krain(x), corresponding to Urain(x), and Khail(x), corresponding to Uhail(x). When it rains and hails simultaneously, the function U(x) is again given by Urain+hail(x) = Urain(x) + Uhail(x). Does the Iowa Karma function superpose? (Is it true that Krain+hail(x) = Krain(x) + Khail(x))? Prove your answer. discussion: linear vs. nonlinear differential equations Linear equations will obey superposition, while nonlinear ones (often, usually, always??) will not. discussion: cheap audio equipment “Total harmonic distortion:” In 1 2 3 4 5 (kHz) Out 1 2 3 4 5 (kHz) Pure frequency in yields various frequencies out if amplifier is (slightly) nonlinear. Physics 212, Spring 2003. Unit 4 Why superposition works; Green’s functions 4.6 50 Summary and wrap-up discussion • • The principle of superposition generally holds for linear differential equations. The Green’s function for a linear system (which obeys superposition) is the “helper function” which indicates how to sum the effects caused by various sources. For example, the Coulomb potential’s Green’s function is G ( r , ri ) ≡ 1 ( 4πε 0 r − ri ) . Comments about homework assigned for next week Physics 212, Spring 2003. Unit 4 Why superposition works; Green’s functions 4.7 51 Reading and homework should be completed by 4pm Friday. Required reading: The Feynman Lectures on Physics (volume I), chapter 25, sections 1 and 2. Read over (but don't worry about understanding) the Unit 5 material we'll tackle in class next week. Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to this week’s in-class exercises as well as last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Problem 1 (10 points): Anthrax’s personal MP3 player sounds funny Anthrax, the Urkins’ unpleasant feline, enjoys listening to music on a defective MP3 player. The relationship between the player’s input signal Ain ( t ) and its output signal Aout ( t ) is described by the equation Aout ( t ) = 10−3 dAin ( t ) . dt Notice that this equation is linear: superposition works for it. (a) The closing note of Grunt Like a Chicken Thief is a sustained 500 Hz triangle wave whose amplitude vs. time is shown in the following figure: V +0.5 V t -0.5 V 1 ms Ain is a 500 Hz triangle wave Draw an accurate plot of the defective MP3 player’s output signal, Aout. (b) Any periodic function can be well-represented by a (possibly infinite) sum of sines and cosines. In particular, the triangle-wave final wail of Grunt Like a Chicken Thief can be represented this way: 4 k =∞ 1 Ain ( t ) = − 2 ∑ 2 cos ( k ⋅ π ×103 t ) . π k =1,3,5,... k Physics 212, Spring 2003. Unit 4 Why superposition works; Green’s functions 4.8 52 An expression of this sort is called a Fourier series. Using this, along with the knowledge gleaned from your answer to part (a), derive an expression for the Fourier series which represents the following 500 Hz square wave: +1 V V t -1 V 1 ms 500 Hz square wave . Notice that superposition allows you to draw conclusions about the behavior of the input and output signals based on an understanding of how the individual Fourier components (the sines and cosines) behave when passed through the defective MP3 player. Problem 2 (10 points): Starburn In this problem you will find it convenient to use (struggle with??) a volume integral in spherical coordinates (r,θ,φ) instead of the more familiar Cartesian (x,y,z) coordinates. Please find Brian or me if you need help setting up the problem’s solution. Here’s how spherical coordinates work: z dV = dr × rdθ × r sinθ dφ r θ dr θ φ y φ dθ dV dφ x The lengths of the sides of an almost-rectangular volume element dV at (r,θ,φ) are dr, r dθ, and r sinθ dφ. Its volume is therefore dV = dr × rdθ × r sinθ dφ = r2 sinθ dr dθ dφ. A volume integral of a function F can be written in the two coordinate systems as Physics 212, Spring 2003. Unit 4 Why superposition works; Green’s functions 4.9 53 rf θ f φ f  y f  x f       = ∫  ∫  ∫ F ( x, y, z ) dx  dy  dz or I spherical = ∫  ∫  ∫ F ( r ,θ ,φ ) r sin θ dφ  rdθ  dr .     zi  ri   θi  φi  yi  xi zf I cartesian You will derive an argument against a model of our universe as infinite and static, in which new stars appear periodically to replace old stars as they burn out. It is thought that there are roughly 1020 stars inside the 15 billion lightyear diameter observable universe. (This number could be wrong by a factor of 1000!) Since a lightyear is approximately 1016 meters, let’s assume that the average density of stars in our universe is 1 star per 1058 cubic meters, and that we can treat a distant star as a point source of light. If all stars shine with the same luminosity as our sun, each one is pouring 3.8×1026 watts of power into space. (Recall that a watt is the same as a joule per second.) Naturally, the intensity of power per square meter a distance r from a star is just the fraction of the area of a sphere of radius r subtended by one square meter, namely 1 ( 4π r 2 ) . (a) Imagine that starlight falls on a sphere of radius 1 π floating in space. (The cross-sectional area of the sphere is a square meter.) Using the assumptions mentioned above, derive an expression for the power, in watts, striking the sphere from all stars within a distance R from the sphere. Since it will be necessary for you to sum the intensity from lots of stars, you will find it convenient to recall how to do an integral in spherical coordinates using the volume element dV = r 2 sin θ drdθ dφ with θ ∈ ( 0, π ) but φ ∈ ( 0, 2π ) . (Neglect light from the sun!) (b) The intensity of sunlight reaching the ground at noon in Champaign on a clear spring day is about 1000 watts per square meter. What value (if any) of R would make the starlight illuminating the sphere in part (a) be just as bright as this? (Express you answer in lightyears, please.) (c) Using the same assumptions, estimate the intensity of starlight at the Earth's surface. Physics 212, Spring 2003. Unit 4 Why superposition works; Green’s functions 4.10 54 Problem 3 (10 points): New technology ice cream Research scientists at Urkin’s D&HE contemplate upgrading their sauerkraut mill so that it can be used to prepare gallons of ever-popular brussels sprout ice cream. The complication is that the ice cream matrix needs to be agitated at frequency 4 Hz in order to set properly, while the brussels sprouts need to detect vibrations at 3 Hz in order to develop the correct flavor. Can the modified kraut mill handle the task? This is, the scientists, realize, a problem of superposition. Will the simultaneous installation of new 4 Hz and 3 Hz drivers in the kraut mill manage to deliver both frequencies to the ice cream under preparation? Sophisticated computer simulations of the sauerkraut mill show that the mixing amplitude x ( t ) delivered to the ice cream in the mill’s crucible by a single driver supplying force f ( t ) obeys one of the following equations: (a) (b) (c) d 4x d 2x + B + C = f (t ) dt 4 dt 2 d 2x dx A 2 + Bx + Cx 2 = f ( t ) dt dt 3 d x d 2x dx A 3 + B 2 + C + Dx = f ( t ) . dt dt dt A The coefficients A, B, C, and D are constant. The researchers realize that the question is this: when two drivers exert a combined force f ( t ) = f 4 Hz ( t ) + f 3 Hz ( t ) will the response be the same as the sum of the single-driver responses (so that x ( t ) = x4 Hz ( t ) + x3 Hz ( t ) )? If this is the case, the sauerkraut mill upgrade will work nicely. Which (if any) of the possible response functions (a), (b), and (c) will allow a successful upgrade? Please prove, or otherwise justify, your answers. Physics 212, Spring 2003. Unit 4 Why superposition works; Green’s functions 4.11 55 Physics 212 Honors topics in electrodynamics Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.1 56 Unit 5: Pinwheels (curl and the vector potential); le souvenir du temps passé Recap of last week Introduction: Does it spin? We’ve been looking at some of the characteristics of static electric potentials and fields with the help of a variety of tools such as Poisson’s equation. As you’ve seen in Physics 112, the connection between the electric potential and electric field is closely tied to the idea that the work done moving a charge from one place to another is independent of the path taken. If our charges are not static (in particular, if they are accelerating) all bets are off. We’re going to investigate another kind of differential operator today: the curl of a vector function. Curl will prove useful when we discuss systems in which currents are flowing, and when we investigate (derive/discover) the existence of electromagnetic radiation. We will also look at a vector analog of the scalar potential. This is less mysterious than it sounds: the scalar potential at r created by a point charge q at position r ′ moving with velocity v is approximately V (r ) ≈ 1 q 4πε0 r − r ′ while the vector potential A is approximately A( r ) ≈ 1 q µ0 q v = v 2 4πε0c r − r ′ 4π r − r ′ since µ 0 = 1 . ε 0c 2 . (There are some subtleties concerning what we need to do to make these equations exactly true.) Here’s what you’ll find/work on today: • properties of the electric field that are impossible to find in fields generated by static charges • curl and divergence of electric fields • Stokes’ Theorem • The vector potential • Time retardation’s influence on the vector potential Exercise 5.1: Pinwheel statics (or should we say dynamics?) Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.2 57 Imagine we make a tiny pinwheel with point charges q on each of its four blades as drawn in the following figure. Electric field lines in the vicinity of the pinwheel, caused by (static) charges placed elsewhere are shown in the diagram. The pinwheel’s center bearing is kept fixed. Let’s see if it is possible for the electric field to cause the pinwheel to spin faster and faster. q y ∆y δ = 2 2 q q q ∆x δ = 2 2 z pinwheel center at (x,y) x ∆x ∆y ∆x ∆y , y, 0) , ( x, y + , 0) , ( x − , y, 0) , ( x, y − , 0) . 2 2 2 2 Recall that torque is r × F . Assuming that the pinwheel’s arms are of equal lengths (so that ∆x = ∆y = δ), write an expression for the torque on the pinwheel in terms of the x and y components of the electric field. (Assume the z component of the electric field is zero.) (a) The charges are at positions ( x + Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.3 58 (b) If you haven’t already done so, transform your answer to part (a) into an expression involving partial derivatives of the fields by taking the limit that ∆x and ∆y go to zero. discussion You should end up with something like this: τ = q δ 2  ∂E y ∂E x    zˆ . ∂y  2  ∂x y If our pinwheel also had had a pair of arms parallel to the z axis, as drawn here, we could have calculated torque components along all three axes to find  ∂E y ∂Ex   ∂Ez ∂E y   ∂Ex ∂Ez  zˆ +   xˆ +  ∂y  ∂z  ∂x  ∂z  ∂y  ∂x τ =  ≡ 2   δ ˆ y     2 δ2 x z (∇ × E ) 2 . ∇ × E is called the “curl of E .” If the electric field has a nonzero curl, it’ll make the pinwheel spin. How might such electric fields look? Here are two examples. Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.4 59 Exercise 5.2: Can the electric field from a charge distribution do this? If no charges are moving, we can use Poisson’s equation to calculate the potential, and then take its gradient to solve for the electric field: ∇ 2V = − ρ ( x, y , z ) 1 ρ ( r ′) 3 d r ′ , and E = −∇V . so that V ( r ) = ∫ 4 r r πε − ε0 ′ 0 all space ( ) When the electric field is the gradient of a scalar field is it possible for ∇ × E = −∇ × ∇V to be non-zero? (Prove your answer.) Exercise 5.3: What’s the divergence of a curl? Let’s say we find a different vector field (call it A ) whose curl is non-zero. Can the divergence of the curl of A ( ∇i ∇ × A ) also be different from zero? (Prove your answer.) ( ) discussion Path independence, work done along a closed path, line integrals, and curl... Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.5 60 Exercise 5.4: Line integrals around microscopic loops; curl Imagine that an electric field (with non-zero curl) does work moving a charged particle counterclockwise around a tiny rectangular loop, as shown in the diagram. The midpoints of the ∆x ∆y ∆x ∆y four sides are at positions ( x + , y, 0) , ( x, y + , 0) , ( x − , y, 0) , and ( x, y − , 0) ; 2 2 2 2 the work done by the electric field is ∫ qE idl . loop y ∆y ∆x Rectangular loop is centered at (x,y) z x (a) Calculate the work done by the electric field, and then re-express it in terms of ∆x, ∆y, and various partial derivatives of the electric field. (You should assume that the electric field along each side of the loop is well-represented by its value at the midpoint of the side, and write the field explicitly in terms of its x and y components: E ( x, y, z ) = Ex ( x, y, z ) xˆ + E y ( x, y, z ) yˆ .) (b) Re-express your answer in terms of ∇ × E if you haven’t already done so. Discussion dA vs. dA ; n̂ (normal to the loop via the right hand rule). Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.6 61 Exercise 5.5: Line integrals around lots of microscopic loops Now imagine that we tile part of the x-y plane with several small loops as shown below. Prove that the work done by the field when a charge moves around the outer perimeter of the tiled region is equal to the sum of the work done moving a charge around each of the small loops: ( ∑ ∫ small loops i th loop ) qE idl = ∫ big loop qE idl . ∆x y z ∆y 1 2 3 4 x Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.7 62 Discussion: Stokes’ theorem From an earlier exercise, you should have concluded that ∫ = E idl tiny loop ( ∇ × E ) inˆ dx dy = ( ∇ × E ) idA where n̂ is a unit vector normal to the surface of the loop (its direction is given by the right hand rule) and dA = n̂ dx dy . We can use this to rewrite ( ∑ ∫ small loops i th loop ) E idl = ∫ big loop E idl as ∑ small loops ( ∇ × E ) idA i ⇒ ∫ ( ∇ × E ) idA surface surrounded by big loop = ∫ E idl . big loop This is Stokes’ Theorem, named for the Irish mathematical physicist George Gabriel Stokes (1819-1903); it is true for any vector field. Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.8 63 Discussion: the Helmholtz theorem Suppose we are told what the divergence and curl are for some vector function F ( x, y, z ) , but don’t know the explicit form of F itself: ∇ i F ( x , y , z ) = D ( x, y , z ) and ∇ × F ( x, y , z ) = C ( x, y , z ) . (Naturally, ∇iC will be zero since the divergence of a curl is always zero.) Is there a general technique we can use to determine F if we know C and D? (Being told that ∇i E = ρ ε 0 and ∇ × E = 0 for a static charge distribution is an example of this.) According to something called the Helmholtz theorem, we can always write any well-behaved vector function as the sum of a gradient and a curl as long as C and D go to zero sufficiently rapidly at large distances. The exact form is  1 F ( r ) = −∇    4π  1 D ( r′ ) 3  ′ + ∇ × d r   ∫ r − r′   all space   4π C ( r′ ) 3  ∫ r − r′ d r′  . all space  If you use this to calculate something, keep in mind that the partial derivatives in the gradient and curl are taken with respect to x, y, z and not with respect to x′, y′, z ′ . As a result, terms which depend only on the primed variables act like constants when you evaluate the partial derivatives. Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.9 64 Discussion: The vector potential The scalar potential for a stationary point charge at position r ′ is V ( r ) = 1 q . 4πε0 r − r ′ If we have a continuous distribution of static charge so that the amount of charge contained in a small volume dV is ρ dV , we can calculate the potential at r by doing an integral ( d 3 r ′ is just a volume element): V (r ) = 1 ∫ ρ ( r ′) 4πε 0 r − r ′ all space d 3r ′ . Are there ever times when we might want to superpose something that is a vector instead of a scalar? Certainly! It is possible to determine the electric field by calculating the gradient of the potential; since the potential obeys the superposition principle when we have several charges in our universe, the electric field also obeys superposition. We know that the electric field at r caused by a small amount of stationary charge ρ dV at r ′ is 1 ρ dV r − r ′ . E (r ) = 4πε 0 r − r ′ 2 r − r ′ (The ( r − r ′ ) r − r ′ piece is just a unit vector pointing away from the charge.) As a result, we can calculate the electric field (without referring explicitly to the potential) this way: E (r ) = ∫ all space 1 ρ ( r ′) r − r ′ 4πε 0 r − r ′ 2 r − r′ d 3r ′ . ( d 3 r ′ is just the volume element dV ′ .) An integral is really just a big sum. The presence of the vector’s direction as part of the integrand may make the integral harder to evaluate, but that’s just a practical matter-- it doesn’t make the integral’s meaning (more??) obscure. Remember that we're only dealing with the case right now that the charge density is unchanging: the number of Coulombs per cubic meter at a particular point never changes, no matter how long we wait. We can have constant ρ when charges are moving: as long as the amount of charge dq = ρdV flowing into a small volume element equals the amount flowing out, we'll never have a change in the amount of charge contained by dV. Here’s another quantity that has a direction associated with it. Let's say our charge distribution is swirling around so that the charge dq = ρ ( x, y, z ) dV at the point (x,y,z) has velocity v ( x, y, z ) . By the way, ρ ( r ) v ( r ) is called the current density J ( r ) : it corresponds to the Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.10 65 number of amperes per second per square meter flowing through a surface at r which is perpendicular to J ( r ) . I'm going to impose the constraint that J ( r ) never changes with time, even though its value can differ from place to place. For grins, let’s define the vector potential associated with this sort of current flow as follows: A( r ) ≡ 1 1 J ( r ′) 3 µ0 J ( r ′ ) 3 d r = ′ 2 ∫ 4πε 0 r − r ′ ∫ 4π r − r ′ d r ′ c all space all space since µ0 = 1 . ε 0c 2 Keep in mind that this equation is really shorthand for three distinct equations: Ax ( r ) = µ0 J x ( r′) 3 µ0 J y ( r′) 3 µ0 J z ( r′ ) 3 d r′, Ay ( r ) = ∫ d r ′, Az ( r ) = ∫ d r′ . 4π r − r ′ 4π r − r ′ 4π r − r ′ all space all space all space ∫ Of what use could this possibly be? Think about the scalar potential: the reason that it is useful to work with the function 1 ρ ( r ′) 3 V (r ) = ∫ d r′ 4πε 0 r − r ′ all space is that it is the solution to Poisson’s equation ∇ 2V = − ρ ( x, y , z ) . ε0 Imagine that we encounter an equation which resembles Poisson’s equation, except that the function of x,y,z on the right side is f ( x, y, z ) = ρ ( x, y, z ) vx ( x, y, z ) instead of ρ ( x, y, z ) , and the constant on the right side is -µ0 instead of -1/ε0. Our equation would be ∇ 2 Ax = − µ0 f ( x, y, z ) = − µ0 ρ ( x, y, z ) vx ( x, y, z ) . with solution Ax ( r ) = µ0 f ( r ′ ) 3 µ0 ρ ( r ′ ) v x ( r ′ ) 3 d r′ = ∫ d r′ . 4π r − r ′ 4π r − r′ all space all space ∫ (This would correspond to a charge distribution ρ ( x, y, z ) swirling around, with the velocity of the charge differing from place to place, so that the “local velocity” of charge at ( x, y, z ) is vx ( x, y, z ) . Neither function is time dependent.) Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.11 66 If we encounter similar equations involving the y and z velocity components, we could multiply both sides of each equation by the appropriate unit vector, then add them: ∇ 2 Ax xˆ = − µ0 ρ ( x, y, z ) vx ( x, y, z ) xˆ   ∇ 2 Ay yˆ = − µ0 ρ ( x, y, z ) v y ( x, y, z ) yˆ   ∇ 2 Az zˆ = − µ0 ρ ( x, y, z ) vz ( x, y, z ) zˆ  ⇒ ∇ A = − µ ρ ( x, y , z ) v ( x, y , z ) . 2 0 Why might we ever bump into an equation of this sort? How could something as abstract as the vector potential possibly matter? Who knows? (Who cares?) In any event, if for some reason we do encounter this ∇2 A equation, we’ll already know how to solve it because of our experience with Poisson’s equation: µ0 ρ ( r ′ ) v ( r ′ ) 3 A( r ) = ∫ d r′ . − 4 π r r ′ all space (There actually is a good reason for messing with the vector potential: the magnetic field is exactly the same as the curl of the vector potential: B = ∇ × A . We’ll deal with this later.) We might want to discuss the vector potential for a point charge in motion, but this sort of charge distribution does not satisfy the constraint that ρ ( x, y, z ) is constant. (As the point charge moves, the charge density changes a great deal at each point the charge occupies during its travels.) However, as long as the charge is moving slowly compared to the speed of light, it's not a bad approximation to use A( r ) = µ0 ρ ( r ′ ) v ( r ′ ) 3 µ0 qδ ( r ′ − rq ( t ) ) v ( t ) 3 µ qv ( t ) d r′ ≈ ∫ d r′ = 0 r − r′ r − r′ 4π 4π 4π r − rq ( t ) all space all space ∫ where rq ( t ) and v ( t ) represent the charge's position and velocity. Exercise 5.6: Sketching the vector potential A slowly moving point charge is passing through the origin at time t = 0 with constant velocity v = v0 zˆ . Ignoring any effects associated with the difference between t and t ′ mentioned earlier, draw a reasonably good sketch of the vector potential associated with this charge’s behavior at t = 0. Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.12 67 Exercise 5.7: Sketching another vector potential Two slowly moving point charges are passing through the points ( x, y, z ) = (0, −1,0) and ( x, y, z ) = (0, +1,0) with constant velocities v = v0 zˆ . Ignoring any effects associated with the difference between t and t ′ mentioned earlier, draw a reasonably good sketch of the vector potential at t = 0. Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.13 68 Exercise 5.8: Sketching yet again another vector potential Two slowly moving point charges are passing through the points ( x, y, z ) = (0, −1,0) and ( x, y, z ) = (0, +1,0) with constant velocities v = v0 zˆ and v = −v0 zˆ respectively. Ignoring any effects associated with the difference between t and t ′ mentioned earlier, draw a reasonably good sketch of the vector potential at t = 0. Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.14 69 Discussion: retardation The correct expressions for the scalar and vector potentials include the fact that an observer at a position r sees the charge as it was at an earlier time: the information about the position of the charge takes a finite amount of time to propagate from the charge to the observer. (We see the sun as it was eight minutes ago; we see Alpha Centauri as it was 4.35 years ago.) If the charge density and/or current density changes with time, the integrals used to calculate the potentials will need to take this into account. If the observer at position r (at observer-time t) sees charge at position r ′ , the time at which the charge was really at that point was at a time earlier than t by ∆t = t − t ′ = r − r ′ c . We need to modify our equations like this: V (r ) = ∫ all space A( r ) = 1 ρ ( r ′) 4πε 0 r − r ′ d 3r ′ ⇒ V (r ,t) = ∫ all space 1 ρ ( r ′, t − r − r ′ c ) 4πε 0 r − r′ d 3r ′ µ0 J ( r ′ ) 3 µ 0 J ( r ′, t − r − r ′ c ) 3 , ⇒ = d r A r t d r′ ′ ( ) ∫ 4π r − r ′ ∫ 4π r − r′ all space all space What happens to the potentials from a finite charge distribution with total charge Q when we 1 Q include retardation? It's tempting to think that V and A will satisfy V ( r , t ) ≈ 4πε 0 r − rQ ( t ′ ) and A ( r , t ) ≈ µ0 Qv ( t ′ ) if we're far from the charge. (The retarded time t ′ satisfies 4π r − rQ ( t ′ ) t ′ = t − r − rq ( t ′ ) c ). However, this is wrong! The problem arises because Q ≠ ∫ ρ ( r ′, t − r − r ′ c ) d 3r ′ . As we integrate over the volume all space containing the charge cloud, the integral passes through different regions at different times due to retardation. (It will pick up contributions to V from distant regions further in the past than from close regions.) Here's a concrete example. Imagine that a small charge, shaped in its rest frame like a tiny cube with side a, is moving towards an observer. The observer will evaluate the potential (scalar or vector) caused by this charge as a sum of contributions from small slices in space as the cube moves towards him, as shown in the figure. Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.15 70 a 1 − v2 c2 1− v c 29 1 3 5 7 9 30 2 4 6 8 a v distant observer a 1 − v2 c2 cube is here when information from first slice reaches observer cube is here when information from last slice reaches observer The observer will take a "snapshot" of the amount of charge filling different regions of space and will calculate the net potential as the sum of the contributions from all the small regions of space in his snapshot. Because of retardation, the "local time" (the retarded time) in each region of space in the snapshot will be different: the further away a region is, the earlier in time he'll be seeing it. The slice of space labeled "1" in the observer's snapshot just happened to be holding the back of the Lorentz-contracted cube. Since the cube is moving, the front of the cube just happened to be occupying slice 30 at the correct time for it to appear in the snapshot: slice 30 is seen at a less retarded time than slice 1 since it is closer to the observer. As far as the observer is concerned, the potential must be the same as it would be with slices 1 through 30 all containing electric charge! (If the charge were moving away from the observer, there would seem to be less charge there.) With some algebra, it's easy to show that the length of the region in space which seems to hold charge is a factor of 1 (1 − v c ) longer than the side of the charge which is parallel to its line-ofsite to the observer. If the charge were not moving directly towards/away from the observer, the correction would involve the component of velocity parallel to the observer's line-of-site: 1 (1 − v ⋅ εˆ c ) where εˆ is the unit vector from the object to the observer, as shown in the following figure. Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.16 71 line-of-sight to observer observer εˆ v As a result, our expressions for the potentials associated with a point charge need to be written V (r ,t) = Q 1 × 4πε 0 r − rQ ( t ′ ) (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) A( r , t ) = µ0 Qv ( t ′ ) 1 × . 4π r − rQ ( t ′ ) (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) and 1 These are know as the Liénard-Wiechert potentials. (A.Liénard, 1898; E.Wiechert, 1900.) Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.17 72 Exercise 5.9: Retardation Consider a point charge which oscillates along the z axis so that its position is ( x, y, z ) = (0,0, a sin(ω t )) . The amplitude and frequency of its motion are a=0.001 feet and ω=2π×109 radians per second (so that the period of its motion is one nanosecond.) Including effects associated with the finite speed of light (c ≈ 1 foot per nanosecond), sketch how the vector potential looks along the line y = z = 0 within a few feet of the origin. (Note that v ⋅ εˆ ≈ 0 so you can ignore the 1 (1 − v ⋅ εˆ c ) correction.) Discussion Summary and wrap-up discussion • • • • • • properties of the electric field that are impossible to find in fields generated by static charges curl and divergence Stokes’ Theorem The vector potential Time retardation’s influence on the vector potential Why did we do this, anyway? Comments about homework assigned for next week Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.18 73 Reading and homework should be completed by 4pm Friday. Required reading: Purcell, chapter 2, sections 13 - 16; chapter 6, section 3. The Feynman Lectures on Physics (volume II), chapter 21, section 5. Read over the remaining Unit 5 material we didn’t get to in class this week. Also read over (but don't worry about understanding) the Unit 6 material we'll tackle next week. Optional reading: The Feynman Lectures on Physics (volume II), chapter 14. Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to this week’s in-class exercises as well as last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Problem 1 (10 points): Anthrax’s litter box is exposed to an electric field Mrs. Urkin creates an E field across the top Anthrax’s cat box in order to lay claim to the invention of the field of electroscatology. (The top of the cat’s litter box is in the x-y plane at z = 0 with 0 ≤ x ≤ 1, 0 ≤ y ≤ 1. ) She experiments with different field configurations as described below. (a) The first electric field configuration is E ( x, y, z ) = xyz ( xˆ + yˆ + zˆ ) at the top of the aquarium. Is it possible for this particular electric field to be the gradient of a scalar potential V ( x, y, z ) ? Be sure to prove, or otherwise justify your answer. (b) The next electric field configuration is E ( x, y, z ) = yz xˆ + zx yˆ + xy zˆ at the top of the aquarium. Can this electric field be the gradient of a scalar potential? Prove your answer. (c) Another configuration is E ( x, y, z ) = x 2 xˆ - y 2 yˆ . Can this electric field be the gradient of a scalar potential? Prove your answer. (d) The final field configuration is E ( x, y, z ) = y 2 xˆ - x 2 yˆ . Can this electric field be the gradient of a scalar potential? Prove your answer. Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.19 74 Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.20 75 Problem 2 (10 points): Being clever may help you avoid doing the line integrals Mr. Urkin fabricates Christmas presents for his nephews from electrically charged marbles and polyvinyl chloride sewer pipes as illustrated in the figure. One of the “toys” comprises a square loop of (frictionless) sewer pipe in which a marble rolls. Another is a circular loop of pipe, also frictionless, in which a second marble rolls. Each marble carries charge q. 1 meter 1 meter plastic pipe plastic pipe y y x 1 meter charged marble x charged marble (a) Placed flat on the floor during tests with its center at ( x, y ) = ( 0, 0 ) and its sides parallel to the x and y axes, the square loop is exposed to an electric field E ( x, y, z ) = x yˆ . (The floor is in the plane z = 0.) Assuming the marble does not lose any energy as it rounds the corners, calculate the change in its kinetic energy after the field drives it once around the loop. (b) The round loop is tested in a similar fashion: it is placed in the same electric field, with its center at the origin. How much kinetic energy does the marble gain traveling once around the loop? Problem 3 (10 points): Vector potential sketch Read through the in-class material about retardation, digesting it as best as you can (see me with questions) and then do Unit 5’s in-class exercise 5.9. Physics 212, Spring 2003. Unit 5 Pinwheels (curl and the vector potential); le souvenir du temps passé 5.21 76 Physics 212 Honors topics in electrodynamics Unit 6 The origin of the magnetic field as a consequence of special relativity Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.1 77 Unit 6: The origin of the magnetic field as a consequence of special relativity Recap of last week Introduction: magnetic fields are just electric fields seen from a different perspective. Magnetic fields and forces exist because of Coulomb’s law and special relativity. Though observers in one frame may determine that a current-carrying wire is electrically neutral, observers in another frame might conclude that the wire does carry a net electric charge. We’ll derive this today, for the particular case of motion parallel (or antiparallel) to the current. You’ll slog through much less vector calculus today than last week! Here’s what you’ll find/work on today: • analyzing the charge density in a wire loop which suddenly finds itself carrying a current, according to observers in two different frames of reference • describing how electric fields from a static charge transform when changing to a frame in which the charge is in motion Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.2 78 Exercise 6.1: Whap! They’re moving! A rectangular loop is made from a thin filament of superconducting wire as drawn below. Initially no current flows in the wire; there are as many (positive) metal ions in the wire’s crystal lattice as there are conduction electrons. The spacing between adjacent electrons is a while the spacing between positive metal ions is b. (Naturally, since the wire carries no net charge, a = b.) before... ...after synchronized clocks wire loop u electron flow u metal ions electrons b a When all (synchronized) clocks in the lab frame read zero, all the electrons are suddenly accelerated from rest to a speed u so that the flow of electrons in the wire loop is in a counterclockwise sense. (a) According to observers in the lab frame (the rest frame of the still-stationary metal ions), what is the spacing between adjacent (but now-moving) electrons? Discussion ...but what about Lorentz contraction? Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.3 79 (b) An observer in a different frame of reference sees the wire loop glide past with speed v, moving to the left as shown in the figure. When the observer’s clock reads zero, clocks close to the right-most edge of the (moving) wire loop are also seen to read zero. v Sketch the readings on the other (moving) clocks’ faces at the instant the right-side clocks read zero. discussion Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.4 80 (c) Keep in mind that an electron is accelerated when the lab-frame clock next to it reads zero. From the perspective of the observer who sees the wire loop moving to the left, the densities of electrons and metal ions are identical before any of the electrons begin circulating in the wire loop. Describe the relative densities of electrons and metal ions in each of the four sides of the loop after all the electrons have been set in motion. It may help to think in terms of which electrons begin moving when. (d) What would happen to a test charge, stationary in the frame of the observer who sees the wire loop moving to the left, if the test charge were placed close to the bottom side of the loop? How would this charge behave when seen from the rest frame of the wire loop? discussion Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.5 81 Exercise 6.2: Quantitatively, now... Let’s consider the bottom leg of the wire loop now, after the electrons have started moving. You should have concluded that the electron spacing, viewed from the lab frame (the rest frame of the wire), does not change after the electrons are accelerated. (How could it possibly change? In this frame all electrons started moving simultaneously, so there was never a time interval during which one electron was moving but its neighbor was not.) a metal ions u a electrons (a) Calculate (in terms of a and u) the spacing between electrons in the electrons’ rest frame. (Even though u c for realistic currents you’ll need to consider the effects of special relativity.) n ( n − 1) 2 n ε + ε + … 1! 2! (This works even when n isn’t an integer, and is useful for approximations.) Some examples for −1 2 ε 1 : 1 + ε ≈ 1 + ε 2 ; 1 (1 + ε ) ≈ (1 − ε ) ; 1 1+ ε = (1 + ε ) ≈ 1 − ε 2 . In these exercises you’ll (b) Recall that the binomial expansion for (1 + ε ) is (1 + ε ) = 1 + n n only need to calculate things to second order accuracy in any of the velocities present in your equations (terms proportional to v 3 , u 3 , uv 2 ... can be ignored). Use the binomial expansion to calculate a version of the relativistic velocity addition formula which is accurate to second order in the velocities. Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.6 82 (c) In another frame of reference the wire is seen to move to the left at speed v c . (By this I mean that the metal lattice is moving with speed v). How fast are the electrons moving in this frame? (Please use the second-order expression you just derived.) (d) Calculate the spacing between metal ions in the frame in which the wire is moving. Do the same for the spacing between electrons in this frame. (e) Use the binomial expansion to generate an approximate expression for the spacing between adjacent metal ions in the frame with the wire moving. Please include all terms up to second order in the ratios of small velocities (u and v). Do the same thing for the electrons. (f) Calculate the apparent charge density in the wire in terms of the fundamental charge (e), a, u, v, and c. Verify that your expression yields zero if v = 0 or u = 0. Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.7 83 Discussion Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.8 84 Exercise 6.3: Real numbers Let’s calculate what happens in a real wire. Imagine that we have a 1000 ampere current flowing in a heavy copper cable with crosssectional area 1 cm2. The density of copper is about 9 grams per cm3 while the mass of a single copper atom is approximately 10-22 grams. As a result, every cubic centimeter of copper cable holds 9 × 1022 copper atoms. Each copper atom gives up two conduction electrons so that there are 1.8 × 1023 mobile electrons per cm3 of cable. The charge on each electron is 1.6 × 10-19 Coulombs. (a) How fast are the electrons moving, on average? (b) How large is the electric field generated by the (charged!!) wire at a distance of 10 centimeters according to a test charge which observes the wire moving to the left with speed 100 m/s? (Use Gauss’ law!) Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.9 85 (c) If the test charge is an electron (with mass 9.1 × 10-31 kg), how large is the acceleration it experiences due to the passage of the current-carrying wire? (d) In the wire loop’s rest frame the charge moves with initial velocity 100 m/s parallel to the wire. Its acceleration is nearly the same in this frame. What is the radius of curvature of its path at the instant that it is moving parallel to the wire? Discussion Exercise 6.4: How the fields transform (let’s call this one “optional” in case we don’t have enough time) Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.10 86 Relativistic effects will also deflect a moving test charge which heads directly towards the current-carrying wire. Let’s investigate this now, starting with the fact that Gauss’ law is always true, even for charges in motion. The flux of electric field through a closed surface is always ∫ E idA = Qenclosed ε0 , even when the enclosed charge is in motion. (An experimental proof surface of this: the electric field far from a hydrogen atom is nearly zero, even though its proton is nearly at rest, while its electron is moving with v ∼ c 137 .) We’ll need to determine how the electric field at a point in space transforms as we shift frames of reference. Let’s say that observers in two frames of reference, O (the "unprimed" frame) and O', measure the electric field at the origins of their respective coordinate systems just as the origins coincide. Frame O' (according to observers in O) is moving with velocity v along the x axis of O as shown in the figure. In the figure, the field measurements have already been performed. O O' y y' v x z z' x' electric field lines Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.11 87 (a) The field lines have been created by a pair of capacitor plates which are stationary in O as shown in the next figure. The surface charge densities are ±σ . O O' y −σ y' v x z x' capacitor plates z' +σ What are the surface charge densities σ ′ measured by observers in O'? (b) What is E ′ E ? Discussion Fields which are perpendicular to the direction of motion... (Note: in O there is no magnetic field.) Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.12 88 (c) In a different universe, field lines have been created by a pair of capacitor plates which are stationary in O as shown, but whose faces are perpendicular to the direction of motion of O'. As before, the surface charge densities are ±σ . capacitor plates O What are the surface charge densities σ ′ measured by observers in O'? O' y y' +σ x z −σ v x' z' (d) What is E ′ E ? Discussion Fields which are parallel to the direction of motion... (Note: in O there is no magnetic field.) Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.13 89 Discussion You should have concluded that E ′ = E and E⊥′ = ( ) 1 − v 2 c 2 E⊥ = γ E⊥ where E and E⊥ represent electric field components parallel to and perpendicular to the relative velocities of the two frames. An important point: this is only true if there is no magnetic field in the frame in which the electric charge (the surface charge on the capacitors) is at rest. Naturally, this applies to electric fields created by any charge distribution, not just the sheets of charge on capacitor plates. In particular, the electric field around a moving point charge becomes stronger in the plane perpendicular to its direction of travel. If we use the density of field lines to represent the strength of the electric field, we’ll find our field lines seem to bunch up like this: v charge, at rest in O charge, as seen from O'. To a test charge moving towards the current-carrying wire, the ions’ and electrons’ velocities will appear as follows (in the rest frame of the test charge): metal ions electrons test charge Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.14 90 (e) On the following diagram (which shows one pair of conduction electrons and metal ions, placed symmetrically about the test charge), sketch the field lines at the test charge associated with the two electrons. (In the test charge’s rest frame, the wire is moving towards the test charge.) test charge Discussion Direction of the force, and comparison with qv × B ... Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.15 91 Summary and wrap-up discussion • Magnetic force is just a consequence of relativity: observers in different frames do not agree about charge densities! Comments about homework assigned for next week Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.16 92 Reading and homework should be completed by 4pm Friday. Required reading: Purcell, chapter 5, sections 4 - 6; chapter 5, section 9. Read over (but don't worry about understanding) the Unit 7 material we'll tackle next week. Optional reading: The Feynman Lectures on Physics (volume II), chapter 26. The Feynman Lectures on Physics (volume II), chapter 42. (Some of these problems require you to know how to multiply matrices, and to remember that the transpose of a matrix is what you get when you swap its rows and columns. Please see me for a quick lesson if you've never learned about this.) Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to this week’s in-class exercises as well as last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Problem 1: More on Lorentz transformations Imagine observers in the frame O see another frame O' moving in the positive x direction with velocity v so that the origins of the two coordinate systems overlap at t = t' = 0. We can use the Lorentz transformations to relate the space-time coordinates of an event in one frame with those in another frame. Defining β ≡ v c and γ ≡ 1 1 − β 2 , the Lorentz transformations become x′ = γ ( x − β ct ) , y′ = y, z ′ = z, t ′ = γ ( t − β x c ) x = γ ( x′ + β ct ′ ) , y = y′, z = z′, t = γ ( t ′ + β x′ c ) . Keep in mind that v is the velocity of the primed frame's coordinate axes as seen from the unprimed frame. We can write the Lorentz transformations in matrix form this way:  ct ′  γ −γβ  x′   −γβ γ  =  y′   0 0    0  z′   0 0 0 1 0 0   ct  0   x  0  y    1  z  and  ct   γ +γβ  x   +γβ γ  =  y  0 0    0 z  0 Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 0 0 1 0 0   ct ′ 0   x′  . 0  y′    1  z′  6.17 93 More compactly, defining ct ′  ct   γ −γβ  x′  x  −γβ γ     x′ ≡ , x≡ , and Λ ≡   y′   y  0 0      0  z′  z  0 0 0 1 0 0 0  0  1 allows us to write the Lorentz transformations as x′ = Λ x , x = Λ −1 1 0 −1 −1 x′ where ΛΛ = Λ Λ = I ≡  0  0 0 1 0 0 0 0 1 0 0 0  . 0  1 Any quantity which transforms this way under changes of reference frame is called a 4-vector. Another four vector can be built from a particle's energy E and momentum p : E c  p  p≡ x .  py     pz  since it transforms like this: p′ = Λ p . Naturally, this is just shorthand for the four equations p′x = γ ( px − β E c ) , p′y = p y , p′z = pz , E ′ c = γ ( E c − β px ) .  γ −γβ  −γβ γ (a) For Λ defined as   0 0  0  0 0 0 1 0 0  γ +γβ  +γβ γ 0  , prove that its inverse Λ −1 is   0 0 0   1 0  0 0 0 1 0 0 0  . 0  1 (b) A proton at rest has energy E proton = m p c 2 ≈ 938 MeV . (Its momentum, not surprisingly, is zero.) The protons which circulate inside the Fermilab Tevatron have energies close to 1,000 GeV. (1 GeV = 1,000 MeV.) What value of γ does a Tevatron proton have? Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.18 94 (c) Somehow a Tevatron proton captures an electron, becoming a fast-moving hydrogen atom without changing its speed. An electron at rest has Eelectron = me c 2 ≈ 0.511 MeV . What is the electron's energy in the rest frame of Fermilab's sedentary buffalo herd? Problem 2: Field transformation laws As you've seen, the electric and magnetic fields are intimately connected by relativity. We can construct the "electromagnetic field strength tensor" out of the fields' components like this:  0 − Ex c − E y c − Ez c  E c 0 By  − Bz x  . F≡  E y c Bz 0 − Bx    0  Bx  Ez c − By What makes this a "second rank tensor" (a 4-vector is a "first rank tensor") is that the electric and magnetic fields transform when one changes frames this way:  0 − E x′ c − E y ′ c − E z ′ c    E ′ c ′ ′ − Bz 0 By  x   = ΛF Λ T = ΛF Λ ′ F = E ′ c B ′ − Bx′  0 z  y   E ′ c −B ′ ′ Bx 0   z y If an observer in one frame of reference measures E and B at the point (x,y,z) at time t, that observer can predict what an observer in a different frame would measure for E ′ and B′ should their field-measuring devices glide past each other just as the measurements are being performed. (a) The electric and magnetic fields of a 1 Coulomb charge Q are measured by a pair of gizmos as shown in the figure. From the perspective of observers in frame O, the charge is at rest at the origin and one of the field-measuring devices is also at rest, with position (x,y,z) = (0,1,0). Observers in O see the second field-measuring gizmo moving at high speed, with velocity v = β cxˆ . Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.19 95 E B v=0 E B v=βc 1 meter y Q x z At time t = 0 this gizmo passed very close to the stationary device at (x,y,z) = (0,1,0). The readings on the dials of the moving device were photographed by observers in O as it was illuminated by the lights from the stationary gizmo's panel lights. • Calculate the components of the electric and magnetic field vectors measured by the stationary device at the time the photograph was taken. • Calculate the components of the electric and magnetic field vectors measured by the moving device. • According to observers in the rest frame of the moving device, how far away is the charge Q? (b) The same gizmos are now placed on the x axis with the stationary device at position (x,y,z) = (1,0,0) and the moving device traveling along the x axis with velocity v = β cxˆ . As the moving device brushes past the stationary device, photographs are taken. • Calculate the components of the electric and magnetic field vectors measured by the stationary device at the time the photograph was taken. • Calculate the components of the electric and magnetic field vectors measured by the moving device. • According to observers in the rest frame of the moving device, how far away is the charge Q? Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.20 96 Problem 3: And now for something completely different The idealized “operational amplifier” shown in the figure has the following properties: VV+ • • • Vout + Vout = G(V+ - V-) No current enters its inputs, regardless of how large the input voltages V+ and V- become. The output voltage is always exactly Vout = G(V+ - V-) where G is the (constant) gain of the amplifier. The gain G is “nearly” infinite. The inputs to which V+ and V- are connected are called the amplifier’s “non-inverting” and “inverting” inputs, respectively. Note that voltages can be bipolar: both positive and negative values are fine. (a) In the following circuit, what is Vout as a function of Vin? Vin Vout + (b) What is Vout as a function of Vin for this circuit? Keep in mind that no current flows into the amplifier’s input terminals and that Ohm’s law requires the voltage drop across a resistor to be equal to the product of its resistance and the current flowing through it. . Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.21 97 R2 = 20kΩ - Vin Vout = ? R1 = 10kΩ + Physics 212, Spring 2003. Unit 6 The origin of the magnetic field as a consequence of special relativity 6.22 98 Physics 212 Honors topics in electrodynamics Unit 7 Operational amplifier circuits Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 7 Operational amplifier circuits 7.1 99 Unit 7: Operational amplifier circuits Recap of last week Introduction: opamps and other circuit elements We’re going to build toys today. It’s a lot more fun to build electronic circuits with active components (amplifiers and the like) than it is to be stuck in the passive-only world of electrical circuits built only with resistors and capacitors. Our principal building block will be the operational amplifier (opamp), a differential amplifier whose “open loop” gain is so high, and input impedance so large, that it is always used with feedback to control its behavior. Let’s discuss opamps using a particular example, the venerable LF411. This opamp is built on a single sliver of silicon, and comprises twenty-two transistors, one diode, one capacitor, and eleven resistors. You can find a description of it at the Texas Instruments web site http://www.ti.com/: run a search on “LF411” to find the data sheets. One of the many ways the chip is packaged is in an 8-pin dual-inline package (DIP): IN- and IN+ are the “inverting” and “non-inverting” inputs to the opamp. OUT is, of course, the opamp’s output pin. VCC+ and VCC- are the power supply pins, held somewhere between ±3.5 and ±18 volts. The pins BAL1 and BAL2 may be used to fine-tune the amplifier’s output when the inputs IN+ and IN- are held at exactly the same voltage. We will leave them disconnected. We’ll use a symbols like the following to represent the opamp, omitting the power and balance pins. or + Physics 212, Spring 2003. Unit 7 Operational amplifier circuits 7.2 100 (It’s OK to draw them with the non-inverting input at the top instead of the bottom.) The output voltage is Vout = G (V+ − V− ) , where G is the opamp’s “open loop” gain. According to Texas Instruments, the typical open loop gain for an LF411 is 200,000. (However, the output cannot go higher or lower than the power supply voltages!) The LF411’s input impedance is typically 1012Ω: increasing the input voltage by a volt will only change the current flowing into the input pin by a picoamp. The device can drive output currents as large as 20 milliamps, and output voltages within a few volts of VCC+ and VCC-, so it has enough “oomph” to drive a pair of headphones without additional amplification. (Headphones have input impedances of several dozen ohms and can handle input voltages of a couple of volts before melting.) We NEVER use an opamp without feedback between its output and its inputs: its gain, “input offset voltage,” and other parameters are not particularly stable against changes in power supply voltage, temperature, and so forth. The following circuit is a BAD BAD THING! ~ Vin - Bad circuit!!! + Vout = -GVin Here’s how to think about an opamp: 1. An opamp’s gain is (almost) infinite; 2. An opamp’s input impedance is (almost) infinite. This sounds crazy, but there are three very useful rules of thumb which follow from this: 1. In a properly designed circuit (which won’t “pin” its output), the opamp does whatever is necessary to make its inputs (almost) exactly equal; 2. (almost) Absolutely no current flows into the opamp’s input pins. (almost) As much current as needed flows from (or to) the output pin to maintain its output voltage. 3. There must always be a DC path between the opamp output and (at least) one input, even if it’s made with a huge resistor that does very little in setting the circuit’s behavior. Here’s what you’ll find/work on today: • Analysis (and construction!) of a few simple opamp circuits Physics 212, Spring 2003. Unit 7 Operational amplifier circuits 7.3 101 Exercise 7.1: Opamp voltage follower You analyzed this circuit as a homework problem. Be sure that you understand how the circuit works. (Do you see why Vout as a function of Vin behaves the way it does?) Vout = ? Vin + Exercise 7.2: Mystery amplifier #1 This was another homework problem, and is the circuit that we have setup for you on the breadboard at your lab bench. Make sure you are clear about how the circuit works. R2 = 20kΩ Vin Vout = ? R1 = 10kΩ + Discussion What happens to Vout if we load the above circuit’s output, as shown? Vin + Physics 212, Spring 2003. Unit 7 Operational amplifier circuits Vout load resistance 7.4 102 Hands-on time: We’ve already assembled the simple “inverting amplifier” circuit in the previous exercise for you, provided power to the opamp chip, attached a signal generator to its input, and set up the oscilloscope to trigger sensibly. Note that we’ve put some capacitors on the board between ground and the power supply voltages to smooth out noise and glitches on the power “rails.” The capacitors are probably the kind that care about the polarity they see at their input leads: one particular lead must always be more positive than the other. (Reversing the polarity can cause the cap to explode, or catch fire, in a most dramatic fashion.) Investigate the following with your circuit: 1. 2. 3. 4. How close is the gain to what you’d expect? How close to being equal are the voltages on the two input pins? Change one of the resistors. Does the gain change in the way you expect? Change (increase) the frequency of the input signal. At what frequency do you begin to notice the amplifier’s gain dropping? 5. Change the input signal frequency again, keeping an eye on the phase difference between the input and output signals. When does the phase shift begin to change noticeably? Exercise 7.3: Mystery amplifier #2 What does the following (idealized) circuit do? (By this I mean what is Vout in terms of Vin?) C = .01µF Vin Vout = ? R = 100kΩ + Note that this circuit violates our third rule of thumb and is NOT A GOOD CIRCUIT! Physics 212, Spring 2003. Unit 7 Operational amplifier circuits 7.5 103 Physics 212, Spring 2003. Unit 7 Operational amplifier circuits 7.6 104 Discussion and hands-on time: An important design rule: due to imprecision in manufacturing, the point at which an opamp produces zero volts at its output will be when its inputs are held at slightly unequal voltages. This input offset voltage can be as large as several millivolts. Generally, it is necessary to have a DC feedback path from the opamp’s output to an input. As a result, a better version of the integrator circuit shown above would be the following: 22MegΩ .01µF Vin Vout 100kΩ + The 22Meg resistor is small compared to the LF411’s input impedance, but large compared to the 100k input resistor. The circuit’s behavior as an integrator will change to that of an ordinary inverting amplifier for times which are not short compared to RC = 22Meg × .01µF = 0.22 seconds. Build this circuit and see how it responds to sinusoidal, square, and triangle wave inputs. Vary the signal generator’s frequency and observe how the circuit’s response to a sinusoidal input changes (the circuit’s gain, in particular). Exercise 7.4: Mystery amplifier #3 What does this circuit do? R = 100kΩ Vin Vout = ? C = .01µF + Physics 212, Spring 2003. Unit 7 Operational amplifier circuits 7.7 105 Discussion The above circuit is vulnerable to high frequency noise (some of which is generated by the amplifier itself) since its gain increases with frequency. If you need to use one of these, a better version includes another (smaller) resistor and capacitor, as in the following circuit. 50pF 1kΩ Vin - 100kΩ Vout .01µF + Build one of these and see if it behaves as you’d expect. Exercise 7.5: Mystery amplifier #4 A diode’s current-vs.-voltage behavior is I (V ) ≈ I S e 40V . The parameter IS (the “reverse saturation current) depends on details of the diode’s construction and changes rapidly with temperature. Here are a couple of voltage vs. current curves for a 1N914 silicon junction diode. Since the current axis scale is logarithmic, the V vs. I curves are straight lines. Let’s assume that IS is 10-15 amperes for the diode in the following circuit. As drawn, the diode likes to pass current from left to right. Physics 212, Spring 2003. Unit 7 Operational amplifier circuits 7.8 106 In the following circuit what is Vout as a function of Vin? (Assume that Vin ≥ 0.) current flows in this direction diode - Vin Vout = ? R = 1kΩ + Discussion Since the diode will only pass current in one direction, a better scheme might be to use a pair of diodes in parallel, with the second diode passing current from right to left. Build the circuit with the pair of diodes as described above, and see how it behaves in response to triangle waves. Exercise 7.6: Do-it-yourself summing amplifier How would you design an opamp circuit which sums two voltages so that Vout = -(Vin1 + Vin2)? (Hint: one version of the circuit requires three resistors and one opamp.) Discussion Build an example of your circuit and make it work. (Figure out a clever way to have your function generator provide a second input to your circuit.) Summary and wrap-up discussion Physics 212, Spring 2003. Unit 7 Operational amplifier circuits 7.9 107 Comments about homework assigned for next week Physics 212, Spring 2003. Unit 7 Operational amplifier circuits 7.10 108 Reading and homework should be completed by 4pm Friday. Required reading: Read over (but don't worry about understanding) the Unit 8 material we'll tackle next week. Optional reading: Horowitz & Hill, chapter 4 through section 4.06. (The book is on reserve in the Physics Department library.) Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to this week’s in-class exercises as well as last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Problem 1 (10 points): Diode circuit IS is 10-15 amperes for the diode in the following circuit. (As drawn, the diode likes to pass current from left to right.) What is Vout as a function of Vin? (Assume that Vin ≥ 0. There is a description of the current-voltage behavior of a diode in the in-class materials.) R = 1kΩ diode Vin Vout = ? + Problem 2 (10 points): Multiplying voltages You now know about opamp circuits which incorporate diodes and resistors. (You will need the results of the previous problem in order to work this problem.) Keeping in mind that V1 × V2 = e ln (V1 ×V2 ) = eln V1 + ln V2 , design a circuit whose output is Physics 212, Spring 2003. Unit 7 Operational amplifier circuits 7.11 109 Vout = Vin1 × Vin2 where Vin1 > 0 and Vin2 > 0 . You should assume that the (ideal) diode behaves as described previously, with IS = 10-15 amperes. (Your circuit will probably require several opamps.) Problem 3 (10 points): Mrs. Urkin commissions a prototype for something more terrible Anticipating a possible upturn in the market for horrible instruments of torture and punishment, scientists in the UD&HE Research Division are charged with the responsibility of developing a current-source poultry incinerator based on a high-voltage, high-power operational amplifier. The potentiometer (a variable resistor) at the opamp's input controls the amount of current passing through the dressed bird in a species-independent fashion. (For a given potentiometer setting the same current will flow through a chicken as a turkey.) 1000 V - pin 1 pin 2 + pin 3 potentiometer: Rpin 1 to pin 3 fixed at 1000 Ω Rpin 2 to pin 3 varies from 0 to 1000 Ω Rpin 1 to pin 2 = Rpin 1 to pin 3 - Rpin 2 to pin 3 10 Ω Determine the current I which flows through the bird as a function of the resistance Rpin 2 to pin 3 dialed into the potentiometer. (The utility of this circuit is that the current which flows through the "load" [the bird] is independent of the properties of the load.) Physics 212, Spring 2003. Unit 7 Operational amplifier circuits 7.12 110 Physics 212 Honors topics in electrodynamics Unit 8 Electronic models of mechanical systems; analog computers 1 Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 8 Electronic models of mechanical systems; analog computers 1 8.1 111 Units 8 & 9: Electronic models of mechanical systems; analog computers Recap of last week Introduction: using opamp circuits to represent (differential) equations Last week you analyzed (and built) a number of circuits which can be used to add voltages, take logarithms, multiply voltages by constants, and so forth. I have installed a table of the circuits (some will probably be new to you) on the last page of the unit 8 material. As you may already know, it is possible to build circuits whose behavior mimics that of nonelectronic systems. I think it’s easiest to show this with an example. Imagine we apply a driving force F(t) to a mass immersed in a viscous liquid so that the drag force it experiences is proportional to velocity. We’ll have Fnet = F ( t ) + Fdrag or ma = F ( t ) − β v (β is a positive constant, and the drag force opposes the velocity) so that m dv dt = F ( t ) − β v . Rewrite this slightly so that the highest derivative is by itself on the left: dv 1 β = F (t ) − v . dt m m A circuit which “solves” dv dt = F ( t ) m − β v m can be arranged like this: 1. 2. 3. 4. 5. 6. 7. assume we have a voltage which correctly represents dv dt coming in from the left. integrate it to produce a voltage which represents -v. multiply the voltage corresponding to –v by the constant β/m. receive (from a function generator) a voltage which corresponds to F(t)/m. add the voltages to produce a voltage representing –(F(t)/m - vβ/m). invert –(F(t)/m - vβ/m) to produce (F(t)/m - vβ/m). bring this signal out from the right, inject it into the input of step 2’s integrator. For example, this circuit produces an integral of its input signal: 22 MegΩ Vin R C + Vout Vout = − 1 Vin dt ( R RC ∫ 22 MegΩ) . Other functions are shown in the table at the end of this unit. Physics 212, Spring 2003. Unit 8 Electronic models of mechanical systems; analog computers 1 8.2 112 Here’s what we can do when β/m < 1: 22 MegΩ R C - R1 -v + + R2 Integrate with RC=1 -vβ/m F(t)/m -vβ/m - Scale with R2 ( R1 + R2 ) = β m R R -F(t)/m + vβ/m - R + R R - F(t)/m - vβ/m (= dv/dt) dv/dt + Nillog Invert Add We could have combined steps 2 and 3 ( reducing our component count) by picking the integrator’s RC value to be m/β instead of 1: 22 MegΩ dv/dt R C - -vβ/m Integrate with RC=m/β -vβ/m R R F(t)/m - -F(t)/m + vβ/m + R Nillog Add Physics 212, Spring 2003. Unit 8 Electronic models of mechanical systems; analog computers 1 R R - F(t)/m - vβ/m (= dv/dt) + + Invert 8.3 113 If we are interested in seeing a representation of the object’s position we can add an extra integrator to calculate x = ∫ v dt : 22 MegΩ C - R -vβ/m R + C - x + Integrate with RC=m/β Integrate with RC=β/m -vβ/m R R F(t)/m -F(t)/m + vβ/m - R + R R - F(t)/m - vβ/m (= dv/dt) dv/dt 22 MegΩ + Nillog Invert Add By watching the circuit’s voltages on an oscilloscope we’ll see a good representation of the behavior of the mechanical system’s position and velocity. To make scope work easier, let’s use one millisecond as the unit of time. This is easy to do: instead of building the circuit so that RC = 1, or β/m, or m/β, we’ll use RC = 10-3, or β/m × 10-3, or m/β × 10-3 instead. (For example, use R = 100 kΩ and C =.01 µF to obtain RC = 10-3.) In addition, let’s imagine we’ve chosen a system where β/m ≈ 0.2. Defining f ( t ) ≡ F ( t ) m , our equation becomes dv = f ( t ) − 0.2v or dt d 2x dx = f ( t ) − 0.2 2 dt dt if you prefer. The circuit becomes F(t)/m – 0.2v (= dv/dt) dv/dt 22 MegΩ 500kΩ .01µF 100kΩ -0.2v + 100kΩ Integrate - -F(t)/m + 0.2v + Add and flip sign F(t)/m 100kΩ Nillog Physics 212, Spring 2003. Unit 8 Electronic models of mechanical systems; analog computers 1 100kΩ 100kΩ + Invert . 8.4 114 Exercise 8.1: Analytic solution Imagine that you drive your mass-in-viscous fluid with a time-dependent driving force as shown in the following figure. f(t) f0 t The equation of motion for the velocity of the mass is dv dt = −0.2v when the force is zero and dv dt = f 0 − 0.2v when the force is f0. (a) Let’s call the time at which the force goes to zero t = 0. If the velocity at the instant the force drops from f0 to zero is v0, prove that v(t ) = v0e −0.2t is a solution to the equation of motion during the time when the force remains zero. (b) Now let’s call the time at which the force changes from zero t = 0. If the velocity at the instant the force increases from zero to f0 is v1, prove that v(t ) = ( v1 − 5 f 0 ) e −0.2t + 5 f 0 is a solution to the eqution of motion as long as the force remains f0. Exercise 8.2: Build it! Build the three-opamp-circuit drawn above. Try driving it with 1 volt square waves of various frequencies. How does your result for the voltage corresponding to -vβ/m correspond to the exact solution from the prevuous exercise? Discussion You’ve just built an analog computer! Why might one want to do (to have done) that? Physics 212, Spring 2003. Unit 8 Electronic models of mechanical systems; analog computers 1 8.5 115 Optional material: Modeling a damped, driven simple harmonic oscillator Introduction A damped, driven harmonic oscillator experiences a net force with contributions from a spring ( Fspring = −kx ), a velocity-dependent drag ( Fdrag = − β v ), and a driving force (Fdrive). We can write its equation of motion as Fnet = Fspring + Fdrag + Fdrive . After a small amount of manipulation, the equation becomes d 2 x (t ) Fdrive ( t ) β dx ( t ) k = − − + . x t ( ) dt 2 m dt m m You may remember that the “natural frequency” of this oscillator is ω = k m radians/sec. Its natural frequency in Hz (cycles/sec) is ω ( 2π ) . For homework you designed a circuit which models this oscillator using the values k/m = 1 and β/m = 0.2. Defining f ( t ) ≡ F ( t ) m , your oscillator’s equation becomes d 2 x (t ) dx ( t ) = − 0.2 − x (t ) + f (t ) . dt 2 dt Build it! To make scope work easy, let’s use milliseconds as the unit of time so that your integrators will have RC values in the vicinity of 10-3 instead of 1. (R = 100 kΩ and C =.01 µF gives RC = 10-3.) The oscillator’s natural frequency is 1 ( 2π ) cycles per millisecond, or about 159 Hz. Build your circuit, and try driving it with 100 millivolt sine waves of various frequencies. Describe how the amplitudes of the voltages which correspond to x(t) and v(t) behave as you change the frequency of the driving “force.” Also take note of the relative phases of x(t) and f(t). Configure your oscilloscope to display an x-y plot of the voltages representing x(t) and v(t). What do you see? Discussion Summary and wrap-up discussion Physics 212, Spring 2003. Unit 8 Electronic models of mechanical systems; analog computers 1 8.6 116 Comments about homework assigned for next week Physics 212, Spring 2003. Unit 8 Electronic models of mechanical systems; analog computers 1 8.7 117 Opamp circuit function blocks add voltages and flip sign (R1=R2=R3=R4) R1 V1 - R2 R1 V1 Vout + R2 - V2 R4  ( R + R2 ) R4   R2  Vout =  1  V2 −   V1  R1   ( R3 + R4 ) R1  R R = −  3 V1 + 3 V2 R2  R1  Vout   Multiply by a positive constant < 1 Vin - R1 multiply by a positive constant > 1 R1 Vout + R2 Vout = + R2 Vin R1 + R2 Vin R2 - Vin R1 + R2 Vin R1 exponentiation R2 diodes - Vin Vout + Vin Vout = − differentiate − sign (Vin ) 40 C Vin R - Vout ⋅ ln Vin − ln ( I s R )  Vout = − RC • • diodes + 1 kΩ Vout 1 Vin dt ( R 22 MegΩ) RC ∫ take logarithm Vin Vout ≈ + Vout + 22 MegΩ C - R - Vout ≈ − sign (Vin ) ⋅ I s R ⋅ e 40Vin R Vout = − 2 Vin R1 integrate R Vout + Vout = + Multiply by a negative constant R1 Vout + R3 ... V2 subtract two voltages (R1=R2=R3=R4) R3 ( I s ∼ 10−15 ) dVin (R dt 50 pF + R Vout 1 kΩ, C 50 pF) Useful facts RC has units of time. If you’d like to have a millisecond act as your unit quantity of time (this is convenient for oscilloscope work), chose RC to be 10-3. For example, R = 100 kΩ and C =.01 µF should work fine Physics 212, Spring 2003. Unit 8 Electronic models of mechanical systems; analog computers 1 8.8 118 Physics 212, Spring 2003. Unit 8 Electronic models of mechanical systems; analog computers 1 8.9 119 Reading and homework should be completed by 4pm Friday. Required reading: Read over (but don't worry about understanding) the Unit 9 material we'll tackle next week. Optional reading: Horowitz & Hill, chapter 4 through section 4.06. (The book is on reserve in the Physics Department library.) Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to this week’s in-class exercises as well as last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Problem 1 (10 points): No Venusians were found Research electricians at Urkin’s have shown the following circuit to be unable to detect signs of extraterrestrial life, though it has proved useful for other purposes. Calculate VA and VB in terms of Vin. Vin VA + R R VB + Problem 2 (20 points): Doing A/B electronically Physics 212, Spring 2003. Unit 8 Electronic models of mechanical systems; analog computers 1 8.10 120 Design a circuit whose output is Vout = Vin1 / Vin2 where Vin1 > 0 and Vin2 > 0 . You should assume that (ideal) diodes behave as described previously, with IS = 10-15 amperes. (Your circuit will probably require several opamps.) Physics 212, Spring 2003. Unit 8 Electronic models of mechanical systems; analog computers 1 8.11 121 Physics 212 Honors topics in electrodynamics Unit 9 Electronic models of mechanical systems; analog computers 2 Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 9 Electronic models of mechanical systems; analog computers 2 9.1 122 Unit 9 is now included with the unit 8 writeup which covers both weeks. Physics 212, Spring 2003. Unit 9 Electronic models of mechanical systems; analog computers 2 9.2 123 Reading and homework should be completed by 4pm Friday. Required reading: Read over (but don't worry about understanding) the Unit 10 material we'll tackle next week. Optional reading: Horowitz & Hill, chapter 4 through section 4.06. (The book is on reserve in the Physics Department library.) Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to this week’s in-class exercises as well as last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Problem 1 (20 points): We are so clever, and so frugal Design an opamp circuit which “solves” the equation d 2 x (t ) dx ( t ) = − 0.2 − x (t ) + f (t ) 2 dt dt using only four opamps. (f(t) [or -f(t) if it should prove convenient] is provided by a function generator.) Be sure to include explanatory comments on your circuit diagram so we can tell what you had in mind. Problem 2 (10 points): Yep, it subtracts them! Prove that the following circuit V1 V2 R1 R3 R2 - Vout + R4 produces an output voltage  ( R + R2 ) R4   R2  Vout =  1  V2 −   V1 .  R1   ( R3 + R4 ) R1  Physics 212, Spring 2003. Unit 9 Electronic models of mechanical systems; analog computers 2 9.3 124 Physics 212 Honors topics in electrodynamics Unit 10 Magnetic fields and the vector potential Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.1 125 Unit 10: Magnetic fields and the vector potential Recap of last week Introduction: A and B Today we’ll investigate the connection between the vector potential and the magnetic field. It should be clear that there’s some sort of link between the two: if no currents are flowing, both the vector potential and the magnetic field are zero. In addition, both grow linearly with the magnitude of the current. (In addition, recall the problem set for unit 6 in which you used the Lorentz transformations to calculate the electric and magnetic fields of a moving charge by transforming the electric field of a stationary charge.) Since the vector potential is defined in terms of an integral over the current density J , there’s also some sort of direct connection between the B and J . You’ll discover that it is the curl of the magnetic field which is proportional to the current density. Here’s what you’ll do today: • Calculate the curl of the vector potential associated with a current-carrying wire • Compare ∇ × A with B • Calculate ∇ × A for a point charge moving with constant velocity • Discover that ∇ × B = µ0 J • Derive the Biot-Savart law • Derive Ampère’s law Discussion: A definition of the magnetic field, and a recap of your previous results From Physics 112 you know that the (Lorentz) force on a moving charge is F = qE + qv × B . Let’s use this as the definition of the magnetic field: it is what is responsible for the velocitydependent portion of the force acting on a charged particle. You can immediately see that the observers in different frames will disagree about the values of E and B since all the force in the rest frame of a particle must come from E . Both E and B contribute to the force in a frame in which the particle is moving. A few units back you worked up a calculation of the apparent charge density of a currentcarrying wire as seen by a moving test charge. Your calculation concerned a test charge moving Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.2 126 to the right with speed v parallel to a wire in which electrons flow to the right (as seen from the wire’s rest frame) with speed u . Your calculations should have revealed that the apparent net charge density on the wire (according to the test charge) was λ = ( quv ) ( ac 2 ) C/m. q is the charge on a metal ion: +1.6×10-19 C. Further calculation would/should have shown that the force on the test charge Q was F =λ Q quv Q . = ac 2 2π rε 0 2π rε 0 The magnitude of the current flowing in the wire is the amount of charge per second that flows past one point in the wire. Since the spacing between electrons (in the wire’s rest frame) is a while the average electron velocity is u, there will be u/a electrons per second flowing past one position in the wire. As a result, the magnitude of the current is I = qu a . We can use this to rewrite the force as F = Qv 1 1 I ≡ Qv × B . ε 0c 2 2π r From our definition of the magnetic field, we conclude that its magnitude a distance r from a straight current-carrying wire must be 1 1 I µ I B = = 0 2 ε 0c 2π r 2π r since µ 0 = 1 ( ε 0c 2 ) . This agrees with the Physics 112 result. (The direction also agrees if you work it all out correctly.) Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.3 127 Exercise 10.1: Line current vector potential A few weeks ago we found that the vector potential for a non-relativistic charge is µ qi vi µ q . A( r ) ≈ 0 v . Use superposition for several charges: A ( r ) ≈ 0 ∑ 4π r − r ′ 4π all charges i r − ri For a distribution of charge ρ ( r ) swirling around so that the distribution’s velocity v ( r ) depends on position, we can superpose the contributions from cells of volume dV containing charge dq = ρdV this way: A( r ) ≈ µ0 ρ ( r ′) v ( r ′) 3 µ0 J ( r ′) 3 d r d r ′ (d 3 r ′ is the same as dV ) . = ′ ∫ ∫ 4π r − r′ 4π r − r ′ all space all space The current density J is the number of Coulombs per second per square meter which flow through a surface perpendicular to J . (The above equation is exact only if J does not change with time.) If the current flow is confined to a wire, it is convenient to “do the integral” along the two coordinates perpendicular to the direction along the wire. For example, if we have a current I flowing in a thin wire along the z axis… z y x the vector potential created by the current is A ( r ) = Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential z =+∞ µ0 I zˆ dz ′ . 4π r − r ′ z =−∞ ∫ 10.4 128 (a) In Cartesian coordinates, the vector potential created by this current is A ( x, y , z ) = µ0 4π +∞ ∫ −∞ I dz ′ x 2 + y 2 + ( z − z ′) 2 . Note that the integral diverges: for large z′ it behaves like ∫ dz′ z′ = ln ( z′ ) . We encounter the same difficulty when we calculate the electrostatic potential from a line charge: +∞ 1 ρ ( r ′) 3 1 λ V ( x, y , z ) = ∫ d r′ = dz ′ . ∫ 2 2 2 r r 4 4 − πε πε ′ 0 0 all space −∞ x + y + ( z − z ′) One way around this would be to calculate the electric field instead (which behaves like 1/r), then to do the integral to work with the potential difference between two points which are a finite distance from the line charge. Since we’re interested in showing that the curl of the vector potential is the same as the magnetic field, we can circumvent the difficulty here by taking the curl of A , and moving the curl inside the integral. This will yield a convergent integral:  µ ∇ × A ( x, y , z ) = ∇ ×  0  4π +∞ ∫ −∞ Idz ′ x 2 + y 2 + ( z − z ′) 2  µI = 0  4π  1 ∫ ∇ ×  2 2 2 −∞  x + y + ( z − z ′) +∞  zˆ  dz ′ .  Note that the partial derivatives of the curl act only on the unprimed variables x, y, z and not on the primed variables x′, y′, z ′ .  1 Calculate ∇ ×  2 2 2  x + y + ( z − z ′ )  ẑ  .  Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.5 129 (b) The vector potential is symmetric under rotations about the z axis, and is independent of the value of z. As a result, we can work with points on the x axis (so that y = z = 0) without loss of generality in order to simplify things. (We could do the calculation in cylindrical coordinates instead.) µI Calculate ∇ × A ( x, y, z ) = 0 4π (You may find that ∫ (x dz′ 2 + z′  1 ∫ ∇ ×  2 2 2 −∞  x + y + ( z − z ′) +∞ ) 2 32 = z′ x 2 Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential x 2 + z′2  zˆ  dz ′ for points in the x axis.  is useful.) 10.6 130 (c) Compare your answer in part (b) with the magnetic field found using the relativity-based calculation described earlier. (The results should agree!) Discussion: is the magnetic field always the curl of the vector potential? It looks like the vector potential as we’ve defined it is a good candidate for what we want! (In a more advanced course you’ll probably encounter a proof that the definition we’re using for the vector potential is correct. I’ll just assert that it’s true and forget about the formalism.) Can we always write the magnetic field as the curl of the vector potential? There seem to be no isolated sources or sinks of magnetic field lines anywhere in the universe. (These would be the magnetic analogs of positive and negative electric charge.) Think of the field lines created by a solenoid: they don’t begin (or end) anywhere. Consequently, the magnetic version of Poisson’s equation, ∇i B = ρmagnetic ε 0 , becomes ∇i B = 0 . Remember Helmholtz’s theorem? We didn’t get to it during unit 5, but you may have read about it if you looked through the notes. Here’s how we can use it: For any vector field B which dies away very far from the origin (not just the magnetic field), we begin by defining the scalar and vector fields D and C so that D ( x, y, z ) ≡ ∇i B ( x, y, z ) and C ( x, y, z ) ≡ ∇ × B ( x, y, z ) . According to Helmholtz’s theorem we can then reconstruct B from D and C this way:  1 B ( r ) = −∇    4π  1 D ( r ′) 3  ′ d r + ∇ ×   ∫ r − r′   all space   4π C ( r′) 3  ∫ r − r ′ d r ′  . all space  If ∇i B ( x, y, z ) = 0 (so that D = 0), this reduces to  1  1 C ( r ′) 3  ∇′ × B ( r ′ ) 3  ′ B (r ) = ∇ × d r = ∇ ×   ∫ r − r′  ∫ r − r ′ d r ′  ≡ ∇ × A .  4π all space  4π all space     The prime on the curl inside the integral indicates that its derivatives are with respect to primed, not unprimed, coordinates. Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.7 131 We can always write a divergenceless function as the curl of some vector field. As a result, we can always find a vector potential A which satisfies the equation B = ∇ × A : A(r ) = 1 ∇′ × B ( r ′ ) 3 . d r′ 4π r − r ′ all space ∫ ( ) One last point: since the curl of a gradient is always zero (you showed that ∇ × ∇V = 0 and ( ) ∇i ∇ × A = 0 a few weeks ago), we are always free to add the gradient of any old scalar function to the vector potential without changing the magnetic field which results since ∇ × A + ∇V = ∇ × A + ∇ × ∇ V = ∇ × A = B . ( ) ( ) It turns out that this allows us to impose the condition on the vector potential that it has zero divergence: ∇i A = 0 . (This isn’t a requirement for A , but it is sometimes convenient to calculate a vector potential which satisfies this condition to simplify other calculations.) Exercise 10.2: Magnetic field caused by a point charge moving with constant velocity A point charge moves with constant velocity along the z axis so that its position is ( x′, y′, z′) = ( 0,0, vt ) . An observer positioned at ( x, y, z ) = ( x,0,0 ) measures the magnetic field associated with the particle. (Assume that the particle is moving slowly so that retardation effects can be ignored: during the time it would take light to travel from the charge to the observer the charge’s position doesn’t change significantly.) Make use of the vector potential to calculate the magnetic field as a function of time at the observer’s position. Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.8 132 discussion Note the 1/r2 dependence to the field as the charge passes through the origin. discussion: cylindrical coordinates We could have calculated B = ∇ × A using a non-Cartesian coordinate system. In fact, since the problems we’ve done so far have all been symmetric under rotations about the z axis, it would have made sense to use cylindrical coordinates, illustrated in the figure. Naturally, we’d find the same values for B , but they would be parameterized in terms of ( r ,θ , z ) instead of ( x, y, z ) . The unit vectors rˆ, θˆ, zˆ point in the direction of increasing r, θ, and z. z x r θ y In cylindrical coordinates the curl of the vector potential is  1 ∂Az ∂Aθ ∇× A =  − ∂z  r ∂θ 1  ∂ ( rAθ ) ∂Ar   ∂Ar ∂Az  ˆ − − θ +   rˆ +  ∂r  ∂θ r  ∂r   ∂z Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential   zˆ .  10.9 133 discussion: retarded time We’ve concocted a couple of ways of calculating the magnetic field. The first uses relativity to describe how the effects of Lorentz contraction and non-simultaneity make observers in different frames disagree about charge density. The second involves a global calculation if we know the “current density” ρ ( r ) v ( r ) everywhere. It is common to use J ( r , t ) ≡ ρ ( r , t ) v ( r , t ) to represent the current density so that  J ( r ′, t ′ )  3 µ B ( r ,t) = ∇ × A( r ,t) = 0 ∫ ∇ ×  d r ′ . 4π all space  r − r′  Note that the curl only acts on unprimed coordinates. One subtle point is that the time t ′ inside the integral is the retarded time: at each value of r ′ that the integral touches, t ′ represents the present time t set back by the amount of time a light beam would have required to arrive at the observer’s position ( r , no prime). The connection between the two times is t ′ = t − r − r ′ c . Sometimes the need to use the retarded time inside the integral can make it very difficult to evaluate! Since the curl acts on unprimed coordinates, any time dependence in the current density will cause it to “react” to the curl’s partial derivatives due to the presence of unprimed spatial coordinates in the retardation term. If the current density J depends on position but not on time (the currents never change), things simplify somewhat. Exercise 10.3: a microscopic version of some of this Starting with the global version of Gauss’ law ∫ E idA = surface Qenclosed ε0 = 1 ∫ ε 0 volume ρ ( r ) dV , we were able to construct a microscopic (local) version ∇i E = ρ ε0 several weeks ago. Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.10 134 Let’s do the same thing for the magnetic field now: we’ll establish a connection between its curl and the current density, assuming the current density is constant in time. There’s a hard way to do this, which proceeds as follows: • ( ) ( ∇ × B ( r ) = ∇ × ∇ × A ( r ) . Show that, for example, the x component of ∇ × ∇ × A ( ) ( ) ) satisfies  ∇ × ∇ × A  = ∂ ∇i A ∂x − ∇ 2 Ax .  x • Since we are able to work with vector potentials for which ∇i A = 0 , ∇ × B ( r ) x = −∇ 2 Ax . There are similar equations for the y and z components which can be written this way: ( ∇ × B) ( ∇ × B) ( ∇ × B) xˆ = −∇ 2 Ax xˆ    yˆ = −∇ 2 Ay yˆ  y  2 zˆ = −∇ Az zˆ  z  x ⇒ ∇ × B = −∇ A . 2 Rewrite ∇ × B = −∇ 2 A so that the right side if the equation is expressed in terms of the current density J ( r ) . (You could find what you need in a previous unit.) (a) You should demonstrate this the easy way instead: compare the expression for the vector potential in the earlier discussion (which refers to the Helmholtz formula) with our original definition of the vector potential. From Helmholtz: A(r ) = 1 ∇′ × B ( r ′ ) 3 . d r′ 4π r − r ′ all space ∫ From our original definition of the vector potential: A( r ) = µ0 ρ ( r ′) v ( r ′) 3 1 µ0 J ( r ′) 3 d r = ′ ∫ 4π r − r ′ ∫ 4π r − r ′ d r ′ . all space all space By inspection, what’s ∇ × B in terms of the current density J ? Exercise 10.4: Biot-Savart law Let’s consider a current density which is confined to a thin wire. Earlier, when you calculated the vector potential for a wire running along the z axis, you found that Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.11 135 A ( x, y , z ) = µ0 J ( r ′ ) 3 µ d r′ = 0 ∫ 4π r − r ′ 4π all space +∞ ∫ −∞ I dz ′ . r − r′ Define dl to be a short distance along the wire in the direction that current is flowing so we can rewrite the integral more generally, for arbitrary wires: A ( x, y , z ) = ∫ along wire µ0 Idl . 4π r − r ′ This suggests that we write A( r ) = ∫ dA = so that dA contributions from entire wire µ0 Idl 4π r − r ′ and B = ∇× A = ∫ ∇ × dA so that dB = contributions from entire wire (a) Evaluate dB =  Idl  µ0 ∇× . 4π  r − r′   Idl  µ0 ∇×  for a small segment of current at the origin flowing in the z 4π  r − r′  direction: Idl = I zˆ dz . Assume that the observer is located at r as shown in the diagram (in the y-z plane), and that r ′ = 0 . (Keep in mind that the curl acts only on unprimed coordinates.) After grinding through the differentiation, reexpress your answer in terms of θ and r . z observer θ r I dl y x (b) Rewrite your expression for dB in terms of dl × rˆ , where r̂ is a unit vector which points from the small current element Idl (at the origin here) towards the observer’s location. Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.12 136 discussion • Written this way (using a cross product), your expression is independent of the choice of coordinate system: very convenient! • You should have obtained dB = µ0 I dl × rˆ . This is the Biot-Savart law, named for Jean4π r 2 Baptiste Biot (1774-1862) and Felix Savart (1791-1841), two French mathematicians. This is the magnetic equivalent of Coulomb’s law: the contribution to B from a small amount of current somewhere follows an inverse-square law. • If we have to contend with current densities instead of currents in thin wires, we’ll need to replace I dl with JdV in our expression for dB . Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.13 137 Exercise 10.5: Ampère’s law A short time ago you found that ∇ × B = µ 0 J . Using Stokes’ theorem, you’ll now derive the macroscopic version of this, known as Ampère’s law (named for the French mathematician and physicist André Ampère [1775-1836]). Imagine we have a current density J flowing through a loop, as shown in the diagram. We can integrate both sides of the equation ∇ × B = µ 0 J over the surface enclosed by the loop: z ∫ ∇ × BidA surface enclosed = ∫ µ0 J idA . J surface enclosed y x Use Stokes’ theorem to rewrite the left side as a line integral of the magnetic field around the loop. Note that the details of how the surface extends across the loop do not matter: it can be as lumpy as we want without spoiling the connection between the line integral and the surface integral guaranteed by Stokes’ theorem. Now rewrite the right side as an expression involving the total current I which flows through the loop. Ampère’s law! discussion Use of Ampere’s law with cylindrical symmetry Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.14 138 Summary and wrap-up discussion • • • • • Calculating the curl of the vector potential for a current-carrying wire ∇ × A is the same as B ∇ × A for a point charge moving with constant velocity µ I dl × rˆ The Biot-Savart law: dB = 0 4π r 2 Ampère’s law: ∫ B ⋅ dl = µ0 Ithrough perimeter surface Comments about homework assigned for next week Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.15 139 Reading and homework should be completed by 4pm Friday. Required reading: Purcell, chapter 2, sections 13 - 16; chapter 6, section 3. Optional reading: The Feynman Lectures on Physics (volume II), chapter 14. Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to this week’s in-class exercises as well as last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Problem 1 (10 points): Les belles choses qu’on peut dessiner You sketched a vector potential, including retardation effects, as the last problem of unit 5’s homework assignment. Reproduce your sketch, and then draw on the sketch the vectors corresponding to the magnetic field associated with (the curl of ) this vector potential. (As in the unit 5 problem, only indicate the field on the x axis.) Problem 2: Mrs. Urkin’s industrial jell-o dispenser To speed its distribution of electrically charged jell-o, Urkin’s D&HE builds a pipeline from the Emporium to the central New Jersey rail head at which tank cars are to be filled with the product. You may approximate the pipe as a 2 kilometer long straight cylinder of radius 0.1 meter. The jell-o flows due north through the pipe at 10 m/s and carries uniform charge 1 µC per cubic meter. (a) Calculate the current density J inside and outside the pipe. (b) Calculate the curl of the magnetic field ∇ × B inside and outside the pipe. (Note that you do not need to know B to calculate ∇ × B here! See exercise 9.3 in the class materials if you’re stuck.) (c) Calculate the value of the vector potential A along the centerline (r = 0) of the pipe midway between its ends. (It’s easiest to do this problem in cylindrical coordinates. Keep in mind that the volume element in your integral will be dV = rdrdθdz, and not drdθdz.) (continued next page...) Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.16 140 (d) Recall Stokes’ theorem, true for any vector field B (not just a magnetic field): ∫ ( ∇ × B ) ⋅ dA = ∫ surface B ⋅ dl . perimeter of surface Use this to determine the magnetic field B as a function of distance from the center of the pipe. Physics 212, Spring 2003. Unit 10 Magnetic fields and the vector potential 10.17 141 Physics 212 Honors topics in electrodynamics Unit 11 Maxwell’s equations Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.1 142 Unit 11: Maxwell’s equations Recap of last week Introduction: Establishing a connection between E and B We’ve worked up a number of connections between electric fields and charge distributions, magnetic fields and current distributions, potentials and fields, and so forth. In many ways our equations are still incomplete, however: some of them are only true for static charge distributions, or for currents which never change. In addition, we don’t have anything which provides a direct link between the electric and magnetic fields, even though we know that special relativity requires a connection between them. Here’s a table which summarizes what we know so far. Scalar and vector potentials in terms of charge and current distributions static charges, currents 1 ρ ( r ′) 3 V (r ) = ∫ d r′ 4πε 0 r − r ′ all space A( r ) = µ0 J ( r ′ ) 3 d r′ ∫ 4π r − r ′ all space static charges, currents Electric and magnetic fields in terms of charge and current distributions 1 ρ ( r ′ ) r − r ′ 3 (Coulomb’s law) static charges, currents E (r ) = d r′ 2 ∫ all space B ( r1 ) = 4πε 0 r − r ′ r − r ′ µ0 I J × rˆ2→1 3 d r2 (Biot-Savart law) 4π r22→1 space ∫ static charges, currents Electric and magnetic fields in terms of potentials static charges, currents E = −∇V always true B = ∇× A Divergences (flux integrals) of electric and magnetic fields Q ρ always true ∇i E = or ∫ E idA = enclosed (Gauss’ law) ε0 ε0 surface always true ∇i B = 0 (No magnetic charge) Curls (line integrals) of electric and magnetic fields static charges, currents ∇× E = 0 static charges, currents ∇ × B = µ0 J or ∫ ∇ × BidA = µ0 I through surface surface Today we’ll work on “correcting” the curl equations to make them accurate even when charges move and currents are time-dependent. Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.2 143 Here’s what you’ll be working on: • using Stokes’ theorem and Ampère’s law to investigate the connection between changing electric fields and the existence of non-zero magnetic fields • developing the connection between changing magnetic fields and the existence of non-zero electric fields. • Maxwell’s equations! Exercise 11.1: Stokes’ theorem, applied to two different surfaces A current I flows through a wire, as shown in the following figure. The wire passes through two different surfaces: the first is a flat circle, while the second resembles an already-opened can of tuna. first surface “tuna can” is open here I this loop encloses the (entire) surface surface B this loop encloses the (entire) surface surface A The two loops which bound the surfaces are identical in size and shape. The part of the second surface labeled “surface A” is a cylinder, while “surface B” is a flat circle. (a) Evaluate ∫ ∇ × BidA for surface A only in terms of I. (Divide it into two pieces to obtain a surface A pair of surfaces with simply-connected boundaries if you like. Stokes’ theorem will be helpful to you here.) Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.3 144 (b) Prove ∫ ∇ × BidA first surface (c) Evaluate ∫ = ∫ ∇ × BidA . (Making use of Stokes’ theorem should help.) second surface Bidl in terms of I. Do the same for first loop ∫ Bidl . second loop discussion The integrals are independent of details of shapes of surfaces and bounding curves. Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.4 145 Exercise 11.2: Another version of the last two surfaces A current I flows through a wire, charging a parallel-plate capacitor, as shown in the following figure. The surfaces shown are similar to those in the previous exercise: the first surface is a circular disk, perpendicular to the wire, seen edge-on. The second looks like a soup can with the top removed: the open end is on the left. The closed end of the soup can passes between the capacitor plates. As before, line integrals around the boundaries of the two surfaces involve traveling along circles of identical radii, indicated in the figure. You should assume that the capacitor plates are very close together so that the magnetic fields on the circles labeled “boundaries of surfaces” are identical. boundaries of surfaces parallel-plate capacitor I I first surface (a) Evaluate ∫ second surface Bidl in terms of I. (Since the magnetic field is parallel to the boundary boundary of first surface used in your integral, you could have used ∫ Bidl = 2π r B if you had known the value of boundary of first surface the magnetic field.) Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.5 146 (b) It is certainly true that ∫ Bidl = boundary of first surface ∫ Bidl since the magnetic fields are the same along boundary of second surface the two boundaries. Must it still be true that ∫ ∇ × BidA first surface = ∫ ∇ × BidA ? (Remember Stokes’ second surface theorem!) discussion But I through the surface is zero! What’s going on? Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.6 147 Exercise 11.3: The electric field gets into the act There must be a magnetic field between the capacitor plates while it is charging up. There’s no way to have a non-zero line integral along the boundary of that soup can without having a nonzero curl to the magnetic field: that would violate Stokes’ theorem, which is true for any vector field. If the magnetic field were zero over a finite (non-pointlike) volume of space, its curl would also be zero. We are forced to conclude that something going on between the capacitor plates is creating a magnetic field so that Stokes’ theorem is satisfied. This is an especially nice example of how mathematics can lead us to a better understanding of physical reality: it is impossible for Stokes’ theorem to be wrong. As a result, it is impossible for the magnetic field between the capacitor plates to be zero, even though there is no current flowing across the gap between the plates. We know there’s no current between the plates of an already-charged capacitor: with no charges in motion there can be no relativistic effects giving rise to a magnetic field due to different Lorentz contraction effects on (stationary) positive and (stationary) negative charges. It must be something associated with the change in the electric field between the plates that is giving rise to a magnetic field. (a) Let’s assume that our capacitor plates each have area A and are uncharged at time t = 0. Assume the current flowing onto the left-hand plate is I, and is constant. Calculate the electric field between the plates as a function of time, E(t). (If you don’t remember how to do this from Physics 112, use Gauss’ law, and keep in mind that the electric field is zero outside the plates.) (b) Now calculate ∂E . ∂t Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.7 148 (c) We know that: ∫ ∫ Bidl = boundary of second surface ∇ × BidA = 2π r B = µ 0 I second surface and also that ∂E I = ∂t Aε 0 so that ∫ second surface ∂E I idA = ε0 ∂t and also that ε 0 = 8.854 × 10−12 , µ 0 = 4π × 10−7 ≈ 1.257 ×10−7 , ε 0 µ 0 = 1 c 2 . Fill in the missing terms in curly brackets: ∫ ∇ × BidA = second surface  ∂E  expression in terms of  idA ∫ ∂t  second  surface so that  ∂E  ∇ × B = expression in terms of . ∂t   Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.8 149 Discussion In the absence of currents, you’ve just proposed a connection between the curl of the magnetic field and the time derivative of the electric field: ∇ × B = µ 0ε 0 ∂E 1 ∂E . = ∂t c 2 ∂t In integral form this can be written ∫ ∇ × BidA = surface ∫ Bidl = boundary of surface  1 ∂E 1 ∂  idA = 2  ∫ E i dA 2 c ∂t c ∂t  surface  surface ∫ so that ∫ boundary of surface Bidl = 1 ∂Φ E c 2 ∂t where ΦE is the flux of electric field through the surface. Imagine that we had the following situation, in which a current passes through part of our surface, while a changing electric field crosses another part of the surface, as shown in the figure. I1 I2 surface parallel-plate capacitor In this case we’ll expect the curl to contain contributions from both the current and the changing electric field: 1 ∂E . ∇ × B = µ0 J + 2 c ∂t Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.9 150 Exercise 11.4: Now to investigate the curl of E ... This one’s easier to get. We’ll start with a Physics 112 approach, then turn the integral expression into a differential one. (a) A rectangular conducting loop slowly glides with speed v into a region in which there is magnetic field B , pointing in the positive z direction as shown in the figure. The mobile charge carriers inside the conductor each have positive charge q. magnetic field z B a y b x v loop no magnetic field here: B=0 Calculate the force (magnitude and direction) on a charge q (due to its motion through the magnetic field) in the side of the loop which has just entered the magnetic field.. Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.10 151 (b) Somewhat later, half the moving rectangular loop has entered the magnetic field as shown. z y v x Calculate the direction of the magnetic force on charges q in the sides of the loop which are parallel to the loop’s direction of motion. (Assume that the charges are almost at rest with respect to the wire of the loop.) (c) In the rest frame of the loop, each mobile charge will ascribe the force it experiences to the presence of an electric field. Calculate ∫ E idl in the loop’s rest frame when it has conducting loop partially entered the magnetic field. Note that the line integral involves traveling around the loop in the positive sense, as given by the right hand rule. (The path around the loop is counterclockwise, when seen from above.) Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.11 152 (d) Notice that the effects of the induced electric field disappear as soon as the entire loop is inside the field: when d Φ B dt = 0 , ∫ E ⋅ dl = 0 . (The magnetic flux is Φ B = ∫ B ⋅ dA : positive loop flux is in the positive z direction.) Prove that ∂Φ B = − ∫ E idl for this loop. (This is Faraday’s law, named for English chemist ∂t loop Michael Faraday [1791 – 1867]. It is true in general, not just for our rectangular loop moving with constant velocity.)  ∂Φ B ∂  =  ∫ B ⋅ dA = − ∫ E idl . This also works in the rest frame of the ∂t ∂t  loop  loop loop, where the magnetic field sweeps across the (stationary) loop. In fact, it also holds if the magnetic field is spatially constant across the loop, but varies with time. As a result, we can pull  ∂Φ B ∂  ∂B =  ∫ B ⋅ dA = ∫ ⋅ dA = − ∫ E idl . the time derivative inside the integral to write ∂t ∂t  loop loop  loop ∂t (e) You know that Use Stokes’ theorem to rewrite the rightmost integral as a surface integral of the curl of the electric field: − ∫ { E idl = something involving ∇ × E } loop Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.12 153 (f) What should go between the curly brackets? ∂B = something involving ∇ × E ∂t { Physics 212, Spring 2003. Unit 11 Maxwell’s equations } 11.13 154 discussion We can correct the last several rows in the table at the start of this unit. The following two tables hold equivalent statements: the first are differential, the second integral equations which describe exactly the same physics. These are known as Maxwell’s equations, named for James Clerk Maxwell (1831 – 1879), Scottish physicist and mathematician. They appeared in his 1873 work Treatise on Electricity and Magnetism. They represent one of the most profound intellectual achievements that the human species has yet produced. Fully relativistic, these equations (and extensions of them) have served as the basis for much of modern physics. Divergences of electric and magnetic fields ρ always true (Gauss’ law) ∇i E = ε0 always true (No magnetic charge) Curls of electric and magnetic fields always true (Faraday’s law) ∂B ∇× E = − ∂t always true (Ampère’s law) 1 ∂E ∇ × B = µ0 J + 2 c ∂t ∇i B = 0 ∫ Flux integrals of electric and magnetic fields Q always true (Gauss’ law) E idA = enclosed ε0 surface ∫ always true (No magnetic charge) B ⋅ dA = 0 surface Line integrals of electric and magnetic fields ∂Φ B always true (Faraday’s law) E ⋅ dl = − ∫ ∂t perimeter ∫ perimeter B ⋅ dl = µ0 I through + surface 1 ∂Φ E c 2 ∂t always true (Ampère’s law) The differential versions of Maxwell’s equations relate the behavior of the electric and magnetic fields (and their derivatives) at all points in space. It does not matter what is done to produce the fields: once they are known to exist they MUST obey the Maxwell equations. Often one works with electric and magnetic fields external to the region where there are currents or free charges. (This way ρ = 0 and J = 0 .) When this is the case, Maxwell’s equations simplify to Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.14 155 Maxwell’s equations in the absence of charges and currents ∇i E = 0 ∇i B = 0 ∇× E = − ∂B ∂t ∇× B = 1 ∂E c 2 ∂t Note the symmetry between the electric and magnetic fields in Maxwell’s equations. The presence of 1/c2 in only one of the curl equations is a consequence of our ill-advised choice of units for the magnetic field. If we had used, say, the “Nillog” (where 1 Nillog = c × 1 Tesla), the revised curl equations would each have a 1/c coefficient in front of the time derivative. The negative sign in the ∇ × E equation is important: it’s why Lenz’s law works. Without it, a change in magnetic flux through a wire loop would induce a current in the wire which would increase the change in magnetic flux, inducing an even stronger current. The current would rise exponentially, vaporizing the wire. The positive sign in the ∇ × B equation is needed so that the changing electric field between capacitor plates will create a magnetic field whose curl matches that of the field made just upstream/downstream of the capacitor (where there is a current flowing in the circuit). As long as we include time retardation, we can still calculate the vector potential and use it to extract the magnetic field:  r − r′  J  r ′, t − c  3 µ0  A( r , t) = ∫ d r ′ and B = ∇ × A . 4π r − r′ all space When there are free charges in motion we can calculate a scalar potential in a similar fashion: V (r, t) = ∫ all space 1 4πε 0   r − r′  c  3 d r′ . r − r′ ρ  r ′, t − Note that the vector potential also lets us calculate the electric field when ρ = 0 (so that ∇ ⋅ E = 0 ) in our part of the universe: Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.15 156 ∇× E = − ∂∇ × A ∂A ∂B = −∇ × , yielding so ∇ × E = − ∂t ∂t ∂t E=− ∂A . ∂t The more general case (ρ ≠ 0) requires the inclusion of the −∇V term. To summarize: Scalar and vector potentials in terms of charge and current distributions   r − r′  r − r′  ρ  r ′, t − J  r ′, t −  c  3 c  3  µ0  1 V (r, t) = ∫ d r′ A( r , t) = ∫ d r′ 4 πε − − π r r 4 r r ′ ′ 0 all space all space Electric and magnetic fields in terms of potentials ∂A B = ∇× A E = −∇V − ∂t Divergences of electric and magnetic fields ρ ∇i B = 0 ∇i E = ε0 Curls of electric and magnetic fields 1 ∂E ∂B ∇ × B = µ0 J + 2 ∇× E = − c ∂t ∂t Summary and wrap-up discussion Here’s what you were working on: • using Stokes’ theorem and Ampère’s law to investigate the connection between changing electric fields and the existence of non-zero magnetic fields • developing the connection between changing magnetic fields and the existence of non-zero electric fields. • Maxwell’s equations! Comments about homework assigned for next week Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.16 157 Reading and homework should be completed by 4pm Friday. Required reading: nothing this week. Optional reading: Purcell, chapter 9 (you’ll have to borrow it from the library). The Feynman Lectures on Physics (volume II), chapter 4, section 1. The Feynman Lectures on Physics (volume II), chapter 18. Recall that the Maxwell equations are Sources (charges and currents) are present ρ ε0 ∂B ∂t 1 ∂E ∇i B = 0 ∇ × B = µ0 J + 2 c ∂t Sources are absent (ρ = 0, J = 0) ∂B ∇i E = 0 ∇× E = − ∂t 1 ∂E ∇i B = 0 ∇× B = 2 c ∂t ∇i E = ∇× E = − They are always true for electric and magnetic fields. It is impossible to find a time-varying electric field which is not accompanied by a magnetic field with non-zero curl. Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to this week’s in-class exercises as well as last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Problem 1 (15 points): How can we make these fields satisfy Maxwell’s equations? The electric and magnetic fields that exist in some universe which is empty of charges and currents are known to be given by E ( x, y, z , t ) = E0 f ( kz − ω t ) xˆ and B ( x, y, z, t ) = B0 f ( kz − ω t ) yˆ where E0, B0, k, and ω are constants, and f is a function. Since these fields do exist, they must satisfy all four of the Maxwell equations. Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.17 158 (a) Prove that these fields satisfy the divergence equations ∇i E = 0 and ∇i B = 0 . (b) Show that it is also possible for the fields to satisfy the curl equations ∇ × E = − ∂B and ∂t 1 ∂E if the ratios E0 B0 and ω k are chosen properly, and determine the required c 2 ∂t values of E0 B0 and ω k . ∇× B = (c) Imagine that the function f has a peak when its argument is zero (so that the peak is at z = 0 at time t = 0, but is located elsewhere when t ≠ 0 ). Calculate the speed with which this peak travels, making use of your answers to part b. In an entirely different universe, Mrs. Urkin realizes that she had only partially folded the egg whites into the chocolate mousse that she has just placed before her dinner guests. Fortunately, the circular traces of egg in the dark chocolate background remind everyone of a simple electric field with non-zero curl, namely E ( x, y, z ) = E0 [ yxˆ − xyˆ ] . Her guests notice that the local magnetic field is zero at the instant that dessert is placed on the table and that, to their amazement, they are actually immersed in the time-independent electric field E ( x, y, z ) = E0 [ yxˆ − xyˆ ] . (There are no charges or currents in the vicinity of the dining room table.) (d) Sketch the electric field and calculate its curl ∇ × E . (e) Solve for the magnetic field B ( x, y, z , t ) in Mrs. Urkin’s dining room. Keep in mind that the Maxwell equations are always satisfied by real electric and magnetic fields. Problem 2 (15 points): Deriving the wave equation The Maxwell equations couple the electric and magnetic fields: a changing electric field creates a magnetic field with non-zero curl; a changing magnetic field creates an electric field with nonzero curl. In this problem you’ll manipulate the Maxwell equations to produce a pair of decoupled wave equations for the electric and magnetic fields. Imagine that an electric field E ( x, y, z , t ) = Ex ( x, y, z , t ) xˆ + E y ( x, y, z , t ) yˆ + Ez ( x, y, z, t ) zˆ is known to exist in space. Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.18 159 ( ) (a) With the help of the Maxwell equations, prove that ∇ × ∇ × E = − ( ) ( ∂ ∇× B ∂t ) =− 1 ∂ E. 2 c 2 ∂t 2 (b) Show that the z component of ∇ × ∇ × E satisfies the following: ( )  ∇ × ∇ × E  = −∇ 2 ( Ez ) +  z ( ) ( ∂ ∇⋅ E ∂z ). (The equations for the x and y components of ∇ × ∇ × E are similar to this, allowing us to ( ) ( ) ( ) combine the three equations to write ∇ × ∇ × E = −∇ 2 E + ∇ ∇⋅ E .) (c) If the charge density ρ is zero we’ll have ∇ ⋅ E = ρ ε 0 = 0 so that − ( ) 2 1 ∂ 2 Ez   = −∇ 2 ( Ez ) , yielding the wave equation ∇ 2 ( Ez ) − 1 ∂ Ez = 0 = ∇ × ∇ × E  z c 2 ∂t 2 c 2 ∂t 2 for Ez. (If you like, you can combine this with the wave equations for the x and y components 1 ∂2 E 2 into the single equation ∇ E − 2 2 = 0 .) c ∂t ( ) Derive the relationship between k and ω necessary to make E ( x, y, z , t ) = E0 cos ( kz − ω t ) xˆ a solution to the wave equations. (It should look familiar to you if you’ve already done the first problem!) Note that you can derive the wave equation for the magnetic field in a similar fashion: • • • ( ) ⇒ ( ) ∇ × ( ∇ × B ) = −∇ ( B ) + ∇ ( ∇ ⋅ B ) = −∇ ( B ) ⇒ ∇× ∇× B = + 1 ∂ ∇× E 1 ∂2 B = − c2 ∂t c 2 ∂t 2 2 2 1 ∂2 B ∇ B − 2 2 = 0. c ∂t 2 ( ) (d) Recall where that 1/c2 factor in front of the time derivative came from: 1/c2 is really just the product of µ0 and ε0. The constant ε0 comes from Coulomb’s law: it sets the scale for the strength of the Coulomb force between a pair charges. The constant µ0 comes from the connection between the amount of current flowing in a wire and the magnitude of the resulting magnetic field a fixed distance from the wire. Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.19 160 Imagine a different universe in which the speed of light is ten times greater, but in which the Coulomb force is the same as it is for us. How would magnetic fields created by currentcarrying wires compare in the two universes? Physics 212, Spring 2003. Unit 11 Maxwell’s equations 11.20 161 Physics 212 Honors topics in electrodynamics Unit 12 Visualizing fields and potentials with Mathematica Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.1 162 Unit 12: Visualizing fields and potentials with Mathematica Recap of last week Introduction: Visualization and Mathematica We’ve been working with fairly abstract quantities—electric and magnetic fields and their derivatives, as well as scalar and vector potentials (and their derivatives). Let’s look into using the software package Mathematica to generate plots of potentials and fields in order to better visualize them. This week will serve as an introduction to Mathematica. Next week we’ll use the program as a tool to investigate the nature of the fields associated with a moving charge. We’ll be working in Loomis 257 where a large number of personal computers are available to run Mathematica. It’s likely that the PC’s are running Windows NT, and are loaded with Mathematica version 4. If you’ve never used Mathematica, you’re likely to find that its interface is fairly intuitive, and that the software behaves quite sensibly once you’re used to it. It’s a good tool, and you may find it useful in a number of other courses. It is able to do symbolic manipulation of equations and all sorts of other good stuff. Most of your interactions with Mathematica are through a window displaying a notebook, which will contain your instructions to Mathematica and the results of its efforts. I’ve been noticing a certain tendency for Mathematica to spit out a dozen error message when it corrupts some of its internal settings files. According to Wolfram Research, sometimes this can be fixed by restarting the program while holding down the ctrl and shift keys simultaneously. (I find that this doesn’t work.) In addition, the program has a tendancy to insert non-displaying formatting characters into notebook files in a manner which corrupts the files occasionally. Try this: copy the suspect text from your notebook into a text-only editor (MicroSoft Notepad, for example), remove things which don’t belong, then paste it back into your notebook. Alternatively, retype the misbehaving lines, then delete the original text. This is an annoying problem, and should not be present in software judged ready for commercial release. Sometimes stopping the kernel (do this from the kernel menu), then restarting it will fix problems. Screen shots are taken from my PC’s, so what you see will be slightly different. Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.2 163 Discussion: Introduction to Mathematica Fire up Mathematica on your PC. The path probably is something similar to the following: Start menu → Programs → Mathematica → Mathematica 4 You’ll see something like this: The large window labeled “Untitled-1” is a Mathematica notebook in which you can enter calculations and instructions to the Mathematica program. The palette to the right is useful for entering various quantities (Greek letters, integrals, and so forth) into your notebook. Mathematica’s online help facility is pretty good—you can get to it using the “Help browser...” item in the “Help” menu, as shown below. The Mathematica program is split into two parts: a “front end,” which manages the user interface, and a “kernel,” which actually performs various computations for you. You do not interact directly with the kernel. Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.3 164 Many of your interactions with Mathematica require you to type instructions into the notebook, then request Mathematica to evaluate them. To cause the program to evaluate your inputs, often you’ll type keypad-enter or shift-enter. Have Mathematica calculate 2 + 2 for you... ...then hit keypad-enter (or shift-enter) to tell it to evaluate what it sees. Notice that Mathematica has added the In[x]: and Out[x]: labels in the example below. In addition, the strange brackets at the right mark “cells,” subunits in the Mathematica notebook. Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.4 165 You can define constants like this: They can then be used in other expressions: The palette is useful for building a variety of functions. For example, the tile labeled “π” represents 3.141592653... Mathematica contains loads of functions, all of whose arguments must be placed between square brackets ( [ and ] ). Parentheses will not work! For example (the “π” came from clicking the appropriate tile in the palette): Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.5 166 You can define your own functions this way: Note the use of the underscore at the end of the argument name ( x_ ), the use of the palette to enter x3 (use the tab key to move from one input box to the next), and the := assignment symbol in the definition. Here are some other useful Mathematica symbols. Since Mathematica also does symbolic manipulation of equations, it sometimes does surprising things: take note of the difference between 8/5 and 8./5. for example. A few obscure things: • % means “the result of the previous calculation” • == means “logical test” • >= prints as ≥ I always seem to get burned by this: ALL (nearly all??) Mathematica built-in functions begin with an uppercase letter: Sin[π] works, but sin[π] doesn’t. The program is case-sensitive. It is easy to plot stuff in Mathematica but keep in mind that sometimes you’ll need to tell it to have the kernel load some “add-on” definitions. Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.6 167 No add-on stuff was needed here. Pay attention to the format: x2 Exp[-x] is the function to be plotted, while {x, 0, 10} is an argument list (lists are always enclosed in {} curly brackets). The arguments, as you can surely tell, inform Mathematica of the variable to plotted along the abscissa, as well as the minimum and maximum values to be plotted. Mathematica is good at symbolic manipulations. For example, the palette symbol represents a partial derivative. The two boxes are for you to fill in: the hollow one is requested first, the solid one is requested second. Use the tab key to move from one box to the next. ( ∂ x f ( x ) means ∂f ( x ) ∂x ; it’s a common, and convenient, notation used frequently in physics.) Here’s how this looks in your notebook (but note one way in which you must actually make use of a function defined this way): This is very convenient! (Note the upper case first letters in Tan[x] and Sec[x].) Because a reference to f[x] asks Mathematica to take a derivative of Sin[x], it is necessary to let the program do the differentiation before replacing x with a numerical value: asking it to evaluate d sin(π ) d sin( x) f[Pi] is the same as asking it to calculate instead of . (Thus the dπ dx x =π f[Pi]/.x->Pi format. There are other ways to do this also...) Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.7 168 Mathematica also does integrals, using the indefinite integral palette tile . Note the two ways I’ve used to enter x2: the first is with a “^” to indicate an exponent, while the second is done using the palette. Not surprisingly, Mathematica notebooks can be stored so that you can resume/correct/extend your work in a convenient fashion. To find more information about Mathematica functions that you might find useful, use the Help menus: There are lots of cool things available. A far-from-complete list: Plot[] Plot3D[] PlotVectorField[] PlotVectorField3D[] ContourPlot[] ContourPlot3D[] PlotGradientField[] PlotGradientField3D[] SphericalPlot3D[] Here are some common mistakes/problems: • • • Leaving out the underscore at the end of an argument in a function definition: Wombat[x]:=x^1.5; instead of Wombat[x_]:=x^1.5; Forgetting that built-in functions begin with upper case letters: sin[x] instead of Sin[x] Forgetting that arguments are surrounded by square brackets instead of parentheses: Sin(x) instead of Sin[x] Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.8 169 • • Forgetting to load a required software package, such as Graphics`PlotField` or leaving off the back-apostrophe(s) in the package name. Having Mathematica insert non-printing characters in places where it shouldn't. Retype, then delete the offendin text. We’ll work through some of the notebooks I’ve already set up so you can see some examples of Mathematica at work. Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.9 170 Exercise 12.1: Plotting potentials with Mathematica Let’s start with potentials for linear charges which are oriented parallel to the z axis. (This way we’ll be dealing with a system which can be drawn nicely in two dimensions since everything we’ll calculate is independent of z.) To avoid problems with singularities, we’ll make each of our charges a hollow cylinder. Recall that the potential varies logarithmically with distance from an infinite line charge or cylinder. Make a local copy of the file p212_unit12_mathematica_1.nb for yourself, then open it in Mathematica. Have Mathematica evaluate the entire notebook, then read through it to make sense of what’s there. If there are headphones available, plug them into the PC to hear the sounds generated by the cells at the end of the notebook. When you think you understand what’s going on, change the charge and/or position of one of the cylinders and reevaluate the cell which generates a contour plot. Here is a partial list of what’s in (an earlier version??) of the notebook. (* Comments are placed between (* and *). They can span multiple lines and be nested. Notice the long brace near the right edge of this window. A brace indicates the extent of a "cell," a subunit of a notebook. *) (*Generally good programming practice is to use lots of comments and to define functions only once, then refer to them in your work *) (* To cause part, or all, of the notebook to be evaluated go to the Kernel -> Evaluation -> Evaluate Cells (or Evaluate notebook) menu path. Another way is to select a cell (or cells), then hit keypad - enter. *) (* RealTime3D` will let us use the mouse to rotate some of our plots. An important thing to note is the back - apostrophe as the last character in"RealTime3D`" I'll also load other packages here. *) Needs["RealTime3D`"] Needs["Graphics`PlotField`"] Needs["Calculus`VectorAnalysis`"] Off[General::spell1] (* Define some constants here. The semicolons suppress printing of the values. Note that quantities separated by blank spaces are multiplied together : 4 and Pi for example (see below). *) Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.10 171 EpsilonNought = 8.854 10^-12; OneOver4PiEpsilonNought = 1. / (4 Pi EpsilonNought); (* Define the square of distance between {x1, y1, z1} and {x2, y2, z2}. Note the use of := and also the syntax concerning underscores at the ends of argument names. *) DistanceSquared[x1_, y1_, z1_, x2_, y2_, z2_] := (x1 - x2)^2 + (y1 - y2)^2 + (z1 - z2)^2; (* Now define the distance between {x1, y1, z1} and {x2, y2, z2} *) Distance[x1_, y1_, z1_, x2_, y2_, z2_] := Sqrt[DistanceSquared[x1, y1, z1, x2, y2, z2]]; (* Now define the Green's function for the potential at (x, y, z) associated with a hollow line charge of radius rmin at (xprime, yprime, zprime). Recall that the Green's function is the same as the potential caused at position r by a unit whatever - it - is placed at rprime. *) GreensFunctionLinePotential[x_, y_, z_, xprime_, yprime_, zprime_, rmin_] := 2 * OneOver4PiEpsilonNought * Log[Max[rmin, Distance[x, y, z, xprime, yprime, zprime]]]; (* specify the coordinates, radius, and charge per meter for the first cylinder *) x1 = -1.; y1 = 0.; rmin1 = 0.05; lambda1 = -1./ OneOver4PiEpsilonNought; (* Now do the same for the second cylinder *) x2 = 1.; y2 = 0.; rmin2 = 0.05; lambda2 = +1./ OneOver4PiEpsilonNought; (* Here's the contribution to the potential at {x, y} due to the first cylinder ... *) v1[x_, y_] := lambda1 * GreensFunctionLinePotential[x, y, 0., x1, y1, 0., rmin1] ; (* ... and due to the second cylinder *) v2[x_, y_] := lambda2 * GreensFunctionLinePotential[x, y, 0., x2, y2, 0., rmin2] ; (* The potential due to both of them is just the sum (superpositon) of the individual contributions *) v[x_, y_] := v1[x, y] + v2[x, y]; Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.11 172 (* Let's put the requests for plots into separate cells. Since you can ask the kernel to reevaluate a single cell instead of the entire notebook this'll save some execution time when we want to redo only one of the plots. *) (* Make a contour plot. None of the options (PlotPoints, and so on) are required. *) ContourPlot[v[x, y], {x, -5, 5}, {y, -5, 5}, PlotPoints -> 50, ColorFunction -> Hue, Contours -> 20, PlotLabel -> "Contour Plot"]; Contour Plot 4 2 0 -2 -4 -4 -2 0 2 4 (* Now make a 3 - d plot. You can click on the plot, then resize it, and rotate it with the mouse to inspect it from different angles. Note that loading the RealTime3D package will suppress the axis labels. *) Plot3D[v[x, y], {x, -2, 2}, {y, -2, 2}, ColorFunction -> Hue, PlotPoints -> 50, PlotLabel -> "3D plot", AxesLabel -> {"x", "y", "potential"}]; Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.12 173 (* Now make a field vectors plot. Note that we need to have already loaded the package Graphics`PlotField`. Also, notice that we're not getting field lines--only a bunch of vectors whose lengths and directions tell us about the electric field strength and direction. They are placed on a grid rather than being drawn so that the density of them would indicate the field strength. *) PlotGradientField[v[x, y], {x, -2., 2.}, {y, -2., 2.}, Frame -> True, PlotLabel -> "Electric field vectors", AxesLabel -> {"x", "y"}]; Electric field vectors 2 1 0 -1 -2 -2 -1 0 1 2 (* Now redraw that field vectors plot, but also draw in the positions of the charged cylinders. The use of % indicates "the previous thing." *) Show[{%, Graphics[Disk[{x1, y1}, rmin1]], Graphics[Disk[{x2, y2}, rmin2]]}]; Electric field vectors 2 1 0 -1 -2 -2 -1 0 1 2 Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.13 174 (*** * Another way to do this would have been as follows-MyPlot = PlotGradientField[v[x, y], {x, -2., 2.}, {y, -2., 2.}, Frame -> True]; Show[{MyPlot, Graphics[Disk[{x1, y1}, rmin1]], Graphics[Disk[{x2, y2}, rmin2]]}]; so that we wouldn't need to put this request to redisplay the plot immediately after the previous plot was generated. ****) (* Let's read a sound from a file. *) Needs["MiscellaneousÀudio`"] (* Construct the full file pathname for where the current notebook file lives. There has to be an easier way to do this but I haven't yet discovered it! *) thingone thingone thingone thingone thingone = = = = = Options[$FrontEnd, NotebookDirectory]; ToString[thingone[[1]][[2]][[1]]]; StringDrop[thingone, 3]; StringDrop[thingone, -1]; thingone <> ", "; NotebookDirectoryPath = StringReplace[thingone, {", " -> "\\"}]; (* Now concatenate the directory path onto the sound file name. *) SoundFile = NotebookDirectoryPath <> "sound1.wav" noise = ReadSoundFile[ SoundFile, PrintHeader -> True]; ListPlay[noise, PlayRange -> {-2^15, 2^15}, SampleRate -> 11025]; (* Now unload the audio package since it may conflict with the music package if that is requested later. *) Remove[Audio] Out[37]= "H:\\Physics_education\\PHYCS_212\\p212_unit12\\sound1.wav" From In[37]:= "Format: "\[InvisibleSpace]"Microsoft PCM WAVE RIFF" From In[37]:= "Duration: "\[InvisibleSpace]2.61279\[InvisibleSpace]" seconds" From In[37]:= "Channels: "\[InvisibleSpace]1 From In[37]:= "Sampling rate: "\[InvisibleSpace]11025 From In[37]:= "Bits per sample: "\[InvisibleSpace]8 From In[37]:= "Data size: "\[InvisibleSpace]28806\[InvisibleSpace]" bytes" From In[37]:= "Number of samples: "\[InvisibleSpace]28806 Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.14 175 Needs["Miscellaneous`Music`"] (* Names and frequencies (in Hz) for notes are available : for example, A4 is 440, C4 is 523.251, Bflat4 is 466.164. Lowest note is A0 (27.5 Hz), highest is Gsharp7 (6644.88 Hz). *) (* Define a wacko time - varying frequency (frequency in radians/sec) *) WeirdFrequency1[t_] := 2 Pi A3 Sin[5. t]; WeirdFrequency2[t_] := 2 Pi Aflat3 Sin[5. t]; (* Define left and right channel signal strength vs time *) LeftChannel[t_] := Sin[2 t]*Sin[WeirdFrequency1[t]]; RightChannel[t_] := Cos[2 t]*Sin[WeirdFrequency2[t]]; (* Now play it! Also, assign it to a sound object so we can replay it quickly.*) WeirdSound = Play[{LeftChannel[t], RightChannel[t]}, {t, 0, 5}, PlayRange -> {0, 4.}]; (* Now let's unload the package for music routines *) Remove[Music] discussion: grids of field vectors vs. display of field lines Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.15 176 Local vs. global concerns in graphical representations... If it’s not 1/r2, field lines don’t make sense (they fade out, disappear). Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.16 177 Exercise 12.2: Potentials in 3D It’s harder to represent fields and potentials in three dimensions: there are problems with surfaces obscuring other surfaces, making it hard for you to see what you’re working with, and calculations running slowly. Make a local copy of the file p212_unit12_mathematica_2.nb for yourself, then open it in Mathematica. Take a look at what it does: there are a few 3D plot-generating cells, as well as a couple of animations. Play with code in the notebook to familiarize yourself with how it works. Here’s a listing of (a preliminary version of) the notebook: (* Load required packages here. *) Needs["RealTime3D`"] Needs["Graphics`ContourPlot3D`"] Needs["Graphics`PlotField3D`"] Needs["Calculus`VectorAnalysis`"] (* Define some constants. *) EpsilonNought = 8.854 10^-12; OneOver4PiEpsilonNought = 1. / (4 Pi EpsilonNought); (* Define the square of distance between {x1, y1, z1} and {x2, y2, z2}. Note the use of := and also the syntax concerning underscores at the ends of argument names *) DistanceSquared[x1_, y1_, z1_, x2_, y2_, z2_] := (x1 - x2)^2 + (y1 - y2)^2 + (z1 - z2)^2; (* Now define the distance between {x1, y1, z1} and {x2, y2, z2} *) Distance[x1_, y1_, z1_, x2_, y2_, z2_] := Sqrt[DistanceSquared[x1, y1, z1, x2, y2, z2]]; (* Now define the Green's function for the potential at (x, y, z) associated with a hollow spherical charge of radius rmin at (xprime, yprime, zprime). Recall that the Green's function is the same as the potential caused at position r by a unit whatever - it - is placed at rprime. *) GreensFunctionPointPotential[x_, y_, z_, xprime_, yprime_, zprime_, rmin_] := OneOver4PiEpsilonNought / Max[rmin, Distance[x, y, z, xprime, yprime, zprime]]; Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.17 178 (* specify the coordinates, radius, and charge for the first charge *) x1 = -2.; y1 = 0.; z1 = 0.; rmin1 = 0.05; charge1 = -1. / OneOver4PiEpsilonNought; (* Now do the same for the second charge *) x2 = 2.; y2 = 0.; z2 = 0.; rmin2 = 0.05; charge2 = +1. / OneOver4PiEpsilonNought; (* Here's the contribution to the potential at {x, y} due to the first charge ... *) v1[x_, y_, z_] := charge1 * GreensFunctionPointPotential[x, y, z, x1, y1, z1, rmin1] ; (* ... and due to the second charge *) v2[x_, y_, z_] := charge2 * GreensFunctionPointPotential[x, y, z, x2, y2, z2, rmin2] ; (* The potential due to both of them is just the sum (superpositon) of the individual contributions *) v[x_, y_, z_] := v1[x, y, z] + v2[x, y, z]; (* Let's put the requests for plots into separate cells. Since you can ask the kernel to reevaluate a single cell instead of the entire notebook this'll save some execution time when we want to redo only one of the plots. *) (* Make a contour plot. None of the options (PlotPoints, and so on) are required. After the plot is finished, click on it, resize it, and rotate it around using the mouse. The list of values for the "Contours" option says for which values of the potential we'll draw our contours. *) ContourPlot3D[v[x, y, z], {x, -5, 5}, {y, -5, 5}, {z, -5, 5}, PlotPoints -> {5, 5}, Contours -> {-0.1, -0.5, 0., 0.5, 0.1}, Axes -> True ]; Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.18 179 (* Now make a field vectors plot. Note the z range I have chosen. You can resize the plot, also rotate it around.*) PlotGradientField3D[v[x, y, z], {x, -5., 5.}, {y, -5., 5.}, {z, 4., 10.}, VectorHeads -> True]; (* Now do an animation. *) Needs["GraphicsÀnimation`"]; Plot3D[Sin[x*y], {x, 0, 3}, {y, 0, 3}, Axes -> None, Mesh -> False, ColorFunction -> GrayLevel]; Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.19 180 SpinShow[%, RotateLights -> True]; "Double-click on a plot to see something spin!" (Lots of these get drawn) (* Now do an animation. *) dn = 0.1; nstart = 1; nstop = 3; Do[ Plot3D[Sin[n*x]*Sin[n*y], {x, 0, 2 Pi}, {y, 0, 2 Pi}], {n, nstart, nstop, dn}]; Do[ Plot3D[Sin[n*x]*Sin[n*y], {x, 0, 2 Pi}, {y, 0, 2 Pi}], {n, nstop - dn, nstart + dn, -dn}]; "Double-click on a plot to see something move!" Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.20 181 Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.21 182 Exercise 12.3: Going solo with Mathematica Go play! I suggest you try doing something of this sort: • • animate a scalar potential contour plot, also a field vectors plot for a pair of charges whose separation oscillates cook up the strangest sound you possibly can using Mathematica’s sound-generating tools Summary and wrap-up discussion Next week we’ll work on using Mathematica to plot vector potentials, and to investigate the associated magnetic field. Comments about homework assigned for next week Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.22 183 Reading and homework should be completed by 4pm Friday. Optional reading: The Mathematica Book, S. Wolfram, or any other Mathematica reference you might happen to find useful. Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Problem 1 (5 points): Some symbolic manipulations Have Mathematica evaluate the following integrals and derivatives for you. Please submit a hardcopy of an evaluated Mathematica notebook to show us your results for this problem. (I can give you a hand if you have problems generating the hardcopy or moving your files off the PC in Loomis 257.) To display useful palettes, follow this sort of menu path: File → Palettes → Complete Characters → Letter-like Forms → Technical Symbols (you’ll find ∞ there) (a) ∫ [ tan( x)] 2 dx (remember that Tan[x]and not tan(x) is what Mathematica uses to represent a tangent.) +∞ (b) −x 2 ∫ e dx 2 −∞ (c) (d) d ( ln x ) dx 4 ∂ 2 ( x3 y 4 z 2 ) ∂x∂y Problem 2 (10 points): An animation For the x range 0 ≤ x ≤ 2π please use the Plot[…] function inside a Do loop to plot sin(kx) using k as the animation variable, with k stepping over values from 0 to 10 in increments of 0.1 unit. When you’ve done it correctly, Mathematica will generate a large number of plots for you; double-clicking on one of them will cause the program to play all the plots as an animation. In order to keep Mathematica from adjusting the minimum, maximum values of the y axis you can Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.23 184 include the expression PlotRange->{{0, 2*Pi}, {-1, 1}} as a last argument to the Plot[…] function. Problem 3 (5 points): Plotting an implicit function One can define a function (or relation) implicitly by specifying a rule that is satisfied by the all points (x, f(x)). For example, the rule x2 + y2 = 4 is an implicit definition of a circle of radius 2 centered at the origin. Mathematica is able to plot functions defined this way, although you’ll need to load one of its “packages” using the statement Needs["GraphicsÌmplicitPlot`"] at the start of your notebook. (Be careful about the exact form for this: the single quotes are the back-apostrophes usually found near the upper-left corner of a standard keyboard.) Look up the ImplicitPlot[…] function in Mathematica’s online help facility and use it to generate plots of: (a) the Folium of Descartes, defined implicitly through the equation x3 + y 3 = 3xy 2 (a) a cardioid, defined implicitly through the equation  x 2 + y 2 − x  = x 2 + y 2 Your plots should look something like this: Folium of Descartes Cardiod 2 1 1 0.5 -3 -2 -1 1 2 3 -1 -2 0.5 1 1.5 2 -0.5 -3 -1 -4 Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.24 185 Problem 4 (10 points): Solving an algebraic equation Mathematica can solve a variety of algebraic equations, both through symbolic manipulation, and through numerical means. See the program’s online help documentation concerning the Solve[…] function. Here are a couple of examples: Note the required pair of equal signs “==” used to define the equation. (a) Use the Solve[…] function to generate the roots of the most general form of a cubic equation in one variable, ax 3 + bx 2 + cx + d = 0 . (b) Some equations do not lend themselves to solution through symbolic techniques built into Mathematica. If you encounter one of these, try using the FindRoot[…] function instead to search for roots to the equation. (See the online help information for the format of FindRoot’s arguments.) Use FindRoot to determine numerical values of the solutions to sin θ = θ 2 . plots of sin Htheta L and theta ê2 2 1 -4 -2 2 4 -1 -2 Physics 212, Spring 2003. Unit 12 Visualizing fields and potentials with Mathematica 12.25 186 Physics 212 Honors topics in electrodynamics Unit 13 Studying electromagnetic radiation with Mathematica Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.1 187 Unit 13: Studying electromagnetic radiation with Mathematica Recap of last week Introduction: Those Liénard-Wiechert potentials for a point charge When a point charge is in motion, the scalar potential it creates is modified in two fundamental ways: V (r ) = 1 Q 4πε 0 r − rQ ⇒ V (r,t) =   Q 1   4πε 0 r − rQ ( t ′ )  (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c )  1 1. The potential depends, in part, on the charge’s retarded position-- its location at the retarded time t ′ , defined through the relation t ′ = t − r − rQ ( t ′ ) c . 2. The 1/r form of the potential is modified by a velocity-dependent factor 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) where εˆ ( t ′ ) is a unit vector pointing from the charge’s retarded position to the observer’s position as shown in the following figure. line-of-sight to observer observer εˆ observer sees Q here (at its retarded position) Q v Q is really here now The charge’s vector potential is modified in similar fashion so that A( r , t ) = ( v ( t ′) 1 v ( t ′) 1 Q × = 2 V ( r ,t) . 2 c 4πε 0 r − rQ ( t ′ ) (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) c ) (Recall that µ0 = 1 ε 0c 2 .) This complicates things somewhat—the electric field is no longer guaranteed to be conservative, so it is no longer the case that we can always define it as the gradient of a potential. The scalar and vector potentials are still useful, however, since E = −∇V − ∂A ∂t and B = ∇ × A , so we can use them to calculate the fields if we’re willing to do all the differentiation. Here’s what you’ll work on, with the help of Mathematica: Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.2 188 • • generating contours of constant retarded time for observers watching a charged particle moving with constant velocity, and a separate particle which stops suddenly at the origin; graphing the behavior of the 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) factor as a function of observer position for • the constant-velocity and stop-at-the-origin cases just mentioned studying the scalar potential including retardation, but neglecting the 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) • • correction factor (there’ll be some surprises here) graphing the vector potential plotting the relative contributions to E of −∇V and − ∂A ∂t (more surprises!) Exercise 13.1: Retarded time for a moving charge Since Alpha Centauri is 4.3 light years away, something we see happening there at the present time actually took place several years ago. The retarded time we would use in calculating the potentials at Earth induced by electric charges on Alpha Centauri is set back from the present time by 4.3 years. It works the same way for something in motion: imagine that a moving object emitted a flash of light nine nanoseconds ago. Since c ≈ 0.3 m/nsec, observers who are three meters from the charge’s position at the time it emitted the flash will see the flash now. (Since emitting the flash the charge may have moved elsewhere.) Here’s a graphical example, involving an observer watching a charge moving at half the speed of light up the y axis. The true position of the charge is 0.5ct yˆ (it passes through the origin at t = 0) while the observer is stationary at x = 1.5 meters. At t = 5 nsec in the observer’s frame, the charge is really at y = 0.75 meters but the observer is seeing light that left it earlier, when it was at y = 0. y charge’s true position: x=0.75 m when t=5 nsec charge’s retarded position: x=0 x present time is t=5 nsec observer at x=1.5 m In this particular case, the retarded time corresponding to this observer’s t = 5 nsec is t ′ = t − robs − rQ ( t ′ ) c = 0 . It is a simple algebra problem to solve for the retarded time as a function of the observer’s time when the charge moves with constant velocity: to find the form of the function t ′ ( t ) . When the charged particle’s velocity isn’t constant, though, it can be nearly impossible to solve for t ′ ( t ) . Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.3 189 Consider one particle moving with constant velocity 0.5ct yˆ , and another moving with velocity 0.5ct yˆ for t < 0, which stops at the origin at t = 0. Graphs of the particles’ y positions as functions of time are shown below. (Remember that c ≈ 0.3 m/nsec.) y (0.3 m per division) y rvst1y rvst2y t (nsec) t charge is moving with v = 0.5c charge is moving with v = 0.5c until t = 0 (a) Imagine that the present time is now t = 10 nsec and that observers are watching the charge which moves with constant velocity. Sketch possible locations of observers who are seeing (at t = 10 nsec) light which had illuminated the moving particle at the earlier times t = 0 nsec and t = 5 nsec. (These observers would use retarded times t ′ = 0 nsec and t ′ = 5 nsec in their calculations of the potentials associated with the charge.) y charge here now (t = 10 nsec) charge was here at t = 0 nsec x charge was here at t = -10 nsec charge is moving with v = 0.5c grid cell (∆x, ∆y) is (0.3m, 0.3m) present time is t = 10 nsec (b) Imagine that the present time is now t = 5 nsec (not 10 nsec). Observers are watching the charge which moved with constant velocity for t < 0 nsec, but then stopped at the origin. Sketch possible locations of observers who are seeing (at t = 5 nsec) light which had illuminated the Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.4 190 moving particle at the earlier times t = 4, 2, 0, -5, -10 nsec. (These observers would use retarded times t ′ = 4, 2, 0, − 5, − 10 nsec in their calculations of the potentials associated with the charge.) y charge here since t = 0 nsec x charge was here at t = -10 nsec charge was moving with v = 0.5c but stopped at (0,0) 5 nsec ago present time is t = 5 nsec (c) Let’s do this with Mathematica now. I’ve set up a file of useful functions for you in a utility file p212_mathematica.m in the Physics 212 area. (They are automatically loaded when you evaluate the first cell in the notebook file p212_unit13_mathematica_1.nb, which you should use as a template for your work.) Here is a list of the functions defined for you: Distance[{x1,y1}, {x2,y2}] distance between points 1 and 2 Norm[{x,y}] length of the vector extending from {0,0} to {x,y} DotProd[{x1,y1}, {x2,y2}] dot product between two (2D) vectors CrossProd[{x1,y1}, {x2,y2}] cross product between two (2D) vectors UnitVector[{x,y}] unit vector parallel to the line from {0,0} to {x,y} rvst1[t] {x,y} at time t for a particle moving with constant velocity up the y axis so that x=0, y=0.5ct Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.5 191 rvst2[t] like rvst1[t], but the particle stops in 0.1 nsec at the origin at t=0. rvst3[t] like rvst2[t], but the particle takes 10 nsec to stop at the origin. NOTE: you must select one of these, or define your own function so that rvst[t] is defined. For example, to select rvst1 do this: rvst[time_] := rvst1[time] being careful to include the underscore at the end of time_ to the left of the “:=” symbol, but NOT to the right of the “:=” vvst[t] time derivative of rvst[t]. (assumes user has defined rvst) avst[t] time derivative of vvst[t]. (assumes user has defined rvst) TRetard[{x,y}, t] returns retarded time at charge's position for observer at {x,y}, t (uses rvst function to calculate this) VDotETerm[{x,y}, t] value of 1/(1 - v dot E) evaluated using the retarded velocity and line-of-sight vector e (not the electric field!) ScalarPotential[{x,y}, t] scalar potential when charge is at rvst[t] and observer is at {x,y} at time t. Calculation includes retardation effects. ScalarPotentialNoVDotE[{x,y}, t] GradV[{x,y}, t] like ScalarPotential, but omitting the velocity-dependent correction factor. gradient (in 2 dimensions) of scalar potential when charge is at rvst[t] and observer is at {x,y} at time t. Includes retardation effects. VectorPotential[{x,y}, t] vector potential when charge is at rvst[t] and observer is at {x,y} at time t. Calculation includes retardation effects. dAdt[{x,y}, t] ECoulomb[{x,y}, t] time derivative of vector potential when charge is at rvst[t] etc. Coulomb (1/r^2) portion of the electric field ERadiation[{x,y}, t] Radiation (1/r) portion of the electric field ETotal{x,y}, t] Total electric field (Coulomb + radiation) NonRadialAngle[{xobs,yobs}, {fx, fy}, t] returns the angle (in degrees) between the vector {fx, fy} and the line from the instantaneous (not the retarded) position of the charge to the observer’s location. Here's an example of how to use the predefined functions: (* As an example, assign rvst to be the same as rvst2, then plot its y component. *) (* define the (new) function rvst here *) rvst[timearg_] := N[rvst2[timearg]] (* print one value of it to remind everyone that rvst returns a vector, not a number *) Print["rvst at -20 nsec is ", rvst[-20*10^-9]]; (* Now plot its y component (thus the [[2]] index) from - 20 nsec to + 20 nsec *) Plot[rvst[t][[2]], {t, -20*10^-9, 20*10^-9}]; Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.6 192 rvst at -20 nsec is {0., -2.99792} -2 ×10 - 8 -1 × 10 - 8 1×10 - 8 2× 10 - 8 -0.5 -1 -1.5 -2 -2.5 -3 Graphs of the y components of rvst1 and rvst2 appeared in the writeup a few pages ago. Do the following: 1. Assign the constant velocity function rvst1 to rvst. 2. Assume the present time is 30 nsec after the charge has reached (or passed through) the origin in our frame of reference. 3. Make a contour plot of the retarded time in the region −20 ≤ x ≤ +20, − 20 ≤ y ≤ +20 . (The contour closest to the charge’s t = 30 nsec position corresponds to observers who see the charge as it is at t = 30 nsec. Contours further away correspond to observers who see the charge as it was at progressively earlier times.) 4. Assign the stop-at-the-origin velocity function rvst2 to rvst. 5. Make a contour plot with the same x,y limits. 6. Comment sensibly on what you see, in particular about the differences in the two plots. Your plots should look something like these: Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.7 193 d time vs. position , constant velocity 0.5 c, time now 3 20 10 0 10 20 -20 Retarded -10 0 10 20 time vs. position , stop -at -origin , time now 30 nsec 20 10 0 -10 -20 -20 -10 0 10 20 The concentric region in the second plot extending 9 meters ( = c × 30 nsec ) out from the origin expands as time passes: observers at greater distance become aware at later times that the charge is stopped. Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.8 194 Exercise 13.2: That v ( t ′ ) ⋅ εˆ ( t ′ ) correction factor Let’s investigate the effects of retardation and that 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) factor on the scalar potential now. We’ll work with two different cases, both involving a point charge. The first charge is moving up the y axis at half the speed of light so that its position is rQ ( t ) = 0.5ct yˆ . The second is also moving up the y axis, but decelerates rapidly with constant acceleration after it passes through the origin, coming to rest in 0.1 nanosecond at rQ = 0.0075 ŷ . (These are the two cases rvst1, rvst2 used in the previous exercise.) While moving, our charge has v = 0.5c so the correction factor is 2 in forward direction, 0.5 in the backwards direction, and 1 at 90 degrees. Use Plot3D to make graphs of the 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) factor using the same time (30 nsec) and x,y ranges for the plots. (The function is defined in the utility file as VDotETerm[{x,y}, t].) Do it for both velocity cases we’ve been using. Your plots should look something like the two which follow. Bear in mind that the boundary of the flat region within 9 meters ( = c × 30 nsec ) of the origin in the second plot expands outwards at c as more distant observer become aware that the charge has stopped. This means that the 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) correction factor included in the scalar potential will change suddenly for observers as they realize that the charge is no longer moving. V dot E correction term vs. position , uniform velocity , t=30 nsec 2 20 1.5 10 1 -20 0 -10 -10 0 10 20 -20 Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.9 195 V dot E correction term vs. position , stop -at -origin , t=30 nsec 2 20 1.5 10 1 -20 0 -10 -10 0 10 20 -20 Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.10 196 Exercise 13.3: Scalar potential with retarded time, but no v dot e correction When retardation is included, but the velocity-dependent correction factor is omitted, the scalar caused by a moving charge at an observer's position is the same as that caused by a stationary charge at the moving charge's retarded position. The equipotential contours for a charge moving with constant velocity are circles centered on the retarded positions of the charge. Use ContourPlot to make graphs of the scalar potential without the 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) factor using the same time (30 nsec) and x,y ranges for the plots as before. (The function is defined in the utility file as ScalarPotentialNoVDotE[{x,y}, t].) Do it for both velocity cases we’ve been using. Your plots should look something like the two which follow. Note the bunching of field lines towards the tops of the plots: this means the potential would change more rapidly in front of the moving charges if this were the correct expression for the scalar potential. (It's not, since we've left off the necessary 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) factor.) As before, take note in the second plot of the change that occurs at the boundary 9 meters from the origin. calar potential NOT including V dot E correction term , constant v, t=30 nsec 20 10 0 -10 -20 -20 -10 0 10 Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 20 13.11 197 calar potential NOT including V dot E correction term , stop − at −0, t= 30 nsec 20 10 0 -10 -20 -20 -10 0 10 20 Exercise 13.4: Scalar (and vector) potential with retarded time and v dot e correction Now use ContourPlot to make graphs of the scalar potential including the 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) factor using the same time (30 nsec) and x,y ranges for the plots as before. (The function is defined in the utility file as ScalarPotential[{x,y}, t].) Do it for both velocity cases we’ve been using. Your plots should look something like the two which follow. Note the surprising fact that the equipotential contours in the first plot are concentric, and are centered on the instantaneous position of the charge! Distant observers cannot know that the charge is really 4.5 meters up the y axis, since it might have stopped recently. This should be clear from the second plot-- observers far from the (now stopped) charge still see the same equipotentials as are seen by distant observers in the constant velocity plot shown first. The effects at the 9 meter boundary are especially dramatic in the second plot in the forward direction where the velocity correction factor changes from 2 to 1 suddenly. The rapid change in the potential at this point causes an impressive bunching of the equipotential contours. We haven't done anything with the vector potential yet. The constant velocity case is easy, since v ( t ′) A ( r , t ) = 2 V ( r , t ) . Except for the direction (along y) associated with the vector potential, it c behaves just like the scalar potential. The second case is also a snap-- the vector potential is zero close to the (stopped) charge where all observers realize that it is stationary. Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.12 198 Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.13 199 potential including V dot E correction term , uniform velocity , t= 30 20 10 0 10 20 -20 Scalar potential -10 including 0 V dot E correction 10 20 term , stop − at −0, t= 30 nsec 20 10 0 -10 -20 -20 -10 0 10 Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 20 13.14 200 Exercise 13.5: Vector potential for the stop-at-0 charge Use PlotVectorField to make a graph of the vector potential using the same time (30 nsec) and x,y ranges for the plots as before. (The function is defined in the utility file as VectorPotential[{x,y}, t].) Do it only for the stop-at-zero velocity case. Your plot should look something like the one which follows. Note that vector potential is always along y, and that the forward-backward symmetry has been broken by the velocity-denominator term. The boundary at 9 meters moves outwards at the speed of flight. Vector potential , stop − at −0, t = 30 ns 20 10 0 -10 -20 -20 -10 0 Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 10 20 13.15 201 Discussion: Liénard-Wiechert fields It’s always true that E = −∇V − ∂A ∂t and that B = ∇ × A . Calculating E and B is “just” a matter of taking some derivatives! After a fair amount of messy calculation it can be shown that: ( )  εˆ ′ 1 − v ′ 2 c 2 − v ′ c (1 − εˆ ′ ⋅ v ′ c )    −∇V ( r , t ) = 2 3 4πε 0 ˆ   1 ε v c − ⋅ ′ ′ ( ) ′ r − rQ  + 1 Q 1 Q 1 4πε 0 r − r ′ c 2 Q  εˆ ′ ( a ′ ⋅ εˆ ′ )    3  (1 − εˆ ′ ⋅ v ′ c )  Primed quantities are evaluated at the appropriate retarded time; a′ is the (retarded) acceleration. (The acceleration enters because of the subtleties involving functions of t ′ being differentiated with respect to t.) Note that the first term in the gradient is independent of acceleration, and is proportional to 1/r2. (This is the “Coulomb term.”) However, the second term is proportional to acceleration, and falls off as 1/r. It is possible to pound out the derivatives to calculate − ∂A ∂t too: the result is − ∂A ( r , t ) − v ′ 1 = c 4πε 0 ∂t   2 r − rQ ′  Q ( v ′ c ) ⋅ ( εˆ ′ − v ′ c)  (1 − εˆ ′ ⋅ v ′ c ) 3  + 1 Q 1 4πε 0 r − r ′ c 2 Q  − a ′ + εˆ ′ × ( a ′ × v ′ c )   .  (1 − εˆ ′ ⋅ v ′ c ) 3  As with the −∇V equation, the first (Coulomb) term in − ∂A ∂t is independent of acceleration, and is proportional to 1/r2. The second term, proportional to acceleration, falls as 1/r. Let’s study the “Coulomb terms” of −∇V and − ∂A ∂t first by looking at a charge moving with constant velocity. Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.16 202 Exercise 13.6: -Grad V: constant velocity (a) Use PlotVectorField to make a graph of the negative of the gradient of the scalar potential using the same time (30 nsec) and x,y ranges for the plots as before. (The function is defined in the utility file as GradV[{x,y}, t]; do it only for the constant velocity case.) To make it easier to see the directions of the vectors, also use the utility function UnitVector to replot −∇V . Your plots should look something like the first two which follow. It looks like the vectors point radially away from the instantaneous position of the charge, without retardation. How close is this to the truth? Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.17 203 Vectors... − grad V, constant v, t = 30 nsec 20 10 0 -10 -20 -20 -10 Unit vectors... Unit vetors 0 of − grad V, constant 10 20 v, t = 30 nsec 20 10 0 -10 -20 -20 -10 0 10 20 Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.18 204 (b) Is −∇V really radial? (Does it really point directly away from the instantaneous position of the charge?) Use Plot3D and the utility function NonRadialAngle to investigate, using the same observer time and plot boundaries as before. Your plot (which may take two or three minutes to appear) will look something like this: non − radialness angle H degrees L of − grad V, uniform velocity , t = 30 nsec 5 20 0 10 -5 -20 0 -10 -10 0 10 20 -20 What do you conclude? Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.19 205 Exercise 13.7: -dA/dt for constant velocity Use PlotVectorField to make a graph of the negative of the time derivative of the vector potential using the same time (30 nsec) and x,y ranges for the plots as before. (The function is defined in the utility file as dAdt[{x,y}, t]; do it only for the constant velocity case.) To make it easier to see the directions of the vectors, also use the utility function UnitVector to replot −∂A ∂t . Your plots should look something like the first two which follow. Not surprisingly, −∂A ∂t can only point along y. Vectors... − dA ê dt , constant Unit vectors... v, t= 0 unit vactors 20 20 10 10 0 0 -10 -10 -20 along − dA êdt, constant v, t= 0 -20 -20 -10 0 10 20 -20 -10 0 10 20 Note the sign flip at instantaneous charge position Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.20 206 Exercise 13.8: E = -Grad V - dA/dt for constant velocity Since E = −∇V − ∂A ∂t , we can investigate the electric field by combining the last few functions we've been using. Do it-- generate vector, and unit vector plots of E = −∇V − ∂A ∂t , then plot the degree to which the electric field points radially away form the instantaneous position of the charge. Here are my versions of the plots. Don't be mislead by the round-off errors which fool Mathematica's arrow-drawing routine when the fields are too weak. Vectors... − grad V − dA ê dt, constant v, t = 30 nsec 20 10 0 -10 -20 -20 -10 0 10 Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 20 13.21 207 Unit vectors... unit vector along − grad V − dA ê dt , constant v, t = 30 nsec 20 10 0 -10 -20 -20 -10 "Radialness..." non − radialness angle 0 10 20 Hdegrees L of − grad V − dA ê dt , constant velocity , t = 30 0.0002 0.0001 20 0 -0.0001 10 -0.0002 -20 0 -10 -10 0 10 20 Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica -20 13.22 208 Well, look at that! What can we conclude? Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.23 209 Discussion: Electromagnetic radiation! When do the terms in E = −∇V − ∂A ∂t proportional to acceleration dominate E ? Since the Coulomb terms are proportional to 1/r2, while the acceleration terms are proportional to 1/r, it's clear that at large distances the acceleration terms must dominate. But what constitutes a "large" distance? Very roughly, the ratio of the "radiation" (1/r) and Coulomb terms for E is ra c 2 = r c 2 a , where r represents the distance to the observer and a is the charge's ( ) acceleration. As a result, when the observer's distance is large compared to c 2 a the accelerationbased term will dominate. Exercise 13.9: Plotting the radiation terms Including only the terms proportional to 1/r gives E 1 r only =− 1 Q r − rQ ′ 4πε 0 c (1 − εˆ ′ ⋅ v ′ c ) 2 3 { a ′ − εˆ ′ ( a ′ ⋅ εˆ ′) − εˆ ′ × ( a ′ × v ′ c)} . To simplify matters, let’s only consider acceleration a (anti)parallel to v . (Clearly this will not work for circular motion!). In this case, ( a ′ × v ′ c ) = 0 so we can ignore the term with the cross products. { } Let’s look at the remaining terms in the curly brackets: a ′ − εˆ ′ ( a ′ ⋅ εˆ ′ ) . Here’s a diagram. observer a′ − ( a′ ⋅ εˆ′ ) εˆ′ a′ Q εˆ′ ( a′ ⋅ εˆ′) εˆ′ { } With some thought you should be able to see that a ′ − εˆ ′ ( a ′ ⋅ εˆ ′ ) is just the component of the (retarded) acceleration which is perpendicular to the observer’s line of site towards the (retarded) position of the charge. Let’s give this component the name a⊥′ (“a prime perp”). (Even if we Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.24 210 hadn’t asked that the acceleration be along the direction of the velocity we’d still have the field perpendicular to εˆ ′ since the ignored term εˆ ′ × ( a ′ × v ′ c ) is perpendicular to εˆ ′ due to the cross products.) With these simplifications, we can write E 1 r only =− Qa⊥′ . 3 r − rQ ′ 4πε 0 c (1 − εˆ ′ ⋅ v ′ c ) 1 2 Keep in mind that most everything in this expression is a retarded variable, and that the velocitycorrection denominator is cubed. Things to notice: 1. The electric field is perpendicular to the line-of-sight vector εˆ′ . 2. The electric field is proportional to the charge’s (retarded) acceleration. 3. The field falls off like 1/r. 4. There’s a negative sign in front: the radiation electric field of a positive charge with upwards acceleration points down. Use ContourPlot to make graphs of the magnitide of the radiation electric field using the same time (30 nsec) and x,y ranges for the plots as before. (The function is defined in the utility file as ERadiation[{x,y}, t].) Do it for the as-yet unused position-vs.-time function rvst3 which describes a particle which begins decelerating from v = 0.5c yˆ at t = -10 nsec, coming to a complete stop at t = 0. Your plot should look something like the sample plot which follows. The spikes are an artifact of the granularity of the plot I’ve chosen. Notice the following: • Since the radiation fields are blowing past an observer only while that observer sees the particle’s (retarded) acceleration as different from zero, the region in space containing a nonzero radiation field is only three meters thick. • The fields would be strongest in directions perpendicular to the particle’s acceleration if the 3 velocity-correction factor 1 (1 − εˆ ′ ⋅ v ′ c ) weren’t present. Because of it, the fields are somewhat stronger in the forward direction. (“Forward” means parallel to the particle’s velocity during acceleration.) Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.25 211 Radiation E field , stop -at -zero , 10 ns deceleration , t=30 ns 6×10 8 20 4×10 8 2×10 8 10 0 -20 0 -10 -10 0 10 20 -20 Now do the same thing, but using the position-vs.-time function rvst4 which describes a particle which begins decelerating from v = 0.01c yˆ at t = -10 nsec, coming to a complete stop at t = 0. Your plot should look something like the sample plot which follows. Because the particle is nonrelativistic, the radiation electric fields are nearly forward-backward symmetric, and are strongest perpendicular to the particle’s acceleration. In addition, the fields are considerably weaker due to the reduced acceleration. Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.26 212 Radiation E field , non -relativistic , 10 ns deceleration , t=30 ns 6×10 6 4×10 20 6 2×10 6 10 0 -20 0 -10 -10 0 10 20 -20 Discussion: What about the magnetic field? We haven’t said anything about the magnetic field! It’s just a matter of calculation to take the curl of the vector potential; what one finds is that the magnetic and electric fields obey B = E c for electromagnetic radiation. The magnetic field is perpendicular to the electric field, and also perpendicular to the line-of-sight direction. Summary and wrap-up discussion We’ve only made plots for particles which move with constant velocity, or stop rapidly. However, everything we’ve done would work for a particle moving with arbitrary acceleration. (Mathematica might gag, but that doesn’t mean that there’s anything wrong with our equations.) Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.27 213 Here are the main points. • When a point charge is in motion, the scalar and vector potentials it creates are modified in two fundamental ways: 1. The potential depends, in part, on the charge’s retarded position-- its location at the retarded time t ′ , defined through the relation t ′ = t − r − rQ ( t ′ ) c . 2. The 1/r form of the potential is modified by a velocitydependent factor 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) where εˆ ( t ′ ) is a unit vector pointing from the charge’s retarded position to the observer’s position. • The electric field depends on both the gradient of the scalar potential and the time derivative of the vector potential. These two contributions conspire in a remarkable fashion so that the electric field of a charge moving with constant velocity is radial, always pointing exxactly away from the instantaneous position of the charge. • Observers far from a charge which has stopped cannot know instantaneously that it has stopped moving. As a result, they will continue to see the fields associated with a charge in motion until enough time has passed so that they become aware the charge has stopped. • The sudden change in the 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) term in the potentials (which corresponds to the 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) term in the fields) induces dramatic changes in the derivatives of 3 the potentials, giving rise to electromagnetic radiation: electric fields which are proportional to (retarded) acceleration, and propagate away from the charge, falling off like 1/r instead of 1/r2. Comments about homework assigned for next week Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.28 214 Reading and homework should be completed by 4pm Friday. Required reading: Read all of the in-class exercise material for this unit. Purcell, chapter 5, section 7. Optional reading: Purcell, chapter 9 and appendix B. The Feynman Lectures on Physics (volume II), chapters 20 and 21. Radiation review/reminder: When a point charge is in motion, the scalar and vector potentials it creates are modified in two fundamental ways: 3. The potentials depend, in part, on the charge’s retarded position-- its location at the retarded time t ′ , defined through the relation t ′ = t − r − rQ ( t ′ ) c . 4. The 1/r forms of the potentials are modified by a velocity-dependent factor 1 (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) where εˆ ( t ′ ) is a unit vector pointing from the charge’s retarded position to the observer’s position as shown in the following figure. (Sometimes I’ll use εˆ′ as an abbreviation for εˆ ( t ′ ) .) line-of-sight to observer observer εˆ observer sees Q here (at its retarded position) Q v Q is really here now The changes to the potentials work like this:   Q 1   4πε 0 r − rQ ( t ′ )  (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c )  v ( t ′) 1 v ( t ′) 1 Q × = 2 V ( r , t ) . (Recall that µ0 = 1 ε 0c 2 .) A( r , t ) = 2 c 4πε 0 r − rQ ( t ′ ) (1 − v ( t ′ ) ⋅ εˆ ( t ′ ) c ) c V (r ) = 1 Q 4πε 0 r − rQ ⇒ V (r,t) = 1 ( ) It is still true that E = −∇V − ∂A ∂t and B = ∇ × A so we can use the potentials to calculate the fields. The radiation terms in E and B will be proportional to 1/r. If you crank out the derivatives you’ll find that the radiation term in the expression for the electric field is this: Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.29 215 E 1 r only =− 1 Q 2 r − rQ ′ 4πε 0 c (1 − εˆ ′ ⋅ v ′ c ) 3 { a ′ − εˆ ′ ( a ′ ⋅ εˆ ′) − εˆ ′ × ( a ′ × v ′ c)} . The particle’s (retarded) acceleration is a′ . Don’t be intimidated by all the cross products! As described in the in-class exercise material, if we only consider acceleration a (anti)parallel to v we can write Qa⊥′ 1 E =− 3 2 1 r only r − r ′ 4πε 0 c (1 − εˆ ′ ⋅ v ′ c ) Q where a⊥′ (“a prime perp”) is the component of (retarded) acceleration perpendicular to the lineof-sight to the observer. Keep in mind that most everything in this expression is a retarded variable, and that the velocity-correction denominator is cubed. Calculating the magnetic field is “just” a matter of taking the curl of the vector potential; what one finds is that the radiation (1/r) magnetic and electric fields obey B = E c and that the magnetic field is perpendicular to the electric field, and also perpendicular to the line-of-sight direction. Things to keep in mind for the radiation fields: 5. The electric and magnetic fields are perpendicular to the line-of-sight vector εˆ′ . 6. The electric and magnetic fields are proportional to the charge’s (retarded) acceleration. 7. The fields fall off like 1/r. 8. B = E c . 9. There’s a negative sign in front of the expression for E : the radiation electric field of a positive charge with upwards (retarded) acceleration points down. Problem 0 (5 points): Make your own hardcopies… Print (from the web) your own copy of the solutions to last week’s problem set and write a (true!) statement on your problem set asserting that you have successfully printed your personal copies. Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.30 216 Problem 1 (15 points): Non-relativistic dipole radiation A particle with charge q oscillates about the origin with position rq = 10−3 sin(107 t ) zˆ . Since the particle is always moving non-relativistically, the electric field it generates at r , the location of a 1 qa⊥′ distant observer, is well-approximated by E ( r , t ) ≈ − . Note that a⊥′ = a ′ sin θ as r 4πε 0c 2 shown in the following figure, and that the retarded time is t ′ ≈ t − r c . z observer at r a′⊥ a′ εˆ′ θ q a′ You learned in Physics 112 that the power flux (energy per second per square meter) transported by an electromagnetic wave has magnitude S = E × B µ0 and that µ0ε 0 = 1 c 2 . (a) Calculate the RMS power flux (in watts per square meter) arriving at an observer located at the point (r, θ), as shown in the figure. (b) What is the total RMS power radiated by the oscillating charge? (You’ll need to calculate ∫ S ⋅ dA to answer this question. The integral is easiest to do in spherical all directions coordinates (r, θ, φ) where S = rˆ S , dA = rˆ r 2 sin θ dθ dφ , and r̂ is a unit vector pointing away from the origin.) (c) Assuming q = 1 Coulomb, what is the numerical value (in watts) for your answer to part (b)? Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.31 217 Problem 2 (15 points): Mrs. Urkin builds her own x-ray machine After seeing Marathon Man for the 97th time, Mrs. Urkin decides to teach herself dentistry. Knowing that x-ray machines employ bremsstrahlung (German for “braking radiation”) to produce short-wavelength electromagnetic radiation, she borrows the UD&HE electron linear accelerator to use as the electron gun for her device. When a fast-moving electron slams into a dense target its rapid deceleration will cause it to radiate x-rays. Assume that an electron from the “linac” initially travels upwards along the z axis (so that its initial velocity is v = v0 zˆ ), then decelerates with constant acceleration a = −azˆ to stop near the origin at t = 0. The teeth of an unfortunate patient are located at (r, θ ), as shown. Please assume that r is very large compared to the distance through which the electron moves while coming to rest so that r − re− ′ ≈ r always. z θ ex patient at r εˆ′ y a′ (a) Prove that the magnitude of the radiation electric field (for fixed r ) is greatest at θ = 90° when v c. (b) For constant acceleration as the electron slows to rest from a relativistic initial velocity prove that the intensity of x-rays at θ = 90° remains constant during the deceleration process, even though v ∼ c in the early stages of deceleration. (c) An observer whose teeth are at (r, θ ) sees the decelerating electron’s velocity as 0.5c at one particular instant in time. What value of θ, expressed in degrees, will yield the maximum electric field strength for this (retarded) velocity? (You will have to slog through a differentiation and a small amount of algebra to derive a useful quadratic equation, one of whose roots is the cosine of the desired angle. The fact that sin 2 θ = 1 − cos 2 θ might be helpful.) Physics 212, Spring 2003. Unit 13 Studying electromagnetic radiation with Mathematica 13.32 218 Physics 212 Honors topics in electrodynamics Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.1 219 Unit 14: Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) Recap of last week Introduction: The fields aren’t really fields, and you cannot know as much as you’d like You’ve probably heard of the Heisenberg Uncertainty Principle. Today we’ll investigate a plausibility argument for why it is true, as well as some of its consequences. We’re forced to head down this path because one of the reasonable assumptions we’ve made about electrodynamics is completely, totally, utterly wrong: the electromagnetic radiation streaming from an accelerating charge is not composed of fields which vary smoothly in space and time. Einstein’s Nobel Prize was not awarded for his work on relativity (which certainly did deserve one), but rather for his explanation of the photoelectric effect, one of the consequences of the “granularity” of electromagnetic radiation. Here’s what you’ll be working on: • a classical analysis of the rate at which electrons “should be” ejected from a conductor which is exposed to an intense light source. • a classical analysis of the relationship between the maximum kinetic energy of electrons ejected from a conductor and the intensity of the light source illuminating the conductor. • photons! (photon energy and momentum) • modeling the resolution of a simple magnifier • analyzing the “Heisenberg’s Microscope” thought experiment • estimating atomic sizes using the Uncertainty Principle We’ll also discuss the master equation of quantum mechanics, known as Schrödinger’s equation. Exercise 14.1: An estimate of the rate of photoelectron ejection from a metallic surface A charge above a classical conducting surface is attracted back towards the surface due to the fields created by the surface charge it attracts beneath it, as shown in the figure. The fields above the conductor are exactly the same as they would be if an image charge of equal magnitude, but opposite sign, were placed as shown, without the conductor being present. Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.2 220 electron + + + + ++ + + + + these field lines don’t really exist real surface charge conductor fictitious positive image charge (a) Since atomic scales are typically a tenth of a nanometer, it shouldn’t be too crazy to model an electron in the process of escaping from the conductor as sitting ~10-10 meters above the surface, fighting the attractive force of its image charge as it tries to wander off. Estimate the energy needed to fee an electron from its image charge, using this model. (Recall the expression for electrostatic potential energy you learned in physics 112.) Please express you answer in electron-volts (1 eV = 1.6×10-19 Joules). (b) A bright light illuminates the metal surface with an intensity of 1 watt per square centimeter. (This corresponds to approximately 6×1018 eV per second). Let’s make the entirely unreasonable assumption that electrons at the surface absorb all the incident light until they have accumulated sufficient energy to escape from the metal, and that there’s only one electron per atom that is capable of absorbing energy. How long after the light is switched on will electrons begin to escape the metal? Discussion Our ridiculous model has the virtue that it underestimates by many orders of magnitude how long it should actually take for electrons to begin escaping from the metal. The reality is quite different! It was known in 1928 that it took less than 3 nanoseconds to begin observing electrons after illumination; in 1955 it was known to take less than 10-10 seconds. I’m sure the present upper limit is substantially smaller. Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.3 221 Exercise 14.2: An estimate of the dependence of maximum photoelectron kinetic energy on illumination intensity Let’s continue to work with our flawed classical model. We illuminate our metallic surface with monochromatic light and crank up the intensity, measuring the kinetic energy of the ejected photoelectrons. Imagine that we find something like the following distribution for the kinetic energy spectrum of detected electrons: light source ejected electron number detected electron detector/spectrometer maximum energy conductor kinetic energy (eV) . maximum kinetic energy (eV) Propose a sensible-sounding (classical) model that relates the maximum kinetic energy of detected photoelectrons to the intensity of the light source, then sketch what you expect to find on the graph below. intensity of light source Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.4 222 Discussion: photons! Our ridiculous model gets this wrong too: the maximum kinetic energy is independent of the intensity of illumination. The number of ejected electrons does increase with intensity, but the photoelectron kinetic energy spectrum does not change with intensity! Einstein’s idea was that the energy of the electromagnetic field comes in discrete quanta, called photons. The energy carried by a single photon depends only on the classical wavelength associated with the light shining on our metal surface: the shorter the wavelength, the higher the photon’s energy. The exact expression is hc Eγ = λ red orange yellow green blue where h = 6.626 × 10 Joule-second so that hc = 1.986 × 10-25 Joule-meter. It is more useful to express this in terms of electron volt-nanometers: hc = 1,240 eV-nm. For example, the energy carried by a photon of green light (~600 nm wavelength) is about 2 eV. If you’re viewing this in color (on the web, for example), the following spectrum will let you see the correspondence between color and wavelength. violet -34 The photoelectric effect involves a single photon being absorbed by an electron, which uses the acquired energy to escape from the surface of the metal. A brighter light source sends more photons per second towards the surface, but each photon still carries the same amount of energy. Photons carry momentum too. As is the case with classical electromagnetic waves, a photon with energy E carries momentum p with p = E c = h λ . (See the brief discussion of relativistic kinematics in the second week’s materials: since E 2 = p 2 c 2 + m 2 c 4 , anything massless must satisfy the energy-momentum relation E = pc: if it carries energy, it also carries momentum.) Exercise 14.3: Resolution of a simple magnifier Imagine we build a simple optical system which images light from a small object onto a sensor. How well can it determine the position of a small object? Let’s begin with a very simple optical system, a pinhole camera. Note that determining the resolution isn’t a matter of “geometrical optics” since we’ll have to think about diffraction effects associated with the finite width of the pinhole. To further simplify the analysis, let’s use a horizontal slit instead of a pinhole so that the camera can only image light along the vertical direction. Here’s a diagram: Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.5 223 detector/sensor Do Dd/s slit, width A object emits light of wavelength λ image formed here opaque screen . If the object is a point source, how well localized is its image on the detector? Can we tell to arbitrary precision where the object was? Another way of asking this: is it possible to produce a single Euclidian point (or in this case Euclidian line) image at our detector/sensor? If we can’t (because of diffraction effects, for example), then the theoretical resolution of our optical system is limited by physical principles. It is tempting to think that the narrower the slit the better able we’ll be to localize the image of our point source on the detector surface. This is a geometrical-optics way of thinking! The problem is that we’ve neglected the role played by the opaque sheet in the propagation of the electromagnetic waves from the point source to the detector. The idea is simple: unless the opaque sheet does something to cancel the incident electromagnetic waves, they’ll go sailing straight through without being affected. The electric field in light waves from the point source accelerates electrons in the opaque screen, causing them to radiate their own fields which cancel the incident fields. It’s not hard to see how this works! The key is in the presence of the negative sign in front of the expression for the radiation electric field: Qa⊥′ 1 E =− 3 2 1 r only r − r ′ 4πε 0 c (1 − εˆ ′ ⋅ v ′ c ) Q An electron in the opaque screen is accelerated by the incident electric field so that it experiences an acceleration a ∼ QEincident . Neglecting fine points such as the distinction between a and a⊥ , the radiation field is produced along the direction −Qa ∼ −QEincident which is opposite to the incident field's direction. Think of the opaque screen with the slit as a solid plane of antennae which radiate electromagnetic waves (perfectly canceling the incident electromagnetic wave downstream of the screen) combined with a narrow strip of identical antennae which generate electromagnetic Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.6 224 waves of the opposite polarity to cancel the perfect cancellation in the region where the actual screen has a slit. Sounds confusing, yes? Here’s a diagram. To make it easier to draw, I’ll only draw the polarization component of incident radiation parallel to the slit. = + What this all comes down to is that we’ll do fine if think of the electromagnetic radiation passing through the slit as if it had all been generated by a strip of antennae as wide as the slit which is cut into the opaque screen, all radiating coherently (in phase). No other sources of light need contribute to the signal observed by our detector. Another diagram: Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.7 225 Dd/s enlarged view of slit, width A θ A/2 extra distance for lower ray opaque screen detector/sensor Since electromagnetic waves leaving all antennae are in phase, they can interfere constructively or destructively at the detector/sensor if the path lengths are different from antenna to antenna to a spot on the sensor. (a) Assuming the sensor is very far from the opaque screen (compared to the width of the slit) and that the wavelength of light is λ, at what angle θ will the two rays drawn in the diagram arrive at the sensor out of phase? (This angle corresponds to a half-wavelength difference in path A, length for the two rays, as indicated in the diagram.) You may assume that Dd / s Dd / s λ , and that θ is small enough so that sin θ ≈ θ . (b) In terms of λ, A, and Dd/s what is the half-width of the bright spot at the detector? Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.8 226 Discussion Notice that all pairs of antennae interfere destructively at the angle you calculated. Discussion Resolution of our optical system: it would be difficult to tell two point sources apart if their bright spots overlapped at the detector. It's reasonable to identify the half-width of the bright spot from a point source as the system's resolution. This holds true even when we replace the slit with a lens of aperture A. Except for fine points concerning difference in how the interference zeroes behave for circular and slit apertures, it works the same way. Small wavelengths and large apertures give better position resolution. Exercise 14.4: Heisenberg’s microscope gedanken (thought) experiment Imagine we build a simple magnifier to locate a small object illuminated by a light source of wavelength λ. Our sensor is so quiet and so efficient that we can determine the position of the object when a single photon scatters from it, passes through the lens, and arrives at the sensor. Naturally, we are unable to tell the path the photon took as it traveled through the lens to the detector; all we know is where the photon ultimately arrived at the face of the detector. photon’s path light source x θ z A/2 D Dd/s detector/sensor Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.9 227 (a) A photon, initially traveling along the z axis of our optical system, bounces off the object at an angle θ as shown. Calculate the x component of momentum transferred to the object when the photon scatters from it. (You may assume that the photon doesn’t lose any energy in the collision.) (b) Let's define the scale of momentum transfer to our object to be ∆px. The full range of possible x momentum transfers to the object, for all photon paths which pass through the lens, is ±∆px . Calculate ∆px assuming that D A and that θ is small enough so that sin θ ≈ θ . (c) Assuming that D = Dd/s for simplicity, calculate the position resolution ∆x for this optical system. (d) Calculate the product ∆px∆x for our locate-the-object optical system. Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.10 228 Discussion The Heisenberg Uncertainty Principle! It’s true in general, not just for this particular (optical) system. The exact expression, which would require us to define more precisely what we mean by a position and momentum uncertainty is ∆px ∆x ≥ h 4π . Since h 2π appears so frequently in the equations of quantum mechanics, the convention is to use a new symbol (pronounced "h-bar"): ≡ h 2π . = 1.0546 × 10−34 Joule ⋅ seconds = 6.5822 ×10−16 eV ⋅ seconds . Keep in mind that 1 Joule ⋅ second = 1 ( kg ⋅ m/sec ) ⋅ m . The smaller the value of the smaller the domain in which quantum mechanics will differ noticeably from the (fundamentally incorrect) classical description. In three-dimensional systems we have three separate expressions: ∆px ∆x ≥ ∆p z ∆ z ≥ 2 , ∆p y ∆y ≥ 2, 2. The uncertainty principle is built into the structure of our quantum mechanical universe. It holds true for ALL systems, not just those which employ photons in measurements. Exercise 14.5: Atomic stability, thanks to the uncertainty principle. An electron is known to be confined inside a hollow cubic of side length 2a so that a reasonable measure of its position uncertainty is ∆x = ∆y = ∆z = a . (a) In terms of a, the electron mass me, and , calculate the typical kinetic energy this electron will have. (You now know how to calculate typical magnitudes for the electron's momentum components. Use these to calculate the electron's kinetic energy.) You should find that K ∼ 3 2 8me a 2 ≈ (.029 eV ⋅ nm ) a 2 when a is expressed in nanometers. ( ) Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.11 229 (b) Here's a not-nearly-as-stupid-as-it-seems model for a hydrogen atom. Assume the electron in a hydrogen atom is confined by the Coulomb potential to rattle around inside a cube of side length 2a so that the electron's typical potential energy is − q 2 ( 4πε 0 a ) . (q is the fundamental charge; − q 2 ( 4πε 0 a ) ≈ − (1.44 eV ⋅ nm ) a when a is expressed in nanometers.) As a result, our simple model predicts that the electron's total energy should be .029 eV ⋅ nm 1.44 eV ⋅ nm . a2 a The size of the volume in which the electron resides will be that which minimizes its total energy. E = K +U ≈ Sketch a graph of the electron's total energy as a function of a. Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.12 230 (c) Calculate the value of a which minimizes the electron's energy. (d) Calculate the electron's total energy at this preferred value for a. Discussion Atoms don’t collapse: lower (more negative) potential energy is more than offset by the rising kinetic energy associated with better localizing the electrons! The full-blown quantum mechanical treatment of the hydrogen atom reveals that the probability to find the electron near the point ( r ,θ , φ ) is proportional to e − r .053 nm . (The value of .053 nm is called the Bohr radius.) The energy needed to remove a bound electron from a hydrogen atom is 13.6 eV. Your results for the atom's size and the energy of the electron should have been surprisingly close to these values! Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.13 231 Discussion: what’s really going on It is not possible to describe the world correctly if we insist on using a theory which describes a system in terms of the positions and momenta of its constituent particles. The predictions of any such theory are guaranteed to be wrong whenever the theory crashes into the limits on precision set by the uncertainty principle. It’s a deeper problem than not just being able to measure the position of a particle without disturbing its momentum: a variety of effects which are absolutely impossible in a classical theory become allowed in quantum mechanics. We are forced to abandon a theory which makes statements like these: dp ); dt • F = ma (which can be written −∇V = • p2 + U = E ). K + U = E (which can be written 2m In Newtonian mechanics we have the habit of solving directly for functions which tell us the values of the particle’s kinematic parameters. For example, in a uniform gravitational field, 1 x ( t ) = x0 + v0t + at 2 . 2 This doesn’t work for quantum systems! Our classical description assumes that we are able to say exactly where the particle is at a particular time. No good! We need new ways of thinking, described by a new equation. The idea is this: we need to give up an approach in which we talk directly about the values assumed by kinematic parameters as functions of position and time. Instead we need to work with the probabilities that we might find particular values for the various parameters if we happen to make a measurement. More exactly, we need to work with the probability amplitudes associated with quantities like position, momentum, and energy. We’ll use our new equation to calculate the form of a particles wave function: a function which contains information about the particle. (It’s like an owner’s manual for a stereo: the manual contains information about the stereo. The owner’s manual is NOT the stereo, just an information source about the stereo.) The way we extract information about the particle from its wavefunction ψ ( x, y, z , t ) is this: • ψ ( x, y, z, t ) dV is the probability to find the particle in a small volume element dV centered 2 at position ( x, y, z ) at time t. Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.14 232 •  ∫ (ψ )  i ∗ all space dψ   dV is the average energy we will obtain if we measure the energy of many dt  identical systems •  ∫ (ψ )  −i ∗ all space ∂ψ   dV is the x component of the average momentum we will obtain if we ∂x  measure the energy of many identical systems In general, ψ ( x, y, z , t ) can be complex: ψ ∗ is the complex conjugate of ψ . ∂ ∂ as the “energy operator” and −i as the “x-momentum operator.” ∂x ∂t  ∂ ∂ ∂ For three-dimensional systems, the momentum operator is −i  + +  = −i ∇ .  ∂x ∂y ∂z  We refer to i We can replace momentum and energy in our K + U = E equation: 2 2 d p2 ∂2 d − ∇2 + U = i in three. +U = E ⇒ − + U = i in one dimension and 2 2m 2m ∂x 2m dt dt Clearly, we need to add something on which the differentiations can act: the correct choice is the wavefunction. This yields the “master equation” for quantum mechanics, called Schrödinger’s equation: 2 2 d 2ψ dψ dψ U i ψ − + = in one dimension and − ∇ 2ψ + Uψ = i in three. 2 2m dx dt 2m dt Solving this for ψ will provide us with all the information about a nonrelativistic system that it is possible to obtain. Summary and wrap-up discussion Here’s what you were working on: • a classical analysis of the rate at which electrons “should be” ejected from a conductor which is exposed to an intense light source. • a classical analysis of the relationship between the maximum kinetic energy of electrons ejected from a conductor and the intensity of the light source illuminating the conductor. • photons! (photon energy and momentum) • modeling the resolution of a simple magnifier Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.15 233 • • analyzing the “Heisenberg’s Microscope” thought experiment estimating atomic sizes using the Uncertainty Principle We also discussed the master equation of quantum mechanics, known as Schrödinger’s equation. That's it! Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.16 234 Physics 212, Spring 2003. Unit 14 Oops! It doesn’t really work like that! (Quantum mechanics enters the picture) 14.17 235 Physics 212 Honors topics in electrodynamics Notes on special relativity Spring, 2003 George Gollin University of Illinois at Urbana-Champaign 2003 Physics 212, Spring 2003. Notes on special relativity .1 236 Notes on Special Relativity George Gollin University of Illinois Contents The speed of light is finite ............................................................................................................1 The speed of light is constant, regardless of relative motion of source and observer ..................4 Nonsimultaneity of events viewed from different reference frames ............................................7 Things on which we can agree when comparing notes with other observers.............................12 Time dilation...............................................................................................................................16 Doppler shifts..............................................................................................................................18 Lorentz contraction .....................................................................................................................22 Quantitative description of nonsimultaneity...............................................................................24 Some simple paradoxes and their resolution ..............................................................................27 Coordinates of an event, viewed from different reference frames..............................................28 Time dilation in terms of events .................................................................................................29 Lorentz contraction in terms of events .......................................................................................30 Lorentz transformations ..............................................................................................................31 Addition of relativistic velocities................................................................................................33 Causality .....................................................................................................................................35 The speed of light is finite All the strange features of relativity are consequences of the fact that the speed of light is exactly 2.99792458 × 108 meters per second. It doesn't matter whether the source of light moves with respect to the measuring apparatus: any device measuring c (the speed of light) will obtain, within experimental error, this value. Nothing else works this way: sound travels at fixed speed with respect to the air, for example. The value 2.99792458 × 108 meters per second really is exact: it serves as a definition, along with some time standard, of the length of one meter. The speed of light is very close to 1 foot per nanosecond, and I'll use that as a convenient approximation periodically. Let's start by discussing how moving objects look to a single, stationary observer. Because the speed of light is finite, an object moves between the time light rays leave it and the time they arrive at an observer. For example, here's a diagram of what happens when a light bulb, traveling at 0.2 c, is seen by an observer: Physics 212, Spring 2003. Notes on special relativity .2 237 v= 0.2c true position 2 feet apparent position 10 feet . Things are more complicated when the light comes from an extended source instead of a point source. Imagine we put lights on the corners of a transparent rectangular block and observe it from far away like this: 1 foot Light from the back of the block takes about a nanosecond longer to reach the observer than light from the front. If the block is in motion, the observer will see its back at an earlier time from its front: cube is moving up . As a result, the block looks like a parallelogram and not a rectangle: . Because the speed of the block is greater than the component of the light ray’s velocity along the cube’s direction of motion, light from the far, back corner doesn’t travel through the transparent Physics 212, Spring 2003. Notes on special relativity .3 238 cube on its way to the observer. If the block had been opaque instead of transparent, light from the front side would not have been visible while light from the back side would have been. (Imagine a camera shutter.) There's nothing exotic about this: the same effect would happen if you tried to locate airplanes in formation at an air show by listening for their engine sounds. Objects moving towards or away from an observer will also appear distorted. For example, consider what happens when an observer looks at a square plate, as shown below: 2 feet 4.243' 4.123' 2 feet 4.000' The center of the square is exactly 4 feet from the observer, while the centers of the sides and the corners are 4.123 feet and 4.243 feet away, respectively. Light rays leaving the square's center and corners simultaneously won't reach the observer simultaneously. If the square moves towards the observer, he'll see the corners of the square at an apparent position that corresponds to a time that's 0.243 nanoseconds earlier than the apparent position of the center of the square. A diagram, assuming the square moves with speed close to c (not really accurate-- the square looks curved in two dimensions): apparent shape of the square's surface side view 0.243' . The outline of the square isn't "square" anymore, either: Physics 212, Spring 2003. Notes on special relativity .4 239 . What does the square look like if it is moving away from the observer? What does a transparent rectangular solid look like when it moves towards the observer at a speed close to c? Keep in mind that the unusual appearance of the objects we’ve been discussing just arises from the finite speed of propagation of light. If several observers position themselves as shown below, they’ll all agree that each was passed by the closest part of the square at the same time (call it t=0). In a sense, the objects just “look funny” but aren’t really distorted due to the finite speed of light. The speed of light is constant, regardless of relative motion of source and observer This is the tricky part in relativity. Let’s first consider how sound works, to show the contrast with the way light propagates. Sound travels at a fixed speed with respect to still air. The speed of sound is about 331 meters per second; let’s approximate this as 1000 feet per Physics 212, Spring 2003. Notes on special relativity .5 240 second or one foot per millisecond. Imagine that an observer on the ground measures the propagation speed of the engine noise from a subsonic airplane: 1 2 100 feet Even though the airplane is moving, it will take about 100 milliseconds for any particular sound wave to travel between the two measuring stations labeled 1 and 2. The speed of sound is independent of the speed of the airplane. If we put the observer on a train, so that he’s moving towards the source of the sound with speed v, the time for a particular sound wave to travel from the first sensor to the second will be reduced. 1 2 v From practical experience, we know that snowballs don't work the same way sound does. A snowball thrown from a car hits a pedestrian a lot harder if it's thrown in the direction the car's traveling than if it's thrown in the opposite direction. Light has the (unusual??) property that any measurements made of its propagation speed always yield the same answer, regardless of relative motion between light source and observer. Each of the following observers will measure the same transit speed between the two sensors: Physics 212, Spring 2003. Notes on special relativity .6 241 v=0.9c laser beam from rocket 100 feet 1 2 (Here we see the source moving towards a fixed observer.) v=0.9c laser beam from rocket 100 feet 1 2 (Here we see the source moving away from a fixed observer.) light ray from a distant star 1 2 v = 0.8c with respect to star (Here we see the observer as moving towards a fixed source.) This is not what we’re used to-- either in the way sound travels or in the way snowballs thrown from cars behave. You can read about some experimental tests of this in a variety of textbooks. Here's another, based on every-day things at Fermilab. We know that many different kinds of unstable elementary particles can be produced in violent collisions between high energy protons and target nuclei. The lightest of the particles made of quarks and antiquarks which can be produced this way are call pi mesons, or pions. There are three different kinds of pions: π+ and π- have identical masses of 139.57 MeV/c2 while the lighter π0 has mass 134.96 MeV/c2. (The proton mass is 938.28 MeV/c2.) It is known that nearly 100% of the time a π0 will decay into a pair of energetic photons (the quantum Physics 212, Spring 2003. Notes on special relativity .7 242 mechanical version of electromagnetic waves-- light). Each gamma ray photon from a pion which decays at rest in the lab carries away 67.48 MeV of energy and travels travel with speed c (which isn't surprising). A source of energetic pions at Fermilab shoots π0's into my experiment, which measures the kinematic properties of their decays. These pions are traveling close to the speed of light-useful π0's have values of v/c which range from 0.9996 to 0.9999993-- and their decay photons can have energies as high as 100 GeV (1 GeV = 1000 MeV). A diagram comparing a typical π0 decay at rest with what we observe at Fermilab follows: pion at rest photons are 180° apart pion with v/c = .999990 angle between the photons is about 1/2° Even though the source of the photons is moving at high speed, we find that the speed with which these photons travel is exactly the same as the speed with which the photons from the stationary pion travel. Nonsimultaneity of events viewed from different reference frames The consequences of the constancy of the speed of light are dramatic. Here's the first of them. Consider the following hypothetical encounter between the rockets Nostromo and Exeter. Both are capable of traveling at speeds approaching c, and each carries a pair of periscopes. A spark jumps between the Nostromo’s and Exeter’s electrode as the two ships pass each other. Observers on each ship, equidistant from the electrode, see the light from the spark. Let’s view the creation of the spark from the perspective of passengers on the Nostromo. The spark is created at the electrodes when the Nostromo’s clocks read t = 0, as shown in the figure below. Physics 212, Spring 2003. Notes on special relativity .8 243 N Nostromo spark gap periscopes ? periscopes Exeter ? E As light from the sparks travels towards the Nostromo’s observer, the Exeter continues to move to the left. Here’s a “motion picture” of what happens, from the perspective of passengers on the Nostromo. Let’s assume that the Exeter is traveling with v = 0.5c. Physics 212, Spring 2003. Notes on special relativity .9 244 N Nostromo clocks read t=0 as the Exeter passes. Nostromo Light from spark leaves spark gap. ? Exeter ? E Nostromo crew thinks Exeter is moving to the left with v = 0.5c N Nostromo Nostromo clocks read t=100ns. light pulse Light from spark has traveled 100 feet, Exeter has traveled 50 feet. Light reaches aft Exeter periscope. ? ? Exeter ? E N Nostromo Nostromo clocks read t=200ns. Light from spark has traveled 200 feet, Exeter has traveled 100 feet. Light has already reached both Nostromo periscopes and aft Exeter periscope. Physics 212, Spring 2003. Notes on special relativity light pulse ? ? Exeter ? E .10 245 As you can see, the Nostromo’s observers see the spark simultaneously (light travels one foot per nanosecond, and the periscopes are equidistant from the spark gap). However, the Nostromo crew members think the Exeter’s aft observer sees the spark before the forward observer sees the spark. Let’s shift to the Exeter’s frame of reference now. As far as the Exeter’s crew is concerned, they are happily sitting around eating dehydrated food when the Nostromo glides past them, moving to the right with v = 0.5c. The Exeter’s periscopes are equidistant from the spark electrode, and will necessarily see the produced spark simultaneously, as far as the Exeter's crew is concerned. This is in disagreement with the conclusion of the crew of the Nostromo. Here is a "movie" of what things look like from the perspective of the Exeter’s crew. (There are certain inaccuracies in the drawing concerning the relative lengths of the ships that we'll neglect for the time being.) Physics 212, Spring 2003. Notes on special relativity .11 246 Exeter clocks read t=0 as the Nostromo passes. Light from spark leaves spark gap. N ? ? ? Exeter Exeter crew thinks Nostromo is moving to the right with v = 0.5c Nostromo ? E N ? ? ? Nostromo light pulse Light is about to reach aft Nostromo periscope. Exeter E N ? ? ? Nostromo light pulse Light has already reached both Exeter periscopes and aft Nostromo periscope. Exeter E Things are rather different in the Exeter's reference frame. So-- events which are simultaneous in one frame of reference are not necessarily simultaneous in another frame of reference!! Curious! Physics 212, Spring 2003. Notes on special relativity .12 247 In a ship's rest frame, its clocks read the same value when the spark is produced (since they're synchronized) and also read the same value when the light from the spark arrives at the periscope. (The light travels the same distance from the spark gap to the forward periscope as it does to the aft periscope.) Imagine that each clock stops (breaks??) when the light reaches it. In a ship's rest frame, its own clocks will read the same value when they break. An observer in any frame of reference will agree with the ship's crew that the broken clocks read the same (stopped) time. A ship's crew will conclude that the aft clock on the other ship breaks before the forward clock on the other ship. However, once both its clocks are broken, they will be seen to read the same time. Consequently, as seen from the rest frame of one ship, the aft clock on the other ship reaches this time before the forward clock on the other ship. The crew of one ship, looking at the clocks on the other ship (which is seen to be moving) will conclude that the clocks on the other ship are not synchronized. In particular, the aft clock is set ahead (reads a later time) relative to the forward clock. Both crews will agree that the clocks at the spark gap (the center clocks in the pictures) read zero when the spark was created. (Imagine that these clocks break when current flows through the spark electrodes.) Here's how the clocks will look in the rest frame of the Exeter: N Nostromo Exeter E Physics 212, Spring 2003. Notes on special relativity .13 248 Things on which we can agree when comparing notes with other observers The disagreement between the Exeter’s and Nostromo’s crews' observations about the (non)simultaneity of the observations of the spark comes about because of the fact that the observations take place at different points in space. Events which take place at the same point in space and at the same time will be seen by both crews as happening simultaneously. Here are some examples of pairs of events which work out this way: (Nostromo’s electrode passes Exeter’s electrode) .and. (Nostromo’s center clock reads t=0) (Nostromo’s rear clock breaks) .and. (light from spark arrives at Nostromo's rear periscope) (Exeter’s front clock breaks) .and. (light from spark arrives at Exeter's forward periscope) (I'm assuming that all clocks are tiny, and arbitrarily close to the electrodes or periscopes, etc. etc.) It’s easy to imagine how this works: imagine that one event (light arriving) causes some sort of explosion which breaks the clock immediately. All observers will agree on the value read by the (stopped) clock. Physics 212, Spring 2003. Notes on special relativity .14 249 Here is another example of a “reference frame-invariant” occurrence. I’ve underlined the statement with which all observers will agree. Left ruler is stationary, right ruler is moving. Rods at ends of right ruler pass through hoops at ends of left ruler when the rulers collide. Left and right rulers are moving towards each other with equal speed. Rods at ends of right ruler pass through hoops at ends of left ruler when the rulers collide. Note that this means that the two rulers will appear to have the same length, whether viewed from a frame in which one is at rest or in which both are moving (perpendicular to their lengths). Here’s another example: let’s say the Nostromo carries a pair of parallel mirrors. A crewmember sets a light beam bouncing between the mirrors. Nostromo, as seen by a crewmember floating outside the rocket. Light ray leaves bottom mirror N Nostromo Physics 212, Spring 2003. Notes on special relativity .15 250 Light ray bounces off top mirror N Nostromo Light ray bounces off bottom mirror N Nostromo The light beam bounces from mirror to mirror. Nostromo, as seen from the Exeter. Light ray leaves bottom mirror, moving up and to the right N Nostromo N Nostromo Light ray bounces off top mirror, now moving down and to the right. Light ray bounces off bottom mirror, moving up and to the right N Nostromo The light beam bounces from mirror to mirror as the Nostromo moves to the right. Time dilation Let’s examine the light-bouncing-off-a-mirror case in more detail. For convenience, let's assume that the mirrors are three feet apart. The roundtrip pathlength between the bottom and top mirrors is six feet; it takes a light beam about six nanoseconds to make the trip, as measured by a crewmember of the Nostromo. The path taken by the light is longer, when viewed by crewmembers of the Exeter who see the Nostromo as moving quickly to the right. Imagine that Physics 212, Spring 2003. Notes on special relativity .16 251 the relative velocity between the Nostromo and Exeter is 0.8c. Here's how the light beam seems to move: 3 feet Nostromo speed is 0.8c An important point: an observer in either rocket who measures the speed with which the light beam in Nostromo’s clock travels will always find that it travels 2.997...× 108 meters/second. Let's calculate the round trip time (assuming c = 1 foot per nanosecond) for the moving mirrors. Let's say that the time for a half-tick is ∆t', so that the round trip takes 2∆t' (note the prime here), while the time for a half-tick of a stationary clock is ∆t (no prime): In time ∆t', the mirrors travel ∆x ' = v∆t ' . In this time, the light ray travels L ' = (∆x ') 2 + (∆y ') 2 = (v∆t ') 2 + (3) 2 . Since c = 1ft/nsec in all frames, L' = c∆t' so... c∆t ' = (v∆t ') 2 + (3) 2 . It's true that c∆t = 3 feet (no primes here: in the frame in which the clock is at rest), so we can rewrite c∆t ' = (v∆t ') 2 + (c∆t ) 2 . Solve for ∆t' in terms of ∆t: (c∆t ') 2 − (v∆t ') 2 = (c∆t ) 2 or ∆t ' = ∆t 1 − v2 / c2 . Observers think the moving clock takes longer to tick by a factor of 1/ 1 − v 2 / c 2 than do observers at rest with respect to the clock. The clock seems to be running slowly compared to a "normal" clock. When v=0.8c, 1/ 1 − v 2 / c 2 = 5 / 3 . The Nostromo's and Exeter's crews will not agree about the time required for Nostromo's "light clock" to tick once. You can imagine how this analysis works if the light clock had been on the Physics 212, Spring 2003. Notes on special relativity .17 252 Exeter instead of the Nostromo: The Nostromo's crew would think the Exeter's light clock was ticking too slowly. There's complete symmetry between the Exeter and Nostromo. Each crew sees the other crew's rocket moving at high speed. In general, a light clock which requires time t for each tick (as viewed by observers stationary with respect to the clock) will require a time t ' = t 1 − v2 / c2 for each tick when viewed from a frame in which the clock seems to be moving with speed v. Here's a plot of t'/t as a function of v/c: 14 Time dilation factor vs. v/c 1 1 − v2 c2 12 10 8 6 4 2 v/ c 0 0 0.2 0.4 0.6 0.8 1 So-- moving light clocks seem to run slow! What about other moving clocks, for example mechanical or biological clocks? They all have to seem to run slowly! Here's a plausibility argument for this. Let's say we stretch the distance between the mirrors in the Nostromo's light clock so that it takes one second, as determined by the Nostromo's crew, for the clock to tick. Imagine that one crewmember, whose pulse rate is 60 beats per minute, covers the bottom mirror with a black cloth, removing the cloth briefly each time his/her heart beats. The light ray bouncing back and forth will always arrive at the bottom mirror when it is uncovered, and the "clock" will continue to tick. A crewmember of the Exeter will see the Nostromo’s crewmember uncover the mirror in time with his/her heartbeat, even though the time between bounces will be longer than one second. The Exeter’s crewmember will conclude that the Nostromo’s crewmember’s pulse has slowed down by the same factor as Physics 212, Spring 2003. Notes on special relativity .18 253 the rate of ticking of the light clock. All clocks, whether mechanical, biological, or electrical must slow down by the same factor. This “time dilation” effect is entirely consistent with observations of changes in average lifetimes of unstable particles which are moving at high speed. Here’s an example, based on what we see at Fermilab. K mesons are unstable particles which can be produced in collisions between protons and target nuclei at particle accelerators like Fermilab (which is in Batavia, about an hour’s drive west of Chicago.) A K meson weighs about half as much as a proton. There are two kinds of neutral K mesons and two kinds of charged K mesons; a “long-lived” neutral K, or KL, has an average lifetime of 51.83 nanoseconds. KL‘s can decay into various combinations of lighter particles such as pions, electrons, neutrinos, and muons. Fermilab shoots a beam of 1011 800 GeV protons per second into a beryllium rod located 128 meters “upstream” of my experiment. About 106 KL per second leave the target and travel towards the experiment. The K mesons are traveling close to the speed of light; typical values for their time dilation factors range from 50 to 300. If time dilation didn’t slow down the KL‘s internal clocks, the average number surviving after a time interval T would be e −T 51.83 . Traveling at one foot per nanosecond, only 0.03% of the kaons (or about 300 per second) would survive to reach my experiment. In reality, more than 90% of the kaons manage to travel the 128 meters without decaying in flight. Doppler shifts What would happen if we tried to make a “sound clock” which worked by bouncing the sound of a click between two reflectors mounted on a railroad flat car? Would we get the same sort of time dilation phenomena? (Of course we wouldn’t.) Let’s do the analysis. In the picture below, the reflectors are moving about half the speed of sound: 331m/sec stationary reflectors click! Physics 212, Spring 2003. Notes on special relativity 331m/sec moving reflectors click! .19 254 Because the speed of sound is 331m/sec with respect to still air, it will take longer for the sound of the click to arrive at the upper reflector. Everyone will agree that the sound clock moving with respect to still air is ticking slowly. This time dilation effect will make Doppler shifts for relativistic motion work somewhat differently from Doppler shifts for nonrelativistic systems. Remember about Doppler shifts? Here’s how they work for sounds coming from nonrelativistic sources. Imagine a train emits periodic clicks, as in the following diagram: 1 2 3 4 5 331 m/sec 1 2 34 5 The wavefronts from the clicks travel away from their points of creation at 331 m/sec. A listener in front of the train will hear a shorter period between clicks than a listener behind the train. Let’s say the train is moving at half the speed of sound, emitting 331 clicks per second, and that the first click is emitted at x=0. Here’s a table describing what observers at x = +10 and x = -10 hear: click 1 2 3 4 5 (x,t) for emission of click x=0, t=0 x=0.5, t=.003 x=1, t=.006 x=1.5, t=.009 x=2, t=.012 distance to x = +10 observer 10m 9.5m 9m 8.5m 8m arrival time at +10 observer .0302 .0317 .0332 .0347 .0363 distance to x = -10 observer 10m 10.5m 11m 11.5m 12m arrival time at -10 observer .0302 .0347 .0393 .0438 .0483 Because the source of the clicks is moving, the frequency with which the clicks arrive is shifted, due to the different distance traveled by each click. For a stationary observer listening to a Physics 212, Spring 2003. Notes on special relativity .20 255 source which moves a distance D between clicks, the extra distance traveled by successive clicks is -Dcos(θ): D θ extra distance is Dcos(θ) observer If the source emits clicks separated by time t, it will move a distance D = vt between clicks. The difference in the transit time of successive clicks will be -Dcos(θ) / c where c is the speed of sound. As a result, the ratio of the apparent time interval between clicks to the “true” time interval will be (t - Dcos(θ)/c) / t = (t - vtcos(θ)/c) / t = 1 - vcos(θ)/c. If you prefer to deal in frequencies instead of periods, you can use the fact that frequency is the same thing as 1/period to conclude that fapparent / fsource = c / (c - vcos(θ)). The same thing holds true for sinusoidal waves-- we’d just be describing the time between peaks, instead of time between clicks. There’s one other effect that has to be included when considering relativistic motion-- time dilation. Imagine the Nostromo transmits flashes of light, evenly spaced by a time interval t as measured on the Nostromo’s clocks. An observer on the Exeter will measure a different time interval between the flashes due to the same effect as described above for sound, but with an additional complication: the Nostromo’s clocks seem to be running slowly by a factor (1 - v2/c2)1/2. Naturally, c now represents the speed of light. The time interval between flashes is increased, or the frequency decreased, by this amount. As a result, the apparent frequency of flashes seen by the Exeter is Physics 212, Spring 2003. Notes on special relativity .21 256 f apparent = f source × 1 − v 2 c 2 × c . c − v cos(θ ) Some algebra reveals that this is equivalent to c−v c+v for the case that the source is moving directly away from the observer and c+v f apparent = fsource × c−v for the case that the source is moving directly towards the observer. f apparent = fsource × Notice that many of the algebraic expressions describing the effects associated with motion at high speeds misbehave when v = c. The fact that any observer measuring the speed of a light ray will determine it to be 2.997...× 108 m/sec makes it impossible for any massive object to travel faster than the speed of light. Here’s an illustration of why this is the case. Imagine that a nasty spaceship carrying a large bomb is constructed, as shown in the diagram below. The bomb will detonate if the attached photosensor is struck by light from the rocket’s internal laser. The laser shoots its beam upwards through a tube with black interior walls, from the perspective of the crew of the rocket. hollow tube with black walls Dr. Death Lasers, Inc. Bombs 'r' Us Someone traveling on the rocket ship will get blown up a few nanoseconds after the laser is turned on, as measured by the ships clocks. If the rocket can travel faster than the light beam from the laser, when both are viewed from some other reference frame, the laser will never be able to detonate the bomb: the top of the tube will have moved horizontally farther than the laser beam, even if the beam travels horizontally, and the beam will be absorbed by the walls of the black tube. However, in the rest frame of the spaceship, the bomb certainly will go off. We can’t reconcile these two descriptions of events: either the ship blows up on its way to Alpha Physics 212, Spring 2003. Notes on special relativity .22 257 Centauri, or it doesn’t. Later, we’ll see why it takes an infinite amount of energy to boost an object at rest to the speed of light. Lorentz contraction Things we know: 1. A moving clock seems to run slow. 2. Two clocks synchronized in their rest frame appear to be out-of-synch when viewed from another frame. In particular, the front clock of a pair of moving clocks seems to be set back while the rear clock seems to be set ahead. What happens to (apparent) lengths of relativistic objects? Careful: you must state exactly what you mean to do when you say you'll measure something's length. One way to measure the length of a moving object: N v Nostrom Solaris see which clocks are adjacent to the "moving ship's" nose and tail when t = 0 on all your clocks; call the distance between these two clocks the length of the moving object. Another way: use one clock and the fact that distance = speed × time; the moving ship is known to travel with speed v. Physics 212, Spring 2003. Notes on special relativity .23 258 N v Nostromo Solaris start finish Solaris From the Nostromo's perspective: N Nostromo v ... ... Solaris First method (two clocks): Solaris' clocks are not synchronized so their measurement is not in agreement with the Nostromo's. Second method (one clock): Solaris' single clock runs slow by a factor of 1 − v 2 c 2 so its measurement is "inaccurate" due to this: it measures a length which is shorter than the Nostromo's "rest length" by this same factor. Both methods still have to agree, although they will produce a value which is smaller than the length determined by the Nostromo's crew. Physics 212, Spring 2003. Notes on special relativity .24 259 Quantitative description of nonsimultaneity Let's calculate the amount of missynchronization in a pair of moving clocks (which are synchronized in their rest frame). The rest length of the ship is Lship. 1 (in ship's rest frame) v v A B 2 (in ship's rest frame) v A v B 3 (in ship's rest frame) v A v B Clocks A and B are synchronized in their rest frame. Their spacing is chosen so that in their rest frame each reads zero when passed by the corresponding end of the space ship. Let's call their separation in their rest frame LAB. We will do this: 1. calculate the distance from clock A to the tail of the ship in picture 1, in the ship's rest frame 2. calculate how long it took in the ship's rest frame for clock A to travel this distance (this is how much time has passed on the ship between pictures 1 and 2) 3. calculate the change in the reading on clock A between picture 1 and picture 2. Since we know clock A reads zero in picture 2, this will let us calculate what it read in picture 1. Physics 212, Spring 2003. Notes on special relativity .25 260 1. The clocks are positioned so that their rest-frame separation LAB matches the apparent length of the ship. In the clocks' rest frame, the ship is Lorentz contracted so LAB = Lship 1 − v 2 / c 2 . In the rest frame of the ship, the separation between the (moving) clocks is Lorentz contracted. Call this L'AB: we have 2 L ' AB = LAB 1 − v 2 / c 2 = Lship  1 − v 2 / c 2  = Lship 1 − v 2 / c 2 .   ( ) As a result, the remaining distance to be traveled by clock A is Lship − L ' AB = Lship v 2 c2 . An illustration: 1 (in ship's rest f ) A v B v L'AB Lship - LAB ' Lship 2. →2 = In the ship's rest frame, the clock takes an amount of time t1ship Lship − L ' AB v = Lship v c2 to travel this distance. 3. The amount of time which passes on the face of clock A between pictures 1 and 2 is reduced, because of time dilation: t 1→2 AB =t 1→2 ship 1− v / c 2 Physics 212, Spring 2003. Notes on special relativity 2 {L = ship } 1 − v2 / c2 v c 2 = LAB v . c2 .26 261 LAB v in picture 1. Recall that LAB is c2 the separation between the clocks in their rest frame, and that clock B read zero in picture 1. As a result, the amount of "missynchronization" between two clocks separated by a distance ∆x in their rest frame, when viewed from a frame in which the clicks are in motion, is v∆x/c2. The front clock reads an earlier time than the rear clock. Here's a summary of what we know about the effects of relative motion on distance and time: Since clock A reads zero in picture 2, it must have read − x=-L x=0 t=0 x=+L t=0 t = -vL/c2 L t=0 t=0 t = +vL/c2 v t = +vL/c2 t=0 t = -vL/c2 v From the perspective of the TOP ruler The top ruler is stationary with respect to us. The middle ruler is moving to our left, the lowest ruler is moving to our right. In each ruler's rest frame, the separation between clocks is L and the three clocks are synchronized. 1. Effect on distance between objects: The separation between adjacent clocks seems reduced. Clocks on the moving ruler will seem to be a distance apart given by L × 1 − v2 c2 . 2. Effect on passage of time on a single clock: The rate of passage of time on moving clocks seems slowed. If a clock on the stationary ruler advances by an interval T, a clock on the moving ruler will advance by an amount of time Physics 212, Spring 2003. Notes on special relativity .27 262 T × 1 − v2 c2 . 3. Effect on synchronization of clocks: Moving clocks which are synchronized in their rest frame will not appear synchronized when viewed from another frame. In general, a clock that is "in front" will seem to be set to an earlier time than a clock that is "in back". In the example with the rulers illustrated above, a clock at x = L (as measured in its rest frame) will read t = − Lv c 2 when the moving clock at x = 0 reads t = 0. Note that v < 0 corresponds to motion to the left. Some simple paradoxes and their resolution I. Relativistic train/tunnel Train: rest length 1000 feet, travels with v/c = 0.8. Tunnel: rest length = 800 feet. Stationmasters try to slam entrance and exit doors to the tunnel simultaneously while the train is inside, trapping it in the tunnel. IC Illinois Central Illinois Central Illinois Central 1000 foot rest length door door 800 foot rest length a. Why does this seem like a paradox? b. What happens? Describe from both the train frame and the tunnel frame. II. Clock accelerators A "clock accelerator" consists of a vacuum pipe and a series of accelerating structures, as shown in the diagram. The vacuum pipe contains two rows of clocks; the upper row is moving to the Physics 212, Spring 2003. Notes on special relativity .28 263 right at speed v = 0.8c while the lower row is stationary with respect to the accelerating structures. The accelerating structures (and stationary clocks) are 1000 feet apart. All clocks in a given rest frame are synchronized. The rightmost moving clock reads 0 as it passes the stationary clock, which also reads 0. Just as the stationary clocks read 0, all the accelerating structures fire, boosting the lower row of clocks into the same frame of reference as the upper row of moving clocks. Because the acceleration takes place in an extremely short period of time (let's say a picosecond), none of the clocks have moved an appreciable distance before the acceleration ends. In addition, the readings on the faces of all clocks are virtually unchanged compared with their pre-acceleration values. As a result, the apparent spacing between clocks in the (now moving) lower row will STILL be 1000 feet, and all these clocks will STILL read 0. How is this possible? What happened to all that Lorentz contraction stuff we've been discussing? ? ? Beam pipe accelerating structures 1000 feet Coordinates of an event, viewed from different reference frames An event is something that happens at one point in space, and one instant in time. Examples: collision of clocks on board space ships Exeter forward clock reads 0 as the clock passes the Nostromo's aft clock Not an event: Exeter crew measures the length of the Nostromo (takes a finite time interval or finite distance interval ) Why is this event business useful? Time dilation, Lorentz contractions, etc. etc. can be described in terms of their effects on the separation between events when one shifts reference frames. These are special cases where either the space or time interval between events is zero in ONE frame of reference. For the general case where ∆x ≠ 0 and ∆t ≠ 0, describing measurements in terms of space-time intervals between events will make it easier for us to calculate correctly. Physics 212, Spring 2003. Notes on special relativity .29 264 Time dilation in terms of events event 1: Nostromo nose and Solaris clock coincide and Solaris clock reads 0. event 2: Nostromo tail and Solaris clock coincide. Notice that these events happen in the same place, according to Solaris ... v Event 1 Event 2 N Nostromo Solaris start finish Solaris ...but occur in different places, according to the Nostromo: N Nostromo Event 1 v Solaris Physics 212, Spring 2003. Notes on special relativity .30 265 N Nostromo Event 2 Solaris If the time interval between two events in the frame in which the events occur at the same place is T, the time interval between the events in a different frame (in which the events occurred at different locations) is larger, namely T 1 − v2 c2 . Lorentz contraction in terms of events Event 1: Nostromo's tail passes by Solaris' forward clock. Event 2: Nostromo's nose passes by Solaris' aft clock. From the perspective of the Solaris, the events occur at the same time... N v event 1 Nostromo event 2 Solaris ...but from the perspective of the Nostromo, they don't. Physics 212, Spring 2003. Notes on special relativity .31 266 If the distance interval between two events in the frame in which the events occur at the same time is L, the distance interval between the events in a different frame (in which the events occurred at different times) is larger, namely L 1 − v2 c2 . Why bother? •Avoid sloppy thinking this way: describe all measurements in terms of intervals between events (back of rocket coasts past one part of my tape measure, front of rocket coasts past another part of my tape measure). •Also: gives us a natural language with which to describe things which happen neither at the same time nor at the same place in our reference frame, when viewed from other reference frames. Lorentz transformations Let's work up a description of this: two events are separated by distance x, and time t in one frame; separated by a distance x' and a time t' in another frame. Imagine we are at rest with respect to a string of clocks as shown below. A second string glides past the first, moving with speed v. Two clocks in each string are attached to spark electrodes; a spark is created as the corresponding electrodes align. The left electrode in the (to us) stationary string is at x=0, the right electrode at x=L. In the rest frame of the moving string of clocks the left and right electrodes in this string of clocks are at x'=0 and x'=L'. In general, L and L' can be different. (For example, the electrodes are placed 11 clocks apart on the lower string, but 10 clocks apart on the upper string.) Event 1: the left clocks' spark electrodes touch and a spark is created. x'=L', t'=? x'=0, t'=0 ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? v Event 1 x=0, t=0 x=L, t=0 Coordinates of event 1 from the perspective of the lower string: x=0, t=0. Coordinates of event 1 from the perspective of the upper string: x'=0, t'=0. Physics 212, Spring 2003. Notes on special relativity .32 267 From the perspective of the lower clock string, the upper string's right clock is at x = L′ 1 − v 2 c 2 (note that I am using x, not x' here), and it reads t ′ = − vL′ c 2 (note that I am using t', L', not t, L here). Moving clocks look to be out-of-synch, and moving objects look shorter, right? Event 2: the right clocks' spark electrodes touch and a spark is created. x'=0, t'=? x'=L', t'=?? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? Event 2 x=0, t=T v x=L, t=T Coordinates of event 2 from the perspective of the lower string: x=L, t=T. Coordinates of event 2 from the perspective of the upper string: x'=L', t'=??. Let's sort out what the upper string's right clock is reading. We know it was reading -vL'/c2 when event 1 occurred. We know that a time interval T has passed on the lower string's clocks. We know the upper string's right clock is ticking more slowly than the lower string's clocks. As a result, the right clock must be reading t ′ = − vL′ c 2 + T 1 − v 2 c 2 . Do some work to replace L' with quantities known to the bottom clock string observers: Distance between bottom clocks is the same as the sum of the Lorentz-contracted rest-frame separation between top clocks and the distance the top clock string moves in time T. In L − vT = L′ . equations: L = L′ 1 − v 2 c 2 + vT or 1 − v2 c2 Substitute this into the previous equation to get  L − vT  2 T − vL c 2 after some algebra. t ′ = −v  c + T 1 − v2 c2 =  1 − v2 c2  1 − v 2 c 2  Collecting what we know, we can conclude the following about the space/time interval between the events, when viewed from either frame: Physics 212, Spring 2003. Notes on special relativity .33 268 interval space time bottom string's frame ∆x = x2 − x1 = L ∆t = t2 − t1 = T top string's frame L − vT ∆x′ = x′2 − x1′ = L′ = 1 − v2 c2 ∆t ′ = t2′ − t1′ = T − vL c 2 1 − v2 c2 Slightly more compactly (and after some algebra to express ∆x, ∆t in terms of ∆x', ∆t'), we have the Lorentz Transformations: ∆x′ = ∆x = ∆x − v∆t 1 − v2 c2 ∆x′ + v∆t ′ 1 − v2 c2 ∆t ′ = ∆t = ∆t − v∆x c 2 1 − v2 c2 ∆t ′ + v∆x′ c 2 1 − v2 c2 It takes a bit of thought to keep the sign straight in front of the term proportional to velocity. Note that v refers to the velocity of the reference frame which measures "primed" quantities when viewed from the reference frame which measures "unprimed" quantities. Intervals along directions perpendicular to the relative velocity between the frames agree in both frames: ∆y' = ∆y, ∆z' = ∆z. A useful shorthand is to define γ ≡ 1 1 − v 2 c 2 and β ≡ v c . Using this, we can write ∆x′ = γ ( ∆x − β c∆t ) ∆t ′ = γ ( ∆t − β∆x c ) ∆x = γ ( ∆x′ + β c∆t ′ ) ∆t = γ ( ∆t ′ + β∆x′ c ) Given the space-time interval between two events as measured in one frame, we can calculate what observers in another frame will measure if we know the relative velocity of that frame with respect to the other frame. Addition of relativistic velocities Let's use the Lorentz transformation formulas to derive a velocity addition formula. Here's the sort of problem we'll need this for: Imagine the Nostromo's shuttle craft leaves the Nostromo traveling at speed v, with respect to the Nostromo. Physics 212, Spring 2003. Notes on special relativity .34 269 Nostromo v An observer on the Solaris sees the Nostromo coasting to the right with velocity u. What is the shuttle's apparent velocity according to this same observer? u Nostromo ? Solaris To use Lorentz transformations to sort this out, let's define two events like so: event 1: Shuttle leaves Nostromo and clock at Nostromo's nose reads t' = 0. event 2: Shuttle passes Nostromo crewmember on spacewalk who is at rest with respect to the Nostromo, a distance x' to the right of the Nostromo. The crewmember's watch reads t' = x'/v. The interval between the two events is ∆x' = x'-0 and ∆t' = x'/v - 0 From the perspective of the Solaris, the Nostromo rest frame is moving to the right with speed u, so we can calculate the separation between the two events using the Lorentz transformations Physics 212, Spring 2003. Notes on special relativity .35 270 (with β = u/c here, not v/c): ∆x = γ ( ∆x′ + β c∆t ′ ) = γ ( x′ + β c x′ v ) and ∆t = γ ( ∆t ′ + β∆x′ c ) = γ ( x′ v + β x′ c ) . The shuttle's speed, according to the Solaris, is ∆x/∆t: (1 + u v ) = ( v + u ) . ∆x γ ( x′ + β c x′ v ) = = ∆t γ ( x′ v + β x′ c ) 1 v + u c 2 1 + vu c 2 ( ) ( ) In the limit that u → c and v → c, the apparent speed of the shuttle tends towards c, not 2c. Again-- we find that c is the maximum velocity for any object. Causality Nonrelativistic version: Event 1 can only cause event 2 if event 1 happened before event 2. Problem with this version: nothing can go faster than c. Event 1: You pick up the phone at 9am on January 1, 3000AD and place a LONG distance call while you are coasting through space on the starship Alpha. Alpha Omega Event 1: x = 0, t = 9:00 am, Jan. 1, 3000AD. Alpha places call. Event 2: The starship Omega is 24 trillion miles (about four light years) away from the Alpha, but at rest with respect to the Alpha. Clocks on the two ships are synchronized. A phone rings on the Omega at 9:00am, January 1, 3003AD, waking up the Omega's captain. Alpha Omega Event 2: x = 24 trillion miles, t = 9:00 am, Jan. 1, 3003AD. Omega phone rings. Is it possible that you woke up the Omega's captain? Of course not-- the signal from your telephone can't travel faster than c; it'll take slightly more than 4 years for your phone call to arrive at the Omega. Physics 212, Spring 2003. Notes on special relativity .36 271 Relativistic version of causality: event 1 can only cause event 2 if event 1 happened sufficiently early with respect to event 2 so that a light beam leaving x1 at time t1 could arrive at x2 no later than time t2: x2 − x1 < c t2 − t1 . We expect: events which are "causally connected" when described using x, t coordinates measured in one frame of reference will be causally connected when viewed from any other frame of reference. Is this really true? (We can Lorentz-transform to check that ∆x < c∆t → ∆x' < c∆t'.) How does the Alpha-Omega phone call look when viewed from a third ship, which sees both ships in motion? Let's say the ships seem to be moving to the right with speed v equal to βc. Use Lorentz transformations. Event 1 Alpha Omega 1 Solaris cosmonauts tethered to Solaris Event 2 Alpha Omega 2 Solaris You might think that the Solaris could see the events as causally connected if the Alpha-Omega distance is Lorentz-contracted to less than three light years. Shouldn't a light beam be able to make it from the Alpha to the Omega before the Omega's phone rings, in this frame of reference? NO! It won't work. Non-simultaneity is at it again. Sure, The Omega's phone rings when its clocks read 9am, Jan. 1, 3003AD, but the Omega's clock didn't read 9am, Jan. 1, 3000AD when you placed the call from the Alpha. Use Lorentz transformations to work out the ratio of the space and time intervals between the events as seen from Solaris (primed coordinates are measured by the crew of the Solaris): ∆x′ = γ ( ∆x − β c∆t ) and ∆t ′ = γ ( ∆t − β∆x c ) . After some algebra, one finds that Physics 212, Spring 2003. Notes on special relativity .37 272 ( c∆t′) − ( ∆x′ ) 2 2 = ( c∆t ) − ( ∆x ) : 2 2 two events that are causally disconnected in one frame of reference will be causally disconnected 2 2 in ALL frames of reference. The quantity ∆τ 2 ≡ ( ∆t ) − ( ∆x c ) is an invariant under Lorentz transformations: calculate it using values for the separation between two events as measured in one frame of reference and you'll obtain exactly the same value that someone else using measurements made in a different frame of reference will obtain. ∆τ2 is sometimes called the "invariant interval squared" or the "proper time squared" between two events. Note that ∆τ2 can be negative (as it is in the case we've been discussing): don't be mislead by the square. Events whose separations yield a positive ∆τ2 can be causally connected; events with negative ∆τ2 cannot be causally connected. A pair of events with ∆τ2 = 0 can only be causally linked if the first event used a light beam to make the second event take place. Some comments on ∆τ2: 1. If ∆τ2 > 0: • Event 2 always takes place after event 1, regardless of your frame of reference (t2 > t1 always). • It is possible to find a frame of reference in which event 1 and event 2 occur at the same point in space. • The interval between the events is said to be timelike (|∆t| > |∆x/c|). 2. If ∆τ2 < 0: • In some frames, event 2 takes place after event 1, in some frames event 2 takes place before event 1. There is one particular frame of reference in which the two events occur simultaneously. • It is impossible to find a frame of reference in which event 1 and event 2 occur at the same point in space. • The interval between the events is said to be spacelike (|∆x/c| > |∆t|). ²²²²²²²²²² Physics 212, Spring 2003. Notes on special relativity .38 273 Physics 212, Spring 2003. Notes on special relativity .39 274 Physics 212, Spring 2003. Notes on special relativity .40

Honors Topics in Electrodynamics

Related documents

Products

Support

Honors Topics in Electrodynamics

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib