cse141: Introduction to Computer Architecture Steven Swanson Hung-Wei Tseng 1 Today’s Agenda is architecture? • What is it important? • Why the highest level, where is architecture today? • At Where is it going? • What’s in this class? 2 What is architecture? • How do you build a machine that computes? • Quickly, safely, cheaply, efficiently, in technology X, for application Y, etc. Civilization advances by extending the number of important operations which we can perform without thinking about them. -- Alfred North Whitehead Orientation The internet Orientation The internet Orientation IO Memory Power Memory System Bus (PCI) Memory Power Memory Architecture begins about here. Orientation IO Memory Power Memory System Bus (PCI) Memory Power Memory Architecture begins about here. You are here You are here cse141 The processors go here… The processors go here… Abstractions of the Physical World… Physics/Materials Devices Micro-architecture Processors Architectures Abstractions of the Physical World… Physics/ Chemistry/ Material science cse241a/ ECE dept Physics/Materials Devices This Course Micro-architecture Processors Architectures …for the Rest of the System JVM Processor Architectures Abstraction Compilers Languages Software Engineers/ Applications …for the Rest of the System cse121 cse131 cse130 cseEverythingElse JVM Processor Architectures Abstraction Compilers Languages Software Engineers/ Applications Why study architecture? • As CEs or CSs you should understand how computers work are the basis for everything in CS (except theory) • Processors • They are where the rubber meets the road. is important • •Performance Faster machines make applications cheaper hardware is essential to understanding how • Understanding systems behave • • It’s cool! • • Microprocessors are among the most sophisticated devices manufactured by people How they work (and even that they work) as reliably and as quickly as they do is amazing. Architecture is undergoing a revolution • • The future is uncertain Opportunities for innovation abound. 11 Performance and You! • Live Demo 12 Processor are Cool! are made of silicon • Chips Aka “sand” • • • • The most adundant element in the earth’s crust. Extremely pure (<1 part per billion) This is the purest stuff people make Building Chips Building Chips • Photolithography Silicon Wafer Building Chips • Photolithography Silicon Wafer SiO2 Silicon Wafer Grow silicon dioxide Building Chips • Photolithography Silicon Wafer SiO2 Silicon Wafer Resist SiO2 Silicon Wafer Grow silicon dioxide Apply photo resist Building Chips • Photolithography Mask Silicon Wafer Mask SiO2 Silicon Wafer Resist SiO2 Silicon Wafer Resist SiO2 Silicon Wafer Grow silicon dioxide Apply photo resist Expose to UV Building Chips • Photolithography Mask Silicon Wafer SiO2 Silicon Wafer Patterned resist Mask SiO2 Silicon Wafer Resist SiO2 Silicon Wafer Resist SiO2 Silicon Wafer Grow silicon dioxide Apply photo resist Expose to UV Building Chips • Photolithography Mask Silicon Wafer SiO2 Silicon Wafer Patterned resist Mask SiO2 Silicon Wafer Resist SiO2 Silicon Wafer Resist SiO2 Silicon Wafer Grow silicon dioxide Apply photo resist Expose to UV Silicon Wafer Etch SiO2 Building Chips • Photolithography Mask Silicon Wafer SiO2 Silicon Wafer Patterned resist Mask SiO2 Silicon Wafer Resist SiO2 Silicon Wafer Resist SiO2 Silicon Wafer Grow silicon dioxide Apply photo resist Expose to UV Silicon Wafer Etch SiO2 Met Silicon Wafer Deposit metal Building Chips • Photolithography Mask Silicon Wafer SiO2 Silicon Wafer Patterned resist Mask SiO2 Silicon Wafer Resist SiO2 Silicon Wafer Resist SiO2 Silicon Wafer Grow silicon dioxide Apply photo resist Expose to UV Silicon Wafer Etch SiO2 Met Silicon Wafer Deposit metal Met Silicon Wafer Etch SiO2 (Or not) Building Blocks: Transistors Building Blocks: Wires State of the art CPU Billion xtrs • 1-2 features • 45nm • 3-4Ghz 100 designers • Several years • >5 fab • $3Billion • 70 GFLOPS 18 Current state of architecture Since 1940 Since 1940 • Plug boards -> Java • Hand assembling -> GCC • No OS -> Windows Vista Since 1940 • 50,000 x speedup • >1,000,000,000 x density (Moore’s Law) • Plug boards -> Java • Hand assembling -> GCC • No OS -> Windows Vista Flexible performance is a liquid asset Moore’s Law: Raw transistors The Importance of Architecture • We design smarter and smarter processors • Process technology gives us about 20% performance improvement per year • Until 2004, performance grew at about 40% per year. • The gap is due to architecture! (and compilers) Computer Performance 23 Computer Performance 10000 specINT95 specINT2000 specINT2006 Relative Performance 1000 100 10 1 1990 1992 1994 1996 1998 2000 2002 2004 2006 2008 2010 Year 23 Computer Performance 10000 specINT95 specINT2000 specINT2006 47% per year Relative Performance 1000 100 10 1 1990 1992 1994 1996 1998 2000 2002 2004 2006 2008 2010 Year 23 Computer Performance 10000 specINT95 specINT2000 specINT2006 47% per year 39% per year Relative Performance 1000 100 10 1 1990 1992 1994 1996 1998 2000 2002 2004 2006 2008 2010 Year 23 Computer Performance 10000 specINT95 specINT2000 specINT2006 47% per year 39% per year 25% per year Relative Performance 1000 100 10 1 1990 1992 1994 1996 1998 2000 2002 2004 2006 2008 2010 Year 23 The clock speed addiction specINT2000 specINT2006 5000 Clock speed (Mhz) 4000 3000 2000 1000 0 1996 1998 2000 2002 2004 2006 2008 2010 Year speed is the biggest contributor to power • •Clock Chip manufactures (Intel, esp.) pushed clock speeds very • • hard in the 90s and early 2000s. Doubling the clock speed increases power by 2-8x Clock speed scaling is essentially finished. 24 Power Watts/cm 2 1000 100 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 25 Power Watts/cm 2 1000 100 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 25 Power Watts/cm 2 1000 100 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 25 Power Watts/cm 2 1000 100 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 25 Power Watts/cm 2 1000 100 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 25 Power Watts/cm 2 1000 100 10 1 1.5µ 1µ 0.7µ 0.5µ 0.35µ 0.25µ 0.18µ 0.13µ 0.1µ 0.07µ 25 What’s Next: Brainiacs the clock rate steady. • Hold smarter in silicon • •BeMore sophisticated processors • • • More clever algorithms This continues to deliver about 25% per year. But for how long? 26 What’s Next: Parallelism is all the rage right now • This probably own a multi-processor, they used to • You be pretty exotic. provide some performance, but it’s hard to • They use. • • • There aren’t that many threads Remember, flexible performance is a liquid asset Remember or look forward to cse121 27 Intel P4 1 core Intel Core 2 Duo 2 cores Intel Nahalem 4 cores SPARC T1 Intel Prototype Cell BE AMD Barcelona 8 cores 4 cores 8 + 1 cores 80 cores 28 Computer Performance 29 Computer Performance specINT2000 specINT2006 39% per year 25% per year Relative Performance 10000 1000 100 1996 1998 2000 2002 2004 2006 2008 2010 Year 29 Course Staff Steven Swanson • Instructor: Lectures Tues + Thurs • Hung-Wei Tseng • TA:Discussion sec: Wed. • • (but not this week) the course web page for • See contact information and office hours. 30 What’s in this Class outline • Course Instruction sets • • • • The basics of silicon technology Measuring performance How processors work • • • Basic pipelining Data and control hazards Branch prediction and speculation memory system • The • Introduction to multiprocessors technology digressions • •Weekly How various technologies actually work. 31 Your Tasks • • Read the text! • • Computer Organization and Design: The Hardware/Software Interface (4th Edition) -- previous editions are not supported I’m not going to cover everything in class, but you are responsible for all the assigned text. Come to class! • • I will cover things not in the book. You are responsible for that too. Class participation (5%) throughout the course. (10%) • Homeworks quizzes on Thursdays (10%) • Weekly midterm. (25%) • One cumulative final. (35%) • One project (15%) • •One Design your own ISA! 32 The Link to 141L do not need to take 141L along with 141, • You but you may need both to get your degree. are mostly independent, except • TheTheclasses results of the project will be used in 141L • • You can earn extra credit by licensing your ISA groups in 141L who are not in 141 33 Grading is on a 13 point scale -- F through A+ • Grading You will get a letter grade on each assignment • • Your final grade is the weighted average of the assignment grades. spreadsheet calculates your grades • •AnWeexcel will post a sanitized version online once a week. • • It will tell you exactly where you stand. It specifies the curves used for each assignment etc. • OpenOffice doesn’t run it properly. 34 Academic Honesty cheat. • Don’t Cheating on a test will get you an F in the class and no • • option to drop, and a visit with your college dean. Cheating on homeworks means you don’t have to turn them in any more, but you don’t get points either. You will also take at least 25% penalty on the exam grades. solutions of the internet or a solutions • Copying manual is cheating. the UCSD student handbook • Review in doubt, ask. Honest mistakes will be • When forgiven. 35