Presentation is loading. Please wait.

Presentation is loading. Please wait.

Lectures Slides and Figures from MKP and Sudhakar Yalamanchili

Similar presentations


Presentation on theme: "Lectures Slides and Figures from MKP and Sudhakar Yalamanchili"— Presentation transcript:

1 Lectures Slides and Figures from MKP and Sudhakar Yalamanchili
Introduction Lectures Slides and Figures from MKP and Sudhakar Yalamanchili

2 Reading Sections 1.1, 1.2, 1.3, 1.4, 1.5, 1.7, 1.8 Key Ideas
Types of systems  implications for computer architecture The impact of technology New rules  the power wall and parallelism

3 Historical Perspective
ENIAC built in World War II was the first general purpose computer Used for computing artillery firing tables 80 feet long by 8.5 feet high and several feet wide Each of the twenty 10 digit registers was 2 feet long Used 18,000 vacuum tubes Performed 1900 additions per second Since then Moore’s Law Transistor density doubles every months Modern version #cores double every months

4 The Modern Era blogs.intel.com Hundred’s of $$ Battery operated Internet capable Contrast with warehouse scale computing, e.g., Google and Amazon Software as a service Backend for mobile devices Power consumption limited

5 Morgan Kaufmann Publishers
June 30, 2018 Opening the Box Capacitive multitouch LCD screen 3.8 V, 25 Watt-hour battery Computer board Chapter 1 — Computer Abstractions and Technology

6 Morgan Kaufmann Publishers
June 30, 2018 Inside the Processor Apple A5 Chapter 1 — Computer Abstractions and Technology

7 Warehouse Scale Computers
SUN MD S20: Water cooled containers 187.5KW Google data center in Oregon Power densities of KW/m2  footprint  cost From R. Katz, “Tech Titans Building Boom,” IEEE Spectrum, February 2009,

8 Morgan Kaufmann Publishers
June 30, 2018 Inside the Core (CPU) Datapath: performs operations on data Control: sequences datapath, memory, ... Cache memory Small fast SRAM memory for immediate access to data (IBM HS20 blade server) Chapter 1 — Computer Abstractions and Technology

9 Morgan Kaufmann Publishers
June 30, 2018 Inside the Processor AMD Barcelona: 4 processor cores Chapter 1 — Computer Abstractions and Technology

10 Morgan Kaufmann Publishers
June 30, 2018 Reminder ECE 2035 High-level language Level of abstraction closer to problem domain Assembly language Textual representation of instructions Hardware representation Encoded instructions and data ECE 3056 How does this work? Chapter 1 — Computer Abstractions and Technology

11 Moore’s Law Goal: Sustain Performance Scaling From wikipedia.org

12 Uniprocessor Performance
Morgan Kaufmann Publishers June 30, 2018 Uniprocessor Performance §1.8 The Sea Change: The Switch to Multiprocessors Constrained by power, instruction-level parallelism, memory latency Chapter 1 — Computer Abstractions and Technology

13 Memory Wall “Moore’s Law” Time µProc 1000 60%/yr. 100 Processor-Memory
CPU registers ALU registers ALU 100 Processor-Memory Performance Gap: (grows 50% / year) L1D$ L1I$ L1D$ L1I$ 10 DRAM 7%/yr. L2$ L2$ L2$ L2$ DRAM 1 L3$ L3$ Time

14 New Rules: The End of Dennard Scaling
GATE SOURCE DRAIN tox L Voltage is no longer scaling at the same rate Feature size Slower scaling in power per transistor  increasing power densities From R. Dennard, et al., “Design of ion-implanted MOSFETs with very small physical dimensions,” IEEE Journal of Solid State Circuits, vol. SC-9, no. 5, pp , Oct

15 Post Dennard Performance Scaling
W. J. Dally, Keynote IITC 2012 15

16 Morgan Kaufmann Publishers
June 30, 2018 Power Wall New Metrics Power efficiency Energy Efficiency In CMOS IC technology ×30 5V → 1V ×1000 Chapter 1 — Computer Abstractions and Technology

17 X2: 300mm wafer, 117 chips, 90nm technology
Understanding Cost X2: 300mm wafer, 117 chips, 90nm technology X4: 45nm technology What happens if you simply port a design across technology generations? What about design costs? Hardware and software

18 Integrated Circuit Cost
Morgan Kaufmann Publishers June 30, 2018 Integrated Circuit Cost Nonlinear relation to area and defect rate Wafer cost and area are fixed Defect rate determined by manufacturing process Die area determined by architecture and circuit design Chapter 1 — Computer Abstractions and Technology

19 Average Transistor Cost Per Year
Source: Courtesy H.H. Lee, ECE 3055

20 3D Packaging New packaging technologies to increase processor- memory bandwidth What problems does this create? Images from techweekeurope.co.uk

21 Morgan Kaufmann Publishers
June 30, 2018 A Safe Place for Data Volatile main memory Loses instructions and data when power off Non-volatile secondary memory Magnetic disk Flash memory Optical disk (CDROM, DVD) Chapter 1 — Computer Abstractions and Technology

22 Morgan Kaufmann Publishers
June 30, 2018 Networks Communication and resource sharing Local area network (LAN): Ethernet Within a building Wide area network (WAN: the Internet Wireless network: WiFi, Bluetooth Chapter 1 — Computer Abstractions and Technology

23 Morgan Kaufmann Publishers
June 30, 2018 Parallelism Multicore microprocessors More than one processor per chip Parallel programming Compare with instruction level parallelism Hardware executes multiple instructions at once Hidden from the programmer Hard to do Programming for performance Load balancing Optimizing communication and synchronization Chapter 1 — Computer Abstractions and Technology

24 Multicore, Many Core, and Heterogeneity
NVIDIA Keplar Performance scaling via increasing core count The advent of heterogeneous computing AMD Trinity Intel Ivy Bridge Different instruction sets

25 Eight Great Ideas Design for Moore’s Law
Use abstraction to simplify design Make the common case fast Performance via parallelism Performance via pipelining Performance via prediction Hierarchy of memories Dependability via redundancy

26 Morgan Kaufmann Publishers
June 30, 2018 Concluding Remarks New Rules Power and energy efficiency are driving concerns Cost is an exercise in mass production Relationship to instruction set architecture (ISA)? Instruction set architecture (ISA) The hardware/software interface is the vehicle for portability and cost management Multicore Core scaling vs. frequency scaling Need for parallel programming  need to think parallel! Chapter 1 — Computer Abstractions and Technology

27 Study Guide Moore’s Law Technology Trends Understanding Cost
What is it? What are the cost and performance consequences? Technology Trends Explain the reason for the shift to power and energy efficient computing Understanding Cost What are the major elements of cost? Multicore processor Distinguishing features Basic Components of a Modern Processor

28 Glossary Energy efficiency Dennard Scaling Die yield Feature size
Heterogeneity Moore’s Law Multicore architecture Memory Wall Performance scaling Parallel programming Power efficiency Power Wall Wafer


Download ppt "Lectures Slides and Figures from MKP and Sudhakar Yalamanchili"

Similar presentations


Ads by Google