CS 61 C Great Ideas in Computer Architecture

  • Slides: 28
Download presentation
CS 61 C: Great Ideas in Computer Architecture Case Studies: Server and Cellphone microprocessors

CS 61 C: Great Ideas in Computer Architecture Case Studies: Server and Cellphone microprocessors Instructors: Krste Asanovic, Randy H. Katz http: //inst. eecs. Berkeley. edu/~cs 61 c/fa 12 Fall 2012 -- Lecture #38 1

Today: Intel Haswell and smartphone/tablet processors • This material is not on final exam!

Today: Intel Haswell and smartphone/tablet processors • This material is not on final exam! • Intended for you to see modern day computer architectures. Fall 2012 -- Lecture #38 2

Intel Haswell Core • Not yet in production, the next core after Ivy Bridge!

Intel Haswell Core • Not yet in production, the next core after Ivy Bridge! • Acknowledgements: Slides include material from Intel and David Kanter at Real World Technologies – Recommend site realworldtech. com for reading about new microprocessor architectures 3

Fall 2012 -- Lecture #38 4

Fall 2012 -- Lecture #38 4

Fin. FETs are a Berkeley EECS innovation! Fall 2012 -- Lecture #38 5

Fin. FETs are a Berkeley EECS innovation! Fall 2012 -- Lecture #38 5

Fall 2012 -- Lecture #38 6

Fall 2012 -- Lecture #38 6

How to run x 86 code fast? • x 86 architecture evolved from 16

How to run x 86 code fast? • x 86 architecture evolved from 16 -bit microprocessor designed for CISC microcoded implementation – 8086 introduced in 1978 (34 years old!) – Only older widely used ISA is IBM 360 architecture family introduced in 1964 (48 years old!) • Typical instruction: Reg = Reg op M[Reg] – Two-address format – Register-memory operations – Few general-purpose registers (8 initially, 16 in 64 -bit extension) • Many complex instructions with repeat prefixes – String move in one instruction • Variable-length instructions up to 16 bytes long • Added one instruction/week over lifetime!! 7

Convert CISC to RISC Dynamically! • Translate complex x 86 instructions into RISC-like microoperations

Convert CISC to RISC Dynamically! • Translate complex x 86 instructions into RISC-like microoperations (µops) during instruction decode – e. g. , “R R op Mem” translates into – – load T 1, Mem # Load from Mem into temp reg R R op T 1 # Operate using value in temp • Execute µops using speculative out-of-order superscalar engine with register renaming – Both architectural and temporary registers are renamed from same pool • Reconstruct whole x 86 instructions during commit process to report exceptions precisely • µop translation introduced in Pentium Pro family architecture (P 6 family) in 1995, used in all subsequent x 86 out-of-order processors

Haswell Front-End [Kanter] Fall 2012 -- Lecture #38 9

Haswell Front-End [Kanter] Fall 2012 -- Lecture #38 9

Haswell Rename/Reorder [Kanter] Fall 2012 -- Lecture #38 10

Haswell Rename/Reorder [Kanter] Fall 2012 -- Lecture #38 10

Haswell Execution [Kanter] ALU Memory Fall 2012 -- Lecture #38 11

Haswell Execution [Kanter] ALU Memory Fall 2012 -- Lecture #38 11

Fall 2012 -- Lecture #38 12

Fall 2012 -- Lecture #38 12

Fall 2012 -- Lecture #38 13

Fall 2012 -- Lecture #38 13

Fall 2012 -- Lecture #38 14

Fall 2012 -- Lecture #38 14

Fall 2012 -- Lecture #38 15

Fall 2012 -- Lecture #38 15

Fall 2012 -- Lecture #38 16

Fall 2012 -- Lecture #38 16

Fall 2012 -- Lecture #38 17

Fall 2012 -- Lecture #38 17

Fall 2012 -- Lecture #38 18

Fall 2012 -- Lecture #38 18

Administrivia • Final review session with TAs – Wednesday December 5 – 12: 00

Administrivia • Final review session with TAs – Wednesday December 5 – 12: 00 pm-3: 00 pm – 1 Pimentel Fall 2012 -- Lecture #38 19

Smartphone Processors • Many companies and parts but some common features: – ARM ISA

Smartphone Processors • Many companies and parts but some common features: – ARM ISA for application processors – Lots of dedicated accelerator blocks, especially image processors for cameras and GPUs for graphics – Only ~2 W max power dissipation! Fall 2012 -- Lecture #38 20

NVIDIA Tegra 3 • Used in Nexus 7 and many other phones, tablets, and

NVIDIA Tegra 3 • Used in Nexus 7 and many other phones, tablets, and Audi cars… Fall 2012 -- Lecture #38 21

Tegra 3 Block Diagram Fall 2012 -- Lecture #38 22

Tegra 3 Block Diagram Fall 2012 -- Lecture #38 22

Tegra 3 “ 4 plus 1” operation Fall 2012 -- Lecture #38 23

Tegra 3 “ 4 plus 1” operation Fall 2012 -- Lecture #38 23

ARM Cortex A 9 Fall 2012 -- Lecture #38 24

ARM Cortex A 9 Fall 2012 -- Lecture #38 24

Qualcomm Snapdragon MSM 8960 Fall 2012 -- Lecture #38 25

Qualcomm Snapdragon MSM 8960 Fall 2012 -- Lecture #38 25

Apple A 6 and A 6 X (32 nm) [Chipworks. com, 2012] Fall 2012

Apple A 6 and A 6 X (32 nm) [Chipworks. com, 2012] Fall 2012 -- Lecture #38 26

Apple A 6 X [chipworks. com] In i. Pad 4 76. 8 GFLOPS Peak!

Apple A 6 X [chipworks. com] In i. Pad 4 76. 8 GFLOPS Peak! Fall 2012 -- Lecture #38 27

Summary • Continual rapid change in architecture – Mobile and server processors include large

Summary • Continual rapid change in architecture – Mobile and server processors include large and increasing number of processors on single chip – More specialized processors common – New architectural concepts (transactional memory) • Covered basic ideas behind architectures in CS 61 C, but to learn more take CS 152 Fall 2012 -- Lecture #38 28