HS 06 on the last generation of CPU

  • Slides: 33
Download presentation
HS 06 on the last generation of CPU for HEP server farm Michele Michelotto

HS 06 on the last generation of CPU for HEP server farm Michele Michelotto 1

The HEP server for CPU farm � Two socket � Rack mountable: 1 U,

The HEP server for CPU farm � Two socket � Rack mountable: 1 U, 2 U, dual twin, blade � Multicore � About 2 GB per logical cpu � x 86 -64 � Intel or AMD 2

AMD 3

AMD 3

Interlagos 4

Interlagos 4

Intel roadmap 5

Intel roadmap 5

New Intel Naming After Several generation of Xeon 5 n xx 51 xx (Woodcrest

New Intel Naming After Several generation of Xeon 5 n xx 51 xx (Woodcrest /Core 2 c 65 nm) 53 xx (Clovertown / Core 4 c 65 nm) 54 xx (Harpertown / Penryn 4 c 45 nm) 55 xx (Gainestown / Nehalem 4 c/8 t 45 nm) 56 xx (aka Gulftown / Nehalem 6 c/12 t 45) Now Xeon E 5 26 xx “Sandy Bridge” EP 8 c/16 t ( @32 nm ) 6

The dual proc Xeon E 5 26 xx 7

The dual proc Xeon E 5 26 xx 7

Configuration Software � Operating System: SL release 5. 7 (Boron) � Compiler: gcc version

Configuration Software � Operating System: SL release 5. 7 (Boron) � Compiler: gcc version 4. 1. 2 20080704 (Red Hat 4. 1. 2 -51) � HEP-SPEC 06 based on SPEC CPU 1. 2 (32 bit) � HEP-SPEC 06 64 bit (default config + remove “– m 32”) � 2 GB per core unless explicitly stated 8

AMD 6272 2 x 16 core 64 GB at 2. 1( up to 2.

AMD 6272 2 x 16 core 64 GB at 2. 1( up to 2. 6) GHz 9

At 64 bit 10

At 64 bit 10

Dynamic clock 11

Dynamic clock 11

Opteron 6272: from 32 to 64 bit 12

Opteron 6272: from 32 to 64 bit 12

Intel Xeon E 5 2660 2 x 8 c/16 t 64 GB at 2.

Intel Xeon E 5 2660 2 x 8 c/16 t 64 GB at 2. 2 (up to 2. 8) GHz 13

Xeon E 5 at 64 bit 14

Xeon E 5 at 64 bit 14

Several slopes due to Turbo Mode 15

Several slopes due to Turbo Mode 15

From 32 to 64 bit 16

From 32 to 64 bit 16

Intel vs AMD Running 64 bit application AMD is better than what one would

Intel vs AMD Running 64 bit application AMD is better than what one would expect if one measures it with a 32 bit benchmark like HS 06 17

Xeon E 5 Memory effect 18

Xeon E 5 Memory effect 18

Intel Xeon E 5 2660 HT ON vs HT OFF Hep. Spec 32 bit

Intel Xeon E 5 2660 HT ON vs HT OFF Hep. Spec 32 bit 350, 00 300, 00 250, 00 hepmark 200, 00 32 cores – HT – 64 GB 150, 00 16 cores – NO HT – 64 GB 100, 00 50, 00 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 threads 19

Xeon E 5 – HT ON vs HT OFF 20

Xeon E 5 – HT ON vs HT OFF 20

AMD Opteron 21

AMD Opteron 21

New 6272 2 x 16 core vs old 6174 2 x 12 core 22

New 6272 2 x 16 core vs old 6174 2 x 12 core 22

HS 06 Normalize on the nominal clock 64 bit 23

HS 06 Normalize on the nominal clock 64 bit 23

Architectures compared A Bulldozer core contains the equivalent of 2 cores of the previous

Architectures compared A Bulldozer core contains the equivalent of 2 cores of the previous generation relative performance They have about the same performances at 24 threads (6174 full loaded) With less threads better performances due to dynamic clock increase From 24 to 32 better performances due to the increased number of cores 24 Increase in performance more visible at 64 bit

Intel Xeon E 5 25

Intel Xeon E 5 25

Intel Xeon 2660 @ 2. 2 GHz vs Old Xeon 5650 @ 2. 66

Intel Xeon 2660 @ 2. 2 GHz vs Old Xeon 5650 @ 2. 66 GHz 26

relative performance Sandy Bridge 2. 2 GHz vs Nehalem 2. 66 GHz 27

relative performance Sandy Bridge 2. 2 GHz vs Nehalem 2. 66 GHz 27

Clock normalization 28

Clock normalization 28

Architectures compared A Sandy Bridge core has about 40% more throughput at same clock

Architectures compared A Sandy Bridge core has about 40% more throughput at same clock relative performance Increase in performance slightly better at 64 bit With more than 12 cores better performances due to the added cores 29

Intel vs. AMD 30

Intel vs. AMD 30

Intel architecture vs. AMD architecture One AMD core (not a Bulldozer core) gives 70%

Intel architecture vs. AMD architecture One AMD core (not a Bulldozer core) gives 70% to 77% of the performances of a Intel Xeon 26 xx core Even less (55%) when the server is mostly idle but our servers usually aren’t An economic comparison should take in account of cost of procurements (Euro/HS 06). The list price of Intel processors is higher than the AMD processor We didn’t compare the Power consumption (Watt/HS 06) 31

To do � Redo all the measurement with SL 6. x � Redo all

To do � Redo all the measurement with SL 6. x � Redo all the measurement with RH 7 o SL 7 � Make measurements of power consumption 32

Thank you. Q&A 33

Thank you. Q&A 33