Delivering Performance While Conserving Power AMD Brent Kerby
Delivering Performance While Conserving Power (AMD) Brent Kerby Product Manager, Servers & Workstations Microprocessor Solutions Sector AMD www. amd. com
Did You Know? The combined total of data centers in California are estimated to require 250 MW – 375 MW of energy. That’s equivalent to 3, 495 – 5, 242 barrels of oil a day! SOURCE: California Energy Commission http: //www. energy. ca. gov/reports/2004 -04 -07_500 -04 -004. PDF
IT Managers Have A Problem Power Consumption/Cooling Issues Tracked By Company: 71% Power Consumption Increased Amount Of Power Supplied To Data Center 12% 21% Cooling Both Equally Important Suspect That Both are Issues but do not Track at this Time Neither Presents An Issue at this Time How Are Companies Addressing These Issues 38% Stopped Buying More Servers And/Or Consolidated Existing Equipment 26% Implemented "Cool Aisle/Hot Aisle“ Layout 25% Increased Size Of Data Center 12% 17% 44% 23% Other 16% None Of The Above 15% Base: 1, 177 IT Decision Makers November 2005 Strategy Group/Ziff-Davis Average of 18% of total rack space wasted due to power and cooling issues
Microprocessor Innovations Reducing power and cooling Processor Power Management Virtualization Technology Memory Sub-System Dual-Core native design Efficient Architecture
OS-Directed Power Management P-State P 0 AMD Power. Now!™ technology with Optimized Power Management Multiple performance states for optimized power management Dynamically reduces processor power based on workload Lowers power consumption without compromising performance Leverages industry standards Up to 75% CPU power savings at idle HIGH 2600 MHz 1. 40 V ~95 watts P 1 2400 MHz 1. 35 V ~90 watts P 2 2200 MHz 1. 30 V ~76 watts P 3 PROCESSOR UTILIZATION 2000 MHz 1. 25 V ~65 watts P 4 1800 MHz 1. 20 V ~55 watts P 5 1000 MHz 1. 10 V ~32 watts LOW EXAMPLE ONLY
OS-Directed Power Management -7% MAX -43% -55% -73% -75% -69% MIN AMD Power. Now! technology can provide up to 75% power savings! Benchmarks show no measurable performance impact!
Efficiency Through Instruction Sets ‘AMD 64 Technology’ ‘Virtualization’ Virtual Machine Application OS 32 -bit to 64 -bit OS Application OS ~25% increase Getting more done without increased power and heat Can Reduce number of power consuming servers http: //www. amd. com/us-en/Processors/Product. Information/0, , 30_118_8796_8800~100357, 00. html
Dual-Core Increasing performance without facilities upgrades AMD Opteron™ (Single-Core) (Dual-Core) Power Drop of 9 KVA OR 29 VS. Room for more growth 41% Less Power Less Heat Less Servers Less Space 17 Dual-Core Processors Increasing throughput within the same power and cooling envelope as singlecore processors Performance based on SPECweb® 99_SSL 82, 244 connections http: //www. amd. com/us-en/Processors/Product. Information/0, , 30_118_8796_8800~102056, 00. html
Architectures Effects On Power CORE 190 watts 8 GB/S 300 watts. CPU CORE Native Dual-Core MCP Dual-Core CORE CPU SRQ (150 w per CPU) MCP (95 w per CPU) Crossbar Mem. Ctrlr SRQ Crossbar HT Mem. Ctrlr HT 8 GB/S I/O Hub PCI-E Bridge 22 watts Memory Controller Hub 8 GB/S PCI-E Bridge 8 GB/S USB I/O Hub PCI Legacy x 86 Architecture 300 watts for processors (150 w each) 22 watts for external memory controller ~$451 per year (1 server) ~$225, 698 per year (500 servers) ~69% More 322 watts AMD 64 Technology with Direct Connect Architecture 190 watts for processors (only 95 w each) Integrated memory controllers ~$266 per year (1 server) ~$133, 000 per year (500 servers) Energy includes power input & cooling, Power Utility cost: $0. 10/KW-hr, Publicly available processor & chipset specifications. The examples contained herein are intended for informational purposes only. Other factors will affect real-world power consumption and cost 190 watts
Architectures Effects On Power CORE 692 watts CORE MCP(173 w per CPU) MCP CORE MCP SRQ Crossbar Mem. Ctrlr HT Mem. Ctrlr 380 watts HT (95 w per CPU) 14 watts USB Memory Controller Hub I/O Hub PCI 8 GB/S PCI-E Bridge SRQ Crossbar Mem. Ctrlr HT 8 GB/S 8. 5 watts XMB 8. 5 XMB watts 8. 5 watts XMB 8 GB/S PCI-E Bridge XMB PCI-E Bridge 8 GB/S USB I/O Hub PCI Legacy x 86 Architecture 692 watts for processors (173 w each) 48 watts for external memory controller ~$1037 per year (1 server) ~$518, 592 per year (500 servers) ~95% More 740 watts AMD 64 Technology with Direct Connect Architecture 380 watts for processors (only 95 w each) Integrated memory controllers ~$533 per/year (1 server) ~$266, 304 per/year (500 servers) Energy includes power input & cooling, Power Utility cost: $0. 10/KW-hr, Publicly available processor & chipset specifications. The examples contained herein are intended for informational purposes only. Other factors will affect real-world power consumption and cost. 380 watts
Challenges Efficient Power Supplies Typically only 72% efficient (at full load) That’s 280 watts in a 1000 watt load! Efficient Voltage Regulator Modules Processor, Memory Technologies DDR 2 DIMM = ~6 watts each FBDIMM = ~12 watts each Infrastructure Support of OS-Directed Power Management
Driving Innovation And Awareness To Make A Difference…
Delivering Performance While Conserving Power The mobile perspective Frank P. Helms Principal Member Technical Staff Lead Mobile System Architect Microprocessor Solutions Sector AMD www. amd. com
Agenda AMD Mobile Power Management Philosophy AMD Turion™ 64 X 2 Mobile Technology General Characteristics Power Management Mechanisms Dual Core Processor Benefits for Notebook PCs Windows Vista Power Management Demo
Power Management Philosophy Both OS and hardware important To optimize battery life and performance both the operating system (OS) directed and Hardware (HW) autonomous mechanisms must be involved in power management
Power Management Philosophy Mechanisms controlled by the OS have the following characteristics Involve user policy/preference Interactions with Applications and Drivers Knowledge of CPU usage by the applications which are running Examples of Mechanisms controlled by the OS include Transitions in and out of ACPI S-states Processor P-state control Halt (Processor C-state control)
Power Management Philosophy Controlled Autonomously by HW Mechanisms that are controlled autonomously by Hardware have the following characteristics Transparent to the OS, Software, and the User Mechanisms that are on microsecond and finer time scales Examples Using pre-charge power down mode (memory) Fine grain clock gating (processor, GPU, etc. ) Hyper. Transport™ Link power management
AMD Turion™ 64 X 2 Mobile Technology General characteristics Dual Core (512 KB L 2 per core) Dual Channel DDR 2 Hyper. Transport @ 800 MHz (6. 4 GByte/s bandwidth total) Manufacturing process that is optimized for mobile (to minimize leakage current) 35 x 35 mm lidless package and low profile socket
AMD Turion™ 64 X 2 Mobile Technology Core power management C 0 P-states Dynamic fine grain clock gating of idle logic Pre-charge power down mode for system memory Cores can be halted (C 1) individually Core by core power management C 1 E (Enhanced): When both cores are halted, additional power savings mechanisms can be applied Hyper. Transport™ link can be disconnected System Memory in self refresh On-die northbridge clock grid ramps down
AMD Turion™ 64 X 2 Mobile Technology Power management overview SODIMM Deeper Sleep: Core 0 Core 1 Core 0 VDD plane C 1 C 0 voltage is reduced further Northbridge SODIMM HT Disconnected HT Connected LCD Chipset with GPU Both cores • Both cores Core 1 Halted executing (C 1) clock halted code (C 0) grid ramped enables down hardware C 1 E Cores clock grids ramped down HT Disconnected System Memory in self refresh On-die Northbridge clock grid ramped down Chipset clock gating etc Deeper Sleep (Alt. VID)
Dual Core CPUs Can Improve Performance And Battery Life Higher performance per watt Example TDP Breakdown for 2 GHz AMD mobile processors 2005 2006 Single Core (2 GHz) Core 0 (2 GHz) + + Core 1 (2 GHz) + On Die Northbridge = 32. 8 W On Die Northbridge = 32 W I/O Power with 64 bit DDR interface = 2. 2 W I/O Power with 128 bit DDR 2 interface = 3. 0 W Fixed 35 W TDP
Windows Vista Power Management Processor power management AMD will be evaluating all Windows Vista configurable power management options AMD will publish recommendations for the use of Windows Vista power management options to optimize performance and battery life
Dual Core Processor Frank P. Helms Lead Mobile System Architect AMD Japan Engineering Lab
Call To Action Use AMD Turion™ 64 X 2 Mobile Technology for the best mobile experience with Windows Vista Contact AMD regarding Recommendations for optimized Windows Vista Power Policy settings for notebooks later this year Additional information on AMD Turion™ 64 X 2 Mobile Technology
Delivering Performance While Conserving Power The desktop perspective Paul C. Stanley Desktop Strategic Platform Planning Microprocessor Solutions Sector AMD www. amd. com
Agenda Desktop Power Consumption The Energy Efficient Desktop The Microprocessor Contribution Desktop Power Management Platform Strategies for Reducing Power Windows Vista Power Management on the Desktop Call to Action
Desktop Power Consumption “Enthusiast” Class platform can easily exceed 800 W - 1000 W of total system power High-Performance CPU + Dual GPU can exceed 60 A on 12 V rails ATX 2. 0 Specification = two rails at 20 A per rail 8 -phase VRM, 4 GB System Memory, GPU Physics Card, 4 x. SATA RAID, 2 x. Gb. E, USB, 1394, … Extreme Overclocking… High-End Home Theater can exceed 400 W+ A/V tuner(s), external speakers, 7. 1 Surround Sound, HD DVD, “big” LCD Monitor, etc. Let’s face it – higher performance = higher power
The Energy Efficient Desktop Typical Mainstream PC, measured at the wall At Idle: 150 W - 200 W http: //www. tomshardware. com/2005/06/03/dual_core_stress_test/ Under Load: 200 W (UMA) – 350 W (discrete) Energy Efficient Desktop: ~50 W to ~100 W http: //www. silentpcreview. com/article 313 -page 1. html Typical Commercial usage model 8 hours per day, 90%+ Idle Energy Efficient Desktop (estimated) electricity cost savings alone can easily exceed $100+ per year per platform
IDC Prognostication On SFF http: //www. eweek. com/article 2/0, 1759, 1948894, 00. asp? kc=EWRSS 03119 TX 1 K 0000594 Platforms include Small Form Factor (SFF), Ultra-Small Form Factor (USFF), All-In-One (AIO), Living Room PC (LRPC), Blade PC. . . 2005 U. S. Markets: ~31 M Minitowers (77% of desktop market), ~5 M SFF (12%) 2009 U. S. Markets: ~11 M Minitowers (31%), ~15 M SFF (42%) 2005 to 2009 Global Markets: ~93 M Minitower units (70%) drops to ~78 M (< 50%) 2005 to 2009 Global Markets: SFF ~12 M units (9%) rises to ~48 M (30%) SFF produces less heat and noise Dual Core CPU provides more performance for less energy cost SFF in standard form factors, with standard components, keeps costs low
e. Week On SFF http: //www. eweek. com/article 2/0, 1895, 1854790, 00. asp http: //www. eweek. com/article 2/0, 1895, 1934114, 00. asp Enterprise will drive the move to SFF Management costs, electricity, acoustics, all call for smaller, quieter desktop Expandability gives way to integration Shrinking the size (down to 6 -9 liters) uses less power Standard desktop parts minimizes platform cost Data Center electricity usage is at the top of IT managers' list of worries Balance and tradeoff between performance and power ". . . the lowest-power system. . . pays me back over the life of the system" Desire for objective ways to measure power consumption, to be developed by third parties
e. Week On SFF Trends Out To 2015 http: //www. eweek. com/article 2/0, 1895, 1949079, 00. asp? kc=ewnws 041306 dtx 1 k 0000599 Full-size tower is marginalized Minitower (dominates U. S. retail today) is diminished SFF (7 -20 liters) becomes most popular, uses low-cost desktop components SFF acoustics become unobtrusive, form factor “fits anywhere” (LRPC, etc. ) USFF (1 -7 liters) uses higher priced notebook components (miniaturization) All-In-One (U)SFF a la Apple i. Mac Blade PC Client for the Enterprise
The Microprocessor Contribution AMD Athlon™ 64 Processors AMD Athlon 64 X 2 vs AMD Athlon 64 (V)EE CPU AMD Athlon 64 X 2 4800+ @ 89 W standard TDP AMD Athlon 64 EE 4800+ @ 65 W, VEE 3800+ @ 35 W AMD Athlon 64 X 2 drops to ~9 W at Idle (V)EE drops as low as ~1 W (matching AMD Turion™ 64) AMD Athlon 64 embeds TCase. Max and TDP data AMD Athlon 64 TDP has steadily declined Examples (CPU samples are widely divergent, your mileage may vary) CPU TCase. Max A 64 X 2 3800+ (Toledo) 63°C A 64 4800+ (Toledo) 59°C A 64 3000+ (Venice) 57°C A 64 4000+ (San Diego) 59°C TDP 65. 6 W [standard 89 W] 85. 0 W [standard 110 W] 44. 1 W [66% of standard 67 W] 50. 0 W [56% of standard 89 W]
AMD Athlon™ 64 Energy Efficient CPUs Energy-efficient processors drive new class of platforms Thin Clients and PC Blades for Enterprise SFF Desktops for Home and for Business Smaller footprint reaches further into the Home Better Acoustics are Required Accurate Temperature Monitoring enables improved fan control Total-System Energy Efficiency is Required Regulatory Programs (Energy. Star, Blue Angel, Nordic Swan…) are driving challenging new requirements for Energy Efficiency Building Blocks Energy efficient components Improved power management leveraging mobile technology Robust implementation of Cool’n’Quiet™ Technology Accurate Temperature Monitoring infrastructure
Desktop Power Management It’s not about power management, per se It’s really about delivering a better user experience, lowering costs, and enabling new usage scenarios Lower thermals = fewer/slower system fans Less noise = improved acoustics Desktops are consuming more power, electricity costs are rising – Do the math Home use scenarios demand quiet systems 7 of 10 IT Shops track power consumption http: //enterprise. amd. com/Downloads/Ziff_Power_and_Cooling_IT_survey_en. pdf
AMD Power Management AMD Cool‘n’Quiet™ on AMD Athlon™ 64 CPU Performance On Demand Up to 75% or more CPU power savings at Idle No impact on the performance of other platform subsystems (Memory Controller, Hyper. Transport. . . ) Fully synchronized Dual Core power management Load Balancing across dual cores can lower total CPU power consumption by as much as 50% AMD Dual Core architecture optimizes chip-level interconnects for faster, lower-power operation AMD Direct. Connect Architecture eliminates bus interface silicon and other overhead signals Saves “tens of watts per system” http: //enterprise. amd. com/Downloads/34146 A_PC_WP_en. pdf
Platform Strategies For Lowering Power Undervolting and underclocking components Lower voltage chips running at slower clock speed consume less power Deliberately choosing low-end components Often lower performance and lower voltage, but also lower power Selecting more efficient components Higher performance-per-watt ratio Using notebook PC components and subsystems But Desktop CPU is preferred over Mobile CPU Ease of implementation, infrastructure cost model…
Platform Impact Trends And Potentials AMD Athlon™ 64 CPU Roadmap now includes AMD Athlon 64 Energy Efficient (EE) CPU at 65 W and Very Energy Efficient (VEE) CPU at 35 W Enhanced Power Management Possibilities C 1 E Enhanced Sleep Clock gating can be beneficially implemented in virtually any silicon component GPU performance scaling is gaining traction Away Mode/Instant On for Media Center PC Enhanced Thermal and Acoustics Management Multiple thermal sensors to optimize fan control Instrumented PSU to enable PSU fan control
Additional Hardware Options CPU, Motherboard, VRM, Chipset, PSU, System Memory, Storage (HDD, ODD…)… Select power efficient components Example: Notebook HDD consumes ~1 W Discrete Graphics adds 15 W+ at Idle and 30 W+ under load Many and varied platform power conversions waste as much as 50% of AC power input Improve PSU efficiency Today’s “better” PSU = ~70%, Energy. Star proposal = 80% Other global programs: Blue Angel, Nordic Swan. . . Improve VRM efficiency (today ~80%) Implement more highly power-optimized “sleep” components and algorithms
Windows Vista Desktop power management Robust support for Sleep states encourages Windows Vista Power Management usage on Desktop PC Desktop Default is Hybrid Sleep: S 3 Standby → S 4 Hibernate Focus on Desktop power consumption Group Policy control for IT usage Optimized S 3 Resume performance (< 2 seconds)
Call To Action The entire PC ecosystem must cooperate and contribute to maximize the potential of the Energy Efficient Desktop
Other Resources Download A 64 Max. Temp. zip (TCase. Max and TDP readout) freeware at http: //www. thecoolest. zerobrains. com/forums/viewtopic. php? t=83
- Slides: 42