A SYSTEM DYNAMICS APPROACH TO DATA CENTER CAPACITY




















































- Slides: 52

A SYSTEM DYNAMICS APPROACH TO DATA CENTER CAPACITY PLANNING: A CASE STUDY Thesis by: Kaveh Dianati Supervisor: Pål Davidsen Summer 2012

AGENDA Introduction Problem Definition Model Structure Model Analysis and Validation Policy and Scenario Analysis Conclusions & Areas for Future Improvement 2

INTRODUCTION 3

INTRODUCTION Subject: System Dynamics modeling project for a Norwegian cloud computing company. Cloud computing is the provisioning of centralized IT services and infrastructure, such as processing or storage capacity, to businesses in a flexible, reliable, and inexpensive fashion. Two-fold purpose of this thesis: � Help the client in medium-term capacity planning � Establish the usefulness of the System Dynamics methodology in data center planning and cloud computing business fields. Model was built in close client interaction with the CTO of the company. 4

PROBLEM DEFINITION 5

PROBLEM DEFINITION Design a tool to estimate when the limit of the company’s main data center, in terms of power capacity, would be reached. Main consumers of power: Physical servers We need to model the development of servers. 6

REFERENCE MODE OF BEHAVIOUR 7

MODEL STRUCTURE 8

MODEL BASE: AGING CHAINS AND CO-FLOWS 9

MODEL STRUCTURE PHYSICAL SERVERS Acquisition, Inventory, Installation. 10

MODEL STRUCTURE PHYSICAL SERVERS Disengagement of physical servers 11

MODEL STRUCTURE CO-FLOWS OF HOSTS AND STAND-ALONES Physical servers are two types: - Stand-alones - Hosts 12

MODEL STRUCTURE VIRTUAL SERVERS 13

MODEL STRUCTURE CPUS AND CORES 14

MODEL STRUCTURE POWER DEMAND 15

MODEL STRUCTURE USERS User types: � Full-desktop users (FD) � Distributed-services users (DS) 16

MODEL STRUCTURE USER PROCESSOR REQUIREMENTS 17

MODEL STRUCTURE REVENUES 18

MODEL STRUCTURE DATA CENTER 19

MODEL STRUCTURE ACCOUNTING 20

MODEL STRUCTURE FINANCE 21

MODEL STRUCTURE FINANCE 22

MODEL ANALYSIS AND VALIDATION 23

PARTIAL MODEL TESTING Partial model testing: validating the structure of the model part by part Validating each part by simulating with all other parts driven by data 24

VALIDATION OF PHYSICAL SERVER INSTALLATION 25

VALIDATION OF PHYSICAL SERVER INSTALLATION 26

VALIDATION OF PHYSICAL SERVER INSTALLATION 27

CLD FORINVENTORY OSCILLATIONS

REFERENCE MODE COMPARISON TESTS Comparing the simulated behavior of the whole model with actual time-series data 29

SIMULATION VS. DATA TOTAL PHYSICAL SERVERS RMSE divided by run average = 2. 9% correlation coefficient = 0. 998 30

SIMULATION VS. DATA HOSTS • • RMSE divided by run average = 17. 7% correlation coefficient = 0. 991 31

SIMULATION VS. DATA VIRTUAL SERVERS • • RMSE divided by run average = 4. 4% correlation coefficient = 0. 997 32

SIMULATION VS. DATA TOTAL LOGICAL SERVERS 33

SIMULATION VS. DATA LOGICAL SERVERS PER PHYSICAL SERVER 34

SIMULATION VS. DATA USERS PER LOGICAL SERVER 35

SIMULATION VS. DATA TOTAL CPUS 36

SIMULATION VS. DATA TOTAL VIRTUAL CORES 37

SIMULATION VS. DATA CORES PER PHYSICAL SERVER 38

SIMULATION VS. DATA POWER DEMAND 39

SIMULATION VS. DATA AVERAGE POWER DEMAND OF PHYSICAL SERVERS 40

POLICY AND SCENARIO ANALYSIS 41

INTO THE FUTURE – BUSINESS AS USUAL User Growth: 5. 9% per year Share of Distributed Services Users in New Users: 50% Ø Power limit reached in: 2019 42

INTO THE FUTURE – HIGHER GROWTH User Growth rate: 9% per year Ø Power limit reached in: 2017 43

SENSITIVITY OF DATA CENTER POWER RUNOUT TIME TO FUTURE GROWTH RATE Future Growth: Normal distribution Expected value: 5. 9% per year Standard deviation: 3% per year 44

SENSITIVITY OF DATA CENTER POWER RUN-OUT TIME TO PROCESSOR REQUIREMENTS OF NEW FD USERS Users per logical server for new FD users: Normal distribution Expected Value: 1. 6 [user/server] Standard deviation: 1 [user/server] 45

Sensitivity of Data Center Power Run-out Time to Future Share of DS Users in New contracts Future share of distributed –services users in new users: Normal distribution Expected value: 60% Standard deviation: 30% 46

LIKELY FUTURE SCENARIO: COUNTER-INTUITIVE RESULT Growth: 10% All new servers installed as hosts. Each server hosts 4 virtual servers. 47


CONCLUSION AND AREAS FOR IMPROVEMENT 49

CONCLUSION The main research question: to estimate the time at which the data center would run out of power capacity, under different policies and environmental outcomes. We observed that the answer is sensitive to several parameters: � future growth, � future market policy, � future desired level of customer service. This thesis demonstrates the usefulness of the System Dynamics methodology in policymaking for data centers. 50

AREAS FOR IMPROVEMENT The concept of capacity of hosts in terms of number of virtual machines supported is not modeled. RAM sector not calibrated server installations dependent only on processor requirements of users. Storage spindles are not fully characterized. I/O capacity of spindles for instance is not modeled. Market sector is preliminary: flow of clients not modeled, types of clients not included, etc. The data center sector can be modeled with much more detail: space, racks, rack density, etc. Accounting & Finance and HR sector are quite crude. User interface can be developed.

Thank you for your attention.