iPOEM: A GPS Tool for Integrated Management in Virtualized Data Centers Hui Zhang1 , Kenji Yoshihira1, Ya-Yunn Su2 , Guofei Jiang1, Ming Chen3, Xiaorui Wang3 1. NEC Laboratories America 2. National Taiwan University 3. University of Tennessee Virtualized data centers: server consolidation and green IT • Server consolidation - virtualization facilitates consolidation of several physical servers onto a single high end system — Reduces management costs/overheads — Increases overall utilization Resource Pool • Green IT - computing more, consume less — Improving infrastructure efficiency —Increasing IT productivity Future Today IT load power DCiE = DCpW = Total data center Input power Total facility power DCiE: Data center infrastructure efficiency iPOEM Data center useful work DCpW: Data center performance per Watt ICAC2011 2 Virtualized data center management • Server utilization based performance and power management mechanisms – VMware DPM, NEC SSC, IBM Tivoli… Overload threshold CPU utilization CPUhigh CPUlow Power-saving mode Management Configuration iPOEM ICAC2011 3 iPOEM: a middleware for integrated power and performance management •• A GPS tool declarative is a good metaphor. Features management methodology • How can I get the driver 1. accepts higher-level managementoperator objectives system to 20% less • How can I go to NYC? • target system status set on individual management components power cost? iPOEM 2. generates low-level management configurations. system • Configuration settings of individual management components car 1. Map 2. Direction iPOEM GPS device ICAC2011 1. System status 2. Management decisions 4 Research Goal Management challenges Data center administrator Human-friendly management interfaces iPOEM System Complexity Workload Dynamics System Dynamics Performance management t t t t t t Power management t t t t migration t t t t t t t failure t Application management remove/add Virtualized data center Thermal management iPOEM APIs API 1 : get_position() API 2 : put_position() Input: Management Configuration Input: Target Performance & Power (Time start, Time end) (Time start, Time end) Workload (reshaping-scheme) Workload (reshaping-scheme) VM-server map, resource inventory VM-server map, resource inventory Output: Management Configuration Output: System Status System status is described in 3 metrics Load threshold CPU utilization time Performance cost: March server2010 overloading time in percentage. iPOEM Power cost: KWatts, total power consumed ICAC2011 Operation cost: VM migrations 6 iPOEM architecture iPOEM ICAC2011 7 iPOEM management configuration engine API 1 : get_position() Management Configuration CPU High CPU High, Low 8 iPOEM ICAC2011 8 System status as a function of management configurations 9 iPOEM ICAC2011 9 Formal description of system status functions Assume a homogeneous system, and the workload remains the same for different configuration settings. Theorem 1. Performance-cost(CPUhigh) is a non-decreasing function of CPUhigh. Theorem 2. Power-cost(CPUlow) is a non-increasing function of CPUlow. 10 iPOEM ICAC2011 10 iPOEM configuration searching algorithm • A O(logR) searching algorithm owhere R =CPUmax-CPUmin, the allowable load range Binary search 11 iPOEM ICAC2011 11 iPOEM prototype implementation 12 iPOEM ICAC2011 12 iPOEM System positioning services • Position reporting iPOEM • Destination searching ICAC2011 13 iPOEM 1 14 27 40 53 66 79 92 105 118 131 144 157 170 183 196 209 222 235 248 261 274 287 300 313 326 339 352 365 378 391 404 417 430 443 456 469 482 495 508 521 534 547 560 573 586 599 612 625 638 651 664 1 14 27 40 53 66 79 92 105 118 131 144 157 170 183 196 209 222 235 248 261 274 287 300 313 326 339 352 365 378 391 404 417 430 443 456 469 482 495 508 521 534 547 560 573 586 599 612 625 638 651 664 1 14 27 40 53 66 79 92 105 118 131 144 157 170 183 196 209 222 235 248 261 274 287 300 313 326 339 352 365 378 391 404 417 430 443 456 469 482 495 508 521 534 547 560 573 586 599 612 625 638 651 664 1 14 27 40 53 66 79 92 105 118 131 144 157 170 183 196 209 222 235 248 261 274 287 300 313 326 339 352 365 378 391 404 417 430 443 456 469 482 495 508 521 534 547 560 573 586 599 612 625 638 651 664 Evaluation: data center workload traces • Traces on 2525 servers from 10 IT systems – Each is regarded as a VM in the simulations. • • Monitoring data: CPU utilization. 1 week length, 15 minute monitoring frequency – 672 time points 100 50 0 150 100 50 0 100 50 0 150 100 50 0 ICAC2011 14 Evaluation: methodology • Run the iPOEM prototype as an offline engine. – • It is driven by the data traces stored in the monitoring database, and emulates the integrated management in a virtualized data center hosting the 2, 525 servers as VMs. The system and management configuration settings – Performance manager and power manager • Implementation of the simplified schemes in NEC SigmaSystemCenter middleware – The default <CPUlow,CPUhigh> setting is <40%, 80%>. – The physical servers are homogeneous with the same CPU specs • 3GHZ Quadra-core (the most common CPU model in the traces). – Performance cost: number of performance violation in a time epoch • A server has a performance violation at a time point when its CPU utilization is larger than a threshold (90% in the paper). – Power cost: we assume power consumption per server is either 0 (power-off mode) or 200Watts (power-on mode), simplified on the power model profiled in the local testbed. – Operation cost: the number of VM migrations that the performance and power managers need to execute for the server load configuration enforcement. iPOEM ICAC2011 15 iPOEM engine performance iPOEM engine response time to service requests iPOEM ICAC2011 16 iPOEM auto-piloting service • Sensitivity based optimization [Markovic et al. 2004] where Pth is the upper bound of the performance cost. 17 iPOEM ICAC2011 17 iPOEM auto-piloting evaluation Comparison of Auto-piloting and three static configuration schemes 18 iPOEM Auto-piloting management configuration evolution ICAC2011 18 Conclusions & Future Work • iPOEM, an integrated power and performance management middleware in an virtualized infrastructure. – human-friendly interfaces for multi-objective management. • Future work – Meta-management integrating more objectives. • explosive growth of the system state space – mashup services for customized tenant management • new API designs iPOEM ICAC2011 19 Thank you. • Questions? iPOEM ICAC2011 20 Appendix iPOEM ICAC2011 21