towards eco-friendly database management systems w. lang, j. m. patel (u wisconsin), cidr 2009...

22
Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Upload: kerry-welch

Post on 24-Dec-2015

222 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Towards Eco-friendly Database Management SystemsW. Lang, J. M. Patel (U Wisconsin), CIDR 2009

Shimin Chen

Big Data Reading Group

Page 2: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Introduction Energy consumption is important for data

centers: 2005: 1.2% of total US energy consumption is

attributed to powering and cooling servers, ~ $2.7B If current methods for powering data centers continue,

the consumption will nearly double by 2011

For DBMS: Previously large ignored energy efficiency Must start considering energy as a critical metric

Page 3: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

This paper: ecoDB New project: energy efficient data processing

techniques Two broad classes of techniques:

“global”: change how entire system is managed or used E.g. job scheduling

“local”: improve methods of processing data at individual nodes (focus of the paper)

Page 4: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Idea

Page 5: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Two Questions (1) “How does a system generate graphs as

shown in Figure 1?” DMBS must know HW capabilities and operating

characteristics Accurately estimate / continuously measure energy

consumption (2) “How can such a graph be used?”

Systematic method to change settings Service level agreements (SLAs)

This paper focuses on mechanisms for creating graphs

Page 6: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Outline Introduction Processor Voltage/Frequency Control (PVC) Improved Query Energy-efficiency by

Introducing Explicit Delays (QED) Opportunities for Energy Efficiency Summary

Page 7: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Techniques CPU freq = front side bus (FSB) freq * CPU

multiplier

DVFS (dynamic voltage and frequency scaling) Each p-state defines a CPU multiplier CPU voltage is based on CPU multiplier

Under-clocking (Focus of this paper) Reduce FSB freq Finer granularity Also changes RAM freq

Page 8: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

System Under Test System components:

ASUS P5Q3 Deluxe Wifi-AP motherboard Intel Core2-Duo E8500 2×1GKingston DDR3 main memory ASUS GeForce 8400GS 256M Western Digital Caviar SE16 320G SATA disk Power supply unit (PSU): a Corsair VX450W PSU

System power draw measured by a Yokogawa WT210 unit (suggested by SPEC Power benchmark)

MS Windows Server 2008 JDBC (Java 1.6)

Page 9: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Power CPU power sensors on motherboard:

ASUS motherboard has an EPU processor that directly measures the CPU power.

ASUS P5Q3 Deluxe 6-Engine software displays information gathered from this hardware sensor.

Current CPU wattage displayed in GUI: The authors sample the GUI every second Compute CPU joules using the average CPU wattage

and the execution time of a workload

Page 10: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Component powers

No hard disk, no operating system Focusing on CPU power:

CPU power consumption is often about 25% of the total system power consumption in the experiments

Page 11: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

DB test Workload

Use a commercial DBMS and MySQL 5.1.28 TPC-H (ad-hoc decision support), scale factor 1.0 (1GB data) Only run Query 5: six table join and a group by A run consists of ten queries with different parameters

FSB underclocking (allowed by ASUS 6-engine software) Stock (normal) Reduce FSB freq by 5%, 10%, and 15%

CPU voltage downgrade “small” and “medium” downgrade

7 settings: Stock, 3 FSB freq reductions X 2 CPU voltage downgrades

Page 12: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Equal Energy delay product

Page 13: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

With the same voltage level, larger frequency the better EDP

Equal Energy delay product

Page 14: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Theoretical Modeling EDP= joules x times = power x time2

= power / freq2

Power=CV2F EDP = CV2/F

Page 15: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Disk Energy Measured separately for stock setting Warm database

CPU: 1228.7 Joules Disk: 214.7 Joules

Cold database CPU: 2146.0 Joules Disk: 1135.4 Joules

Page 16: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Outline Introduction Processor Voltage/Frequency Control (PVC) Improved Query Energy-efficiency by

Introducing Explicit Delays (QED) Opportunities for Energy Efficiency Summary

Page 17: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Idea Explicitly delay queries look for commonalities among multiple queries Group multiple queries into a single query After execution, split query results

Page 18: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Setting DB clients repeatedly issue single table select

queries with different selection predicate. For example:

SELECT *FROM lineitemWHERE l_quantity=X

DBMS processes one query at a time QED: buffer queries in a queue, merge them,

send the merged query, split results In the experiments, X is different for the queries,

so no overlaps

Page 19: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

As batch size increases, diminishing decrease in energy consumption.

Page 20: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Outline Introduction Processor Voltage/Frequency Control (PVC) Improved Query Energy-efficiency by

Introducing Explicit Delays (QED) Opportunities for Energy Efficiency Summary

Page 21: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Opportunities in (DBMS) Software Traditional DB investigations into improving

query response times Energy vs. performance tradeoffs

Operator-level: rethink join algorithms Query-level: energy-efficient query plans Workload management per server Workload management for the entire collection of

servers: scheduling and using techniques to turn entire servers off

Page 22: Towards Eco-friendly Database Management Systems W. Lang, J. M. Patel (U Wisconsin), CIDR 2009 Shimin Chen Big Data Reading Group

Summary Energy-efficient data processing Studied two techniques

Processor Voltage/Frequency Control (PVC) Improved Query Energy-efficiency by Introducing

Explicit Delays (QED)

Designing a DBMS to balance the response time vs. energy consumption opens a wide range of research issues