Gear-Shifting Power-Aware RAID

advertisement
PARAID: A Gear-Shifting
Power-Aware RAID
Charles Weddle, Mathew Oldham, Jin Qian, An-I Andy Wang – Florida St. University
Peter Reiher – University of California, Los Angeles
Geoff Kuenning – Harvey Mudd College
1
Motivation

Energy costs are rising

An increasing concern for servers


Energy consumption of disk drives




No longer limited to laptops
24% of the power usage in web servers
27% of electricity cost for data centers
More energy  more heat  more cooling  lower
computational density  more space  more costs
Is it possible to reduce energy consumption
without degrading performance while
maintaining reliability?
PARAID: A Gear-Shifting Power-Aware RAID
2
Challenges

Energy


Performance


Not enough opportunities to spin down RAIDs
Essential for peak loads
Reliability

Server-class drives are not designed for
frequent power switching
PARAID: A Gear-Shifting Power-Aware RAID
3
Existing Work

Most trade performance for energy
savings directly


e.g. vary speed of disks
Most are simulated results
PARAID: A Gear-Shifting Power-Aware RAID
4
Observations

RAID is configured for peak performance


Unused storage capacity



RAID keeps all drives spinning for light loads
Over-provision of storage capacity
Unused storage can be traded for energy savings
Fluctuating load


Cyclic fluctuation of loads
Infrequent on-off power transitions can be effective
PARAID: A Gear-Shifting Power-Aware RAID
5
Performance vs. Energy
Optimizations

Performance benefits


Realized under heavy loads
Energy benefits

Realized instantaneously
6
Power-Aware RAID

Skewed striping for energy savings

Preserving peak performance

Maintaining reliability

Evaluation

Conclusion

Questions
PARAID: A Gear-Shifting Power-Aware RAID
7
Skewed Striping for Energy Saving

Use over-provisioned spare storage

Organized into hierarchical overlapping subsets
1
2
3
4
5
RAID
PARAID: A Gear-Shifting Power-Aware RAID
8
Skewed Striping for Energy Saving

Each set analogous to gears in automobiles
1
2
3
4
5
RAID
Gears
1
2
3
PARAID: A Gear-Shifting Power-Aware RAID
9
Skewed Striping for Energy Saving

Soft states can be reclaimed for space

Persist across reboots
1
2
3
4
5
Soft
States
RAID
Gears
1
2
3
PARAID: A Gear-Shifting Power-Aware RAID
10
Skewed Striping for Energy Saving


Operate in gear 1
Disks 4 and 5 are powered off
1
2
3
4
5
Soft
States
RAID
Gears
1
2
3
PARAID: A Gear-Shifting Power-Aware RAID
11
Skewed Striping for Energy Saving


Approximate the workload
Gear shift into most appropriate gear

Minimize the opportunity lost to save power
Conventional RAID PARAID
Energy
( Powered
On Disks )
workload
Workload
( Disk Parallelism )
PARAID: A Gear-Shifting Power-Aware RAID
12
Skewed Striping for Energy Saving


Adapt to cyclic fluctuating workload
Gear shift when gear utilization threshold is met
load
utilization threshold
gear shift
PARAID: A Gear-Shifting Power-Aware RAID
time
13
Preserving Peak Performance

Operate in the highest gear



Maximize parallelism within each gear



When the system demands peak performance
Uses the same disk layout
Load is balanced
Uniform striping pattern
Delay block replication until gear shifts

Capture block writes
PARAID: A Gear-Shifting Power-Aware RAID
14
Maintaining Reliability

Reuse existing RAID levels (RAID-5)


Also used in various gears
Drives have a limited number of power
cycles

Ration number of power cycles
PARAID: A Gear-Shifting Power-Aware RAID
15
Maintaining Reliability


Busy disk stay powered on, idle disks stay powered off
Outside disks are role exchanged with middle disks
role exchange
Disk 1
Disk 2
busy
disks
Gear 1
Disk 3
Disk 4
power
cycled
disks
Disk 5
Disk 6
idle
disks
Gear 2
Gear 3
PARAID: A Gear-Shifting Power-Aware RAID
16
Logical Component Design
User space
Admin tool
Linux kernel
File system
RAID
Reliability manager
PARAID block mapping
Load monitor
Soft RAID
Gear manager
Disk device driver
PARAID: A Gear-Shifting Power-Aware RAID
17
Data Layout


Resembles the data flow of RAID 1+0
Parity for 5 disks does not work for 4 disks

For example, replicated block 12 on disk 3
Gear 1
RAID-5
Gear 2
RAID-5
Disk 1
Disk 2
Disk 3
Disk 4
(1-4)
8
12
((1-4),8,12)
16
20
(16,20,_)
_
1
2
3
4
(1-4)
5
6
7
(5-8)
8
9
10
(9-12)
11
12
13
(13-16)
14
15
16
(17-20)
17
18
19
20
PARAID: A Gear-Shifting Power-Aware RAID
Disk 5
18
Data Layout

Cascading parity updates

For example, updating block 8 on disk 5
Gear 1
RAID-5
Gear 2
RAID-5
Disk 1
Disk 2
Disk 3
Disk 4
(1-4)
8
12
((1-4),8,12)
16
20
(16,20,_)
_
1
2
3
4
(1-4)
5
6
7
(5-8)
8
9
10
(9-12)
11
12
13
(13-16)
14
15
16
(17-20)
17
18
19
20
PARAID: A Gear-Shifting Power-Aware RAID
Disk 5
19
Update Propagation

Up-shift propagation (e.g. shifting from 3 to
5 disks)
Full synchronization
 On-demand synchronization



Need to respect block dependency
Downshift propagation

Full synchronization
PARAID: A Gear-Shifting Power-Aware RAID
20
Asymmetric Gear-Shifting Policies

Up-shift (aggressive)


Moving utilization average + moving standard
deviation > utilization threshold
Downshift (conservative)


Modified utilization moving average + moving
standard deviation < utilization threshold
Moving average modified to account for fewer drives
and extra parity updates
PARAID: A Gear-Shifting Power-Aware RAID
21
Implementation

Prototyped in Linux 2.6.5




Open source, software RAID
Implemented block I/O handler, monitor,
disk manager
Implemented user admin tool to configure
device
Updated Raid Tools to recognize PARAID
level
PARAID: A Gear-Shifting Power-Aware RAID
22
Evaluation

Challenges
Prototyping PARAID
 Commercial machines
 Conceptual barriers
 Benchmarks designed to measure peak
performance
 Trace replay
 Time consuming

PARAID: A Gear-Shifting Power-Aware RAID
23
Evaluation

Measurement framework
client
RAID
USB cable
P4 2.8 Ghz, 1 GB RAM
160 GB 7200 RPM SATA
RAID
crossover
cable
RAID
multimeter
RAID
server
Xeon 2.8 Ghz, 512 MB RAM
36.7 GB 15k RPM SCSI
power
measurement
probes
RAID
BOOT
SCSI
cable
PARAID: A Gear-Shifting Power-Aware RAID
power
supply
12v & 5v
power lines
24
Evaluation

Three different workloads using two different
RAID settings

Web trace - RAID level 0 (2-disk gear 1, 5-disk gear 2)


Cello99 - RAID level 5 (3-disk gear 1, 5-disk gear 2)


I/O-intensive workload with writes
PostMark - RAID level 5


Mostly read activity
Measure peak performance and gear shifting overhead
Speed up trace playback


To match hardware
Explore range of speed up factors and power savings
PARAID: A Gear-Shifting Power-Aware RAID
25
Web Trace



UCLA CS Dept Web Servers (8/11/2006 – 8/14/2006)
File system: ~32 GB (~500k files)
Trace replay: ~95k requests with ~4 GB data (~260 MB unique)
0.6
0.5
0.4
GB/hour 0.3
0.2
0.1
0
1
6 11 16 21 26 31 36 41 46 51 56 61 66 71 76 81 86 91 96
hours
PARAID: A Gear-Shifting Power-Aware RAID
26
Web Trace Power Savings
64x – 60 requests/sec
60
Energy Savings
50
40
RAID-0
watts 30
64x - 34%
PARAID-0
20
128x - 28%
10
256x - 10%
0
0
5
10
15
20
25
30
hours
128x – 120 requests/sec
256x – 240 requests/sec
60
60
50
50
40
40
RAID-0
watts 30
PARAID-0
20
20
10
10
0
RAID-0
watts 30
PARAID-0
0
0
5
10
15
20
25
30
hours
PARAID: A Gear-Shifting Power-Aware RAID
0
5
10
15
20
25
30
hours
27
Web Trace Latency
256x
1
Overhead
0.8
0.6
RAID-0
0.4
PARAID-0
256x - within 2.7%
64x - 240%
80ms vs. 33ms
0.2
0
1
10
100
1000
10000
100000
msec
128x
64x
1
1
0.8
0.8
0.6
RAID-0
0.6
RAID-0
0.4
PARAID-0
0.4
PARAID-0
0.2
0.2
0
0
1
10
100
1000
10000
100000
msec
PARAID: A Gear-Shifting Power-Aware RAID
1
10
100
1000
10000
100000
msec
28
Web Trace Bandwidth
256x
180
160
140
Overhead
120
MB/sec
256x - within
1.3% in high
gear
RAID-0
100
80
PARAID-0
60
40
20
0
0
5
10
15
20
25
30
hours
128x
64x
180
180
130
130
RAID-0
MB/sec 80
PARAID-0
PARAID-0
30
30
-20 0
RAID-0
MB/sec 80
5
10
15
20
25
30
hours
PARAID: A Gear-Shifting Power-Aware RAID
-20 0
5
10
15
20
25
30
hours
29
Cello99 Trace

Cello99 Workload




HP Storage Research Labs
50 hours beginning on 9/12/1999
1.5 million requests (12 GB) to 440MB of unique blocks
I/O-intensive with 42% writes
PARAID: A Gear-Shifting Power-Aware RAID
30
Cello99 Power Savings
32x – 270 requests/sec
50
Energy Savings
40
30
RAID-5
20
PARAID-5
watts
32x - 13%
64x - 8.2%
10
128x - 3.5%
0
0
10
20
30
40
50
hours
64x – 550 requests/sec
128x – 1000 requests/sec
50
45
40
35
30
watts 25
20
15
10
5
0
50
40
RAID-5
PARAID-5
30
RAID-5
20
PARAID-5
watts
10
0
0
10
20
30
40
50
hours
PARAID: A Gear-Shifting Power-Aware RAID
0
10
20
30
40
50
hours
31
Cello99 Completion Time
128x
1
Overhead
0.98
0.96
RAID-5
0.94
PARAID-5
0.92
0.9
1
10
100
1000
10000
100000
32x - 1.8ms,
26% slower
due to time
spent in low
gear
msec
64x
32x
1
1
0.98
0.98
0.96
RAID-5
0.96
RAID-5
0.94
PARAID-5
0.94
PARAID-5
0.92
0.92
0.9
0.9
1
10
100
1000
10000
100000
msec
PARAID: A Gear-Shifting Power-Aware RAID
1
10
100
1000
10000
100000
msec
32
Cello99 Bandwidth
128x
1000
Overhead
100
RAID-5
MB/sec
PARAID-5
10
< 1% degradation during
peak hours
1
0
500000
1000000
1500000
request number
64x
32x
1000
1000
100
100
RAID-5
MB/sec
PARAID-5
10
RAID-5
MB/sec
PARAID
10
1
1
0
500000
1000000
1500000
requests
PARAID: A Gear-Shifting Power-Aware RAID
0
500000
1000000
1500000
request number
33
PostMark Benchmark



Popular synthetic benchmark
Generates ISP-style workloads
Stresses peak read/write performance of storage
device
PARAID: A Gear-Shifting Power-Aware RAID
34
Postmark Performance
200
150
seconds 100
50
0
1K files, 50K trans
RAID-5
20K files, 50K trans
PARAID-5 high gear
PARAID: A Gear-Shifting Power-Aware RAID
20K files, 100K
trans
PARAID-5 low-gear
35
Postmark Power Savings
80
70
60
watts
50
R A ID 5
40
PA R A ID
30
20
10
0
1
11
21
31
41
51
61
71
81
91
10
111
12
13
14
151
16
171
seconds
PARAID: A Gear-Shifting Power-Aware RAID
36
Related Work
Pergamum
 EERAID
 RIMAC
 Hibernator
 MAID
 PDC
 BlueFS

PARAID: A Gear-Shifting Power-Aware RAID
37
Future Work





Try more workloads
Optimize PARAID gear configuration
Explore asynchronous update propagation
Speed up recovery
Live testing
PARAID: A Gear-Shifting Power-Aware RAID
38
Lessons Learned





Third version of design, early design too
complicated
Data alignment problems
Difficult to measure system under normal load
Hard to predict workload transformations due to
complex system optimizations
Challenging to match trace environments
PARAID: A Gear-Shifting Power-Aware RAID
39
Conclusion

PARAID reuses standard RAID-levels without
special hardware while decreasing their energy
use by 34%.


Optimized version can save even more energy
Empirical evaluation important
PARAID: A Gear-Shifting Power-Aware RAID
40
Research Theme

Data flow management
Storage
 MANETs


Current state

Reminiscent of plumbing industry 200 years
ago
Limited interchangeable parts
 Poorly understood interactions

41
Research Areas





Power-Aware RAID
Electric-field-based routing for MANETs
Conquest disk-persistent-RAM hybrid file
system
Optimistic replication
Real-time systems
42
Questions
PARAID: A Gear-Shifting
Power-Aware RAID

Contact


Andy Wang – awang@cs.fsu.edu
http://www.cs.fsu.edu/~awang/conquest-2
43
PARAID Recovery

2.7 times slower than conventional raid

For example, 2 gear PARAID device




First, the soft state must recover
Second, data must be propagated
Third, conventional raid must recover
Recovery not as bad for read intensive
workloads
44
PARAID Gear-Shifting
Web Trace Gear-Shifting Stats
256x
128x
64x
Number of gear switches
15.2
8.0
2.0
% time spent in low gear
52%
88%
98%
% extra I/Os for update
propagations
0.63%
0.37%
0.21%
Cello99 Gear-Shifting Stats
128x
64x
32x
Number of gear switches
6.0
5.6
5.4
% time spent in low gear
47%
74%
88%
% extra I/Os for update
propagations
8.0%
15%
21%
45
Download