TRAP-Array: A Disk Array Architecture Providing Timely Recovery to Any Point-in-time Authors: Qing Yang,Weijun Xiao,Jin Ren University of Rhode Island Presented By: Anuradharthi T 6/5/2008 1 Outline Introduction Background Related Work TRAP- 4 Architecture Results Conclusion 6/5/2008 2 Introduction RAID Architecture - Most prominent architecture advance in disk I/O systems - In use for more than two decades - Types of RAID: 6/5/2008 RAID1- provides 2N data redundancy to protect data RAID3, 4 & 5- store data in parity stripes across multiple disks to improve space efficiency & performance 3 RAID Architecture Advantages: - Can recover data from more than one disk failure - Improves data reliability Disadvantages: - Recovery not possible when damaged data are not confined to 1 or 2 disks - Accounts for 60% to 80% of data losses - Examples of such damages are: software defects, virus attacks, power failure or site failure 6/5/2008 4 Solution Timely Recovery to Any Point-in-time(TRAP) - Keep log of all previous versions of changed data blocks in time sequence - Utilizes a fast & simple encoding scheme => less space - Provides faster data recovery to any-point-in-time due to drastically smaller amount of storage space used =>improved performance - Thus achieves an optimal space & performance characteristics 6/5/2008 5 Background Recovery of data in real world is measured by ‘2’ key parameters: 1. RPO (Recovery Point Objective) - measures maximum acceptable age of data at time of outage 2. RTO (Recovery Time Objective) - maximum acceptable length of time to resume normal data processing operations after an outage 6/5/2008 6 Classification of storage architecture Storage architectures are capable of recovering data upon an outage Based on the 2 key parameters 6/5/2008 TRAP-1 TRAP-2 TRAP-3 TRAP-4 7 Related work TRAP-1 TRAP-2 Uses periodical backups & snapshots Time consuming & degrades application performance Data transferred to tapes or disk for backup Performs file versioning that records a history of changes to files Versioning has to be done manually Have controllable RTO & RPO File system dependent TRAP-3 6/5/2008 Keep a log of changed data for each data block in a time sequence (time stamps) Continuous Data Protection (CDP) Huge amount of storage space required 8 TRAP-4 Architecture Keeps a log of parities as a result of each write on the block Suppose a host writes into data block with logic address Ai that belongs to a data stripe (A1,A2….Ab….An) RAID controller perform the parity calculation as follows: Where, PT(k) – new parity for corresponding stripe Ai(k) – new data for data block Ai Ai(k-1) – old data of data block Ai PT(k-1) – old parity of the stripe 6/5/2008 9 Optimizing the parity P’T(k)= is appended to the parity log stored in the TRAP disk after a simple encoding Only 5% to 20% of bits inside a data block actually change on a write operation Parity P’T(k) reflects the exact changes at bit level of new write operation on the existing block As a result, this parity block contains mostly zeros with a very small portion of bit stream that is nonzero Thus it can be easily encoded to a small size parity block to be appended to the parity log reducing the amount of storage space 6/5/2008 10 Block Diagram of TRAP-4 design 6/5/2008 11 Recovery on outage Consider the parity log corresponding to a data block, Ai after a series of write operations. Log contains P’T(k) P’T(k-1) …. P’T(2) P’T(1) with timestamps T(k), T(k-1), …. T(2) and T(1) associated with parities When an outage occurs at time t1, and we would like to recover data as at time t0 (t0 < t1) Note that for all l = 1,2, … r 6/5/2008 12 System Architecture of TRAP-4 6/5/2008 13 Experimental Setup 6/5/2008 14 File System micro benchmarks 6/5/2008 15 Data size comparison 6/5/2008 16 Data Reduction ratio 6/5/2008 17 Retrieval time comparison 6/5/2008 18 Conclusions A new disk array architecture capable of providing timely recovery to any point-in-time for user data stored in array Up to 2 orders of magnitude improvements in terms of storage efficiency Has quick recovery time Provides continuous data protection capability 6/5/2008 19 Thank You 6/5/2008 20