Download presentation
Presentation is loading. Please wait.
Published byClement Fletcher Modified over 9 years ago
1
Three-Dimensional Redundancy Codes for Archival Storage J.-F. Pâris, U. of Houston D. D. E. Long, U. C. Santa Cruz W. Litwin, U. Paris-Dauphine
2
Background Archival files Must be kept a long time At lowest possible cost Emphasis on Providing highest reliability at lowest cost Update speed is less important Focus on multi-dimensional RAID arrays Highly reliable Very space-efficient
3
A two-dimensional RAID array D11 D12 P1 D21 D22 P2 Q1Q2 Four parity disks Four parity stripes Four data disks
4
A better array P1 D13 D14 D34 D23 D24 D12 P2 P3 P4 Four parity disks Four parity stripes Six data disks
5
Can we do better? Use a three-dimensional organization Replace parity stripes by parity planes Each parity plane will contain one parity disk Place data disks will at the intersections of three parity planes
6
Parity planes α, β, γ and δ Four data disks αβγ,αβδ, αγδ and βγδ Example
7
With n p parity disks, we can protect data disks against all triple failures 2-D organizations with same number of parity disks could only protect data disks and only against all double failures Advantages
8
More data disks per parity disk
9
More protection at a lower cost
10
Drawback of 3-D arrays More complex update procedure Each time we modify a data block, we have to update three parity blocks Not an issue for data that are rarely updated Archives, media
11
Handling quadruple failures Only a few specific quadruple failures are fatal We show that array can tolerate fraction of all quadruple failures
12
Selected results Compared the MTTDL of a 3-D array with 20 data disks and 6 parity disks with those of Two RAID arrays with 10 data disks and 3 parity disks each 60 disks using three-way mirroring to store the equivalent content of 20 data disks A 2-D array with 21 data disks and 7 parity disks under standard stochastic assumptions
13
System Parameters Disk mean time to fail was assumed to be 100,000 hours (11 years and 5 months) Corresponds to a failure rate of 8 to 9 percent per year High end of failure rates observed by Schroeder and Gibson and Pinheiro et al. Disk repair times varied between 12 hours and one week
14
Comparing MTTDLs
15
Conclusion 3-D RAID arrays require Fewer parity disks than comparable RAID array organizations to achieve Higher MTTDLs Sole limitation is cost of updates
16
Work in Progress Can we build zero-maintenance disk arrays? Start with a 3-D RAID array Add enough spares to last several years Critical factor is failure rate of unused spares Potential for one or two MS theses Require willingness to learn Python
17
Extra Slides
18
Our Model Device failures are mutually independent and follow a Poisson law A reasonable approximation Device repairs can be performed in parallel Device repair times follow an exponential law Not true but fairly robust H.-W. Kao, J.-F. Paris, T. Schwarz, S. J., and D. D. E. Long, A Flexible Simulation Tool for Estimating Data Loss Risks in Storage Arrays, Proc. MSST Symposium, May 2013.
19
State Diagram State 0 is initial state is the fraction of quadruple disk failures that result in a data loss
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.