Download presentation
Presentation is loading. Please wait.
Published byClemence Baldwin Modified over 9 years ago
1
1© Copyright 2013 EMC Corporation. All rights reserved. Characterization of Incremental Data Changes for Efficient Data Protection Hyong Shim, Philip Shilane, & Windsor Hsu Backup Recovery Systems Division EMC Corporation
2
2© Copyright 2013 EMC Corporation. All rights reserved. Data Protection Environment SAN or LAN WAN Application Servers Primary Storage Data Protection Storage High I/O per sec. Medium Capacity Large Capacity Medium I/O per sec. Virtual Machines
3
3© Copyright 2013 EMC Corporation. All rights reserved. Contributions Detailed analysis of data change characteristics from enterprise customers Design for replication snapshots to lower overheads on primary storage. Evaluation of overheads on data protection storage Rules-of-thumb for storage engineers and administrators
4
4© Copyright 2013 EMC Corporation. All rights reserved. EMC Symmetrix VMAX Traces Trace Set#Volume# Storage Systems Duration hrs Estimated Capacity (GB) 1hr_1Wrt109,26312530.4 [78.3]71 [203] 1hr_1GBWrt16,1001207.7 [6.7]132 [262] 24hr_1GBWrt5081324.4 [1.2]318 [439] Collected from enterprise customer sites
5
5© Copyright 2013 EMC Corporation. All rights reserved. Capacity and Write Footprint Analysis for 1hr_1GBWrit Not collected: applications using each volume
6
6© Copyright 2013 EMC Corporation. All rights reserved. I/O Properties Trace Set#Write reqs (1000s) Write size (GB) #Read reqs (1000s) Read size (GB) 1hr_1Wrt72 [510] 2 [31] 167 [1963] 5 [66] 1hr_1GBWrt429 [1270] 11 [80] 796 [4987] 25 [166] 24hr_1GBWrt1803 [4839] 51 [338] 7824 [23875] 242 [763] 1.9-4.3X more read I/Os than write I/Os 2.3-4.7X more GB read than written High variability More analysis in the paper
7
7© Copyright 2013 EMC Corporation. All rights reserved. Sequential vs. Random Write I/O We measure how much data are written, on average, after seeking to a non-consecutive sector. Selected most sequential and most random for analysis Storage Volume wwww Trace Timeline (w = Write I/O, r = Read I/O) rw Sequential Write I/O (5+ 1 + 3)/ 3 = 3
8
8© Copyright 2013 EMC Corporation. All rights reserved. rwwwrwrwwwwwwrw … Replication Interval 1 Transfer Period may require snapshot storage and I/O Trace Timeline (w = Write I/O, r = Read I/O) Storage Volume Sectors Replication Interval 2 Block Trace Analysis Methodology Create a snapshot to protect block data
9
9© Copyright 2013 EMC Corporation. All rights reserved. Replication Snapshot 0 Storage Volume state before transfer takes place 1234 Block: = Modified block to be transferred Trace Timeline (w = Write I/O) Goal: Create a snapshot technique that is integrated with replication that decreases overheads on primary storage Change block tracking records modified blocks for next replication interval, possibly with a bit vector. A snapshot has to maintain block values against overwrites.
10
10© Copyright 2013 EMC Corporation. All rights reserved. Replication Snapshot Baseline Snapshot: All writes cause copy-on-write 0 Storage Volume state before transfer takes place 1234 Block: = Modified block to be transferred Snapshot Area Trace Timeline (w = Write I/O) www Baseline Transfer in progress
11
11© Copyright 2013 EMC Corporation. All rights reserved. Replication Snapshot Changed Block Replication Snapshot (CB): Only writes to tracked blocks cause copy-on-write 01234 Block: Snapshot Area www Baseline Transfer in progress CB
12
12© Copyright 2013 EMC Corporation. All rights reserved. Replication Snapshot Changed Block with Early Release Replication Snapshot (CBER): Only writes to tracked blocks cause copy-on-write, and blocks are released once transferred 01234 Block: Snapshot Area www Baseline Transfer in progress CB CBER
13
13© Copyright 2013 EMC Corporation. All rights reserved. Replication Snapshot 01234 Block: Snapshot Area www Baseline CB CBER Baseline Snapshot: All writes cause copy-on-write Changed Block Replication Snapshot (CB): Only writes to tracked blocks cause copy-on-write Changed Block with Early Release Replication Snapshot (CBER): Only writes to tracked blocks cause copy-on-write, and blocks are released once transferred = Modified block to be transferred
14
14© Copyright 2013 EMC Corporation. All rights reserved. Snapshot Storage Overheads Rule-of-thumb: Over-provision primary capacity by 8% for snapshots
15
15© Copyright 2013 EMC Corporation. All rights reserved. Snapshot I/O Overheads Rule-of-thumb: Over-provision primary I/O by 100% to support copy-on-write related write-amplification
16
16© Copyright 2013 EMC Corporation. All rights reserved. Snapshot I/O Overheads Rule-of-thumb: Over-provision primary I/O by 100% to support copy-on-write related write-amplification
17
17© Copyright 2013 EMC Corporation. All rights reserved. Transfer Size to Protection Storage Rule-of-thumb: 40% of written bytes are transferred to protection storage
18
18© Copyright 2013 EMC Corporation. All rights reserved. IOPS Requirements for Protection Storage Rule-of-thumb: Protection storage must support 20% of the I/O per second capabilities of primary storage
19
19© Copyright 2013 EMC Corporation. All rights reserved. Related Work Trace analysis –Numerous publications Most closely related is Patterson [2002] Snapshots –Common paradigm for storage but rarely integrated with incremental transfer techniques –Storage overheads Azagury [2002] and Shah [2006] Synchronous Mirroring –Effective when change rates are low and geographic distance is small –We are focused on periodic, asynchronous replication
20
20© Copyright 2013 EMC Corporation. All rights reserved. Conclusion SAN or LAN WAN Application Servers Primary Storage Data Protection Storage High I/O per sec. Medium Capacity Large Capacity Medium I/O per sec.
21
21© Copyright 2013 EMC Corporation. All rights reserved. Conclusion Trace analysis shows diversity of storage characteristics Snapshot overheads on primary storage can be decreased by improved integration with network transfer Sequential versus random access patterns affect incremental change patterns on both primary and protection storage
22
22© Copyright 2013 EMC Corporation. All rights reserved. Rules-of-Thumb Over-provision primary capacity by 8% for snapshots Over-provision primary I/O by 100% to support copy-on-write related write-amplification A write buffer decreases snapshot I/O overheads but has little impact on storage overheads 40% of written bytes are transferred to protection storage Schedule at least 6 hours between transfers to minimize clean data in transferred blocks Schedule at least 12 hours between transfers to minimize peak network bandwidth requirements Protection storage must support 20% of the I/O per second capabilities of primary storage
23
23© Copyright 2013 EMC Corporation. All rights reserved. Questions?
25
25© Copyright 2013 EMC Corporation. All rights reserved. Trace Analysis: Replication of Snapshots The amount of data to replicate drops in half with 12 hours between snapshots. 4KB results are compared to Patterson 2002.
26
26© Copyright 2013 EMC Corporation. All rights reserved. I/O Per Second (IOPS) Request Rate Trace SetAverage Write Rate Peak Write Rate 10ms Average Read Rate Peak Read Rate 10 ms 1hr_1Wrt0.7 [8] 1762 [2602] 2 [25] 1693 [2457] 1hr_1GBWrt15 [37] 4360 [4379] 29 [118] 3603 [4135] 24hr_1GBWrt20 [55] 9004 [8165] 89 [269] 5647 [7012] Peak Values: IOPS are calculated every 10ms period, and the peaks for each volume are averaged. More analysis in the paper
27
27© Copyright 2013 EMC Corporation. All rights reserved. Snapshot I/O Overheads Rule-of-thumb: Over-provision primary I/O by 100% to support copy-on-write related write-amplification
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.