Low-Cost Data Deduplication for Virtual Machine Backup in Cloud Storage Wei Zhang, Tao Yang, Gautham Narayanasamy University of California at Santa Barbara.

Slides:

Advertisements

Similar presentations

Live migration of Virtual Machines Nour Stefan, SCPD.

Advertisements

More on File Management

SILT: A Memory-Efficient, High-Performance Key-Value Store

Clustering and Load Balancing Optimization for Redundant Content Removal Shanzhong Zhu (Ask.com) Alexandra Potapova, Maha Alabduljalil (Univ. of California.

Computer Organization and Architecture

VSphere vs. Hyper-V Metron Performance Showdown. Objectives Architecture Available metrics Challenges in virtual environments Test environment and methods.

Tradeoffs in Scalable Data Routing for Deduplication Clusters FAST '11 Wei Dong From Princeton University Fred Douglis, Kai Li, Hugo Patterson, Sazzala.

1 Live Deduplication Storage of Virtual Machine Images in an Open-Source Cloud Chun-Ho Ng, Mingcao Ma, Tsz-Yeung Wong, Patrick P. C. Lee, John C. S. Lui.

Cs 325 virtualmemory.1 Accessing Caches in Virtual Memory Environment.

CMPT 300: Final Review Chapters 8 – Memory Management: Ch. 8, 9 Address spaces Logical (virtual): generated by the CPU Physical: seen by the memory.

1 Overview of Storage and Indexing Chapter 8 (part 1)

Virtual Memory Virtual Memory Management in Mach Labels and Event Processes in Asbestos Ingar Arntzen.

Lecture 3: A Case for RAID (Part 1) Prof. Shahram Ghandeharizadeh Computer Science Department University of Southern California.

CMPT 300: Final Review Chapters 8 – Memory Management: Ch. 8, 9 Address spaces Logical (virtual): generated by the CPU Physical: seen by the memory.

1 stdchk : A Checkpoint Storage System for Desktop Grid Computing Matei Ripeanu – UBC Sudharshan S. Vazhkudai – ORNL Abdullah Gharaibeh – UBC The University.

Seafile - Scalable Cloud Storage System

1 04/18/2005 Flux Flux: An Adaptive Partitioning Operator for Continuous Query Systems M.A. Shah, J.M. Hellerstein, S. Chandrasekaran, M.J. Franklin UC.

The Design and Implementation of a Log-Structured File System Presented by Carl Yao.

MATE-EC2: A Middleware for Processing Data with Amazon Web Services Tekin Bicer David Chiu* and Gagan Agrawal Department of Compute Science and Engineering.

Multi-level Selective Deduplication for VM Snapshots in Cloud Storage Wei Zhang*, Hong Tang †, Hao Jiang †, Tao Yang*, Xiaogang Li †, Yue Zeng † * University.

CSE598C Virtual Machines and Their Applications Operating System Support for Virtual Machines Coauthored by Samuel T. King, George W. Dunlap and Peter.

An Evaluation of Using Deduplication in Swappers Weiyan Wang, Chen Zeng.

Redundant Array of Independent Disks

Computer Architecture Lecture 28 Fasih ur Rehman.

Virtualization Paul Krzyzanowski Distributed Systems Except as otherwise noted, the content of this presentation is licensed.

Improving Disk Latency and Throughput with VMware Presented by Raxco Software, Inc. March 11, 2011.

RAPID-Cache – A Reliable and Inexpensive Write Cache for Disk I/O Systems Yiming Hu Qing Yang Tycho Nightingale.

1 Moshe Shadmon ScaleDB Scaling MySQL in the Cloud.

File System Implementation Chapter 12. File system Organization Application programs Application programs Logical file system Logical file system manages.

Our work on virtualization Chen Haogang, Wang Xiaolin {hchen, Institute of Network and Information Systems School of Electrical Engineering.

CS414 Review Session.

Cache-Conscious Performance Optimization for Similarity Search Maha Alabduljalil, Xun Tang, Tao Yang Department of Computer Science University of California.

1 CloudVS: Enabling Version Control for Virtual Machines in an Open- Source Cloud under Commodity Settings Chung-Pan Tang, Tsz-Yeung Wong, Patrick P. C.

Fast Crash Recovery in RAMCloud. Motivation The role of DRAM has been increasing – Facebook used 150TB of DRAM For 200TB of disk storage However, there.

CPSC 404, Laks V.S. Lakshmanan1 External Sorting Chapter 13: Ramakrishnan & Gherke and Chapter 2.3: Garcia-Molina et al.

CS 153 Design of Operating Systems Spring 2015 Lecture 21: File Systems.

Virtual Memory 1 1.

RevDedup: A Reverse Deduplication Storage System Optimized for Reads to Latest Backups Chun-Ho Ng, Patrick P. C. Lee The Chinese University of Hong Kong.

Replicating Memory Behavior for Performance Skeletons Aditya Toomula PC-Doctor Inc. Reno, NV Jaspal Subhlok University of Houston Houston, TX By.

1 Memory Management. 2 Fixed Partitions Legend Free Space 0k 4k 16k 64k 128k Internal fragmentation (cannot be reallocated) Divide memory into n (possible.

Department of Computer Science MapReduce for the Cell B. E. Architecture Marc de Kruijf University of Wisconsin−Madison Advised by Professor Sankaralingam.

Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition File System Implementation.

HADOOP DISTRIBUTED FILE SYSTEM HDFS Reliability Based on “The Hadoop Distributed File System” K. Shvachko et al., MSST 2010 Michael Tsitrin 26/05/13.

Embedded System Lab. 정영진 The Design and Implementation of a Log-Structured File System Mendel Rosenblum and John K. Ousterhout ACM Transactions.

1© Copyright 2012 EMC Corporation. All rights reserved. EMC BACKUP AND RECOVERY FOR MICROSOFT EXCHANGE AND SHAREPOINT 2010 SERVERS EMC Avamar, EMC VNX,

The Google File System Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Presenter: Chao-Han Tsai (Some slides adapted from the Google’s series lectures)

Using Deduplicating Storage for Efficient Disk Image Deployment Xing Lin, Mike Hibler, Eric Eide, Robert Ricci University of Utah.

CS161 – Design and Architecture of Computer

Seth Pugsley, Jeffrey Jestes,

ECE232: Hardware Organization and Design

Memory COMPUTER ARCHITECTURE

Optimizing Parallel Algorithms for All Pairs Similarity Search

Sarah Diesburg Operating Systems COP 4610

CS161 – Design and Architecture of Computer

Steve Ko Computer Sciences and Engineering University at Buffalo

CS 704 Advanced Computer Architecture

Yu Su, Yi Wang, Gagan Agrawal The Ohio State University

Steve Ko Computer Sciences and Engineering University at Buffalo

Lecture 14 Virtual Memory and the Alpha Memory Hierarchy

Department of Computer Science University of California, Santa Barbara

Sarah Diesburg Operating Systems CS 3430

Department of Computer Science University of California, Santa Barbara

Fan Ni Xing Lin Song Jiang

Virtual Memory 1 1.

Efficient Migration of Large-memory VMs Using Private Virtual Memory

The Design and Implementation of a Log-Structured File System

Presentation transcript:

Low-Cost Data Deduplication for Virtual Machine Backup in Cloud Storage Wei Zhang, Tao Yang, Gautham Narayanasamy University of California at Santa Barbara Hong Tang Alibaba Inc. USENIX HotStorage’2013

Motivation Virtual machines in the cloud can use frequent backup to improve service reliability  Used in Alibaba’s Aliyun - the largest public cloud service in China High storage demand & large content duplicates  Daily backup workload: hundreds of Aliyun  Number of VMs per cluster: tens of thousands Seek for inexpensive solutions

Architecture Consideration An external and dedicated backup storage system. High network traffic for transferring undeduplicated data Expensive A decentralized and co-hosted backup system with full deduplication  Lower cost & traffic

Requirements  Nondedicated resource Cohosted with existing cloud services Resource friendly – small memory footprint and CPU usage  Compute and backup for tens of thousand VMs within a few hours each day during light cloud workload.

Focus and Related Work Previous work  Inline chunk-based deduplication –High cost for fingerprint lookup  Speedup fingerprint comparison with approximation (e.g. subsampling, bloomfilter, stateless routing) Focus of this paper  Not inline - shorten overall backup times of many VM images, but not individual request  Not offline - multi-stage parallel backup with small storage overhead, & limited computing resource  Work-in-progress

Key Ideas Separation of duplicate detection and data backup  Different from inline deduplication. Buffered data redistribution in parallel duplicate detection Stage 1: Collect fingerprints in parallel Stage 2: Detect duplicates in parallel Stage 3: Perform actual VM backup in parallel

VM Snapshot Representation Data blocks are variable-sized Segments are fix-sized

Stage 1: Deduplication request accumulation ➔ Scan dirty data blocks ➔ Exchange &accumulate dedup requests ➔ Map data from VM-based to fingerprint-based distribution

Stage 2: Fingerprint comparison and summary output Load global index and dedup requests one partition at a time Compare fingerprints in parallel Output dedup summary from fingerprint-based to VM-based distribution

Stage 3: Non-duplicate data backup Load dup summaries Read dirty segments Output non-duplicate data blocks

Memory Usage per Machine at Different Stages Stage 1: Request accumulation  1 I/O buffer to read dirty segments  p network send and p recv buffers for p machines  q dedup request buffers for local disk write of q partitions Stage 2: Fingerprint comparison  Space for hosting 1 partition index and corresponding requests  p network send and p recv buffers, v local summary buffers for disk write Stage 3: Nonduplicate backup An I/O buffer to read dirty segments and write non-duplicates Duplicate summary within dirty segments

Issues with Incidental Redundancy Two VM blocks with the same fingerprint are created in parallel in different machines  Both are identified as new blocks  The rest of occurrences are detected as duplicates and logged Repaired inconsistency periodically during index update

Snapshot Deletion Mark-and-sweep – A block can be deleted if its reference count is zero Similar to deduplication stages  Scan the meta data and accumulate block reference pointers  Compute the reference count of each index entry, partition by partition  Log deletion instructions Periodically perform a compact operation when its deletion log is too big.

Evaluation Evaluated on a cluster of  Dual quad-core Intel Nehalem 2.4Hz E5530 with 24GB memory. Test data from Alibaba Aliyuan cloud  41TB. 10 snapshots per VM  Segment size: 2MB. Avg. Block size: 4KB Evaluation objectives  1) Analyze the deduplication throughput and effectiveness for a large number of VMs.  2) Examine the impacts of buffering during metadata exchange.

Data Characteristics Each VM uses 40GB storage space on average OS and user data disks: each takes ~50% of space OS data  7 main stream OS releases:  Debian, Ubuntu, Redhat, CentOS, Win bit, win bit and win bit. User data  From 1323 VM users

Setting & Resource Usage per Machine P=100 machines. 25VMs per machine Disk 8 GB metadata usage 10millsec local disk seek cost 50MB/second I/O per machine < 16.7% of local IO bandwidth usage. Memory usage: ~35MB CPU: Single-thread execution per machine 10-13% of single core

Parallel Time When Memory Limit Varies

Performance when 35MB memory used per machine Option1: unoptimized data redistribution.

Conclusions Low-cost multi-stage parallel deduplication for simultaneous backup of many VM images  Co-hosted with other cloud services  Tradeoff: Not optimized for individual backup request Read dirty data twice. Work-in-progress Evaluation – Backup throughput of 100 machines about 8.76GB per second for 2500 VMs – Resource friendly to the existing cluster services.

Questions?