Data Deduplication in Virtualized Environments Marc Crespi, ExaGrid Systems

Slides:



Advertisements
Similar presentations
© 2009 VMware Inc. All rights reserved vCenter Site Recovery Manager 5.1.
Advertisements

Why Virtual Machine Backups Are Different David Davis Blog:
Eric Siebert vExpert, Author, Blogger Blog: Core Technologies.
RETHINK BACKUP & ARCHIVE. 2 Backup and Archive are Top IT Priorities Which of the following would you consider to be your org’s most important IT priorities.
Basic principles of backup policies Andrea Mauro vExpert and VCDX.
Protect Your Business and Simplify IT with Symantec and VMware Presenter, Title, Company Date.
Your all-in-one backup appliance
1 Storage Today Victor Hatridge – CIO Nashville Electric Service (615)
Empowering Business in Real Time. © Copyright 2009, OSIsoft Inc. All rights Reserved. Virtualization and HA PI Systems: Three strategies to keep your PI.
Windows Azure Conference 2014 Hybrid Cloud Storage: StorSimple and Windows Azure.
TechTarget Backup School exagrid.com | 1 Backup School ExaGrid Stress-free backup storage
Barracuda Backup Service Data Backup and Disaster Recovery.
Symantec De-Duplication Solutions Complete Protection for your Information Driven Enterprise Richard Hobkirk Sr. Pre-Sales Consultant.
Microsoft Virtual Server 2005 Product Overview Mikael Nyström – TrueSec AB MVP Windows Server – Setup/Deployment Mikael Nyström – TrueSec AB MVP Windows.
NetApp The perfect fit for Virtual Desktop Infrastructure
VIRTUALIZATION AND YOUR BUSINESS November 18, 2010 | Worksighted.
vSphere 5 Changes for Backups and Administration Rick Vanover MCITP vExpert VCP Veeam Software.
Barracuda Networks Confidential1 Barracuda Backup Service Integrated Local & Offsite Data Backup.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
1 Virtualization Services. 2 Cloud Hosting –Shared Virtual Servers –Dedicated Servers Managed Server Options Multiple Access Methods –EarthLink Business.
Hyper-V 3.0 – What’s New in Windows Server 2012? Brien Posey
1 © Copyright 2009 EMC Corporation. All rights reserved. Agenda Storing More Efficiently  Storage Consolidation  Tiered Storage  Storing More Intelligently.
1© Copyright 2013 EMC Corporation. All rights reserved. November 2013 Oracle Backup and Recovery.
November 2009 Network Disaster Recovery October 2014.
1© Copyright 2013 EMC Corporation. All rights reserved. EMC and Microsoft SharePoint Server Data Protection Name Title Date.
COMPANY AND PRODUCT OVERVIEW Russ Taddiken Director of Principal Storage Architecture.
PRESIDIO.COM MARCH  Presidio Overview  What’s New in VDP and VDPA  VDPA Features  Backup and Restore Job Creation  Q&A.
Virtualization Lab 3 – Virtualization Fall 2012 CSCI 6303 Principles of I.T.
Windows Server 2012 R2: What’s New Mike Resseler.
STEALTH Content Store for SharePoint using Caringo CAStor  Boosting your SharePoint to the MAX! "Optimizing your Business behind the scenes"
Virtualization. Virtualization  In computing, virtualization is a broad term that refers to the abstraction of computer resources  It is "a technique.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
Meeting the Data Protection Demands of a 24x7 Economy Steve Morihiro VP, Programs & Technology Quantum Storage Solutions Group
Demystifying Deduplication. Global SMB Event Marketing 2 APPROACH: What is deduplication? Eliminate redundant data Start with the backup environment as.
Storage Trends: DoITT Enterprise Storage Gregory Neuhaus – Assistant Commissioner: Enterprise Systems Matthew Sims – Director of Critical Infrastructure.
Virtualization for Disaster Recovery Panel Discussion May 19, 2010 Ed Walsh EMC vSpecialist EMC Corporation Cell Chris Fox.
1 Week #10Business Continuity Backing Up Data Configuring Shadow Copies Providing Server and Service Availability.
David Davis Blog: Disaster Recovery of VMware Workloads.
Physical vs. Virtual Backups Rick Vanover MCITP vExpert VCP Veeam Software.
VMware Backup Integrity Eric Siebert vExpert, Author, Blogger Blog:
@Veeam How to do a backup wrong: Top mistakes Rick Vanover MCITP vExpert VCP Veeam Software.
Best Practices for VMware Backups Rick Vanover MCITP vExpert VCP Software Strategy Specialist Veeam.
Emerging Technologies Understanding Deduplication Kevin Carpenter Account Manager Upstate NY Phil Benincasa System Engineer Upstate NY.
Eric Siebert vExpert, Author, Blogger Restore capabilities of VMware backups Blog:
| nectar.org.au NECTAR TRAINING Module 4 From PC To Cloud or HPC.
ExaGrid Stress-free backup storage
TechTarget Backup School exagrid.com | 1 Backup School Backup School Backup School
TechTarget Backup School Truth in IT Backup School exagrid.com | 1 Backup School Backup School Backup School
TechTarget Backup School exagrid.com | 1 Backup School ExaGrid Stress-free backup storage
1 © Copyright 2009 EMC Corporation. All rights reserved. Backup Challenges in VMware Environments.
TechTarget Backup School exagrid.com | 1 Backup School ExaGrid / Commvault Stress-free backup storage.
TechTarget Backup School exagrid.com | 1 Backup School ExaGrid Stress-free Backup Storage Veeam Training
Practical IT Research that Drives Measurable Results Mitigate Costs & Maximize Value with a Consolidated Network Storage Strategy.
Practical IT Research that Drives Measurable Results 1Info-Tech Research Group Get Moving with Server Virtualization.
Solving Today’s Data Protection Challenges with NSB 1.
Extending Auto-Tiering to the Cloud For additional, on-demand, offsite storage resources 1.
TechTarget Backup School exagrid.com | 1 ExaGrid Stress-free backup storage
Commvault and Nutanix October Changing IT landscape Today’s Challenges Datacenter Complexity Building for Scale Managing disparate solutions.
PHD Virtual Technologies “Reader’s Choice” Preferred product.
Barracuda Backup Easy Cloud-Connected Backup Version 5.4 | July 2014.
ExaGrid Stress-free backup storage
Secure Data – a safe place in an unsafe world!
Demystifying Deduplication
Agenda Backup Storage Choices Backup Rule
Briefing: Leverage HPE Storage Solutions in Windows/Hyper-V
SAN and NAS.
Storage Trends: DoITT Enterprise Storage
Outline Virtualization Cloud Computing Microsoft Azure Platform
Specialized Cloud Architectures
Enterprise Class Virtual Tape Libraries
Presentation transcript:

Data Deduplication in Virtualized Environments Marc Crespi, ExaGrid Systems

About the speaker  Marc has over 20 years of software and hardware experience in the high technology sector  He is part of the ExaGrid team that drives product strategy and execution and is responsible for managing product operations.  Prior to joining the company, Marc was director of product management for security management products at Altiris.

Objective of This Program What is Deduplication? Why Use Deduplication in Backup and Recovery? Challenges of Deduplication in Virtualized Environments Deduplication approaches (two camps) Summary ‒ Deduplication’s Role in Data Protection and Disaster Recovery

 Enhanced Speed/Performance ●Faster backup times due to lower volume of data to be backed up ●Data lands faster because it is targeted at disk  Dramatic Savings in Disk Costs ●20:1 Reduction in amount of disk space required to store backups  Scalability ●Backup higher data volumes while maintaining backup window  Offsite Disaster Recovery ●Efficient use of bandwidth via WAN-efficient replication Why Use Deduplication in Backup and Recovery?

VM Reduced storage footprint with deduplication  Reduce total amount of storage by as much as 1000:1  Store only the bytes that change in your VMware virtual servers  Eliminate redundancy of typical VMware backups  Restore quickly from most recent VMware backup  Each virtual server image gets backed up in its entirety  Large amount of storage consumed  Deduplicate backups to changed bytes  Dramatic savings in disk and bandwidth  Integrated Replication Eliminate Redundancies for More Efficient Virtual Server Backups VM

Specific Challenges of Backups/Restores in Virtualized Environments  Management of backups ●Growing number of virtual machines/ sprawl ●Inability to monitor backups on individual virtual machines  Handling the volume of backup data efficiently ●More data to store as virtual machines proliferate ●Each change means entire virtual server is backed up These challenges are driving a need for better tools to more reliably and easily back up and restore virtual machines Example: 10 guest OS instances x 50GB = 500GB of backed-up virtual images daily

How Dedupe Works: Store Only Changed Bytes Standard Disk Total 500GB Total 3.4GB 2.5GB 100MB Oldest Backup Most Recent Backup 50GB Oldest Backup Most Recent Backup Stored Optimized for Read 100MB Data Deduplication 50GB VM 500GB 3.4GB

 2011 ExaGrid Systems, Inc. Where to Deploy Deduplication PROS  Reduces impact on VM  Shortens BU window/less data  Reduced bandwidth needed to the backup target  Reduction in storage usage CONS  Can be slower for large (multiple TB) amounts of data  Increased workload on servers PROS  Shortens BU window/less data  Reduced replication bandwidth  Reduction in storage usage CONS  Must transfer the entire dataset to the device  Don’t get reduced bandwidth needed to the backup target Target Based Data Reduction Removes data redundancies after transmission to the backup target Source Based Data Reduction Removes data redundancies before transmission to the backup target

 2011 ExaGrid Systems, Inc.  Achieves an additional 80% data reduction (98% total) ●Further reduction in bandwidth ●Further reduction in storage usage ●Further reduction in backup window  Integrated replication of virtual servers Source Based PLUS Target Based Data Deduplication Removes data redundancies before and after transmission to the backup target Using Both Deduplication Techniques Provides Complementary Benefits

 2011 ExaGrid Systems, Inc. Architectural Considerations Scalable GRID Architecture Multiple Deduplication Engines Legacy Architecture - Single Controller One Deduplication Engine Backup Window X TB/hr 20 TB 30 TB 40 TB 50 TB 60 TB Disks Deduplication Engine X TB/hr 2X TB/hr 3X TB/hr 4X TB/hr 5X TB/hr 6X TB/hr 20 TB 30 TB 40 TB 50 TB 60 TB 10 TB Deduplication Engine Backup Window

 2011 ExaGrid Systems, Inc. Architectural Considerations Scalable GRID Architecture Multiple Deduplication Engines Legacy Architecture – Single Controller Legacy Architecture – Appliance Sprawl One Deduplication Engine  Linear performance as data grows, stable backup window  Capacity is virtualized across nodes  Deduplication is shared across nodes  Simplified management through single UI  System can be right-sized to current data size  Avoids forklift upgrades Scalable GRID Features Individual appliances Deduplication Engine

Benefits  One-time division of data during installation (15 to 30 minutes)  GRID software manages placement of data  Revisit only during expansion (additional 15 to 30 minutes)  Eliminates the challenges of monolithic, primary storage like architectures GRID Architecture for Deduplication Performance Backup Servers Wire Speed Node 1 – System Capacity – RAID6 Landing Zone Node 2 – System Capacity – RAID6 Repository Landing Zone Deduplication Process Load Balancing Backup Job VM

What We Covered What is Deduplication? Why Use Deduplication in Backup and Recovery? Challenges of Deduplication in Virtualized Environments Overview Diagram of Major Components Deduplication approaches (two camps) Summary ‒ Deduplication’s Role in Data Protection and Disaster Recovery

Enjoy and share this material  Feel free to promote this material  Recommend your peers to pass certification  Blog, Tweet and share this material and your experience on Facebook  You’re an Expert? We will be happy to have you as Backup Academy contributor. Apply here.here Web: Twitter: BckpAcademyBckpAcademy Facebook: backup.academybackup.academy