Demystifying Deduplication

Slides:



Advertisements
Similar presentations
Tivoli SANergy. SANs are Powerful, but... Most SANs today offer limited value One system, multiple storage devices Multiple systems, isolated zones of.
Advertisements

Data Storage Solutions Module 1.2. Data Storage Solutions Upon completion of this module, you will be able to: List the common storage media and solutions.
RAID Oh yes Whats RAID? Redundant Array (of) Independent Disks. A scheme involving multiple disks which replicates data across multiple drives. Methods.
NAS vs. SAN 10/2010 Palestinian Land Authority IT Department By Nahreen Ameen 1.
RETHINK BACKUP & ARCHIVE. 2 Backup and Archive are Top IT Priorities Which of the following would you consider to be your org’s most important IT priorities.
Real-World Customer Deployments of StorSimple and Microsoft Azure
Your all-in-one backup appliance
REDUNDANT ARRAY OF INEXPENSIVE DISCS RAID. What is RAID ? RAID is an acronym for Redundant Array of Independent Drives (or Disks), also known as Redundant.
LANs and WANs Network size, vary from –simple office system (few PCs) to –complex global system(thousands PCs) Distinguish by the distances that the network.
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. HP StoreOnce How to win.
Windows Azure Conference 2014 Hybrid Cloud Storage: StorSimple and Windows Azure.
1 D2D Data Protection Solution Evaluation Project #552.
Hitachi Sepaton and Symantec NetBackup Powerful Protection Together
TechTarget Backup School exagrid.com | 1 Backup School ExaGrid Stress-free backup storage
Barracuda Backup Service Data Backup and Disaster Recovery.
Symantec De-Duplication Solutions Complete Protection for your Information Driven Enterprise Richard Hobkirk Sr. Pre-Sales Consultant.
Microsoft hybrid cloud backup: … differentiated … cost effective … for private/public cloud deployments 123.
Information Means The World.. Enhanced Data Recovery Agenda EDR defined Backup to Disk (DDT) Tape Emulation (Tape Virtualization) Point-in-time Copy Replication.
Agenda Symantec Enterprise Vault 1 Today’s Management Challenges 1 Why Management? 2 The Solution: Symantec Enterprise Vault 3 Benefits & Closing.
Servers Redundant Array of Inexpensive Disks (RAID) –A group of hard disks is called a disk array FIGURE Server with redundant NICs.
IBM TotalStorage ® IBM logo must not be moved, added to, or altered in any way. © 2007 IBM Corporation Break through with IBM TotalStorage Business Continuity.
1© Copyright 2011 EMC Corporation. All rights reserved. EMC Data Domain and F5 Networks Seamlessly Integrating Deduplicated Storage into a Tiered Storage.
Data Deduplication in Virtualized Environments Marc Crespi, ExaGrid Systems
LAN / WAN Business Proposal. What is a LAN or WAN? A LAN is a Local Area Network it usually connects all computers in one building or several building.
Understanding the Benefits and Costs of Deduplication Mahmoud Abaza, and Joel Gibson School of Computing and Information Systems, Athabasca University.
STEALTH Content Store for SharePoint using Caringo CAStor  Boosting your SharePoint to the MAX! "Optimizing your Business behind the scenes"
NetBackup PureDisk Kris Hagerman Sr. Vice President, Data Center Management.
Offsite Backup Solutions Justin Paul Senior Virtualization Engineer / VMware vExpert –
Quantum Overview A proven global expert in data protection and big data management.
DATA DEDUPLICATION By: Lily Contreras April 15, 2010.
Virtualization for Storage Efficiency and Centralized Management Genevieve Sullivan Hewlett-Packard
Confidential1 Introducing the Next Generation of Enterprise Protection Storage Enterprise Scalability Enhancements.
Demystifying Deduplication. Global SMB Event Marketing 2 APPROACH: What is deduplication? Eliminate redundant data Start with the backup environment as.
Storage Trends: DoITT Enterprise Storage Gregory Neuhaus – Assistant Commissioner: Enterprise Systems Matthew Sims – Director of Critical Infrastructure.
Microsoft Azure Storage. Networking Compute Storage Virtual Machine Operating System Applications Data & Access Runtime Provision.
| Basel Cloud Integrated Storage – StorSimple (e) Paul Chiola Technology Solutions Professional – Incubation WE
RevDedup: A Reverse Deduplication Storage System Optimized for Reads to Latest Backups Chun-Ho Ng, Patrick P. C. Lee The Chinese University of Hong Kong.
Virtual Tape Library
Emerging Technologies Understanding Deduplication Kevin Carpenter Account Manager Upstate NY Phil Benincasa System Engineer Upstate NY.
IBM Systems and Technology Group © 2009 IBM Corporation IBM System Storage – Tape Part 1 This document is for IBM and IBM Business Partner use only. It.
TechTarget Backup School exagrid.com | 1 Backup School Backup School Backup School
TechTarget Backup School Truth in IT Backup School exagrid.com | 1 Backup School Backup School Backup School
1 © Copyright 2009 EMC Corporation. All rights reserved. Backup Challenges in VMware Environments.
TechTarget Backup School exagrid.com | 1 Backup School ExaGrid / Commvault Stress-free backup storage.
© 2009 IBM Corporation Statements of IBM future plans and directions are provided for information purposes only. Plans and direction are subject to change.
CDP Competitive analysis of FalconStor CONFIDENTIAL DO NOT REDISTRIBUTE.
DXi Solution Presentation Casey Burns Field Marketing.
Enhanced Availability With RAID CC5493/7493. RAID Redundant Array of Independent Disks RAID is implemented to improve: –IO throughput (speed) and –Availability.
Solving Today’s Data Protection Challenges with NSB 1.
TechTarget Backup School exagrid.com | 1 ExaGrid Stress-free backup storage
CDP Technology Comparison CONFIDENTIAL DO NOT REDISTRIBUTE.
The Economics of Notes and Domino 8.5: How to Decrease Cost & Increase Productivity by Optimizing your Infrastructure.
Commvault and Nutanix October Changing IT landscape Today’s Challenges Datacenter Complexity Building for Scale Managing disparate solutions.
PHD Virtual Technologies “Reader’s Choice” Preferred product.
Integrating Disk into Backup for Faster Restores
Secure Data – a safe place in an unsafe world!
Network Attached Storage Overview
Indexing and hashing.
Agenda Backup Storage Choices Backup Rule
SAN and NAS.
9/11/2018 4:02 PM BRK2161 Maximize storage efficiency and conquer distributed file access with Windows Server and Azure Files Will Gries, PM Fabian Uhse,
Data Protection Suite Family Overview
Storage Trends: DoITT Enterprise Storage
Hybrid Storage Competitive Sales Guide INTERNAL ONLY
PRESENTER GUIDANCE: These charts provide data points on how IBM BaaS mid-market benefits a client with the ability to utilize a variety of backup software.
Move your data to the cloud with Azure and {Partner Company Name}
Presenter name goes here Presenter title goes here
IBM Tivoli Storage Manager
Fan Ni Xing Lin Song Jiang
Presentation transcript:

Demystifying Deduplication

Deduplication eliminates redundant copies of data What is deduplication? Deduplication eliminates redundant copies of data by leveraging pointers to point duplicate files or blocks to a single object APPROACH: Eliminate redundant data Start with the backup environment as the first phase Maintain references to single instances of data across data store Deduplication can decrease disk capacity requirements by up to 98% and decrease bandwidth requirements for data transfer by up to 50 times.

Dell’s point of view on deduplication Data Deduplication is a capacity optimization feature – not a capacity optimization solution Need to understand what problem you are trying to fix Dell can help find the right solution to your storage challenges As deduplication matures it will be ubiquitous across a wide range of storage products Deduplication integrated into software functionality provides the greatest benefits Deduplication technology will expand beyond backup to include static archive data and inactive primary data

Deduplication – Confusion abounds Different Architectures Source Target Single Instance Storage VTL File Block Sub-block Inline Processing Post Processing Different technologies

Unique data saved to disk Types of deduplication A B C D E Data object #1 Data object #2 F Unique data saved to disk Data deduplication eliminates common data at a file, block, or sub-block level. File aka Single Instance Store or SIS Typically primary storage Block aka Sub-file or Fixed block Typically secondary storage Better dedupe ratios Sub-block aka Variable block or Byte-level Best dedupe ratios Most processor intensive Disk Capacity Required

Deduplication Enables Cost Effective Disk To Disk Backup Shorten backup window and restore faster with B2D – At a cost similar to tape Reduce storage capacity, power, cooling and space requirements Centralize data protection and archive, reducing the burden on offices not staffed to manage it Enable cost-effective DR Backup to Disk with Dedupe 2 Primary Disk Deduplication Replication Backup Archive Secondary Disk

Why optimize disk-based backup with deduplication? B2D without deduplication B2D with deduplication 10:1 ratio Total capacity needed after 3 years (TB’s) 34.6 3.46 # drives needed 36 5 Total storage cost ~$34k ~$8k Example – 20TB of data growing 20% per year How long until the deduplicated storage capacity required equals 3 years without dedupe? Just over 15 years

How deduplication fits into the backup environment Application Servers Backup Server Deduplication Appliance JBOD/NAS/SAN OR Deduplication Here here or Deduplication Server-based (Source) & Integrated (Hybrid) Advantages: Common management Ease of use Can be less expensive solution (lower TCO) Reduces network traffic Global deduplication opportunity Appliance-based (Target) Advantages: Ease of implementation Works with variety of backup SW Disadvantages: Can be more expensive solution Replication target restrictions Greater network traffic overhead Often on proprietary hardware