Deploying disk deduplication for Hyper-v 3.0 Žigmund Maťašovský.

Slides:

Advertisements

Similar presentations

ISCSI guides and suggestions. For most implementations.

Advertisements

What’s New: Windows Server 2012 R2 Tim Vander Kooi Systems Architect

 Welkom  Virtualization  Storage  Pauze  Networking  Identity and Access.

Windows Server 2012 R2 Software Defined Storage Lenovo ThinkServer HA Solutions Rui Freitas OEM Partner Strategist.

Windows Server 2012 Storage: Windows Gets a Bit SANer Presented by Mark on twitter 1 V2.00. contents copyright 2013 Mark.

VSphere vs. Hyper-V Metron Performance Showdown. Objectives Architecture Available metrics Challenges in virtual environments Test environment and methods.

Allocation Methods - Contiguous

Maxim Zhvirblya EPAM Systems © 2013 Or make MSSQL breathe easily RBS and Blob Cache in SharePoint 2013.

CN2140 Server II Kemtis Kunanuraksapong MSIS with Distinction MCT, MCITP, MCTS, MCDST, MCP, A+

Efficiently store fewer bits. File1 File2 After Dedup: Before Dedup:5TB Chunk Store Non-Optimized Files Optimized file stubs Savings = 4TB 1TB.

MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 11 Managing and Monitoring a Windows Server 2008 Network.

Yes, yes it does! 1.Guest Clustering is supported with SQL Server when running a guest operating system of Windows Server 2008 SP2 or newer.

Implementing Failover Clustering with Hyper-V

Upgrading the Platform - How to Get There!

The Journey to the 2012 R2 wave The 2012 R2 wave File Based Storage Storage Management for Private Cloud Storage Spaces.

After completing this topic, you will be able to explain the Agent for Hyper-V: backup flows Agent for Hyper-V: Backup flows.

Elad Hayun Agenda What's New in Hyper-V 2012 Storage Improvements Networking Improvements VM Mobility Improvements.

Empower Hyper-V Improved Manageability Improved Manageability Continue Simplification Mission Expand Cluster Validation Flexible Migrations.

Hyper-V High-Availability & Mobility: Designing the Infrastructure for Your Private Cloud Symon Perriman Technical Evangelist Microsoft

Data Deduplication in Virtualized Environments Marc Crespi, ExaGrid Systems

Understand what’s new for Windows File Server Understand considerations for building Windows NAS appliances Understand how to build a customized NAS experience.

System Center 2012 Setup The components of system center App Controller Data Protection Manager Operations Manager Orchestrator Service.

To dedup or not to dedup ? Deduplication Deep Dive.

Yury Kissin Infrastructure Consultant Storage improvements Dynamic Memory Hyper-V Replica VM Mobility New and Improved Networking Capabilities.

Cluster Shared Volumes Reborn in Windows Server 2012: Deep Dive Amitabh Tamhane Vineeth Karinta Program Manager 2 Senior Engineer Microsoft Corporation.

Windows Server 2012 VSP Windows Kernel Applications Non-Hypervisor Aware OS Windows Server 2008, 2012 Windows Kernel VSC VMBus Emulation “Designed for.

SYN407D: Image Management made easy with Provisioning Services 6.0

Hyper-V Storage Senthil Rajaram Senior PM Microsoft Corporation.

Chapter 18: Windows Server 2008 R2 and Active Directory Backup and Maintenance BAI617.

Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.

Appendix B Planning a Virtualization Strategy for Exchange Server 2010.

86% 50% Infrastructure provisioning Enterprise-class multi- tenant infrastructure for hybrid environments System Center capabilities Application.

Implementing Storage Spaces

Improving Disk Latency and Throughput with VMware Presented by Raxco Software, Inc. March 11, 2011.

DATA DEDUPLICATION By: Lily Contreras April 15, 2010.

4/23/2017 © 2014 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks and/or trademarks.

Demystifying Deduplication. Global SMB Event Marketing 2 APPROACH: What is deduplication? Eliminate redundant data Start with the backup environment as.

11 INSTALLING AND MANAGING STORAGE DEVICES IN WINDOWS XP Chapter 8.

File System Implementation Chapter 12. File system Organization Application programs Application programs Logical file system Logical file system manages.

Managing Disks and Drives Chapter 13 powered by dj.

CS 6502 Operating Systems Dr. J.. Garrido Device Management (Lecture 7b) CS5002 Operating Systems Dr. Jose M. Garrido.

Ji-Yong Shin Cornell University In collaboration with Mahesh Balakrishnan (MSR SVC), Tudor Marian (Google), Lakshmi Ganesh (UT Austin), and Hakim Weatherspoon.

© Janice Regan, CMPT 300, May CMPT 300 Introduction to Operating Systems Operating Systems Overview Part 2: History (continued)

1 CloudVS: Enabling Version Control for Virtual Machines in an Open- Source Cloud under Commodity Settings Chung-Pan Tang, Tsz-Yeung Wong, Patrick P. C.

Server VirtualizationServer Virtualization Hyper-V 2012.

Microsoft Virtual Academy. Microsoft Virtual Academy First HalfSecond Half (01) Introduction to Microsoft Virtualization(05) Hyper-V Management (02) Hyper-V.

Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition, Chapter 11: File System Implementation.

SODA File Server Physical Machine VMHost File Server App File CSV NTFS/REFS RDR Volume Partition Spaces SAS/SCSIIDE FCiSCSI JBODSAN Target FC SAN.

WS2012 File and Storage Services Management Name Jeff Alexander Technical Evangelist – Windows Infrastructure Microsoft Australia

RevDedup: A Reverse Deduplication Storage System Optimized for Reads to Latest Backups Chun-Ho Ng, Patrick P. C. Lee The Chinese University of Hong Kong.

Member of Erste Group 80 people (Croatian subsidiary) 250 people (Together with IT Dev part of ErsteBank) IT service provider for ErsteBank Group entities.

Business and Partnering Opportunities: “Windows Server 8” Continuous Availability Designing Systems for Continuous Availability and Scalability Session.

Cluster Shared Volumes Reborn in Windows Server 2012

MDC323B SMB 3 is the answer Ned Pyle Sr. PM, Windows Server

Including what’s new in Windows Server 2012 R2 Describe the basics of the Hyper-V over SMB scenario, focusing on the new capabilities in Windows Server.

Rick Claus Sr. Technical Evangelist,

1 | SharePoint Saturday Calgary – 31 MAY 2014 About Me.

Alessandro Cardoso, Microsoft MVP Creating your own “Private Cloud” with Windows 10 Hyper- V WIN443.

Control System Virtualization for the LHCb Online System ICALEPCS – San Francisco Enrico Bonaccorsi, (CERN)

Jérôme Jaussaud, Senior Product Manager

Brian Lauge Pedersen Senior DataCenter Technology Specialist Microsoft Danmark.

CommVault Architecture

Microsoft Virtual Academy Module 9 Configuring and Managing the VMM Library.

Demystifying Deduplication

Main Memory Management

MDC-B345 Hyper-V Storage Mathew John. MDC-B345 Hyper-V Storage Mathew John.

12/9/2018 Desktop Virtualization Corey Hynes Kyle Rosenthal President Technical Lead HynesITe Inc Spider Consulting @windowspcguy.

2.C Memory GCSE Computing Langley Park School for Boys.

Chapter 14: File-System Implementation

Microsoft Virtual Academy

Presentation transcript:

Deploying disk deduplication for Hyper-v 3.0 Žigmund Maťašovský

Content  Deduplication in windows 2012  Hyper-v and files  Cooperation Hyper-v and deduplication  Performance  „Eye-opener“  Questions & Example

What deduplication is:  Goal:  Use less storage space  Method:  Ensure that identical content is stored only once (on volume)  How it‘s work:  Post-process  Base on variable block size  Transparent for application  Selective compression

„Save space“ in previous version  NTFS compression  Work with single file  Real compression, write is CPU intensive  Single instance store  File-based  (WDS)  NTFS hard link  File based  Not transparent to application

Dedup mode  Where dedup work:  Source (RDC)  Target  When dedup work:  Inline (NTFS compression; slower write)  Post-Process  Which object dedup:  File (SIS)  Block (fixed size, cluster,...)  Chunk (variable block size,...)

Dedup architecture  File system driver  Dedup service:  Data Deduplication Service  Data Deduplication VSS  JOBS  Is Not supported:  removable device,  CSV  System disk  Remote mounted device

How dedup work  Post-process  Preserve latency and throughput of primary (on-fly) data access  Flexibility in scheduling background job on cold data  Optimize, Delete, check/repair  Optimize job need no input/output operation on deduplicated file  Base on variable block size = chunk  Calculate Rabin fingerprint hash for sliding window  Declare chunk boundary  Selective compression  48bytes metadata for chunk  Organized in LOG-structure  Use also heuristic algorithm

How dedup work – detail I.  Identify dedupicate data:

How dedup work – detail II.  Optimize target files:

How to manage dedup  GUI  Server manager (RSAT)  Command line  Ddpcli.exe  DdpEVal.exe  Powershell:  Disable-DedupVolume  Enable-DedupVolume  Get-DedupStatus  Get-DedupVolume  Set-DedupVolume  Update-DedupStatus  Get-DedupMetadata  Measure-DedupFileMetadata  JOB:  Get-DedupJob  Start-DedupJob  Stop-DedupJob  Schedule  Get-DedupSchedule  New-DedupSchedule  Remove-DedupSchedule  Set-DedupSchedule

Dedup – know problems  How to create file duplicate (create only metadata)?  Scrubbing  Repair from redundant copy  hotspot  Garbage Collection  Space for deduplication process  Event log  Monitoring using SCOM2012 (MP File Server 2012)  Backup problem

Hyper-v & files  Format: .vhd .vhdx  Types:  Fixed size - possible thin provisioning on storage spaces  Dynamically expanding - need compact if require frees space  Differencing – read-only parent disk(s)

Hyper-v & storage  local  iScsi  SMB 3  CSV  Storage spaces

Hyper-v & dedup I.  Dedup on HOST  Not recommended   Because performance impact (for standard disk)  Because deduplication requires a file not in use  Dedup in GUEST  Problem with compact dynamic.vhd

Hyper-v & dedup II.  Approach for TESTING environment & LABS:  VHD Parent file (golden image)  deduplicate volume, high performance read  Low cost or old-model SSD  Use differencing disk,  non deduplicate storage  With high write IOPS/ throughput -> (intel DS...)  In-time diff file grow to fast (change one byte need save whole file in diff)  Need manually-scripted management  Separate SWAP disk (or switch off swap) – no dedup  Separate data disk (mssql, exch,...) – no dedup because performance impact  Do not use Bitlocker in GUEST

Hyper-v & dedup III. Possibilities:  Use iScsi.vhd (fixed) file on deduplicated volume  On „standard“ volume  On storage spaces thin provisioning volume .VHD on dedup storage  DedupJob with save-state VMs ...

Performance – capacity / resouces  Optimizing:  Memory 30MB-1GB (depend on chunk count)  CPU: 30-40%  Disk usage: median disk queue = 0  Save spaces: (parent.VHD)  Capacity: GB  FreeSpace: 8.08 GB  UsedSpace: GB  UnoptimizedSize: GB  SavedSpace: GB  SavingsRate: 86 %

My performance-test reality:  3 different virtuals  speeder 10+-% (VMM Library store) - readonly  slower 30+-% (WDS) – read-write Problems:  Non deterministic (partialy heuristic algoritm), repeated operetion can get different result  Cache store chunk  highly dependent on device IOP charasteristics  No dedup file -> possible sequence operation  Dedup -> Chunk fragmented (partialy compress)  avarage chunk size 80 KBytes Performance I.

Performance – Data access

Eye-opener  Dedup in windows 8  The „same“ kernel as windows 2012  Licensing (?)  Install using DISM  dism /online /add-package /packagepath:.....  dism /online /enable-feature /featurename:Dedup-Core /all  Manage using PowerShell  Import-Module Deduplication  Enable-DedupVolume S:  Set-DedupVolume S: -MinimumFileAgeDays 0  Start-DedupJob S: –Type Optimization  Get-DedupJob  Get-DedupStatus  Get-DedupVolume S: | FL  Backup using wbadmin

Summary  Do not use in product environment  Good for LAB / testing environment / NTB  Good for save space on SSD  (wait for next version)

Questions and/or Example Dedup volume S:  Ddpeval S:\  Enable-dedupvolume s:  Get-dedupVolume s: | FL ! MinimumFileAgeDays  Set-DedupVolume S: -MinimumFileAgeDays 0  Start-DedupJob S: –Type Optimization  Get-DedupJob  Get-dedupVolume s: | FL

REF:  Ronald Beekelaar: Windows 8 DDDD - Disk Deduplication Deep Dive  the-mvps-windows-server-2012-s-data-deduplication-feature.aspx the-mvps-windows-server-2012-s-data-deduplication-feature.aspx  us/library/windows/desktop/hh769303(v=vs.85).aspx#best_practices _for_data_deduplication us/library/windows/desktop/hh769303(v=vs.85).aspx#best_practices _for_data_deduplication  us/library/windows/desktop/hh769303(v=vs.85).aspx us/library/windows/desktop/hh769303(v=vs.85).aspx  sions/SudiptaSengupta-JimBentonPrimary_Data_Deduplication- revision.pdf sions/SudiptaSengupta-JimBentonPrimary_Data_Deduplication- revision.pdf  deduplication-in-windows-8/ deduplication-in-windows-8/

Thank for your attention