Download presentation
Presentation is loading. Please wait.
Published byAmbrose Armstrong Modified over 8 years ago
1
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. Replication and Migration Tool 2016/09/07 DataDirect Networks Japan, Inc. Shuichi Ihara
2
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. Background: Replication and Migration ► Data backup is important Many backup and replication requirements Lustre needs to be migrated from old to new system Time to think backup for few PB Lustre system Lustre Replication is not ready yet. ► Lustre tiering is possible Data Migration from Fast HDD (or SSD) to slow disks is possible o Different from fully automated HSM Lack of user space utilities 2
3
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. Lustre data replication and migration ► Data replication (Backup) Two independent and different namespaces (file system) Asynchronous file level replication Client can mount both filesystems and copy the data ► Data migration Migrate data from a storage device to another type of device (e.g. SSD or fast HDD to slow HDD) Keep same metadata and namespace Application access data transparently even after migration Similar idea of HSM, but no file stub and data recall Introduce two utilities (ldsync and ldmigrate) for data replication and migration in Lustre. 3
4
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. Challenges on Backup and Replication ► Two major challenges on Backup and Replication at large file system "Delta" detection between file systems o Determined by file attributes (mtimes, size, checksum, etc) o Depends on how many files are in the file systems Data copy speed o Copying many large files, as well large single shared file o Must have dynamic process allocation for copy 4 Primary Lustre File system Primary Lustre File system Backup Lustre File system Backup Lustre File system Data Copy Delta Detection
5
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. Major Copy Tool : Rsync and DCP ► Rsync Has been be maintained more than 10 years and packaged in Linux distribution. Supports many features, but lack of parallelization ► Have a look http://fileutils.io File utilities designed for scalability and performance Started as collaboration efforts among several large US laboratories and DDN, that was involved at the beginning ► DCP Part of fileutils and parallel copy tool with MPI Support any POSIX file system Manage chunk of file and efficient MPI rank allocation for copy and maximize resource utilization 5
6
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. Accelerate File system's delta detection ► "Diff" detections of two file systems Rsync takes few hours for delta detection at tens of millions of inode in the file system DCP (DCMP) in fileutils is much faster, but still consume a lot of time and metadata pressure ► Lustre Changelog rescue Lustre Changelog records events the file system namespace or file metadata. (Timestamp, FID and operation) Keep in MDT and it can fetch from Lustre client No more file system scanning except initial copy! 6
7
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ldsync : Parallel synchronization tool based on Lustre changelog ► Similar tool lustre_rsync is exist, but... Single thread and still based on rsync Partial changelog support ► ldsync is a replacement of lustre_rsync A parallel synchronization framework includes Lustre changelog analyzer Changelog analyzer walk through Changelog and invoke minimum stat() call to determine files to be synced Flexible backend copy tool support (Use DCP for now, but any native copy tool possible) 7
8
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. Architecture of ldsync 1. Fetch Changelog from MDT and analyze to determine what file needs to be synchronized 2. Minimum stat() call to MDS to get additional metadata information with FID in Changelog 3. Copy files by "dcp" and also unlink files from backup file system by "drm" (same idea of --delete" in rsync) 4. Clear old Changelog 8 Primary Lustre File system Primary Lustre File system Backup Lustre File system Backup Lustre File system Data Mover Data Mover mounts both primary(ro) and backup file system
9
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. Delta detection speed in two file systems 9 25x reduction
10
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ldsync : Experimental performance (Many small files) 10 Changelog analyzer is still serialized. We are improving this right now. 1M x 4KB file creations and synchronize two file systems
11
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ldsync : Experimental performance (1TB single shared file) 11 Primary System : 1 x DDN ES7K(140 x NLSAS), 2 x FDR Backup System : 1 x DDN ES7K(140 x NLSAS), 2 x FDR DDN Lustre 3.0 (IEEL3.0 + Extra patches) 9x reduction ~80% BW of H/W(10GB/sec)
12
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ldmigrate : Parallel data migration tool for Lustre ► Keep metadata, but change OST object placement e.g) Migrate data from SSD OST pool to HDD "lfs migrate" can do it, but limited scalability ► ldmigrate Migrate OST objects to another OSTs (OST pool) based on Lustre data layout (determines how to place data to OSTs) Parallelization and scalability Integration with Job Scheduler is possible 12
13
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. Architecture of ldmigrate 13 OST Pool SSD MDT Acquire grouplock and start copy data to tmp files by dcp OST OST Pool HDD OST FileA /scratch/Tier1/ object TmpFile object OST Pool SSD MDT OST OST Pool HDD OST FileA /scratch/Tier1/ object TmpFile object Swap Layout and FileA objects are now in OSTs of OST pool HDD and remove tmp files After finish copy Parallel data copy Swap Lustre layout
14
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. How Setup Tiered OST Pool and Migration ► Creating OST pool for different type of device [root@mds ~]# lctl pool_new scratch.SDD [root@mds ~]# lctl pool_new scratch.HDD [root@mds ~]# lctl pool_add scratch.SSD OST[0-9] [root@mds ~]# lctl pool_add scratch.HDD OST[a-13] ► Assassin OST pool to directory [root@client ~]# lfs setstripe –p SSD /scratch/tier1 [root@client ~]# lfs getstripe -p /scratch/tier1/file* SSD ► Copy and layout change [root@client ~]# dcp2 -g 100 -m -o SSD /scratch/tier1 /scratch/tmp [root@client ~]# lfs getstripe -p /scratch/tier1/file* HDD 14
15
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. Conclusions ► Introduced ldsync and ldmigrate for backup and data migration. ► Demonstrated huge performance improvements compared to existing tools and techniques. ► Still investigating several performance optimization and stability. It also require more tests ► These tools still under private repository at Github, but we plan to publish as open source or push patches to fileutils. 15
16
ddn.com © 2015 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. 16 Thank you!
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.