Scalable sync-and-share service with dCache

Slides:



Advertisements
Similar presentations
Data Storage Solutions Module 1.2. Data Storage Solutions Upon completion of this module, you will be able to: List the common storage media and solutions.
Advertisements

Windows Server ® 2008 File Services Infrastructure Planning and Design Published: June 2010 Updated: November 2011.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Enigma Data’s SmartMove.
Chapter 9 Chapter 9: Managing Groups, Folders, Files, and Object Security.
File Management Systems
Oracle 10g Database Administrator: Implementation and Administration
11 SHARING FILE SYSTEM RESOURCES Chapter 9. Chapter 9: SHARING FILE SYSTEM RESOURCES2 CHAPTER OVERVIEW  Create and manage file system shares and work.
Lesson 1: Configuring Network Load Balancing
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 7 Configuring File Services in Windows Server 2008.
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
11 SHARING FILE SYSTEM RESOURCES Chapter 9. Chapter 9: SHARING FILE SYSTEM RESOURCES2 CHAPTER OVERVIEW Create and manage file system shares and work with.
Configuring File Services Lesson 6. Skills Matrix Technology SkillObjective DomainObjective # Configuring a File ServerConfigure a file server4.1 Using.
Experiences Deploying Xrootd at RAL Chris Brew (RAL)
Chapter 10 : Designing a SQL Server 2005 Solution for High Availability MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for Advanced.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
1 Administering Shared Folders Understanding Shared Folders Planning Shared Folders Sharing Folders Combining Shared Folder Permissions and NTFS Permissions.
Evolution, by tackling new challenges| CHEP 2015, Japan | Patrick Fuhrmann | 16 April 2015 | 1 Patrick Fuhrmann On behave of the project team Evolution,
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Introduction to HDFS Prasanth Kothuri, CERN 2 What’s HDFS HDFS is a distributed file system that is fault tolerant, scalable and extremely easy to expand.
Configuring File Services. Using the Distributed File System Larger enterprises typically use more file servers Used to improve network performce Reduce.
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
WP19 DESY Development Plan Frank Schlünzen Jürgen Starek.
Introduction to HDFS Prasanth Kothuri, CERN 2 What’s HDFS HDFS is a distributed file system that is fault tolerant, scalable and extremely easy to expand.
CASTOR project status CASTOR project status CERNIT-PDP/DM October 1999.
1 5/4/05 Fermilab Mass Storage Enstore, dCache and SRM Michael Zalokar Fermilab.
An Introduction to GPFS
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
OSIsoft High Availability PI Replication Colin Breck, PI Server Team Dave Oda, PI SDK Team.
Configuring the User and Computer Environment Using Group Policy Lesson 8.
Open Object Storage as the Foundation for Open Sync and Share in Science Simon Traill – © SwiftStack All.
CERN IT-Storage Strategy Outlook Alberto Pace, Luca Mascetti, Julien Leduc
dCache Paul Millar, on behalf of the dCache Team
Parallel Virtual File System (PVFS) a.k.a. OrangeFS
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
File Syncing Technology Advancement in Seafile -- Drive Client and Real-time Backup Server Johnathan Xu CTO, Seafile Ltd.
That this presentation about ?
Configuring File Services
Reducing Risk with Cloud Storage
WP18, High-speed data recording Krzysztof Wrona, European XFEL
Lesson 4: Configuring File and Share Access
Vincenzo Spinoso EGI.eu/INFN
Unified Data Access and MGMT. in Distributed hybrid Cloud
Section 6 Object Storage Gateway (RADOS-GW)
dCache “Intro” a layperson perspective Frank Würthwein UCSD
StoRM Architecture and Daemons
dCache Scientific Cloud
Introduction to Data Management in EGI
Patrick Fuhrmann (DESY) Benjamin Ertl (KIT) Maciej Brzezniak (PSNC)
VIDIZMO Deployment Options
Introduction to Computers
Introduction to Networks
Introduction to Networks
DCache things Paul Millar … on behalf of the dCache team.
Computing Infrastructure for DAQ, DM and SC
dCache: new and exciting features
Research Data Archive - technology
Storage Virtualization
Get z0-499 Actual Tests - 1z0-499 Actual Dumps PDF - Dumps4download.us
Real IBM C exam questions and answers
2018 Huawei H Real Questions Killtest
File System Management and Fault Tolerance
The Onedata platform Konrad Zemek, Krzysztof Trzepla ACC Cyfronet AGH
O.S Lecture 14 File Management.
WebDAV Design Overview
THE GOOGLE FILE SYSTEM.
Distributed Systems and Concurrency: Distributed Systems
Presentation transcript:

Scalable sync-and-share service with dCache Paul Millar, on behalf of the dCache team Workshop on Cloud Services for Synchronisation and Sharing (CS3) Amsterdam, The Netherlands 2017-01-30 .. 2017-02-01 https://cs3.surfsara.nl/ Direct NFS mount of dCache Looks like a regular file-system; in use at DESY-Cloud Direct dCache WebDAV support: Similar to NFS, but supporting more authn options (e.g., X.509, OpenID-Connect) Tie QoS to directory; e.g., 'important' folder – multiple disk copies, 'archive' folder – contents written to tape.

What is this talk about? Web-client Web-client Sync-client Sync-client 𝑥Cloud 𝑥Cloud NFS Filesystem dCache

What is this 'dCache' cloud thing? Managed distributed (aggregation of heterogeneous) storage. Mature open-source project, core team based at three organisations. In production for over 15 years and deployed throughout the world. Instances with tens-of-Petabytes of data, O(10^8) files. Different QoS within a single dCache instance: Support different media types: random-IO (SSD, HDD, CEPH), whole-file copying (HSM/tape, S3, …) Support different replication policies. Data placement / caching policies. Through INDIGO-DataCloud (Horizon-2020) project: Working within RDA to standardise QoS definitions, Working within SNIA to standardise QoS access models, Working within dCache to improve QoS support.

Media locality QoS in dCache Different QoS chosen by directory; e.g., Different users have different QoS, A user chooses which QoS by writing file into different directory. Single copy: Redundant copies: … + + All QoS options available in 𝑥Cloud: Media placement, geographical locality, replication factor, … Principally admin configured: User chooses into which directory they write. Target directory chooses which policy is used: 'tape' directory goes to tape, 'redundant' gets multiple copies Distinct nodes | racks | buildings | campuses | … Disk + tape copy: +

QoS media type support: CEPH dCache can use any POSIX storage Typically RAID storage with some local filesystem. Admins wanted to be able to use CEPH. Supported: 1x storage-node per CEPH pool. Adding redundant storage-nodes Also benefits cluster file-systems: luster, GPFS, …

Geographic placement QoS, part #1

Geographic placement QoS, part #2

Geographic placement QoS, part #3

Updates in dCache: hardening service Matching NFS semantics to 𝑥Cloud expectations: Allow file truncation (for when 𝑥Cloud overwriting existing files), Better support for multi-open (multiple sync- clients), Implementing more advanced features from the NFS specification. Real-life: conditions / operation intervention: Better support for planned storage-node intervention, Better user experience after unintended storage-node outage/restart, More stable IO when network is flaky. High Availability dCache: All dCache components may be deployed redundantly, Load-balancing for core components, Users unaware of (patch-level) upgrades and (planned) hardware intervention.

Looking towards the future

dCache vision 𝑥Cloud & dCache duplicated namespace: Pain to keeping these in sync. Shared files knowledge only in 𝑥Cloud layer: Shared files are not visible when interacting directly with dCache

Teaching dCache about sharing Users can share files or directories Can share what you own + those you've been given permission (e.g., ACL ADMINISTRATE bit). SHARE object is written in other user(s) 'incoming' directory (e.g., home directory) Only receiving user sees the SHARE Can rename, move, delete, cd/read; similar to regular sym-link. SHARE has share-mode for access control: share can be read-only or read-write. Independent of the SHARE's filesystem permissions. Renaming source does not affect SHARE. Unshare a subset of recipients removes their SHAREs. Removing the source also removes all SHAREs

Teaching dCache about sharing Users can share files or directories Can share what you own + those you've been given permission (e.g., ACL ADMINISTRATE bit). SHARE object is written in other user(s) 'incoming' directory (e.g., home directory) Only receiving user sees the SHARE Can rename, move, delete, cd/read; similar to regular sym-link. SHARE has share-mode for access control: share can be read-only or read-write. Independent of the SHARE's filesystem permissions. Renaming source does not affect SHARE. Unshare a subset of recipients removes their SHAREs. Removing the source also removes all SHAREs

QoS improvements Standardising QoS description: Increasing choice: Work coming through the INDIGO-DataCloud EU Horizon-2020 project. Standardisation (SNIA, RDA), Improved flexibility: more QoS options, Moves decision from admin into users. What it looks like: RESTful interface to dCache An 𝑥Cloud application to drive RESTful interface Examples: User chooses desired QoS for a directory, Change QoS later on: this data is actually important (multiple copies, store on tape, etc). Expose choice to sync-n-share users:

Summary NFS hardened from extensive use in real-world conditions Extending QoS options in dCache. Plan to: Expose QoS options in 𝑥Cloud, Implementing SHARE in dCache, Reduce dependency on 𝑥Cloud database, Investigate getDirectDownload for direct- from-dCache client HTTP transfers Finish sync-client support within dCache.

Backup slides

Solving them: teaching dCache to share Users can share files or directories Can share what you own + those you've been given permission (e.g., ACL ADMINISTRATE bit). SHARE object is written in other user(s) 'incoming' directory (e.g., home directory) Only receiving user sees the SHARE Can rename, move, delete, cd/read; similar to regular sym-link. SHARE has share-mode for access control: share can be read-only or read-write. Independent of the SHARE's filesystem permissions. Renaming source does not affect SHARE. Unshare a subset of recipients removes their SHAREs. Removing the source also removes all SHAREs