CTA: CERN Tape Archive Rationale, Architecture and Status

Slides:



Advertisements
Similar presentations
Object Persistency & Data Handling Session C - Summary Object Persistency & Data Handling Session C - Summary Dirk Duellmann.
Advertisements

October Dyalog File Server Version 2.0 Morten Kromberg CTO, Dyalog LTD Dyalog’13.
CASTOR Project Status CASTOR Project Status CERNIT-PDP/DM February 2000.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
Caching the MDSPlus Data via Hibernate By Ajith M Jose Comp6703 Project Client: Raju Karia Supervisor: Dr. Henry Gardner (Development of “WebScope”)
What is it? Hierarchical storage software developed in collaboration with five US department of Energy Labs since 1992 Allows storage management of 100s.
Magda – Manager for grid-based data Wensheng Deng Physics Applications Software group Brookhaven National Laboratory.
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System 1 Zaihua Ji Doug Schuster Steven Worley Computational.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 7 Configuring File Services in Windows Server 2008.
File System. NET+OS 6 File System Architecture Design Goals File System Layer Design Storage Services Layer Design RAM Services Layer Design Flash Services.
Distributed File Systems Concepts & Overview. Goals and Criteria Goal: present to a user a coherent, efficient, and manageable system for long-term data.
Experiences and Challenges running CERN's High-Capacity Tape Archive 14/4/2015 CHEP 2015, Okinawa2 Germán Cancio, Vladimír Bahyl
Experiences Deploying Xrootd at RAL Chris Brew (RAL)
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
CERN IT Department CH-1211 Genève 23 Switzerland t Tape-dev update Castor F2F meeting, 14/10/09 Nicola Bessone, German Cancio, Steven Murray,
Chapter Oracle Server An Oracle Server consists of an Oracle database (stored data, control and log files.) The Server will support SQL to define.
STEALTH Content Store for SharePoint using Caringo CAStor  Boosting your SharePoint to the MAX! "Optimizing your Business behind the scenes"
7/2/2003Supervision & Monitoring section1 Supervision & Monitoring Organization and work plan Olof Bärring.
RECALL THE MAIN COMPONENTS OF KIM Functional User Interfaces We just looked at these Reference Implementation We will talk about these later Service Interface.
Personal Computer - Stand- Alone Database  Database (or files) reside on a PC - on the hard disk.  Applications run on the same PC and directly access.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
4/5/2007Data handling and transfer in the LHCb experiment1 Data handling and transfer in the LHCb experiment RT NPSS Real Time 2007 FNAL - 4 th May 2007.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Policy Based Data Management Data-Intensive Computing Distributed Collections Grid-Enabled Storage iRODS Reagan W. Moore 1.
CASTOR: CERN’s data management system CHEP03 25/3/2003 Ben Couturier, Jean-Damien Durand, Olof Bärring CERN.
INTRODUCTION TO DBS Database: a collection of data describing the activities of one or more related organizations DBMS: software designed to assist in.
Online Software 8-July-98 Commissioning Working Group DØ Workshop S. Fuess Objective: Define for you, the customers of the Online system, the products.
CASTOR evolution Presentation to HEPiX 2003, Vancouver 20/10/2003 Jean-Damien Durand, CERN-IT.
Process Architecture Process Architecture - A portion of a program that can run independently of and concurrently with other portions of the program. Some.
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
Scientific Storage at FNAL Gerard Bernabeu Altayo Dmitry Litvintsev Gene Oleynik 14/10/2015.
Data & Storage Services CERN IT Department CH-1211 Genève 23 Switzerland t DSS New tape server software Status and plans CASTOR face-to-face.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
A proposal: from CDR to CDH 1 Paolo Valente – INFN Roma [Acknowledgements to A. Di Girolamo] Liverpool, Aug. 2013NA62 collaboration meeting.
10 May 2001WP6 Testbed Meeting1 WP5 - Mass Storage Management Jean-Philippe Baud PDP/IT/CERN.
Andrea Valassi (CERN IT-DB)CHEP 2004 Poster Session (Thursday, 30 September 2004) 1 HARP DATA AND SOFTWARE MIGRATION FROM TO ORACLE Authors: A.Valassi,
CASTOR project status CASTOR project status CERNIT-PDP/DM October 1999.
G.Govi CERN/IT-DB 1GridPP7 June30 - July 2, 2003 Data Storage with the POOL persistency framework Motivation Strategy Storage model Storage operation Summary.
ALICE experiences with CASTOR2 Latchezar Betev ALICE.
CERN IT Department CH-1211 Genève 23 Switzerland t The Tape Service at CERN Vladimír Bahyl IT-FIO-TSI June 2009.
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
(re)-Architecting cloud applications on the windows Azure platform CLAEYS Kurt Technology Solution Professional Microsoft EMEA.
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
CERN IT Department CH-1211 Genève 23 Switzerland t Increasing Tape Efficiency Original slides from HEPiX Fall 2008 Taipei RAL f2f meeting,
Tape write efficiency improvements in CASTOR Department CERN IT CERN IT Department CH-1211 Genève 23 Switzerland DSS Data Storage.
Industrial Control Engineering Session 1 Introduction  What is RADE  Technology  Palette  Tools  Template  Combined Example  How to get RADE 
CASTOR new stager proposal CASTOR users’ meeting 24/06/2003 The CASTOR team.
CERN IT-Storage Strategy Outlook Alberto Pace, Luca Mascetti, Julien Leduc
Federating Data in the ALICE Experiment
Databases and DBMSs Todd S. Bacastow January 2005.
Jean-Philippe Baud, IT-GD, CERN November 2007
Managing, Storing, and Executing DTS Packages
Simulation Production System
CASTOR Giuseppe Lo Presti on behalf of the CASTOR dev team
CTA: CERN Tape Archive Adding front-ends and back-ends Status report
Ákos Frohner EGEE'08 September 2008
The INFN Tier-1 Storage Implementation
CTA: CERN Tape Archive Overview and architecture
Computing Infrastructure for DAQ, DM and SC
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
A Web-Based Data Grid Chip Watson, Ian Bird, Jie Chen,
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System Zaihua Ji Doug Schuster Steven Worley Computational.
Support for ”interactive batch”
Tiers vs. Layers.
CASTOR: CERN’s data management system
Production Manager Tools (New Architecture)
Presentation transcript:

CTA: CERN Tape Archive Rationale, Architecture and Status Germán Cancio, Daniele Kruse, Eric Cano, Steven Murray CASTOR face to face meeting, June 2016 CTA Project

Rationale & objectives CTA :CERN Tape Archive Stand alone tape system Decoupled from disk storage First target EOS, potentially others systems Straightforward architecture with minimum number of elements Session preemption: optimized use of drives Target is LHC Run 3: EOS+CTA as the DAQ interface for all experiments CASTOR face to face meeting, June 2016 CTA Project

Architecture Shared storage concept New queueing system Only 2 daemons: front end for CLI (xrootd based) & taped. New queueing system Based on Ceph Only for transient data Each queue (per tape/tapepool) is an independent object Avoids a single huge queue table Allows storage of rich objects Separate file catalogue Based on usual relational DB For persistent data cta-taped an adapted tapeserverd from CASTOR CASTOR face to face meeting, June 2016 CTA Project

Architecture CTA Catalogue CTA tape server CTA front-end CTA All of the CTA business logic and data management code is in the client interface to metadata CTA Catalogue Evolved from CASTOR Files Routing Commands/control Tape pools Mount policies CTA tape server CTA front-end CTA tape server CTA front-end CTA tape server CTA Queues Drive status Queues Tape Tape Tape Tape Tape Tape Two prototypes Ceph Local file system Data CTA command-line tools CTA command-line tools CTA CLI Disk Disk Remote storage EOS w/workflow Disk Disk CASTOR face to face meeting, June 2016 CTA Project

EOS integration EOS triggers archivals and retrievals Handled by workflow engine Through CLI interface of CTA EOS maintains a stub for each file Namespace belongs to EOS CTA files referenced by unique numeric ID Path only interpreted in EOS EOS does not query CTA CTA maintains Disaster Recovery (DR) data File metadata stored in the catalogue as a DR blob CASTOR face to face meeting, June 2016 CTA Project

Status “Passive EOS” integration achieved end 2015 CTA transfers file in/out of an EOS instance Hand triggered via the CLI Prototype with a full namespace, now removed Some persistent data on Ceph side Namespace based on shared filesystem “Active EOS” integration targeted for 2016 Integration with EOS’s workflow engine Changes needed on both sides New catalogue replaces namespace Queuing system adapted to new catalogue and performance validated Refactoring of cta-taped code Longer term targets Repack Verify Session preemption instead of dedication Saturate the otherwise idle drives with low-priority, high-volume sessions (repack, verify) Yield to higher priority sessions (user access) Disaster recovery CASTOR import (metadata) cta-taped improvements in parallel (low priority) Recommended Access Order (drive-recommended read order) Out of order archiving (writing files in reception from disk order) CASTOR face to face meeting, June 2016 CTA Project