Data Management: US Focus Kaushik De, Armen Vartapetian Univ. of Texas at Arlington US ATLAS Facility, SLAC Apr 7, 2014.

Slides:



Advertisements
Similar presentations
Management of User Requested Data in US ATLAS Armen Vartapetian University of Texas, Arlington US ATLAS Distributed Facility Workshop UC Santa Cruz, November.
Advertisements

 Management has become a multi-faceted complex task involving:  Storage Management  Content Management  Document Management  Quota Management.
15/07/2010Swiss WLCG Operations Meeting Summary of the last GridKA Cloud Meeting (07 July 2010) Marc Goulette (University of Geneva)
1 of 7 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
Chapter 10 Chapter 10: Managing the Distributed File System, Disk Quotas, and Software Installation.
Sharepoint Portal Server Basics. Introduction Sharepoint server belongs to Microsoft family of servers Integrated suite of server capabilities Hosted.
Tier-0: Preparations for Run-2 Armin NAIRZ (CERN) ADC Technical Interchange Meeting Chicago, 29 October 2014.
19 February CASTOR Monitoring developments Theodoros Rekatsinas, Witek Pokorski, Dennis Waldron, Dirk Duellmann,
December 17th 2008RAL PPD Computing Christmas Lectures 11 ATLAS Distributed Computing Stephen Burke RAL.
Alexandre A. P. Suaide VI DOSAR workshop, São Paulo, 2005 STAR grid activities and São Paulo experience.
The National Grid Service User Accounting System Katie Weeks Science and Technology Facilities Council.
Designing Group Security Designing security groups Designing user rights.
Integration Program Update Rob Gardner US ATLAS Tier 3 Workshop OSG All LIGO.
Module 9 Configuring Messaging Policy and Compliance.
Support.ebsco.com My EBSCOhost Tutorial Tutorial.
YAN, Tian On behalf of distributed computing group Institute of High Energy Physics (IHEP), CAS, China CHEP-2015, Apr th, OIST, Okinawa.
InstantGMP: Electronic Batch Records System for GMP Manufacturing InstantGMP™ Inventory Control Module for GMP Manufacturing.
Introduction: Distributed POOL File Access Elizabeth Gallas - Oxford – September 16, 2009 Offline Database Meeting.
Tier 3 Data Management, Tier 3 Rucio Caches Doug Benjamin Duke University.
K. De UTA Grid Workshop April 2002 U.S. ATLAS Grid Testbed Workshop at UTA Introduction and Goals Kaushik De University of Texas at Arlington.
Large Scale Test of a storage solution based on an Industry Standard Michael Ernst Brookhaven National Laboratory ADC Retreat Naples, Italy February 2,
Tier 1 Facility Status and Current Activities Rich Baker Brookhaven National Laboratory NSF/DOE Review of ATLAS Computing June 20, 2002.
Your university or experiment logo here Caitriana Nicholson University of Glasgow Dynamic Data Replication in LCG 2008.
PanDA A New Paradigm for Computing in HEP Kaushik De Univ. of Texas at Arlington NRC KI, Moscow January 29, 2015.
DDM-Panda Issues Kaushik De University of Texas At Arlington DDM Workshop, BNL September 29, 2006.
PanDA Summary Kaushik De Univ. of Texas at Arlington ADC Retreat, Naples Feb 4, 2011.
А.Минаенко Совещание по физике и компьютингу, 03 февраля 2010 г. НИИЯФ МГУ, Москва Текущее состояние и ближайшие перспективы компьютинга для АТЛАСа в России.
MAGDA Roger Jones UCL 16 th December RWL Jones, Lancaster University MAGDA  Main authors: Wensheng Deng, Torre Wenaus Wensheng DengTorre WenausWensheng.
ATLAS in LHCC report from ATLAS –ATLAS Distributed Computing has been working at large scale Thanks to great efforts from shifters.
Module 7 : Configuration I Jong S. Bok
US ATLAS Computing Operations Kaushik De University of Texas At Arlington US ATLAS Distributed Facility Workshop at SLAC October 13, 2010.
Architecture and ATLAS Western Tier 2 Wei Yang ATLAS Western Tier 2 User Forum meeting SLAC April
09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.
Chapter 10 Chapter 10: Managing the Distributed File System, Disk Quotas, and Software Installation.
US ATLAS Computing Operations Kaushik De University of Texas At Arlington U.S. ATLAS Tier 2/Tier 3 Workshop, FNAL March 8, 2010.
Nurcan Ozturk University of Texas at Arlington US ATLAS Transparent Distributed Facility Workshop University of North Carolina - March 4, 2008 A Distributed.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
AliEn AliEn at OSC The ALICE distributed computing environment by Bjørn S. Nilsen The Ohio State University.
SLACFederated Storage Workshop Summary For pre-GDB (Data Access) Meeting 5/13/14 Andrew Hanushevsky SLAC National Accelerator Laboratory.
PD2P The DA Perspective Kaushik De Univ. of Texas at Arlington S&C Week, CERN Nov 30, 2010.
EGI-InSPIRE EGI-InSPIRE RI DDM solutions for disk space resource optimization Fernando H. Barreiro Megino (CERN-IT Experiment Support)
PanDA & BigPanDA Kaushik De Univ. of Texas at Arlington BigPanDA Workshop, CERN October 21, 2013.
CD FY09 Tactical Plan Status FY09 Tactical Plan Status Report for Neutrino Program (MINOS, MINERvA, General) Margaret Votava April 21, 2009 Tactical plan.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Data Management Highlights in TSA3.3 Services for HEP Fernando Barreiro Megino,
ATLAS Distributed Computing perspectives for Run-2 Simone Campana CERN-IT/SDC on behalf of ADC.
U.S. ATLAS Facility Planning U.S. ATLAS Tier-2 & Tier-3 Meeting at SLAC 30 November 2007.
The National Grid Service User Accounting System Katie Weeks Science and Technology Facilities Council.
Shifters Jamboree Kaushik De ADC Jamboree, CERN December 4, 2014.
Distributed Physics Analysis Past, Present, and Future Kaushik De University of Texas at Arlington (ATLAS & D0 Collaborations) ICHEP’06, Moscow July 29,
Dynamic staging to a CAF cluster Jan Fiete Grosse-Oetringhaus, CERN PH/ALICE CAF / PROOF Workshop,
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
Dynamic Data Placement: the ATLAS model Simone Campana (IT-SDC)
Future of Distributed Production in US Facilities Kaushik De Univ. of Texas at Arlington US ATLAS Distributed Facility Workshop, Santa Cruz November 13,
Main parameters of Russian Tier2 for ATLAS (RuTier-2 model) Russia-CERN JWGC meeting A.Minaenko IHEP (Protvino)
ATLAS Distributed Computing ATLAS session WLCG pre-CHEP Workshop New York May 19-20, 2012 Alexei Klimentov Stephane Jezequel Ikuo Ueda For ATLAS Distributed.
PanDA & Networking Kaushik De Univ. of Texas at Arlington ANSE Workshop, CalTech May 6, 2013.
ATLAS Computing Wenjing Wu outline Local accounts Tier3 resources Tier2 resources.
PD2P Planning Kaushik De Univ. of Texas at Arlington S&C Week, CERN Dec 2, 2010.
PD2P, Caching etc. Kaushik De Univ. of Texas at Arlington ADC Retreat, Naples Feb 4, 2011.
BigPanDA Status Kaushik De Univ. of Texas at Arlington Alexei Klimentov Brookhaven National Laboratory OSG AHM, Clemson University March 14, 2016.
Computing Operations Roadmap
Simone Campana CERN-IT
U.S. ATLAS Tier 2 Computing Center
BNL Tier1 Report Worker nodes Tier 1: added 88 Dell R430 nodes
The Data Lifetime model
The ADC Operations Story
ADC Requirements and Recommendations for Sites
ATLAS STEP09 UK T2 Activity
Roadmap for Data Management and Caching
Presentation transcript:

Data Management: US Focus Kaushik De, Armen Vartapetian Univ. of Texas at Arlington US ATLAS Facility, SLAC Apr 7, 2014

Introduction  We are at midway point of LS1  Numerous improvements in computing underway  Need to be ready for new challenges during Run 2  New systems: ProdSys2, Rucio, Grid->Clouds, HPC  Are we ready for Run 2 Data Management?  Rucio migration coming soon – need extensive testing  Biggest challenge will be Tier 1 storage shortage  Tier 2 storage should be ok – to start with  The most important items?  Need new ADC driven data management and data distribution plans  Need automated tools for managing US user data Apr 7, 2014 Kaushik De 2

Apr 7, 2014 Kaushik De 3

Apr 7, 2014 Kaushik De 4

Apr 7, 2014 Kaushik De 5

Space Tokens  Maybe we will be able to simplify tokens after Rucio  For now they are necessary – for accounting, deletions…  ADC managed:  DATADISK/DATATAPE  GROUPDISK  SCRATCHDISK  Locally managed:  PRODDISK  USERDISK  LOCALGROUPDISK Apr 7, 2014 Kaushik De 6

Lack of Cache Space on DATADISK Apr 7, 2014 Kaushik De 7

BNL DATADISK Apr 7, 2014 Kaushik De 8

US Tier 2 DATADISK Apr 7, 2014 Kaushik De 9

DATADISK at US Tier 2’s Apr 7, 2014 Kaushik De 10

TAPE Will be Crucial for Run 2 Apr 7, 2014 Kaushik De 11

SCRATCHDISK Apr 7, 2014 Kaushik De 12

GROUPDISK Apr 7, 2014 Kaushik De 13

PRODDISK  Migration from pandamover  Migration to Rucio Apr 7, 2014 Kaushik De 14

USERDISK  Managed locally in the US  No change to current policy Apr 7, 2014 Kaushik De 15

Proposed US LOCALGROUPDISK Policy  Standard Policy on Total Used Space:  Allow 3 TB (TBD) per user per site in US facilities  If > 3 TB used, send automated warning s  Exceptions Policy for Total Used Space:  User needs to fill web form if they need more than standard limit  Automatic exceptions granted for 20 TB at one site, 30 TB total US  Exceptions will expire after duration specified by user  Exceptional cases (outside above policies):  Must be approved by RAC  Last Access Time Policy:  If data not used for more than 1 year (TBD), send warning  Multiple Replicas Policy:  If more than 7 replicas grid-wide, send warning s  Group Usage Policy:  If data is appropriate for placement on DATA/GROUPDISK, send Apr 7, 2014 Kaushik De 16

Current Status  Total used – 1.1 PB  Extensive cleaning done recently (>700 TB)  Need to automate management  Quotas per user/site  Deletions and exceptions Apr 7, 2014 Kaushik De 17

Current Tools Apr 7, 2014 Kaushik De 18 Developed by H. Ito

Future Tools  Need a new Localgroupdisk management system  Under development – extending Hiro’s tools  Database backend to keep historical space usage by user  Database backend to keep track of allowed exceptions  Web frontend for users  System to send warning s  Provide summary statistics and monitoring Apr 7, 2014 Kaushik De 19