Data Management TEG Status Dirk Duellmann & Brian Bockelman WLCG GDB, 9. Nov 2011.

Slides:



Advertisements
Similar presentations
ITU WORKSHOP ON STANDARDS AND INTELLECTUAL PROPERTY RIGHTS (IPR) ISSUES Session 5: Software copyright issues Dirk Weiler, Chairman of ETSI General Assembly.
Advertisements

Storage Workshop Summary Wahid Bhimji University Of Edinburgh On behalf all of the participants…
Action Programmes KAG / HoPS / PAD Workshop 3 rd March 2014.
1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
Social Accounting and Audit (SAA) and the Social Audit Network An introduction…
The POOL Persistency Framework POOL Summary and Plans.
D. Düllmann - IT/DB LCG - POOL Project1 POOL Release Plan for 2003 Dirk Düllmann LCG Application Area Meeting, 5 th March 2003.
Business Plan Preparation Frank Moyes Leeds School of Business University of Colorado Boulder, Colorado 1 Framework for the Business Plan.
Creating Architectural Descriptions. Outline Standardizing architectural descriptions: The IEEE has published, “Recommended Practice for Architectural.
The Current Status of States' Early Childhood Outcome Measurement Systems Kathy Hebbeler, SRI International Lynne Kahn, FPG Child Dev Inst October 17,
Concept Analysis Document Executive Summary Template, (To view template instructions – Save this template to project files, reopen and then select View,
Update on the California Dairy Future Task Force and moving forward December 5, 2012 CONFIDENTIAL AND PROPRIETARY Any use of this material without specific.
PATHFINDER Mission: Results for New Zealanders. Agenda for WG4 1.Introduction (Chair) - includes short website update (Greg) 2.WG / WS Process (Chair)
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Future support of EGI services Tiziana Ferrari/EGI.eu Future support of EGI.
Computing for ILC experiment Computing Research Center, KEK Hiroyuki Matsunaga.
Michalis Adamantiadis Transport Policy Adviser, SSATP SSATP Capacity Development Strategy Annual Meeting, December 2012.
Using Business Scenarios for Active Loss Prevention Terry Blevins t
Ian Bird LHCC Referee meeting 23 rd September 2014.
University of Sunderland CIFM03Lecture 2 1 Quality Management of IT CIFM03 Lecture 2.
Workshop summary Ian Bird, CERN WLCG Workshop; DESY, 13 th July 2011 Accelerating Science and Innovation Accelerating Science and Innovation.
111 Synthesis of Questionnaires. Thematic concentration  Most of the new member states support the suggested principle while maintaining the element.
CERN IT Department CH-1211 Geneva 23 Switzerland GT WG on Storage Federations First introduction Fabrizio Furano
Innovative Schools toolkit Strategic Workshop 5 Project implementation, tracking and management.
STEP 4 Manage Delivery. Role of Project Manager At this stage, you as a project manager should clearly understand why you are doing this project. Also.
Introduction to Planning
Ian Bird GDB; CERN, 8 th May 2013 March 6, 2013
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
Aid Transparency: Better Data, Better Aid Simon Parrish, Development Initiatives & IATI Yerevan, 4 October 2009.
WebFTS File Transfer Web Interface for FTS3 Andrea Manzi On behalf of the FTS team Workshop on Cloud Services for File Synchronisation and Sharing.
Business Analysis. Business Analysis Concepts Enterprise Analysis ► Identify business opportunities ► Understand the business strategy ► Identify Business.
LCG CCRC’08 Status WLCG Management Board November 27 th 2007
1 ISO/PC 283/N 197 ISO Current status of development November 2015.
Ian Bird GDB CERN, 9 th September Sept 2015
Evolution of storage and data management Ian Bird GDB: 12 th May 2010.
CCRC’08 Monthly Update ~~~ WLCG Grid Deployment Board, 14 th May 2008 Are we having fun yet?
Storage Interfaces Introduction Wahid Bhimji University of Edinburgh Based on previous discussions with Working Group: (Brian Bockelman, Simone Campana,
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Upcoming Features and Roadmap Ricardo Rocha ( on behalf of the.
WLCG Technical Evolution Group: Operations and Tools Maria Girone & Jeff Templon Kick-off meeting, 24 th October 2011.
Driving towards Impact through Development Goals Washington, DC 04/13/2011.
Data Placement Intro Dirk Duellmann WLCG TEG Workshop Amsterdam 24. Jan 2012.
Report from GSSD Storage Workshop Flavia Donno CERN WLCG GDB 4 July 2007.
Data management demonstrators Ian Bird; WLCG MB 18 th January 2011.
Storage Interfaces and Access pre-GDB Wahid Bhimji University of Edinburgh On behalf of all those who participated.
Instructional Leadership: Planning Rigorous Curriculum (What is Rigorous Curriculum?)
Ian Bird Overview Board; CERN, 8 th March 2013 March 6, 2013
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
1 Proposal for a technical discussion group  Because...  We do not have a forum where all of the technical people discuss the critical.
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
Some interest in common solutions in various areas – Federated data, common analysis framework, dashboards, SAM, clouds, …, etc In 2013 after end of EGI-SA3.
Wahid Bhimji (Some slides are stolen from Markus Schulz’s presentation to WLCG MB on 19 June Apologies to those who have seen some of this before)
The Grid Storage System Deployment Working Group 6 th February 2007 Flavia Donno IT/GD, CERN.
Evolution of WLCG infrastructure Ian Bird, CERN Overview Board CERN, 30 th September 2011 Accelerating Science and Innovation Accelerating Science and.
Ian Bird, CERN 1 st February Dec 2015
Strategic Planning for State Energy Workforce Consortia Day 2.
Outcome should be a documented strategy Not everything needs to go back to square one! – Some things work! – Some work has already been (is being) done.
Info-Tech Research Group1 Info-Tech Research Group, Inc. Is a global leader in providing IT research and advice. Info-Tech’s products and services combine.
Writing and updating strategic and annual plans Richard Maggs Astana September 2014.
LHCbComputing Update of LHC experiments Computing & Software Models Selection of slides from last week’s GDB
Ian Bird LCG Project Leader Summary of EGI workshop.
Operations Coordination Team Maria Girone, CERN IT-ES GDB, 11 July 2012.
CERN IT Department CH-1211 Geneva 23 Switzerland t Service Reliability & Critical Services January 15 th 2008.
Storage Interfaces and Access: Introduction
Storage / Data TEG Introduction
BRINGING ECONOMIC DEVELOPMENT BACK TO RURAL REGIONS
SRM Developers' Response to Enhancement Requests
Taming the protocol zoo
Olof Bärring LCG-LHCC Review, 22nd September 2008
System performance and cost model working group
Portfolio, Programme and Project
Business Plan Preparation
Presentation transcript:

Data Management TEG Status Dirk Duellmann & Brian Bockelman WLCG GDB, 9. Nov 2011

DM TEG status The working group has the stakeholder representation we need. The only apparent omission from the first meeting is a representative from dCache and StoRM, but we will contact both projects explicitly. We are also already in contact with the Storage TEG to achieve an additional balance between experiments and infrastructure providers. The mandates are clear and agreed. Some members have expressed concern about the efficacy of such working groups; does the MB have any guidance how the resulting strategy documents will be used after the TEG's have disbanded? The working group has had its initial kickoff phone meeting. We will be holding biweekly meetings and a face-to-face meeting (collocated with the storage TEG) in late January (possibly also one in early December). We plan to focus initially on the needs and status of each experiment, then move into the examining the roadmaps of the infrastructure providers. Other issues: – We would like guidance on how to best coordinate between different TEGs. There's significant overlap with storage management, and some amount of overlap with security and operations. – We believe it could be difficult to have a fully-formed long-term strategy document in January. Our current goal is to put together a strong statement of the status quo by the end of the face-to-face meeting.

TEG Logistics Twiki page – list – Bi-weekly phone meetings – = =3772 In person meetings being planned close to Storage TEG

Disclaimer The following few slides contain the interpretation of the mandate and a proposal for guiding the input collection with a (still incomplete) set of questions by only one of the chairs (Dirk), which has not yet been widely discussed in the working group due to lack of time before the GDB meeting.

The Story so far.. We do have a working system – We do not design a new s/w system – It’s cost and complexity is exceeding what is strictly needed to do the job We should rather critically look at which elements of the current system are used and how? Which have been added and why? – Which (larger) functionality did not make it and therefore could be declared obsolete (time scale 2-5 years)? What is the impact on the experiment / site / dev project side to progressively drop complexity? This review needs a simple layered model of data management as we do it today and a list of boxes labeled with “crucial”, “not required” And one/few future evolutions of this model which we see as desirable target after the 2-5 years

The E stands for Evolution We should not start by collecting bug reports or feature requests for existing systems We should rather step back and look at main concepts behind recent strategy changes/additions on the experiment side? – Eg Data placement and Federation Which concepts allow to implement Storage Federation? – What are the implicit assumptions: eg namespace congruence, read-only files – Upcoming Federation workshop in Lyon will help TEG to collect input How and where can new technology come in and which of our “special needs” is preventing this today? – Clustered file systems at smaller sites but we need SRM for disk? – Cloud storage but we need a global namespace?

Requirements vs Strategy Just collecting a list of additional requirements will not be sufficient Need to document a few strategic target scenarios – main advantages in sustainability – allow storage providers to align / point out risks – define crucial validation tests Possible scenarios – POSIX: storage pools look like local file storage with publishing / deletion – Cloud storage: storage has constrained semantic a la S3

DM Model and Evolution A few key questions to guide input collection – Is the archive disk split an agreed strategy? In other words: can we drop HSM? If yes, several simplifications can be done and benefits can be evaluated. Eg Why SRM for disk-only pools? – Do we see “naked” clustered file systems or cloud storage coming in during the reviewed time scale? Can we take our abstract DM model and fill its conceptual boxes with technology options for a give class of sites? Are the experiment data services prepared for this? – What role do we see for Federation vs Data Push? Eg Federation is complementing Push at small sites (T>1) but Push is still required to maintain control Eg Federation/Push is far better and will be the strategic target

Summary DM TEG has (just) started – First meeting was mainly on logistics and scope – Mandate and environment for it are OK We should start by abstracting from what we learned and come up with DM model(s) and their expected evolution –.. rather than repeating sometimes painful experience of low level functionality definition groups Process will start driven by the experiment view and progressive include storage provider and site feedback – In close contact with other TEGs – Regular MB and GDB reports