TeraGrid & XD approach to long-term archival Phil Andrews et al The Moving Finger writes; and having writ, Moves on; nor all your Piety nor Wit Shall lure.

Slides:



Advertisements
Similar presentations
© 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.
Advertisements

User Services Transition To XD TG Quarterly Management Meeting, San Juan 12/7/2010 Amit & Sergiu.
The transition to Finch: implications for the REF 29 November 2012 Paul Hubbard Head of Research Policy, HEFCE.
JISC funded Project The Language Box: Rethinking Teaching and Learning Repositories Dave Millard Faroes Project Learning Societies Lab University of Southampton.
1 Indiana University Advanced User Concepts August, 2012.
TeraGrid Archival Migration of Data to the XD Era Phil Andrews et al The Moving Finger writes; and having writ, Moves on; nor all your Piety nor Wit Shall.
National Soil Carbon Network Breakout Thanks to our rapporteur: Mark Waldrop participants.
A policy perspective: the role of higher education in meeting the needs of business and the community Mary-Anne Sakkara ACPET Symposium: Raising productivity,
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
York Extra User survey. York Extra Origins –University Communications Audit –Plans to generalise Computing Service Message of the day for multiple providers.
John Falkingham Canadian Ice Service, Environment Canada International Polar Year Ice Information Portal.
1 Supplemental line if need be (example: Supported by the National Science Foundation) Delete if not needed. Supporting Polar Research with National Cyberinfrastructure.
Peer-to-peer archival data trading Brian Cooper and Hector Garcia-Molina Stanford University.
Israel Cluster Structure. Outline The local cluster Local analysis on the cluster –Program location –Storage –Interactive analysis & batch analysis –PBS.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
Cloud Tiering The recovery of primary storage by tiering inactive data.
RECRUITMENT AND RETENTION One thing that makes our lives easier as leaders is who we hire and how we do it!
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Alexandre A. P. Suaide VI DOSAR workshop, São Paulo, 2005 STAR grid activities and São Paulo experience.
Magnet Lab User Portal August 2010.
Dissemination to support Research & Analysis John Cornish.
Software Engineering General architecture. Architectural components:  Program organisation overview Major building blocks in a system Definition of each.
November 2005 Advanced Research Networks Conference BCNET UVic is Wired Leverage the Power.
1 Preparing Your Application for TeraGrid Beyond 2010 TG09 Tutorial June 22, 2009.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
CCSM DATA MANGEMENT POLICY The Community Climate System Model (CCSM) Data Management Policy documents the procedures for the management of model data produced.
TeraGrid Privacy Policy: What is it and why are we doing it… Von Welch TeraGrid Quarterly Meeting March 6, 2008.
State Public Transportation Partnership Conference Reauthorization of the Surface Transportation Programs Jack Basso Director of Program Finance and Management.
TeraGrid Archival and Service Proposal Status Phil Andrews, Patricia Kovatch Cast thy bread upon the waters: for thou shalt find it after many days Ecclesiastes.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Going Google… Drive Eric Yamoah and Haris Azmi August 14, 2015.
June 20, 2007ESRI Intl. User Conference Dawn Wright - Oregon State University Val Cummins - Coastal & Marine Resources Centre, IRELAND Liz O’Dea - Coastal.
Author: Omar Khayyám (trans. By Edward FitzGerald in 1859)
LSST VAO Meeting March 24, 2011 Tucson, AZ. Headquarters Site Headquarters Facility Observatory Management Science Operations Education and Public Outreach.
Karsten Köneke October 22 nd 2007 Ganga User Experience 1/9 Outline: Introduction What are we trying to do? Problems What are the problems? Conclusions.
TeraGrid Quarterly Meeting Arlington, VA Sep 6-7, 2007 NCSA RP Status Report.
1 NSF/TeraGrid Science Advisory Board Meeting July 19-20, San Diego, CA Brief TeraGrid Overview and Expectations of Science Advisory Board John Towns TeraGrid.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
Distributed Data for Science Workflows Data Architecture Progress Report December 2008.
User-Facing Projects Update David Hart, SDSC April 23, 2009.
Neighbourhood Development Plan December 1 st, 2013.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Data Area Report Chris Jordan, Data Working Group Lead, TACC Kelly Gaither, Data and Visualization Area Director, TACC April 2009.
Semantics-based Distributed I/O for mpiBLAST P. Balaji ά, W. Feng β, J. Archuleta β, H. Lin δ, R. Kettimuthu ά, R. Thakur ά and X. Ma δ ά Argonne National.
PRESENTED TO: ENERGY FACILITY CONTRACTORS GROUP SAFETY ANALYSIS WORKING GROUP SAFETY ANALYSIS WORKSHOP BY: CHRIS CHAVES NSR&D PROGRAM OFFICE OF NUCLEAR.
NICS Update Bruce Loftis 16 December National Institute for Computational Sciences University of Tennessee and ORNL partnership  NICS is the 2.
Introduction to AFS IMSA Intersession 2003 An Overview of AFS Brian Sebby, IMSA ’96 Copyright 2003 by Brian Sebby, Copies of these slides.
Attribute-based Authentication for Gateways Jim Basney Terry Fleury Stuart Martin JP Navarro Tom Scavo Nancy Wilkins-Diehr.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
U.S. ATLAS Facility Planning U.S. ATLAS Tier-2 & Tier-3 Meeting at SLAC 30 November 2007.
GridShell/Condor: A virtual login Shell for the NSF TeraGrid (How do you run a million jobs on the NSF TeraGrid?) The University of Texas at Austin.
Advanced User Support Amit Majumdar 8/12/10. Outline  Three categories of AUS  Operational Activities  AUS.ASTA  ASTA examples  AUS.ASP  AUS.ASEOT.
OSG Area Coordinator’s Report: Workload Management February 9 th, 2011 Maxim Potekhin BNL
COST as a network instrument: Actions in Sustainable Construction and Energy Efficient Buildings S3 Platform on smart specialisation Workshop “Towards.
Aalto Research Data Management Policy Ella Bingham 8 April 2016 This work is licensed under the Creative Commons Attribution 4.0 International License.
Proposed DataONE TeraGrid Joint Initiative John Cobb, TeraGrid, and DataONE Presentation to TeraGrid Quarterly Management Meeting August 31, 2010 Seattle,
Data Infrastructure in the TeraGrid Chris Jordan Campus Champions Presentation May 6, 2009.
The Challenges of Digital Preservation in a Changing Environment Andrew Pitt Pfizer eArchive Service Team Global Records Management Services DPC Digital.
TeraGrid’s Process for Meeting User Needs. Jay Boisseau, Texas Advanced Computing Center Dennis Gannon, Indiana University Ralph Roskies, University of.
TeraGrid Science Advisory Board Arlington, VA  10 Dec
INTRODUCTION TO XSEDE. INTRODUCTION  Extreme Science and Engineering Discovery Environment (XSEDE)  “most advanced, powerful, and robust collection.
User Representation in TeraGrid Management Jay Boisseau Director, Texas Advanced Computing Center The University of Texas at Austin.
1 This slide indicated the continuous cycle of creating raw data or derived data based on collections of existing data. Identify components that could.
Barracuda Backup Easy Cloud-Connected Backup Version 5.4 | July 2014.
Amazon Storage- S3 and Glacier
Active Directory Administration
Jeffrey P. Gardner Pittsburgh Supercomputing Center
OGSA Data Architecture Scenarios
Long-Lived Data Collections
Urban Infrastructure: Analysis and Modeling for Their Optimal Management and Operation NSF Workshop NSF Award #: Nada Marie Anid, Ph.D. Professor.
Presentation transcript:

TeraGrid & XD approach to long-term archival Phil Andrews et al The Moving Finger writes; and having writ, Moves on; nor all your Piety nor Wit Shall lure it back to cancel half a Line, Nor all your Tears wash out a Word of it. -Omar Khayyam The output of our labour is now predominantly large datasets. There is currently no unified mechanism to ensure its continued safety and availability

2 Data Archival: no current long- term funding Earlier this year, TeraGrid (Andrews, PI) submitted a $12M proposal for data- replication within TG. No money was available this fiscal year. Plan to resubmit a more general proposal in late ’10. Short term SDSC and NCSA archival issues resolved. It’s Later Than You Think! – A Tale of Two Cities, Charles Dickens Science Advisory Board Meeting, December'09

3 Looking ahead … this will be a recurring and growing issue Science Advisory Board, December, 2009 Notional projections for illustration only! Estimated 3PB/yr/PF(peak) for T2 systems, 5PB/yr for Blue Waters. Current transition

4 What are the high-end users doing? Many users need data for 3-5 years Experimental data must be stored long-term, dual copies Climate data recently in the news: used to motivate $Trillions of policy decisions; should not be deleted COLA group currently generating ~1PB/year of data that needs to be saved and disseminated Single runs can generate 100+TB, 1 yr to generate, 1-2 years to analyze, need easy access during period Data Access profile: high speed I/O, then dissemination, then continued archival Science Advisory Board, December, 2009

5 No Concerted NSF plan for Archival Two NSF DataNet awards at ~$25M each, but no provision for actual data storage Need general NSF plan for data archival-we feel TG/XD is uniquely positioned to lead Will propose joint data responsibility: multiple copies across TG/XD, no individual RP dependence: “Lloyds of London” approach Science Advisory Board Meeting, December'09 I spent half of my money on women, booze, and gambling-the rest of it, I just frittered away! - George Best

6 How should it be done? Science Advisory Board Meeting, December'09 Many possible ways, expect linked global file systems for data transport and dissemination with integrated archival mechanisms over multiple sites. Ongoing development in response to user requirements and technological advances A merry road, a mazy road, and such as we did tread. The night we went to Birmingham by way of Beachy Head! – G.K Chesterton

7 Would like: The endorsement of the SAB for a TG/XD proposal that would provide general archival services, inc. dissemination and replication To submit new proposal to the NSF in late ’10 Recognition of data storage importance Science Advisory Board Meeting, December'09 “Rome wasn’t built in a day: but I wasn’t on that particular job! – Brian Clough, English Soccer Manager