Data Conservancy and the US NSF DataNet Initiative Fourth Workshop on Data Preservation and Long-Term Analysis in HEP Sayeed Choudhury Johns Hopkins University.

Slides:



Advertisements
Similar presentations
The Data Conservancy: A Digital Research and Curation Virtual Organization D4Science World User Meeting November 25, 2009.
Advertisements

Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University.
Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
Supporting Research on Campus - Using Cyberinfrastructure (CI) Public research use of ICT has rapidly increased in the past decade, requiring high performance.
Contouring Curation in Research Libraries: Defining “Working” Data Units and Communities Carole L. Palmer Center for Informatics Research in Science &
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Introduction to Research Data Management Services, January 2013 Research Data Management Infrastructure The Current Context.
Background Chronopolis Goals Data Grid supporting a Long-term Preservation Service Data Migration Data Migration to next generation technologies Trust.
Data, Cyberinfrastructure, and Interoperability: Highlights from Infrastructure Studies Florence Millerand, Karen S. Baker, David Ribes *Florence:
Funding Opportunities at NSF Jane Silverthorne International Arabidopsis Consortium Workshop January 15, 2011.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
Data Conservancy: A Life Sciences Perspective Sayeed Choudhury Johns Hopkins University
The "Earth Cube” Towards a National Data Infrastructure for Earth System Science Presentation at WebEx Meeting July 11, 2011.
Active Data Curation in Libraries: Issues and Challenges ASEE ELD Presentation June 27, 2011 William H. Mischo & Mary C. Schlembach.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Broader Impacts in Proposal Writing Sally Bond Assistant Director of Research Development Services Proposal Coordination Office of the Vice President for.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Supporting the CCSS in the Science Classroom through the Science and Engineering Practices of the Next Generation Science Standards (NGSS) John Spiegel.
Advances in Cyberinfrastructure with a Focus on Data: a U.S. National Science Foundation Overview Alliance for Permanent Access to Records of Science in.
The Natural Resources Digital Library Needs, Partners, and Challenges Bonnie Avery, Janine Salwasser, & Janet Webster Oregon State University.
Presenter: Karla Strieb Assistant Executive Director Transforming Research Libraries June 3, 2010 Supporting E-science: Progress at Research Institutions.
Data Conservancy: A Blueprint for Libraries in the Data Age Sayeed Choudhury Johns Hopkins University
The Data Conservancy: Lessons from Astronomy Third Workshop on Data Preservation and Long Term Analysis in HEP December 7, 2009.
The Data Conservancy: A Digital Research and Curation Virtual Organization Karon Kelly National Center for Atmospheric Research – NCAR Library Special.
2005 UCAR Office of Program Annual Report Jack Fellows,UOP Director Open House. Not going over the Annual Report -- I’ll be summarizing UOP and its programs.
Computational Scientometrics Studying science by scientific means Dr. Katy Börner Cyberinfrastructure for Network Science Center, Director Information.
Illinois MSP Program Goals  To increase the content expertise of mathematics and science teachers; 4 To increase teaching skills through access to the.
A River Runs Through It ARL Membership Meeting Sayeed Choudhury Sheridan Libraries, Johns Hopkins October 15, 2009.
NCAR Annual Budget Review October 8, 2007 Tim Killeen NCAR Director.
Session Chair: Peter Doorn Director, Data Archiving and Networked Services (DANS), The Netherlands.
RMap Project RDA Fourth Plenary Amsterdam 23 September 2014 Sayeed Choudhury, Data Conservancy Sheila Morrissey, Portico.
Arctic Observing Network (AON): Current Status and Future Development Martin O. Jeffries National Science Foundation Office of Polar Programs Division.
Challenges and Opportunities for Academic Libraries Collaborative Imperatives to Support Collections, Digital Initiatives, and New Services for a Changing.
The Office of Financial Research: building a Research Community.
November 2004 NDIIPP: Future Directions and Relevance to Other Countries Beth Dulabahn Office of Strategic Initiatives Library of Congress November 7,
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
Business Models and Economics of Sustainable Data Infrastructures Patricia Cruse University of California Curation Center California Digital Library.
Education and Outreach Overview Susan Van Gundy Core Integration NSDL Central Office, UCAR.
Finding Partners, Creating Impact Rusty Low Poles Together Workshop NOAA Boulder, CO July 20-22, 2005.
S2I2: Enabling grand challenge data intensive problems using future computing platforms Project Manager: Shel Swenson (USC & GATech)
1 The NSDL Program Stephen Griffin National Science Foundation.
Block 7: Reports Back to Plenary Group on CE and CI Working Group Activities Tasks and Activities -- October 22 DataONE Kick-off Meeting October 20-22,
“A Library outranks any other one thing a community can do to benefit its people.” --Andrew Carnegie.
Site-Based Data Curation at Yellowstone National Park PI: Carole L. Palmer, GSLIS, CIRSS Co-PIs: Bruce Fouke, Geology, Microbiology, Institute for Genomic.
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
Open Access from Digital Library Viewpoint Berlin 7 Conference Sayeed Choudhury December 4, 2009.
Funding: Staffing for Research Computing What staffing models does your institution use for research computing? How does your institution pay for the staffing.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
The Case for a Data Decadal Survey A community-driven initiative to harmonize data practices, research priorities and infrastructure for 2025 Carol B.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
Forging the eXtremeDigital (XD) Program Barry I. Schneider Program Director, Office of CyberInfrastructure January 20, 2011.
CNI Task Force Meeting April 7, 2008 OAI-ORE Project Briefing David Reynolds Tim DiLauro Sayeed Choudhury Library Digital Programs Sheridan Libraries Johns.
The Global Scene Wouter Los University of Amsterdam The Netherlands.
Data Infrastructure Building Blocks (DIBBS) NSF Solicitation Webinar -- March 3, 2016 Amy Walton, Program Director Advanced Cyberinfrastructure.
Institutional Repositories: The Beginning of the Journey Sayeed Choudhury Utah State IR Conference September 30, 2009.
EarthCube Sustaining the Geosciences for 21 st Century Challenges Credits: from top to bottom: NOAA Okeanos Explorer Program (CC BY-SA 2.0), NASA/Kathryn.
Data Sources & Using VIVO Data Visualizing Science VIVO provides network analysis and visualization tools to maximize the benefits afforded by the data.
Joslynn Lee – Data Science Educator
PV 2009 December 3, 2009 The Data Conservancy: Building Sustainable Infrastructure for Interdisciplinary Scientific Data Curation and Preservation.
NSDL: A New Tool for Teaching and Learning.
Packaging Specification Package Ingest Service
Mentoring the Next Generation of Science Gateway Developers and Users
A five-year community effort to improve Earth literacy and build a workforce prepared to tackle environmental and resource issues InTeGrate supports integrated.
Research on Data Curation and Repositories
Briefing to ARL Membership
BCoN Data Integration Workshop, University of Kansas, Feb 13-14, 2018
Presentation transcript:

Data Conservancy and the US NSF DataNet Initiative Fourth Workshop on Data Preservation and Long-Term Analysis in HEP Sayeed Choudhury Johns Hopkins University

NSF DataNet Science and engineering research and education are increasingly digital and data- intensive New methods, management structures and technologies necessary NSF DataNet solicitation addresses challenge by creating exemplar data infrastructure organizations

NSF recent actions Five DataNet partners funded at $20 million each for 5 years – seed funding Data Conservancy and DataONE are first two awards – up to three more awards in next round Part of broader initiatives at NSF including requirement for data management plans and (separate) Johns Hopkins grant for feasibility study of open access repository

Data Curation The Data Conservancy embraces a shared vision: data curation is a means to collect, organize, validate and preserve data so that scientists can find new ways to address the grand research challenges that face society.

Goal The goal of Data Conservancy is to support new forms of inquiry and learning that address grand research challenges. The Data Conservancy will accomplish this goal through the creation, implementation and sustained management of an integrated and comprehensive data curation strategy.

…not a rigid road map but principles of navigation. There is no one way to design cyberinfrastructure, but there are tools we can teach the designers to help them appreciate the true size of the solution space – which is often much larger than they may think, if they are tied into technical fixes for all problems.

Principles Our strategy focuses on connection of systems into infrastructure through a program informed by user- centered design and research, sustained through a portfolio of funding streams, and managed through a shared, coordinated governance structure. Build on existing exemplar scientific projects, communities and virtual organizations that have deep engagement with citizen scientists and extensive experience with large-scale, distributed system development

Partner institutions Johns Hopkins University (Lead institution) Cornell University DuraSpace Marine Biological Laboratory National Center for Atmospheric Research National Snow and Ice Data Center Portico Tessella, Inc. University of California Los Angeles University of Illinois at Urbana-Champaign

Objectives Infrastructure research and development – Technical requirements Information science and computer science research – Scientific or user requirements Broader impacts – Educational requirements Sustainability – Business requirements

Domain coverage/methods Multi-site user research methods are a blend of: – Case study & domain comparisons – Depth & breadth – Local & global AstronomyEarth SciencesLife SciencesSocial Sciences UCAR Task-based design and usability testing  Use cases, data requirements, system recommendations UCAR UCLAEthnography, virtual ethnography, oral histories  Use cases, data requirements Interviews, Surveys, Worksheets, Content analysis  Curation requirements, taxonomy, metadata/provenance framework UIUC

Data Framework Start with a common conceptualization that applies across scientific domains Exploit semantic technologies Leverage existing work Prototype the framework in target communities – Iteratively refine, learn from experience – Demonstrate success, measured in terms of new science

Common Conceptualization Observations are the foundation of all scientific studies, and are the closest approximation to facts. Wiens, J. A. (1992). Cambridge studies in ecology: The ecology of bird communities. Foundations and Patterns, 1; Processes and Variations, 2

Data Model using OAI-ORE

Acknowledgements Carole Palmer (information science slides) Carl Lagoze (Data Framework slides) Tim DiLauro (OAI-ORE) Office of Cyberinfrastructure DataNet Award # Office of Cyberinfrastructure EAGER Award #