Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University.

Slides:



Advertisements
Similar presentations
DRIVER Building a worldwide scientific data repository infrastructure in support of scholarly communication 1 JISC/CNI Conference, Belfast, July.
Advertisements

The Data Conservancy: A Digital Research and Curation Virtual Organization D4Science World User Meeting November 25, 2009.
Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Mobile Data from a Research Perspective Institute of Educational Technology The Open University Agnes Kukulska-Hulme JISC/CNI conference, Edinburgh, 1-2.
Can We Talk? MICHAEL Conference London May 23, 2008Joyce Ray.
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
Contouring Curation in Research Libraries: Defining “Working” Data Units and Communities Carole L. Palmer Center for Informatics Research in Science &
ICT 2010: "Global Information Structures for Science & Cultural heritage: The Interoperability Challenge" Networking Session Coordination Action on Digital.
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Libraries in the New Research Environment Joyce Ray NAS/BRDI Symposium Associate Deputy for Libraries June 3, 2010.
Introduction to Research Data Management Services, January 2013 Research Data Management Infrastructure The Current Context.
Background Chronopolis Goals Data Grid supporting a Long-term Preservation Service Data Migration Data Migration to next generation technologies Trust.
EInfrastructures (Internet and Grids) US Resource Centers Perspective: implementation and execution challenges Alan Blatecky Executive Director SDSC.
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
Data Conservancy: A Life Sciences Perspective Sayeed Choudhury Johns Hopkins University
The "Earth Cube” Towards a National Data Infrastructure for Earth System Science Presentation at WebEx Meeting July 11, 2011.
Active Data Curation in Libraries: Issues and Challenges ASEE ELD Presentation June 27, 2011 William H. Mischo & Mary C. Schlembach.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
The NIH Roadmap for Medical Research
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Advances in Cyberinfrastructure with a Focus on Data: a U.S. National Science Foundation Overview Alliance for Permanent Access to Records of Science in.
The Natural Resources Digital Library Needs, Partners, and Challenges Bonnie Avery, Janine Salwasser, & Janet Webster Oregon State University.
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.
Presenter: Karla Strieb Assistant Executive Director Transforming Research Libraries June 3, 2010 Supporting E-science: Progress at Research Institutions.
Data Conservancy: A Blueprint for Libraries in the Data Age Sayeed Choudhury Johns Hopkins University
The Data Conservancy: Lessons from Astronomy Third Workshop on Data Preservation and Long Term Analysis in HEP December 7, 2009.
The Data Conservancy: A Digital Research and Curation Virtual Organization Karon Kelly National Center for Atmospheric Research – NCAR Library Special.
Data Infrastructures Opportunities for the European Scientific Information Space Carlos Morais Pires European Commission Paris, 5 March 2012 "The views.
A River Runs Through It ARL Membership Meeting Sayeed Choudhury Sheridan Libraries, Johns Hopkins October 15, 2009.
Proposition: Digital Collections Are Easier to Find and Use through DLF Aquifer’s American Social History Online Katherine Kott, Aquifer Director Library.
Session Chair: Peter Doorn Director, Data Archiving and Networked Services (DANS), The Netherlands.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
Business Models and Economics of Sustainable Data Infrastructures Patricia Cruse University of California Curation Center California Digital Library.
Education and Outreach Overview Susan Van Gundy Core Integration NSDL Central Office, UCAR.
Finding Partners, Creating Impact Rusty Low Poles Together Workshop NOAA Boulder, CO July 20-22, 2005.
Framework for the Creation of Digital Knowledge Resources to meet the Challenges for Digital Future: A Librarian’s Perspective Dr. Harish Chandra Librarian.
S2I2: Enabling grand challenge data intensive problems using future computing platforms Project Manager: Shel Swenson (USC & GATech)
1 The NSDL Program Stephen Griffin National Science Foundation.
Block 7: Reports Back to Plenary Group on CE and CI Working Group Activities Tasks and Activities -- October 22 DataONE Kick-off Meeting October 20-22,
“A Library outranks any other one thing a community can do to benefit its people.” --Andrew Carnegie.
Site-Based Data Curation at Yellowstone National Park PI: Carole L. Palmer, GSLIS, CIRSS Co-PIs: Bruce Fouke, Geology, Microbiology, Institute for Genomic.
Open Access from Digital Library Viewpoint Berlin 7 Conference Sayeed Choudhury December 4, 2009.
Award # funded by the National Science Foundation Award #ACI Jetstream: A Distributed Cloud Infrastructure for.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
Implementing a National Data Infrastructure: Opportunities for the BIO Community Peter McCartney Program Director Division of Biological Infrastructure.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
Digital Data Collections ARL, CNI, CLIR, and DLF Forum October 28, 2005 Washington DC Chris Greer Program Director National Science Foundation.
Data Conservancy and the US NSF DataNet Initiative Fourth Workshop on Data Preservation and Long-Term Analysis in HEP Sayeed Choudhury Johns Hopkins University.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
The Global Scene Wouter Los University of Amsterdam The Netherlands.
Data Infrastructure Building Blocks (DIBBS) NSF Solicitation Webinar -- March 3, 2016 Amy Walton, Program Director Advanced Cyberinfrastructure.
Digital Data Collections in Biology Collaborative Expedition Workshop November 8, 2005 Arlington, Virginia Chris Greer Program Director National Science.
Source: Paul Hanson. Collaboration in Environmental Science Global Lake Ecological Observatory Network A grassroots network of –People: lake scientists,
Institutional Repositories: The Beginning of the Journey Sayeed Choudhury Utah State IR Conference September 30, 2009.
Joslynn Lee – Data Science Educator
PV 2009 December 3, 2009 The Data Conservancy: Building Sustainable Infrastructure for Interdisciplinary Scientific Data Curation and Preservation.
NSDL: A New Tool for Teaching and Learning.
Packaging Specification Package Ingest Service
Mentoring the Next Generation of Science Gateway Developers and Users
Research on Data Curation and Repositories
Briefing to ARL Membership
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Bird of Feather Session
BCoN Data Integration Workshop, University of Kansas, Feb 13-14, 2018
Presentation transcript:

Data Conservancy and the US NSF DataNet Initiative 2010 JISC/CNI Conference July 1, 2010 Sayeed Choudhury Johns Hopkins University

Difficult times… Sub-theme for this conference states: Policies, strategies, technologies and infrastructure to manage research and teaching data in a fast changing technological and economic environment

Difficult times… Sub-theme for this conference states: Policies, strategies, technologies and infrastructure to manage research and teaching data in a fast changing technological and economic environment

…not a rigid road map but principles of navigation. There is no one way to design cyberinfrastructure, but there are tools we can teach the designers to help them appreciate the true size of the solution space – which is often much larger than they may think, if they are tied into technical fixes for all problems.

Central points Infrastructure development occurs because of fast changing technological and economic environments Yet the words we associate typically with infrastructure include reliable, persistent, ubiquitous…stable

NSF DataNet Science and engineering research and education are increasingly digital and data- intensive New methods, management structures and technologies necessary NSF DataNet solicitation addresses challenge by creating exemplar data infrastructure organizations

NSF recent actions Five DataNet partners funded at $20 million each for 5 years – seed funding Data Conservancy and DataONE are first two awards – up to three more awards in next round Part of broader initiatives at NSF including requirement for data management plans and (separate) Johns Hopkins grant for feasibility study of open access repository

Data Curation The Data Conservancy embraces a shared vision: data curation is a means to collect, organize, validate and preserve data so that scientists can find new ways to address the grand research challenges that face society.

Goal The goal of Data Conservancy is to support new forms of inquiry and learning that address grand research challenges. The Data Conservancy will accomplish this goal through the creation, implementation and sustained management of an integrated and comprehensive data curation strategy.

Principles Our strategy focuses on connection of systems into infrastructure through a program informed by user- centered design and research, sustained through a portfolio of funding streams, and managed through a shared, coordinated governance structure. Build on existing exemplar scientific projects, communities and virtual organizations that have deep engagement with citizen scientists and extensive experience with large-scale, distributed system development

Partner institutions Johns Hopkins University (Lead institution) Cornell University DuraSpace Marine Biological Laboratory National Center for Atmospheric Research National Snow and Ice Data Center Portico Tessella, Inc. University of California Los Angeles University of Illinois at Urbana-Champaign

Objectives Infrastructure research and development – Technical requirements Information science and computer science research – Scientific or user requirements Broader impacts – Educational requirements Sustainability – Business requirements

Domain coverage/methods Multi-site user research methods are a blend of: – Case study & domain comparisons – Depth & breadth – Local & global AstronomyEarth SciencesLife SciencesSocial Sciences UCAR Task-based design and usability testing Use cases, data requirements, system recommendations UCAR UCLAEthnography, virtual ethnography, oral histories Use cases, data requirements Interviews, Surveys, Worksheets, Content analysis Curation requirements, taxonomy, metadata/provenance framework UIUC

Data Framework Start with a common conceptualization that applies across scientific domains Exploit semantic technologies Leverage existing work Prototype the framework in target communities – Iteratively refine, learn from experience – Demonstrate success, measured in terms of new science

Common Conceptualization Observations are the foundation of all scientific studies, and are the closest approximation to facts. Wiens, J. A. (1992). Cambridge studies in ecology: The ecology of bird communities. Foundations and Patterns, 1; Processes and Variations, 2

Emergence Emergence: The Connected Lives of Ants, Brains, Cities, and Software by Steven Johnson The movement from low-level rules to higher- level sophistication is what we call emergence.

Data Model using OAI-ORE

Concerning Infrastructure Infrastructure is not about system building, but rather the rich, comprehensive set of human and technology interactions within the Data Conservancy Embrace the diversity of cultures Embrace the chaos before imposing order

Acknowledgements Carole Palmer (information science slides) Carl Lagoze (Data Framework slides) Tim DiLauro (OAI-ORE) Office of Cyberinfrastructure DataNet Award # Office of Cyberinfrastructure EAGER Award #