Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1, P. Dru Clark 1, Jacob M. Perez 1, Aaron D. Sweeney 1, John.

Slides:



Advertisements
Similar presentations
Panel 2 – Promoting Re-Use of Scientific Collections John Harrison SHAMAN Project University of Liverpool
Advertisements

National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
Rolling Deck to Repository: Transforming the United States Academic Fleet Into an Integrated Global Observing System Suzanne M. Carbotte, Robert Arko,
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Connecticut State Data Center at the Map and Geographic Information Center - MAGIC Connecticut State Data Center Data Collaborator for Planning, Analysis,
The Frame NSF-funded national supercomputer centers Centers have hosted significant projects: TeraGrid, NPACI, GEON, SCEC, Chronopolis Fostered development.
Ocean Data Interoperability Platform EU-US-Australia collaborative project Grant Number: Call: FP7-INFRASTRUCTURES INFSO Activity: INFRA :
Background Chronopolis Goals Data Grid supporting a Long-term Preservation Service Data Migration Data Migration to next generation technologies Trust.
Data, Cyberinfrastructure, and Interoperability: Highlights from Infrastructure Studies Florence Millerand, Karen S. Baker, David Ribes *Florence:
The Data Curation Profile IASSIST 2010 Jake Carlson Data Research Scientist Purdue University Libraries.
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
Rutgers University Libraries What is RUcore? o An institutional repository, to preserve, manage and make accessible the research and publications of the.
SCORM-NSDL Workshop May 18, Educational Materials are Scattered across the Internet NASA Math Forum State standards Scientific American Ask.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Dogan Seber, PhD San Diego Supercomputer Center University of California, San Diego I. DLESE Library II. DISCOVER OUR EARTH Earth Science Resources for.
Tyler O. Walters, Associate Director, Technology & Resource Services Library & Information Center, Georgia Institute of Technology For NSF Site Visit to.
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Multi-Institution Testbed for Scalable Digital Archiving NSF CISE/Library of Congress DIGARCH Award Stephen Miller Scripps Institution of Oceanography.
Managing the Record of Research At the Smithsonian Using SIdora SAA Research Forum August 12, 2014.
The Marine Metadata Interoperability Project A Model for Community Collaboration September 23, 2010 Nan Galbraith WHOI.
Helen Glaves (NERC- BGS), Dick Schaap (MARIS), Robert Arko (LDEO) and Roger Proctor (IMOS)
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
Project Planning Workshop Woods Hole July 11-13, 2005 Multi-Institution Testbed for Scalable Digital Archiving NSF CISE/Library of Congress DIGARCH Award.
Research Data Management At the Smithsonian Using SIdora Nano Tech Working Group May 15, 2014.
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
ACCESS for VALIDITY ACCESS for INNOVATION. Starting January 2011 for NEW proposals Not voluntary – “integral part” of proposal and FastLane Required for.
“A Library outranks any other one thing a community can do to benefit its people.” Andrew Carnegie Mary R. Marlino, Ed.D. DLESE Program Center Presentation.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Deep Sea Drilling Project and Google Earth. A LITTLE BIT ABOUT THE PROGRAM GLOMAR CHALLENGER June 24, 1966, that the Prime Contract between the National.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1, P. Dru Clark 1, Jacob M. Perez 1, Aaron D. Sweeney 1, John.
Life Cycle Models & Principles Jake Carlson Associate Professor of Library Science Data Services Specialist Purdue University Libraries.
Recommend SSDB FY06 Priorities Oct – Sep Provide access 2.Respond to reviews 3.Add new capabilities 4.Action items SSDB Advisory Board.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Rolling Deck to Repository II: Getting Control of Provenance and Quality AGU Poster IN43A-1169 AGU Fall Meeting December 17, Stephen.
WHOI and SIO (II): Next Steps Towards Multi-Institution Archiving of Shipboard and Deep Submergence Vehicle Data (IN51A-0306) The Woods Hole Oceanographic.
June 20, 2007ESRI Intl. User Conference Dawn Wright - Oregon State University Val Cummins - Coastal & Marine Resources Centre, IRELAND Liz O’Dea - Coastal.
1 The NSDL Program Stephen Griffin National Science Foundation.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
We take the argument of emergence very seriously: the elements which we have defined here are analytic resources rather than causal factors. They have.
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
1Mobile Computing Systems © 2001 Carnegie Mellon University Writing a Successful NSF Proposal November 4, 2003 Website: nsf.gov.
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
The Collaborative Semantic Grid David De Roure University of Southampton, UK
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
GeoLink Overview Goal: Develop Semantic Web technologies that facilitate discovery (and reuse) of geoscience data.Goal: Develop Semantic Web technologies.
1 1 NOAA Office of Ocean Exploration End-to-End Data Management: A Success Story NOAA Tech Conference November 2005 Susan Gottfried National Coastal Data.
SIOExplorer Stephen Miller Scripps Institution of Oceanography USA International Data Exchange Workshop Building a Global Data Network for Studies of Earth.
National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a.
Semantic Concepts in Expedition Metadata Semantic Concepts in Expedition Metadata Bob Arko Lamont-Doherty Earth Observatory OOSSI Workshop Nov. 18, 2008.
SSDB Progress Report Site Survey Panel Meeting CIRE, Sapporo, Japan July 22, 2006 John Weatherford San Diego Supercomputer Center Subcontract to IODP-MI.
The launching of an expedition has its own brand of excitement, with the sound of the main engines firing up, and the lifting of the gangway in a foreign.
Working prototype Multi-Institution Testbed for Scalable Digital Archiving Three institutions are working together to rescue at-risk media, establish interoperability,
Rolling Deck to Repository (R2R): How to Systematically Document Quality for the New Era of Data Re-Usability? AGU Poster IN21B-1048 AGU Fall Meeting December.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 U.S. GEOTRACES Data Management Cyndy Chandler BCO-DMO ~ WHOI 23 September 2008.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
SIOExplorer: Digital Library Projects R/V Alexander Agassiz November, 1907 UCSD Libraries Scripps Institution of Oceanography San Diego Supercomputer Center.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 The Biological and Chemical Oceanography Data Management Office (BCO-DMO) Cyndy.
Store and exchange data with colleagues and team Synchronize multiple versions of data Ensure automatic desktop synchronization of large files B2DROP is.
Human Social Dynamics: Interoperability Strategies for Scientific Cyberinfrastructure: The Comparative Interoperability Project ( ) initiates a.
JCU Australian Marine Science Data Network.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Digital Collections Update
Presentation transcript:

Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1, P. Dru Clark 1, Jacob M. Perez 1, Aaron D. Sweeney 1, John Helly 1,2, Karen I. Stocks 2, and Donald W. Sutton 2 1 Scripps Institution of Oceanography, University of California, San Diego, La Jolla, CA, USA 2 San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA, USA The fully electronic IODP Site Survey Data Bank (SSDB; is primarily used to support Site Survey Panels as they evaluate each proposal for drilling, site by site. The SSDB has been in operation since 2005, and has now grown to hold more than 7000 digital objects, such as seismic sections, bottom photographs, bathymetric maps and background reports. By design, the SSDB allows for the secure holding of proprietary data when necessary. Since every object is quality-controlled, geo-referenced, indexed with standard metadata, and stored in a long-term searchable digital library, the Data Bank serves a wider purpose, from conception of proposal ideas, through site evaluation, operations, publication, and future research and education. With this investment in information technology, the system reliably preserves data acquired over many decades. Its flexible and extensible infrastructure makes it applicable to a range of data types and disciplines, allowing it to respond to changing IODP science priorities. To enable searches across all aspects of drilling, metadata are now being mapped for harvest in the next generation Scientific Earth Drilling Information Service (SEDIS) format. In addition, an “SSDB-in-a-Box” development will allow the entire collection to be transported to vessel or a remote office on a laptop. The SSDB shares technology with the SIOExplorer Digital Library of Scripps expeditions dating back to the 1950’s. The most recent related project is the “Rolling Deck to Repository” initiative that seeks to archive all standard underway shipboard data from US research vessels, with up to 500 expeditions per year conducted by 18 operating institutions ( Hosted at SDSC machine room for high bandwidth 24/7 operations. System replicated at SIO and deep archived at 3 remote sites across the U.S. The SSDB also archives 33,381 legacy physical objects, including maps, reports, seismic sections, disks, and tapes. Complete Site Characterization: Proponents upload the data they need to support a proposal. Objects on proprietary hold are protected by username/password security. Persistence: Preserving data for the long term as a digital library with metadata, so they can always be discovered. The Site Survey Data Bank Quality Control: Every upload must meet IODP data and metadata requirements, including use of certified site names. Select from a community-accepted list of standard categories. Seafloor features from sidescan data Location map to relate activities, sites, and data sets Subsurface profile from interpreted seismic section Flexible design allows for new data types and domains as needed in the future. SSDB internal QC checklist QC map viewer checks spatial relationships. Data used with permission of Greg Mountain, Rutgers. Go to

The SSDB Team: The SSDB is developed by a team from the Scripps Institution of Oceanography and the San Diego Supercomputer Center, at the University of California, San Diego, under a contract with IODP-MI, funded by the NSF. Stephen Miller, John Helly, Don Sutton, Jake Perez, Caryn Neiswender, Chris Massell Symons (top) Dru Clark, Aaron Sweeney, Karen Stocks, students Andrea Cardenas, Goldy Thach, Jenny Smith, Katie Foster (bottom) Review Panels: Decision support system provides access to site-specific data for each proposal. INTViewer enables interactive display of seismic data across the web. Seismic INTViewer Mobile Support: SSDB-In-a-Box is a laptop for remote use at meetings or on vessels, independent of Internet connection. Researchers: Furthering science through data discovery and download. Search by Proposal, Expedition, Leg, data type, format, lat/lon, date, etc. Supporting IODP Science Operations and Expeditions: Packaging the data needed to make on-site operational decisions. Integrating into SEDIS: Entire SSDB Collection can be searched by SEDIS. SSDB Related Projects Sharing technology and resources SIOExplorer Digital Library SIO cruises since 1950’s Rolling Deck To Repository (R2R) - Archiving all US routine underway shipboard data, ~ 30 vessels, hundreds of cruises/year Marine Metadata Interoperability Project (MMI) – Community collaborations advancing marine data integration and re-use Data used with permission of Peter Clift, University of Aberdeen.