Working prototype Multi-Institution Testbed for Scalable Digital Archiving Three institutions are working together to rescue at-risk media, establish interoperability,

Slides:



Advertisements
Similar presentations
Panel 2 – Promoting Re-Use of Scientific Collections John Harrison SHAMAN Project University of Liverpool
Advertisements

Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
High Performance Wireless Research and Education Network
Prototype Phase SIO Accomplishments
Rolling Deck to Repository: Transforming the United States Academic Fleet Into an Integrated Global Observing System Suzanne M. Carbotte, Robert Arko,
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
Background Chronopolis Goals Data Grid supporting a Long-term Preservation Service Data Migration Data Migration to next generation technologies Trust.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
Preservation and Long-term access through Networked Services Adam Farquhar, The British Library iPres2006 Cornell University, October 2006.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Corporation For National Research Initiatives NSF SMETE Library Building the SMETE Library: Getting Started William Y. Arms.
Dogan Seber, PhD San Diego Supercomputer Center University of California, San Diego I. DLESE Library II. DISCOVER OUR EARTH Earth Science Resources for.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
1 The NSDL: A Case Study in Interoperability William Y. Arms Cornell University.
Multi-Institution Testbed for Scalable Digital Archiving NSF CISE/Library of Congress DIGARCH Award Stephen Miller Scripps Institution of Oceanography.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Project Planning Workshop Woods Hole July 11-13, 2005 Multi-Institution Testbed for Scalable Digital Archiving NSF CISE/Library of Congress DIGARCH Award.
Project Builder and MediaMatrix: Redefining Access in the Digital Age Dean Rehberger and Michael Fegan MERLOT August 7-10, 2006 New Orleans, LA.
1 The IODE Ocean Data Portal - current status and future Nikolai Mikhailov, Chair of IODE/JCOMM ETDMP National Oceanographic Data Centre, Russia Four Session.
Exploring the Applicability of Scientific Data Management Tools and Techniques on the Records Management Requirements for the National Archives and Records.
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
Ms. Irene Onyancha ISTD/Library & Information Management Services United Nations Economic Commission for Africa The Second Session of the Committee on.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1, P. Dru Clark 1, Jacob M. Perez 1, Aaron D. Sweeney 1, John.
Mind the Gap: Finding Data Across Decades and Disciplines with the SSDB Stephen P. Miller 1, P. Dru Clark 1, Jacob M. Perez 1, Aaron D. Sweeney 1, John.
Recommend SSDB FY06 Priorities Oct – Sep Provide access 2.Respond to reviews 3.Add new capabilities 4.Action items SSDB Advisory Board.
Microsoft Research Faculty Summit Natasa Milic-Frayling & Vijay Rajagopalan Microsoft Corporation.
Data Integration and Management A PDB Perspective.
Rolling Deck to Repository II: Getting Control of Provenance and Quality AGU Poster IN43A-1169 AGU Fall Meeting December 17, Stephen.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
GPO’s Federal Digital System December 10, 2009 U.S. Government Printing Office.
WHOI and SIO (II): Next Steps Towards Multi-Institution Archiving of Shipboard and Deep Submergence Vehicle Data (IN51A-0306) The Woods Hole Oceanographic.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
June 20, 2007ESRI Intl. User Conference Dawn Wright - Oregon State University Val Cummins - Coastal & Marine Resources Centre, IRELAND Liz O’Dea - Coastal.
1 The NSDL Program Stephen Griffin National Science Foundation.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
1 1 NOAA Office of Ocean Exploration End-to-End Data Management: A Success Story NOAA Tech Conference November 2005 Susan Gottfried National Coastal Data.
SIOExplorer Stephen Miller Scripps Institution of Oceanography USA International Data Exchange Workshop Building a Global Data Network for Studies of Earth.
National Archives and Records Administration1 Integrated Rules Ordered Data System (“IRODS”) Technology Research: Digital Preservation Technology in a.
SSDB Progress Report Site Survey Panel Meeting CIRE, Sapporo, Japan July 22, 2006 John Weatherford San Diego Supercomputer Center Subcontract to IODP-MI.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
The launching of an expedition has its own brand of excitement, with the sound of the main engines firing up, and the lifting of the gangway in a foreign.
Physical Oceanography Distributed Active Archive Center THUANG June 9-13, 20089th GHRSST-PP Science Team Meeting GHRSST GDAC and EOSDIS PO.DAAC.
Rolling Deck to Repository (R2R): How to Systematically Document Quality for the New Era of Data Re-Usability? AGU Poster IN21B-1048 AGU Fall Meeting December.
The Virtual Observatory and Ecological Informatics System (VOEIS): Using RESTful architecture and an extensible data model to provide a unique data management.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
SIOExplorer: Digital Library Projects R/V Alexander Agassiz November, 1907 UCSD Libraries Scripps Institution of Oceanography San Diego Supercomputer Center.
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Policy-Based Data Management integrated Rule Oriented Data System
Joseph JaJa, Mike Smorul, and Sangchul Song
Institutional Repositories
Technical Issues in Sustainability
Presentation transcript:

Working prototype Multi-Institution Testbed for Scalable Digital Archiving Three institutions are working together to rescue at-risk media, establish interoperability, and provide community access to shipboard and deep submergence vehicle data. The scientific benefits extend beyond these three institutions, as the holdings contain the results of 1600 major research expeditions from dozens of institutions, worldwide. Preservation is also motivated by acquisition cost, at $1M to $1.5M per expedition. This DIGARCH project tests the extension of digital library architectures, establishment of controlled vocabularies, auto-harvesting of metadata, automation of ingest and validation of content. SIO and SDSC contribute technology from the SIOExplorer NSDL project, including data and metadata harvesting, federated digital libraries, and user interfaces, as well as a digital library of 647 SIO cruises with 92,000 digital objects. WHOI contributes GeoBrowser Technologies and GIS Server based applications, and a collection of digital, video, film and paper items from 5000 campaigns to explore the deep sea over 40 years. Stephen Miller (SIO), John Helly (SDSC), Bob Detrick (WHOI) Scripps Institution of Oceanography (SIO), San Diego Supercomputer Center (SDSC), Woods Hole Oceanographic Institution (WHOI) We thank the DIGARCH Program of the National Science Foundation and the Library of Congress for their support (NSF IIS ). SIOExplorer was largely developed as an NSDL Collections Track project (NSF DUE ). OverviewAt-risk data of historic significance Collection building tools Acknowledgements Access and display tools Building a human network Testbed combining WHOI and SIO resources 1.Modify information architecture to enable scalable metadata evolution across institutions. Make greater use of controlled vocabularies. 2.Extend access tools across federated collections. 3.Inventory at-risk media and stage selected content for prototype test. 4.Harvest metadata from data and distributed resources. 5.Publish metadata and data in digital library 6.Adapt video display tools for digital library use. 7.Adapt GIS server for digital library use. Related project Original film archives Alvin nuclear bomb search, 1966 Discovery of 350° C Black Smoker hot vents 20 years of digital tapes, in critical need of migration mtfCreator - design a project Create a metadata template file (mtf) Define digital library structure Design for arbitrary digital object (ado) Data, image, document Metadata blocks Collection, Canonical ADO, domain-specific Controlled vocabularies Dictionaries Allow scalable, flexible changes to project adoHarvest – manage collection building Ingest “ado” objects of all types Automatic recognition of data categories Prioritize selection from alternative resources Auto-harvest metadata from data Scalable for individual objects or mass migrations Graphically monitor status, collection-wide Manage collections at distributed institutions adoQC - quality control QC of data and metadata Used during harvest, and throughout lifecycle Evaluation and maintenance, collection-wide adoCreator - prepare digital library entry Arbitrary digital object (ado) creator Finalize metadata record, synchronized with ado Implement persistent filename Collection access tools Data, images, documents Jason2 ROV Virtual Control Van Alvin submersible Framegrabber DIGARCH is more than storage systems and metadata. This multi- institution testbed is developing a network of computer scientists, researchers, librarians, programmers and students with a wide range of expertise. SIO: Miller, Clark, Gee, Peckman, Symons, Thach SDSC: Helly, Sutton, Weatherford WHOI: Detrick, Chandler, Gaylord, Gegg, Goldsmith, Lemmond, Lerner, Maffei, Tivey, Walden WHOI/MBL Library: Norton, Raymond, Rioux WHOI cruise, Alvin, and Jason2 data are fed into GeoBrowser and GIS Server applications. Working with federated collections. Template-driven, designed for re- use with diverse projects. Metadata and data are harvested from SIO cruises, ingested into the digital library and then accessed with SIOExplorer GUIs. Next Generation IODP Site Survey Data Bank Support international community 1000 scientists, 40 nations 68 proposals online in Digital Library Expedition cost $10-15M Archival lifecycle data access Preliminary ideas Review panels Expedition planning and safety Publication Subcontract to IODP-MI from NSF OCE Sustainable effort 9-year contract Technology shared with DIGARCH Video display tools Integrated with metadata and other sensor streams Java search tool GIS Server Text-based search Prototype working across federated collections Shipboard DataGrabber