SEAD Virtual Archive: Building a Federation of Institutional Repositories for Long-Term Data Preservation in Sustainability Science Beth Plale, Indiana.

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

The Dryad Data Repository Ryan Scherle 1, Hilmar Lapp 1, Amol Bapat 2, Sarah Carrier 2, Jane Greenberg 2, Peggy Schaeffer 1, Todd Vision 1,3, Hollie White.
S.J. Coles a*, M.B. Hursthouse a, R.A. Stephenson a, P. Cliff b, E. Lyon b, M. Patel b J. Downing c & P. Murray-Rust.
Opening the Research Data Lifecycle Workshop Capturing and Sharing Research Data Simon Coles School of Chemistry, University of Southampton, U.K.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
ORCID – Institutional Uses Minimizing contributor disambiguation costs Use-case: MIT Libraries support for OA initiative Need to determine Institute scholarly.
Finding a Software System to Support ETDs Susan Gibbons Digital Initiatives Librarian University of Rochester.
Archiving research data in the cloud or in a local repository Michele Kimpton, CEO DuraSpace CNI Dec 2014.
The Central Role of Data ‘Capturing and Sharing Chemistry Research Data’ Simon Coles School of Chemistry, University of Southampton, U.K.
Technical Framework Charl Roberts University of the Witwatersrand Source: Repositories Support Project (JISC)
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
Administration & Workflow
University of Southampton, U.K.
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Durable Digital Repositories: The DSpace Project Bill Jordan University Libraries.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Supporting Federally Funded Research Requirements with DSpace and SWORD 10 th International Conference on Open Repositories Hui Zhang, Michael Boock Oregon.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
DataCite: Making Data Citable Jan Brase (DataCite/TIB Hannover) Brigitte Hausstein (GESIS) Wolfgang Zenk-Möltgen (GESIS)
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
EZID (easy-eye-dee) is a service that makes it simple for digital object producers (researchers and others) to obtain and manage long-term identifiers.
Z EGU Integration of external metadata into the Earth System Grid Federation (ESGF) K. Berger 1, G. Levavasseur 2, M. Stockhause 1, and M. Lautenschlager.
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
DSpace. TM 2 Agenda  Introduction to DSpace  DSpace community  Institutional Repository  Easy to add/find content in DSpace  Building Online Communities.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Software Sustainability Institute Dealing with software: the research data issues 26 August.
1 Data Description Registry Interoperability (DDRI) Working Group Dimitris Gavrilis, Amir Aryani.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Dermot Frost Digital Repository of Ireland Trinity College Dublin.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
EPrints 10 Years of Digital Preservation. What is EPrints For?  EPrints offers a safe, open and useful place to store, share and manage material in the.
BUILDING ON COMMON GROUND: EXPLORING THE INTERSECTION OF ARCHIVES AND DATA CURATION Lizzy Rolando & Wendy Hagenmaier 6/3/2015IASSIST 2015.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
SEAD Virtual Archive :: A Thin Layer for Scientific Discovery and Long-Term Preservation Inna Kouper April #dlbbspring2013.
Session 3.  Now you know WHY to make policies and WHAT they should contain…  But HOW do you implement policies?  And then HOW do you implement a program.
TWC Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Observatory Community Xiaogang (Marshall) Ma, Yu Chen, Han Wang, Patrick West,
VIVO and Scholarly Repositories: Synergistic Opportunities.
Uganda Scholarly Digital Library (USDL) Makerere University’s Institutional Repository By Margaret Nakiganda URL:
Global Digital Format Registry Progress Andrea Goethals, Harvard University Library NDIIPP Digital Preservation Partners’ Meeting Arlington, VA July 9,
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena.
Content Challenges for Open Government Dale Waldt Sr. Analyst / Consultant
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
DEEP BLUE University of Michigan Institutional Repository.
Why RDA? A domain repository perspective George Alter ICPSR University of Michigan.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
IUScholarWorks Repository Update Jim Halliday, Stacy Konkiel & Jennifer Laherty.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Discover ScholarSphere A repository service collaboration between the University Libraries and ITS.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Bringing visibility to food security data results: harvests of PRAGMA and RDA Quan (Gabriel) Zhou, Venice Juanillas Ramil Mauleon, Jason Haga, Inna Kouper,
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Cloud based linked data platform for Structural Engineering Experiment
What is the National Data Service?
Data Management Agenda
ACS 2016 Moving research forward with persistent identifiers
Flexible Extensible Digital Object Repository Architecture
DIGITAL RESEARCH DATA MANAGEMENT
Flexible Extensible Digital Object Repository Architecture
Implementing an Institutional Repository: Part II
Jisc Research Data Shared Service (RDSS)
Dataverse for citing and sharing research data
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
ArchivesSpace – Archivematica – DSpace Workflow Integration
Presentation transcript:

SEAD Virtual Archive: Building a Federation of Institutional Repositories for Long-Term Data Preservation in Sustainability Science Beth Plale, Indiana University, Bloomington, Indiana, USA Robert H. McDonald, Indiana University, Bloomington, Indiana, USA Kavitha Chandrasekar, Indiana University, Bloomington, Indiana, USA Inna Kouper, Indiana University, Bloomington, Indiana, USA Stacy Konkiel, Indiana University, Bloomington, Indiana, USA Margaret L. Hedstrom, University of Michigan, Ann Arbor, Michigan, USA Jim Myers, Rensselaer Polytechnic Institute, Troy, New York, USA Praveen Kumar, University of Illinois, Urbana, Illinois, USA Cooperative agreement #OCI IDCC 2013 – Amsterdam – Jan. 16, 20131

SEAD TEAMS Margaret Hedstrom-PI, Marietta Van Buhler, Karen Woollams, George Alter (ICPSR), Bryan Beecher (ICPSR) Beth Plale-Co-PI, Katy Börner, Robert H. McDonald, Robert Light, Kavitha Chandrasekar, Stacy Kowalczyk, Inna Kouper, Stacy Konkiel, Robert Ping, Ryan Cobine James Myers-Co-PI, Ram Prasanna Govind Krishnan, Lindsay Todd Praveen Kumar-Co-PI, Terry McLaren (NCSA), Rob Kooper (NCSA), Luigi Marini (NCSA) Michigan Indiana Rensselaear Illinois IDCC 2013 – Amsterdam – Jan. 16, 20132

Challenge: The Data Deluge 1. Scientific data ingestion must be quick and minimally intrusive on a scientist’s time. 2. Ingesting must be flexible enough to handle the varied kinds of data. sizes // formats // composition 3. Tools for advertising and serving data from an institutional repository need to be consistent with tools and processes of the scientific community. IDCC 2013 – Amsterdam – Jan. 16, 20133

Challenge: Long Tail Scientific Research Many research niches – customized methods & toolsets – localized storage Less consideration for long-term availability and data reuse IDCC 2013 – Amsterdam – Jan. 16, 20134

Requirements of Virtual Archive for Sustainability Science Must connect multiple IRs Must be minimally intrusive on a scientist’s time Must handle varied data: – multi-GB collection, – vastly heterogeneous collection of files, – small complex database of a thousand variables, or – set of files in formats that are unique to the subdiscipline Must be consistent with tools and processes of the community IDCC 2013 – Amsterdam – Jan. 16, 20135

SEAD Active Curation Repository (ACR) -- metadata harvest -- annotation -- web tools SEAD VIVO -- social networking -- links data sets and researchers SEAD Virtual Archive (SVA) -- manage sustainability science window to multiple IRs --OAIS model IU Scholarworks IR publish associate discover UIUC IDEALS IR UMich Deep Blue IR ingest IDCC 2013 – Amsterdam – Jan. 16, 20136

Active Curation Repository (ACR) -- metadata harvest -- annotation -- web tools SEAD VIVO -- social networking -- links data sets and researchers SEAD Virtual Archive (SVA) -- manage sustainability science window to multiple IRs --OAIS model SEAD Virtual Archive (SVA) Design Policy Decisions Progress to Date [Single view into data] [Easy deposit] IDCC 2013 – Amsterdam – Jan. 16, 20137

Preview Data Upload Data to VA Run Virus Checking File Charact- erization Mint DOI Deposit to IR (& cloud) Update DOI target Index Metadata Index Scientific Metadata Large Dataset Decision Version Data IR Match- maker Index Scientific Metadata Accept Repository Agreement SEAD Virtual Archive Workflow IDCC 2013 – Amsterdam – Jan. 16, 20138

VIVO IR Matchmaker Client IR Matchmaker Service IR Matchmaker Service Repository Agent IR Match- maker Query for data contributor metadata Return data contributor’s affiliation information VA Load Monitor Agent Query Match Get Match Query for IRs’ details Return all IRs’ details Query VA load Return VA load constraints Architecture: SEAD VA Matchmaker IDCC 2013 – Amsterdam – Jan. 16, 20139

Policy: Licensing Agreements IDCC 2013 – Amsterdam – Jan. 16,

Policy: Licensing Agreements IDCC 2013 – Amsterdam – Jan. 16,

Policy: Licensing Agreements Single-license solution Satisfy all repository requirements Mitigate rights on behalf of depositor Matchmaking solution Connect requirements of: End users Repositories SEAD Virtual Archive IDCC 2013 – Amsterdam – Jan. 16,

Policy: Permanent Identifiers Author IDs VIVO identifiers Dataset IDs Digital Object Identifiers (DOIs) IDCC 2013 – Amsterdam – Jan. 16,

Policy: Author IDs ORCID ResearcherID Scopus Author ID Pivot ID VIVO ID Used primarily at domain/institution al level Supports many researcher ID systems, including ORCID Global system Buy-in from and integration with major publishers and institutions IDCC 2013 – Amsterdam – Jan. 16,

Policy: Dataset IDs HandlesDOIs EZID integration into DSpace Metadata storage Widely used Foundation for DOIs Basis for DSpace PID IDCC 2013 – Amsterdam – Jan. 16,

Progress to Date Ingested all NCED data – Small-sized collection (overall < 150 Mb) – File organization for heterogeneous collection of related files with flat or hierarchical structure Tested deposit between the VA, UIUC IDEALS, and IUScholarWorks IDCC 2013 – Amsterdam – Jan. 16,

Future Work Address other use cases – Large size collections (overall > 1 Gb) – Relational database / interconnected variables – Unique formats (to project, discipline, community) Interoperability with other DataNets Support for API access Determine how prototype fits researcher workflows IDCC 2013 – Amsterdam – Jan. 16,

Thank you Download this presentation at Cooperative agreement #OCI IDCC 2013 – Amsterdam – Jan. 16,