Overview on Sustainable Digital Preservation and Access eScience Seminar, Max Planck Society, June 20, 2008 Sayeed Choudhury Johns Hopkins University.

Slides:



Advertisements
Similar presentations
DC Science and Metadata Community Meeting Jane Greenburg University of North Carolina School of Information and Library Science Stuart Weibel OCLC Research.
Advertisements

The Incentives to Preserve Digital Materials: Roles, Scenarios, and Economic Decision-making Brian Lavoie Research Scientist OCLC Research CNI Spring Task.
UCL LIBRARY SERVICES Blue Ribbon Task Force on Economically-Sustainable Digital Preservation Dr Paul Ayris Director of UCL Library Services and UCL Copyright.
Supporting Further and Higher Education Joint Information Systems Committee JISC Strategies & Support of e-Science for Research Dr Malcolm Read JISC Executive.
A centre of expertise in digital information management UKOLN is supported by: Curating the Scientific Record: The Challenges Ahead Dr.
Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
Moving Forward With Digital Preservation at the Library of Congress Laura Campbell Associate Librarian for Strategic Initiatives Library of Congress.
Digital Repositories: interoperability & common services Closing Remarks Dr Liz Lyon, UKOLN, University of Bath, UK
2 July 2010Chris Rusbridge Consulting1 Blue Ribbon Task Force on Sustainable Digital Preservation and Access Summary of activity and recommendations Chris.
Update: Blue Ribbon Task Force on Sustainable Digital Preservation and Access Dr. Fran Berman BRTF-SDPA Co-Chair with thanks to Sayeed Choudhury, Brian.
Digital Preservation and Trusted Digital Repositories Priscilla Caplan Florida Center for Library Automation ALA 2005 Chicago IL.
OCLC Research Exploration, innovation and community for libraries and archives. Featuring Karen Smith-Yoshimura, OCLC Research Managing Research Data—from.
The JISC vision of research information management Dr Malcolm Read Executive Secretary, JISC.
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
New DFG Information Infrastructure Projects Dr. Stefan Winkler-Nees; Birmingham, 28. March 2011 New DFG Information Infrastructure Projects.
"Keeping alert: issues to know today for long-term digital preservation with repositories" Neil Beagrie Fedora Users Group Open Repositories Southampton.
Bielefeld Conference 2006: Academic Library and Information Services: New Paradigms for the Digital Age Hans Geleijnse Director of Library and IT Services.
Update 2009: Blue Ribbon Task Force on Sustainable Digital Preservation and Access Fran Berman (SDSC/RPI) and Brian Lavoie (OCLC) BRTF-SDPA Co-Chairs Abby.
Data Seal of Approval Overview Lightning Talk RDA Plenary 5 – San Diego March 11, 2015 Mary Vardigan University of Michigan Inter-university Consortium.
The Tower Hotel, November 26, 2009 Research Data Management Infrastructure Programme Launch Event SUpporting Data Management Infrastructure for the Humanities.
Long-Term Preservation of Astronomical Research Results Robert Hanisch US National Virtual Observatory Space Telescope Science Institute Baltimore, MD.
Ensuring Long-Term Access to Digital Information Blue Ribbon Task Force on Sustainable Digital Preservation and Access Summary of the Blue Ribbon Task.
BUILDING AN ECONOMICALLY SUSTAINABLE FOUNDATION FOR THE INFORMATION AGE Dr. Francine Berman Vice President for Research, Rensselaer Polytechnic Institute.
Data Management Development and Implementation: an example from the UK SLA Conference, Boston, June 2015 Geraldine Clement-Stoneham Knowledge and Information.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
W w w. i l u m i n a – d l i b. o r g iLumina: A Digital Library of Educational Resources for Science & Mathematics National Science Digital Library All-Projects.
Final Search Terms: Archiving (digital or data) Authentication (data) Conservation (digital or data) Curation (digital or data) Cyberinfrastructure Data.
Presenter: Karla Strieb Assistant Executive Director Transforming Research Libraries June 3, 2010 Supporting E-science: Progress at Research Institutions.
Challenges & opportunities in the preservation of (digital) information: the case of European research libraries Museo de las Ciencias Teatro de UNIVERSUM.
Designing the Microbial Research Commons: An International Symposium Overview National Academy of Sciences Washington, DC October 8-9, 2009 Cathy H. Wu.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
Demystifying the Business Analysis Body of Knowledge Central Iowa IIBA Chapter December 7, 2005.
Brian Lavoie Research Scientist OCLC Sustainable Economics for a Digital Planet: Ensuring Long-term Access to Digital Information OCLC.
Ensuring access to the record of science: driving changes in the role of research libraries APE2014 Berlin, 29 th January Susan Reilly Projects Manager.
1 CS 502: Computing Methods for Digital Libraries Lecture 28 Current work in preservation.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
OAIS Open Archival Information System. “Content creators, systems developers, custodians, and future users are all potential stakeholders in the preservation.
Caring and Sharing Collaboration in Digital Curation outside North America Ross Harvey Simmons College, Boston Curation Matters: 17 June 2010.
Towards a European network for digital preservation Ideas for a proposal Mariella Guercio, University of Urbino.
1 Why should “WE” CARE about data?. International initiatives OECD principles and guidelines for access to research data from public funding 2007 “Access.
1 Digital Archives - Past, Present & Future Issues Anne Van Camp Manager, Member Initiatives The Research Libraries Group Digital Archives Directions (DADs)
Life Cycle Models & Principles Jake Carlson Associate Professor of Library Science Data Services Specialist Purdue University Libraries.
Data Management and Accessibility S.M. Kaye PPPL Research Seminar 12/16/2013.
Draft GEO Framework, Chapter 6 “Architecture” Architecture Subgroup / Group on Earth Observations Presented by Ivan DeLoatch (US) Subgroup Co-Chair Earth.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
April 12, 2005 WHAT DOES IT MEAN TO BE AN ARCHIVES? Trusted Digital Repository Model Original Presentation by Bruce Ambacher Extended by Don Sawyer 12.
Consultant Advance Research Team. Outline UNDERSTANDING M&E DATA NEEDS PEOPLE, PARTNERSHIP AND PLANNING 1.Organizational structures with HIV M&E functions.
Brian Lavoie Research Scientist OCLC The Economics of Sustaining Digital Information NDIIPP Partners Meeting Washington, DC July 22, 2010.
O C I October 31, 2006Office of CyberInfrastructure Implementing the Strategic Vision for Digital Data NSF Data Group ACCI Meeting October 31, 2006.
ARL Workshop on New Collaborative Relationships: The Role of Academic Libraries in the Digital Data Universe September 26-27, 2006 ARL Prue.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
Queensland University of Technology CRICOS No J HOW RESEARCHERS FIND INFORMATION IN THE NEW DIGITAL AGE Gaynor Austen Director, Library Services.
JISC/CNI Conference Edinburgh, 26th June 2002 Challenges of Digital Preservation – do we have a road map? Maggie Jones.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
Digital Data Collections ARL, CNI, CLIR, and DLF Forum October 28, 2005 Washington DC Chris Greer Program Director National Science Foundation.
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
Office of Science Statement on Digital Data Management Laura Biven, PhD Senior Science and Technology Advisor Office of the Deputy Director for Science.
CNI Task Force Meeting April 7, 2008 OAI-ORE Project Briefing David Reynolds Tim DiLauro Sayeed Choudhury Library Digital Programs Sheridan Libraries Johns.
Brian Lavoie Research Scientist OCLC Sustainable Economics for a Digital Planet: Ensuring Long-term Access to Digital Information Funders.
NSF Draft Strategic Plan for Data, Data Analysis, and Visualization Chris Greer Program Director National Science Foundation.
Joint Information Systems Committee Repositories Support Project Summer School 2008 Amber Thomas, JISC.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
A GOOD FUTURE FOR UNIVERSITY REPOSITORIES Frederick Friend Honorary Director Scholarly Communication UCL
Changing Practices… Changing Values
Blue Ribbon Task Force on the economic sustainability of digital preservation Dr Paul Ayris Director of UCL Library Services and UCL Copyright Officer.
Long-Term Preservation of Astronomical Research Results
Institutional Repositories
Digital Preservation and Trusted Digital Repositories
Bird of Feather Session
Presentation transcript:

Overview on Sustainable Digital Preservation and Access eScience Seminar, Max Planck Society, June 20, 2008 Sayeed Choudhury Johns Hopkins University

eScience Seminar – Max Planck Society – June 20, 2008 Presentation Outline Specific case study at Johns Hopkins University involving the Virtual Observatory Blue Ribbon Task Force on Sustainable Digital Preservation and Access Elements of DRAFT Task Force Report Report observations

eScience Seminar – Max Planck Society – June 20, 2008 The Virtual Observatory The Virtual Observatory enables new science by greatly enhancing access to data and computing resources. The VO makes it easy to locate, retrieve, and analyze data from archives and catalogs worldwide The VO is about data discovery, access, and integration The VO is NOT a huge centralized data repository The VO provides standard protocols for obtaining data from distributed collections The VO is national (US NVO) and international (IVOA)

eScience Seminar – Max Planck Society – June 20, 2008 Data Preservation Problem Research communities publish peer-reviewed journal papers that describe highly processed data Long-term preservation and curation systems for digital journal content, including the digital data presented only graphically, are not currently in place The research cannot be verified and the results cannot be easily compared to other data in order to broaden impact Public funds invested in scientific research do not have maximum return on investment. Essential legacy datasets may be lost

eScience Seminar – Max Planck Society – June 20, 2008 Data Storage Appliance Metadata database Digital data objects Ancillary information Data Storage Appliance Metadata database Digital data objects Ancillary information Data Storage Appliance Metadata database Digital data objects Ancillary information replication services VOSpace Publication & Editorial Process Data capture Metadata capture & validation Links Identifiers Data Access VO portals Journal portals Other after-market distributors Registry Logging Library Curation Preservation

eScience Seminar – Max Planck Society – June 20, 2008 Open Archives Initiative – Object Reuse and Exchange

eScience Seminar – Max Planck Society – June 20, 2008 Blue Ribbon Task Force on Sustainable Digital Preservation and Access (BRTF-SPDA) Multi-disciplinary group with funding from US National Science Foundation and Andrew W. Mellon Foundation UK Joint Information Systems Committee nominated two representatives In-kind or staff support from Library of Congress, Council on Library and Information Resources, US National Archives and Records Administration, and NITRD Two year program of activities that began in January 2008 with kick-off meeting in Washington DC

eScience Seminar – Max Planck Society – June 20, 2008 Task Force Participants Blue Ribbon Task Force:  Paul Ayris, University College London  Fran Berman, SDSC/UCSD  Bob Chadduck, NARA Liaison  Sayeed Choudhury, Johns Hopkins University  Elizabeth Cohen, AMPAS/Stanford  Paul Courant, University of Michigan  Lee Dirks, Microsoft  Amy Friedlander, CLIR  Chris Greer, NITRD Liaison  Vijay Gurbaxani, UC Irvine  Anita Jones, University of Virginia  Ann Kerr, Consultant  Brian Lavoie, OCLC  Cliff Lynch, CNI  Dan Rubinfeld, UC Berkeley  Chris Rusbridge, DCC  Roger Schonfeld, Ithaka  Abby Smith, Consultant  Anne Van Camp, Smithsonian Sponsoring Agencies/Institutions: National Science Foundation Mellon Foundation Library of Congress National Archives and Records Administration CLIR NITRD JISC Member institutions Specific Responsibilities Fran Berman / co-Chair Amy Friedlander / First Report Editor Ann Kerr / Panel Rapporteur Brian Lavoie / co-Chair Susan Rathbun / Task Force Support Abby Smith / Second Report Editor Jan Zverina / Communications Lead Lucy Nowell / NSF Program Officer Don Waters / Mellon Program Officer Laura Campbell, March Anderson / LC representative

eScience Seminar – Max Planck Society – June 20, 2008 Charge to the Task Force 1.To conduct a comprehensive analysis of previous and current efforts to develop and/or implement models for sustainable digital information preservation 2.To identify and evaluate best practice regarding sustainable digital preservation among existing collections, repositories, and analogous enterprises 3.To make specific recommendations for actions that will catalyze the development of sustainable resource strategies for the reliable preservation of digital information 4.Provide a research agenda to organize and motivate future work

eScience Seminar – Max Planck Society – June 20, 2008 Key Areas for Recommendations  What are appropriate roles and responsibilities for institutions in the international, federal, academic, commercial, and non- profit/foundation sectors? What special capabilities do each of these sectors bring to the table and what are their inherent limitations?  What are the appropriate roles and responsibilities for data authors, data users, and community groups? What incentives exist or are needed to encourage and enable data authors to deposit data and metadata for preservation and reuse?  What sustainability models exist, can be adapted, and/or should be investigated to support long-term, sustainable digital information preservation? What cost/benefit analysis models exist or should be developed to evaluate these institutions and their operational models? What incentives exist or are needed to encourage and enable institutions to preserve digital information over the long term?  How can we characterize long-term digital preservation as an economic activity, and what models can we bring to bear to understand its characteristics and policy implications? What are the alternative strategies for organizing digital preservation capacity (e.g. centralized services, distributed local capacity, etc.) and what are the pros and cons of each? 

eScience Seminar – Max Planck Society – June 20, 2008 Deliverables  First Year Report (positive, “what is”): Describe past and current models (case studies, etc.); Identify points of convergence/divergence; “lessons learned”; What we know so far, and what our key knowledge gaps are.  Second Year Report (normative, “what should be”): General cost framework: key cost categories of digital preservation Set of economic models/“scenarios”: alternate ways of organizing digital preservation activities, within the context of the cost framework Describe each model: features, pros, cons, trade-offs, etc. List real world conditions for which each model is best suited. “If your digital preservation context is X, we recommend you consider using model Y to organize your activities in a sustainable way.” TF Outreach: Community web resource and bibliography Articles designed to enlighten and broaden the community of stakeholders

eScience Seminar – Max Planck Society – June 20, 2008 DRAFT Initial Report  Please keep in mind that the report is still in DRAFT format and the Task Force is still reviewing the document  Definition of Economic Sustainability: The set of business, social, technological, and policy mechanisms that 1) encourage the gathering of important information assets into digital preservation systems, and 2) support the indefinite persistence of the digital preservation systems, thus securing access to and use of the information assets into the long-term future.

eScience Seminar – Max Planck Society – June 20, 2008 Economic Sustainability (continued) Economically sustainable digital preservation requires:  Recognition of the benefits of preservation on the part of key decision-makers, as part of a process of selecting digital materials for long-term retention;  Appropriate incentives to induce decision-makers to act in the public interest;  Mechanisms to secure an ongoing allocation of resources, both within and across organizations, to digital preservation activities;  Efficient use of limited preservation resources;  Appropriate organization and governance of digital preservation activities.

eScience Seminar – Max Planck Society – June 20, 2008 Key Themes  Definition of economic sustainability is meant to be sufficiently detailed enough to scope the work, yet also general enough to be applicable in most contexts  The report focuses on sustainable economic models for digital materials for which there is a clear public interest  Several concepts that should be considered over entire life- cycle of content including stakeholders, (both start-up and ongoing) costs, value, incentives, and organizational frameworks

eScience Seminar – Max Planck Society – June 20, 2008 Literature Review  Focus on economic dimensions of digital preservation (not a complete review of all digital preservation literature)  While there are other studies, reports, etc., seven major studies are examined in greater depth: 1. Roquade Project 2. Harvard Depository and OCLC, Inc. Digital Archive 3. Sweden’s Riksarkivet/National Archives 4. Digital Preservation Testbed, Nationaal Archief of the Netherlands 5. Digital Motion Pictures (Science Technology Council of the Academy of Motion Picture Arts and Sciences) 6. LIFE Project 7. Beagrie-Chruszcz-Lavoie (BCL) Cost Model

eScience Seminar – Max Planck Society – June 20, 2008 Report (DRAFT!) Observations  Difficult to compare and contrast different studies given diverse definitions of costs, units of measurement, scope or accounting methods  Most recent studies such as LIFE or BCL cost model attempt to account for entire life-cycle of costs and also intertemporal considerations (e.g., inflation/deflation, interest rates)  Important to acknowledge costs associated with human dimensions (e.g., management) and infrastructure elements of storage (e.g., power and cooling)  “Format matters; scale matters”

eScience Seminar – Max Planck Society – June 20, 2008 Acknowledgements  Robert Hanisch for Virtual Observatory slides  US Institute of Museum and Library Services and Microsoft for funding related to Virtual Observatory data curation prototype development  Fran Berman and Brian Lavoie for BRTF-SPDA slides  Members of BRTF-SPDA for contributions to draft initial report  Max Planck Society for invitation