Ronald C. Jantz Government & Social Sciences Data Librarian Scholarly Communication Center Rutgers University Libraries Delivering Unique Numeric Data.

Slides:



Advertisements
Similar presentations
MICS4 Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Data Archiving.
Advertisements

Strategic issues for digital projects... …or, what are we doing here?
Strategic issues for digital projects... …or, what are we doing here?
CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
Digital Content Solutions Digital content management technology has transformed the way to manage content and knowledge, in this knowledge era. Research.
CNRIS CNRIS 2.0 Challenges for a new generation of Research Information Systems.
Tom Sheridan IT Director Gas Technology Institute (GTI)
Building Digital Library using DSpace Dr. M.Krishnamurthy, Librarian Indian Statistical Institute 8 th Mile Mysore Road R.V.College Post Bangalore
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
May , IASSIST 2006 May Ann Arbor, MI Ronald C. Jantz Rutgers University Libraries RUtgers COmmunity REpository (RUcore) A FEDORA-based.
Digital Libraries, R. Jantz - Feb. 26, Digital Preservation - Outline Introduction - Definitions, Facts, Challenges Digital Archiving – A Life Cycle.
Current Thinking on Digital Preservation: Role of Metadata Oya Y. Rieger Coordinator, Library Office of Distributed Learning Cornell University Library.
Rutgers University Libraries What is RUcore? o An institutional repository, to preserve, manage and make accessible the research and publications of the.
1 Planning And Electronic Records Issues For Electronically Enhanced Courses Jeremy Rowe Nancy Tribbensee
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
1 The Australian Partnership for Sustainable Repositories Margaret Henty Digital Futures Industry Briefing November 8, 2006.
NHPRC ELECTRONIC RECORDS RESEARCH FELLOWSHIP SYMPOSIUM Nov. 19, 2004 Rebecca Schulte University of Kansas Project Title: Testing Boundaries—An Exploration.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Digital Libraries - R. Jantz, Feb. 27, Topics in Digital Libraries Introduction – Perspectives on Management & Roles The Scholarly Communication.
Archiving the Web: the PANDORA archive at the National Library of Australia Preserving the Present for the Future Copenhagen, June 2001 Warwick Cathro,
Introduction and Conceptual Modeling
Improving access to digital resources: a mandate for order mandate: managing digital assets in tertiary education craig green,
Document Delivery Formats for the Web and Legal Digital Collections Kevin Reiss June 18 th, 2004 Law Library Rutgers-Newark School of Law.
Annick Le Follic Bibliothèque nationale de France Tallinn,
Digital Library Architecture and Technology
World Bank, Africa Region, Africa Household Survey Databank - The World Bank - Africa.
Collaboration and Content Customer solution case study The Yaroslavl region Government creates knowledge base of public authorities of the Yaroslavl region.
6/1/2001 Supplementing Aleph Reports Using The Crystal Reports Web Component Server Presented by Bob Gerrity Head.
Project web site: old.libqual.org LibQUAL+™ from a Technological Perspective: A Scalable Web-Survey Protocol across Libraries Spring 2003 CNI Task Force.
Copyright © cs-tutorial.com. Introduction to Web Development In 1990 and 1991,Tim Berners-Lee created the World Wide Web at the European Laboratory for.
Statewide Digitization and the FCLA Digital Archive Priscilla Caplan, Florida Center for Library Automation Statewide Digitization Planners Meeting OCLC,
1 Meeting on the Management of Statistical Information Systems (MSIS 2010) (Daejeon, Republic of Korea, April 2010) NIS ICT Strategy in the Production.
Changing the culture: Ethiopia’s commitment to dissemination and the multi-media approach By Yakob Mudesir Seid
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
Annick Le Follic Bibliothèque nationale de France Tallinn,
Research Services Introduction to research data management - a humanities case study Slides provided by DaMaRO Project, University of Oxford.
Cataloguing Electronic resources Prepared by the Cataloguing Team at Charles Sturt University.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Investing in the Long-Term Viability of British Columbia’s Digital Collections A presentation to the Steering Committee of the B.C. Digitization Coalition.
11-15 April 2011 Mauritius Institute of Health S.S.Pillai
Scientific Data and Electronic Publishing Renze Brandsma, Head, Digital Production Centre University of Amsterdam Maarten Hoogerwerf, Project Manager,
Elements of a Data Management Plan: Roles and Responsibilities Ruth Duerr National Snow and Ice Data Center Version 1.0 Review Date.
E.Soundararajan R.Baskaran & M.Sai Baba Indira Gandhi Centre for Atomic Research, Kalpakkam.
Introduction to metadata
Tsinghua University Library Yang Zhao & Airong Jiang Tsinghua University Library, Beijing China 4 June, 2004 Electronic Thesis and Dissertation System.
UNIZULU INSTITUTIONAL REPOSITORY GATEWAY TO LOCAL CONTENT.
● A system of Internet servers that support specially formatted documents. The documents are formatted in a markup language called HTML. What is the World.
Building an Infrastructure for Digital Humanities: Issues and Considerations Peter Zhou 周欣平 University of California, Berkeley October 8, 2009.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Web Server.
DSpace - Digital Library Software
Dispatching Java agents to user for data extraction from third party web sites Alex Roque F.I.U. HPDRC.
1/16/2016I. Revels Digital Imaging Workshop 1 Selection Considerations For Digital Imaging Projects.
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
Library Online Resource Analysis (LORA) System Introduction Electronic information resources and databases have become an essential part of library collections.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Open Exeter Project Team
Country Report: Innovation of Library Services at the National University of Laos through mobile Technologies. Chansy Phuangsouketh Director Central.
An Overview of Data-PASS Shared Catalog
Introduction to Implementing an Institutional Repository
DIGITAL LIBRARY.
Unit# 5: Internet and Worldwide Web
Executive Sponsor: Tom Church, Cabinet Secretary
Executive Sponsor: Tom Church, Cabinet Secretary
Presentation transcript:

Ronald C. Jantz Government & Social Sciences Data Librarian Scholarly Communication Center Rutgers University Libraries Delivering Unique Numeric Data on the Web (Projects, Platforms, and Preservation)

R. Jantz - May 18, Delivering Unique Numeric Data on the Web Introduction – Perspectives and Issues Digital Projects and the Scholarly Communication Center Projects, Platforms and Preservation

R. Jantz - May 18, Perspectives and Issues Re-usable platforms (either technology or process) can dramatically reduce development time and improve quality.  How do we establish and sustain re-usable platforms in an academic environment? Digital preservation  A scenario: A truck loaded with hazardous waste is headed toward a dump site. Will our descendants know where we have buried the waste? Unique projects: Those that have specific relevance to Rutgers University and New Jersey

R. Jantz - May 18, The Scholarly Communication Center (Rutgers University Libraries) Goals Allow scholars to apply state-of-the-art technology Teach and demonstrate the latest electronic tools Share the resources of the library  The SCC has given us an opportunity to experiment and innovate.

R. Jantz - May 18, Environment in the SCC A Windows2000/NT Network A Social Sciences Data Center (with 12 workstations) A digital preservation laboratory (under construction) – Large network mass storage (terabytes) – Scanners, including large format scanner (40 inch wide) & digital camera – Large format printer – Image compression software (e.g. djvu from AT&T & LizardTech) Staff – 2 resident librarians, 3 staff and a staff manager – On average, 10 part-time students Work areas for special projects

R. Jantz - May 18, SCC Project Goals Develop platforms that can be quickly learned by students and part-time employees. Encourage re-use to improve quality, reduce development time, and facilitate training. Establish project classes for reusable platforms: directories of people, reference databases, image archives, numeric data, online surveys. Define end-to-end processes for access and preservation

R. Jantz - May 18, A Sampling of SCC Projects Databases/Archives on the Web The Alcohol Studies Database (with the Center for Alcohol Studies) A reference database at: The New Jersey Environmental Digital Library (with NJ DEP) An image archive at: Medieval Early Modern Data Bank (with History Department) Numeric data at: Public Opinion Data (with Eagleton Institute) Numeric data at: For more – see “Digital Projects” at

R. Jantz - May 18, Eagleton Public Opinion Polls (Delivering numeric data on the web) Characteristics: Prototype at: Content: New Jersey public opinion ( ) Frequency: four polls per year Access: public domain Compiler: Eagleton Institute Owner: Eagleton/Star Ledger Archiver: RUL/Scholarly Communication Center Type: database on the Web Format: html, pdf, portable spss files, MS-Access, ColdFusion/SQL

HTML & script output Request to access Web Internet Internet Info. Server Web Server Cold Fusion Database Access SPSS Server Server (NT)Desktop WWW Browser Search/Browse Quest. Database Display/retrieve questionnaire Retrieve numeric data Display statistical results Questionnaire Database Archive Questionnaire Documents Numeric Data Files Eagleton Project Architecture Import Scripts

R. Jantz - May 18, The Challenges of Digital Preservation Lack of standards (or too many standards) Lack of documentation on production and use Cost and rapid obsolescence of technology Impermanence of the medium Content easily changed – legal issues Version control Need to guarantee integrity of digital information Migration of information (driven by external factors)

R. Jantz - May 18, Archiving Eagleton Poll Data In addition to daily and offsite backups, We are archiving essential data in the least device and software dependent format. Objective: to be able to regenerate the website in another hardware and operating system environment (perhaps in another technological epoch).

R. Jantz - May 18, Eagleton Polls – What is to be Archived? Preservation Format Ascii text Ascii text (data & syntax) Ascii text Archived Unit 1. Website 2. Questionnaires 3. Ref. Database 4. Numeric Data 5. Processes 6. Metadata Presentation Format HTML, coldfusion, sql Adobe PDF MS-Access Spss export Readme (ascii text) HTML

R. Jantz - May 18, Preservation Metatdata for Digital Collections* Collection – Eagleton Public Opinion Polls - Questionnaires 1. Persistent identifier: 2. Date of creation: 3. Structural type: ascii text 4. Technical infrastructure: 130 files in ascii text format, one file for each poll 5. File description 6. System requirements: 7. Installation requirements: 8. Storage information: 9. Access inhibitors: 10. Access facilitators: 11. Preservation action permission: 12. Validation: (information about validation mechanism) 13. Relationships (to other objects): 14. Quirks: (any characteristic that may cause loss in funtionality) 15. Archiving decision (work): 16. Decision reason (work): 17. Institution responsible for archiving decision: 18. Archiving decision (manifestation): 19. Decision reason (manifestation): * (from National Library of Australia: )

R. Jantz - May 18, Summary: What are we learning? To take full advantage of platform technology, we need to formalize re-use – processes, platform components and training: reference databases, numeric data, online surveys, digital archives, and directories. For numeric data, we should be able to quickly extend usage beyond the researcher to those who don’t normally have access to data. End-to-end process definition is critical, especially for successful long term preservation.