Overwhelmed by Large-scale Digitization Projects

Slides:



Advertisements
Similar presentations
0 DIGITIZING GREY LITERATURE FROM THE ANTARCTIC BIBLIOGRAPHY COLLECTION Tina Gheen and Sue Olmsted National Science Foundation Arlington, Virginia USA.
Advertisements

Introducing Vireo ETD Submittal and Management for DSpace Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library.
Preservation of the Texas Agricultural Experiment Station Bulletin in the Digital Repository By Dr. Rob McGeachin Texas A&M University Libraries June,
Beyond the Google Book: the Future of the Digital Library Cory Snavely Library IT Core Services manager University of Michigan April 20, 2010.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
A Future for UK theses, University of London, Senate House, 22-Jan-2004 E-thesis submission workflow issues Simon J. Bevan Information Systems Manager.
Strategic issues for digital projects... …or, what are we doing here?
Strategic issues for digital projects... …or, what are we doing here?
Archiving News in a University using Open Source S0ftware By Surendran Cherukodan Junior Librarian Cochin University of Science and Technology, Cochin-22.
Electronic Theses and Dissertations: Benefits, Issues, and the University of Waterloo Approach
Windows XP Photo Workflow Tim Grey Imaging Strategist Microsoft Corporation.
Cataloging in Publication: Moving Beyond the Print ALA Midwinter Chicago, Ill. January 31-February 1, 2015.
Toulouse School of Graduate Studies Theses and Dissertations ETDs - Why We Do them –We at UNT believe that electronic theses and dissertations enhance.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
NATIONAL LIBRARY OF MEDICINE PubMed Central Martha Fishel National Library of Medicine CENDI Meeting September 15, 2004.
Online training for professionals: how this is being addressed by AccessIT Adam Dudczak Poznań Supercomputing and Networking Center
Constructing the Memories Creating a Digital Collection Linda J. White, Digital Project Coordinator.
Digital Repositories for Scholarly Output Marcy E. Rosenkrantz Director of Library Systems.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
Sai Deng, Metadata Catalog Librarian, Wichita State University Libraries Tse-Min Wang, Graduate Student in CS, Wichita State University Digital Imaging.
1 The Vietnam Center and Archive Stephen Maxner, Ph.D.
COOPERATIVE DIGITIZATION: POSSIBILITIES, PITFALLS AND PROSPECTS ALABI Conference June 3, 2010 – Georgetown, KY.
AgNIC Pre-conference 2009 “If It’s Digital and in Google – Then They Will Come” Presented at the National Agricultural Library By Dr. Rob McGeachin Texas.
Introducing New Services with DSpace Open Repositories Conference 2007 Susan Wells Parham Kent Woynowski Julie Griffin.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
Libra: Thesis and Dissertation Submission. What is Libra? UVA’s institutional repository, providing online archiving and access for the scholarly output.
Access IT e-Learning courses – overview of an educational offering dedicated for small memory institutions Adam Dudczak Poznań Supercomputing and Networking.
5-7 November 2014 DR Workflow Practical Digital Content Management from Digital Libraries & Archives Perspective.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Promoting Open Access to Scholarly Data Promoting Open Access to Scholarly Data Ian Y. Song Simon Fraser University Library, Canada Prepared for the 20.
An Overview of Theses Canada and ETD Initiatives at Library and Archives Canada 9 th International Symposium on ETDs Quebec City, June 8, 2006 Sharon Reeves,
Electronic Theses at Rhodes University presented by Irene Vermaak Rhodes University Library National ETD Project CHELSA Stakeholder Workshop 5 November.
Electronic Thesis and Dissertation Initiative at Indiana State University(ISU) where to start and where to go Valentine Muyumba (Chair of Cataloging and.
University of Bergen Library Electronic publishing Bergen – Makerere visit February 2005.
Depth customization of DSpace: Best practices and techniques of institutional repository at IIT Kanpur, India By S. K. Vijaianand V. D. Shrivastava Gaurav.
ALCME: OAI at OCLC Jeffrey A. Young OCLC Online Computer Library Center, Inc.
The Complexities & Economics of Digitizing Microfilm
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Choosing Delivery Software for a Digital Library Jody DeRidder Digital Library Center University of Tennessee.
Technology Choices for the JSTOR Online Archive Presented by Chang Feng Department of Computer Engineering and Computer Science, University of Missouri-Columbia,
ETD’s (Electronic Theses and Dissertations) at the University of Saskatchewan October 15, 2003
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
Integrating a Statewide Web Gateway With Digital Collections ______________________ Eric Weig and Beth Kraemer University of Kentucky and KCVL.
S YCAMORE S CHOLARS ISU Institutional Repository.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Digitization An Introduction to Digitization Projects and to Using the Montana Memory Project.
Agenda  Records Retention Content Management Trends  Demonstration of Technology  Question and Answer Mark Weintraub Business Development Manager Image.
1 UNOG Library Digitization and Microform Unit (DMU) – December 2009.
Digital Image Capture of Musical Scores Jenn Riley, Indiana University Digital Library Program Ichiro Fujinaga, McGill University.
Digital Commons & Open Access Repositories Johanna Bristow, Strategic Marketing Manager APBSLG Libraries: September 2006.
Tsinghua University Library Yang Zhao & Airong Jiang Tsinghua University Library, Beijing China 4 June, 2004 Electronic Thesis and Dissertation System.
IR Applications at University of Saskatchewan Library: present and future CARL Institutional Repository Luncheon Saskatoon, SK June 8, 2005 David Fox Head,
The New DRS Introduction. What is DRS? Digital repository for preservation and access – Maintains integrity of deposited content – Preserves content for.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Vicki Tobias Introduction to and Institutional Repositories.
The Complexities & Economics of Scanning Microfilmed Documents Videos
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
CMPF124 Personal Productivity with Information Technology Chapter 1 – Part 3 Introduction To Windows Operating Systems Windows Accessories Introduction.
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
K.SURESH MLISc- Final Year Student
Trove Tufts Digital Image Library
Digitisation in academic libraries: Experience from Makerere University Library, Kampala Uganda By Patrick Sekikome Presented at the CERN-UNESCO School.
VI-SEEM Data Repository
Digital Project Lifecycle Curating Across the Curriculum
Digital Library and Plan for Institutional Repository
Digital Library and Plan for Institutional Repository
Presentation transcript:

Overwhelmed by Large-scale Digitization Projects Xiaocan (Lucy) Wang Digital Repository Librarian Eric Holt University Archivist Cunningham Memorial Library Indiana State University

Agenda Project background Implementation Outcome Lesson learned Equipment Software choices Process Ingestion Workflow Outcome Lesson learned Conclusion

Project Background Indiana State University

Project Background ETD (electronic theses and dissertations) ETD Digital Initiative 2010 and onward Access

Project background (cont.) RTD (retrospective theses and dissertations) Number: 3,802 Where: Archives + Library basement Condition: most in usable condition, but… Access

Project Background (cont.) Purposes Centralize: ETD & RTD Improve access, search and retrieval Support teaching, learning and research Improve preservation

Project Background (cont.) Consideration Format Copyright Privacy

Equipment Bookdrive DIY

Disclosure Not currently or previously an employee of the corporations whose products I discuss I am not compensated for my comments or opinions Older software version being used

Capture New Book window

Capture in action

Batch entry

Irfanview

GIMP Open source equivalent to Photoshop Batch processing requires additional plugin Supervisor unfamiliarity

Photoshop Can record action to perform batch processing Graphical interface while setting up recorded action

Changing DPI

Color Grayscale B/W

PDF Compression All items being converted are compressed Some formats compress better than others Compression artifacts can also become visible

Original image of page is visible Searchable text layer is hidden

First Review All pages present? All text legible? No shadows covering text? Page in focus? Essential color elements retained?

PDF/a Copy saved to Archives server Only accessible to staff

Final Review and cleanup Review metadata Correct if necessary Approve and publish Remove original camera images, processed images, and extra copies of pdf

Workflow Imaging original theses or dissertations

Workflow (cont.) Processing image files

Workflow (cont.) Converting to PDF/A

Workflow (cont.) Publishing on ISU IR

Outcomes Volume finished: 848 Average volume size: 96 pages Average student time: 1.3 hours Average supervisor time: 5-10 minutes Average file size: 5.5 MB Total Disk Space: 4.6 GB Approximate cost: $15-18

Worth It? Centralize Improve access Via digital repository Search engines Digital repository registries WorldCat

Worth it? (cont.) Support teaching, learning and research Improve preservation strategies Multiple digital copies Backup Bitstream preservation Distributed preservation network via MetaArchive Cooperative

Lesson learned Control quality: Supervise students Add MARC 856 field monochrome and grayscale Supervise students Add MARC 856 field Secure continued funds

Conclusion Complex Various issues In-house vs. outsourcing Funding Technical standards Quality control Format selection In-house vs. outsourcing Metadata Delivery Preservation Rights management Workflow development

Contact info Xiaocan (Lucy) Wang Xiaocan.wang@indstate.edu Eric Holt Eric.Holt@indstate.edu