“Think. Learn. Succeed.” Ver 1.2 Methods for Knowledge Management & Digital Preservation The Theory and Practice of Digital History Carl A. Young, M.A.

Slides:



Advertisements
Similar presentations
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Advertisements

Pulling it all together… with thanks to Sheila Anderson.
Photoshop Lab colorspace A quick and easy 26 step process for enhancing your photos.
Windows XP Photo Workflow Tim Grey Imaging Strategist Microsoft Corporation.
Monash's Mock RQF − Lessons learnt David Groenewegen ARROW Project Manager.
ECM RFP 101 Presented by: Carol Mitchell C.M. Mitchell Consulting.
Input to the Computer * Input * Keyboard * Pointing Devices
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Archiving Data. Essential stuff to know Why deposit? Digital repositories ADS Guidelines Deposit evaluation & requirements Deposit checklist & template.
High Volume Production of Alternative Text: Supporting a Statewide System The Alternative Media Access Center.
S OFTWARE AND M ULTIMEDIA Chapter 6 Created by S. Cox.
Digitization at the National Archives and Records Administration Doris Hamburg Director, Preservation Programs James Hastings Director, Access Programs.
1 The Vietnam Center and Archive Stephen Maxner, Ph.D.
Louisa Lambregts, What Makes a Web Site Successful and Effective? Bottom Line... Site are successful if they meet goals/expectations.
HBCU-CUL Digital Imaging Workshop, November 2005
Unit 30 P1 – Hardware & Software Required For Use In Digital Graphics
Chapter 9 Database Planning, Design, and Administration Sungchul Hong.
1 EDMS 101 Speaker: Monica Crocker, DHS EDMS Coordinator Overview of current project(s) Objective of this section: This session outlines EDMS fundamentals.
What is it a scanner? An optical input device that uses light- sensing equipment to capture an image on paper or some other subject. The image is translated.
Dean Pentcheff NHMLAC MBPC/Crustacea 17 April 2006.
Understanding Data Warehousing
[1] Reference: QCam API reference manual document version Charge Coupled Device (CCD)
Dr. Kurt Fendt, Comparative Media Studies, MIT MetaMedia An Open Platform for Media Annotation and Sharing Workshop "Online Archives:
The purpose of this Software Requirements Specification document is to clearly define the system under development, that is, the International Etruscan.
“Filling the digital preservation gap” an update from the Jisc Research Data Spring project at York and Hull Jenny Mitcham Digital Archivist Borthwick.
Douglas L. Tucker (FNAL) SISPI Meeting 22 February 2007 Sky Camera DB Inputs.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
Multimedia Databases (MMDB)
Catherine C. Marshall Akshay Kulkarni.  Explores practices associated with ◦ Collaborative Authoring ◦ Reference Use ◦ Informal Creation of Personal.
© 2004 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice SISP Training Documentation Template.
Chapter 6 : Software Metrics
Chapter 14 Information System Development
TECHNOLOGY SUPPORT FOR ESSSS Progress, Issues, and Challenges Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library.
Digital Photo RAW Workflow Digital Photo SIG Apr
Principles of Information Systems, Sixth Edition Systems Design, Implementation, Maintenance, and Review Chapter 13.
Relationships July 9, Producers and Consumers SERI - Relationships Session 1.
Choosing Delivery Software for a Digital Library Jody DeRidder Digital Library Center University of Tennessee.
Technology Choices for the JSTOR Online Archive Presented by Chang Feng Department of Computer Engineering and Computer Science, University of Missouri-Columbia,
size of uncompressed video in gigabytes
CHAPTER TEN AUTHORING.
SacProNet An Overview of Project Management Techniques.
REAL TIME GPS TRACKING SYSTEM MSE PROJECT PHASE I PRESENTATION Bakor Kamal CIS 895.
1 UNOG Library Digitization and Microform Unit (DMU) – December 2009.
Historical Aspects Origin of software engineering –NATO study group coined the term in 1967 Software crisis –Low quality, schedule delay, and cost overrun.
Label Design Tool Management Council F2F Washington, D.C. November 29-30, 2006
Digital Photography IID Day August 25, Outline 1. Using your camera overview 2.Tips for shooting great pictures 3.Transferring Images from Camera.
UAA Self Guided PDA Tour Edward Wickham CS470 Project Final Presentation Spring 2004.
FASR Software Considerations Gordon Hurford SSL AUI – August 2007.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Adobe photoshop digital image making. the basics Adobe PhotoShop is an image-editing program that lets you create and edit digital images. ◦PhotoShop.
Introduction to Interactive Media Interactive Media Tools: Authoring Applications.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
Principles of Information Systems, Sixth Edition 1 Systems Design, Implementation, Maintenance, and Review Chapter 13.
Workforce Scheduling Release 5.0 for Windows Implementation Overview OWS Development Team.
TSS Database Inventory. CIRA has… Received and imported the 2002 and 2018 modeling data Decided to initially store only IMPROVE site-specific data Decided.
Rational Unified Process Fundamentals Module 4: Core Workflows II - Concepts Rational Unified Process Fundamentals Module 4: Core Workflows II - Concepts.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
1/16/2016I. Revels Digital Imaging Workshop 1 Selection Considerations For Digital Imaging Projects.
Scanners. Using a Scanner Scanners are used to digitize any flat object. Several types of scanners- flatbed, sheet fed, handheld, film. Most common is.
Calum Dow Thurs 12 th November Our Partners…
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
AWIPS Governance What are we Governing? –EDEX/CAVE plugins developed for an operational AWIPS system Out of Scope: GFE Smart inits and tools, µengine.
Systems Planning and Analysis
Portable Document Format
More on Estimation In general, effort estimation is based on several parameters and the model ( E= a + b*S**c ): Personnel Environment Quality Size or.
RESEARCH TOPICS Web-Interface Performance DTD Extensibility Imaging
Project Plan MS Project Example (Optional)
Presentation transcript:

“Think. Learn. Succeed.” Ver 1.2 Methods for Knowledge Management & Digital Preservation The Theory and Practice of Digital History Carl A. Young, M.A. in waiting 1 December 2009

“Think. Learn. Succeed.” Ver 1.2 Project Overview Resource and skill-constrained historians and archivists require efficient methods for capturing, analyzing, and sharing original artifacts. Multi-phase project Develop a low-cost process for digitally archiving documents Store them in a standards-based data storage platform Set the conditions to scale with future phases Creating a collaborative, accessible, online digital repository Phase I – Prototyping Phase II- Capture Phase III- Web Access Phase IV- Initial Expansion Phase V- Infinite Expansion Major PhasesMethodology Challenge

“Think. Learn. Succeed.” Ver 1.2 Completed in November 2009, this phase established a usable, affordable methodology for project development by prototyping the capture and conversion of an original artifact for testing and exploration purposes. 3 Phase I: Prototype

“Think. Learn. Succeed.” Ver Demonstration Phase I: Prototype (cont.) Original Digital Camera.JPG file format 2 MB Treatment w/Photoshop.TIFF 29 MB Adobe Conversion.pdf 278 KB Time elapsed: Photo: <1 min Treatment: ~3 min Conversion: <1min

“Think. Learn. Succeed.” Ver 1.2 5

“Think. Learn. Succeed.” Ver Phase I: Prototype (cont.) Process Flowchart Legend

“Think. Learn. Succeed.” Ver 1.2 Completed in November 2009, this phase performed and documented a low-budget document capture, artifact preservation, and conversion to a distributable format where a historic text is extracted from the original document, archived, and presented to the user in both the original capture (.jpg or.tiff) and distributable (.pdf and.xml) format with an evaluation of optical character recognition (OCR) and transcription requirements. 7 Phase II: Capture

“Think. Learn. Succeed.” Ver 1.2 Select Area Image –Adjustments –Curves “Digitization” Channel - RGB Output-203 Input Phase II: Capture (cont.) Image Treatment Filter Blur Smart Blur Radius-100 Threshold-100 Quality- High Mode- Normal Surface Blur Radius-100 Threshold-25 Surface Blur (if needed) Radius-100 Threshold-25 Lens Blur Shape - Octagon Radius - 5 Blade Curve - 50 Rotation Brightness -10 Threshold - 75 Noise- 3 Distro –Uniform Select Select Color Range Modify Shadows No Invert Modify Expand2 Cut File New * Width-1600 Height Resolution- 300 CM - RGB 16bit * Recommend saving as a preset. Paste Flatten Clean up as needed Save As.TIFF

“Think. Learn. Succeed.” Ver 1.2 Select Area Image –Adjustments –Curves “Digiszation” Channel - RGB Output-203 Input-160 Filter –Blur –Smart Blur –Radius-100 Threshold-100 Quality- High Mode- Normal –Surface Blu Radius-100 Threshold-25 –Surface Blur Radius-100 Threshold-25 –Lens Blur Shape - Octagon Radius - 5 Blade Curve - 50 Rotation Brightness -10 Threshold - 75 Noise- 3 Distro –Uniform Select Color Range Modify Shadows –No Invert –Expand2 Cut New File Width-1600 Height Resolution- 300 CM - RGB 16bit CP Pix - Sq Paste Flatten Delete Clean up as needed Save As.TIFF 9 Phase II: Capture (cont.) Image Treatment

“Think. Learn. Succeed.” Ver OCR and Transcription Demo Phase II: Capture (cont.) OCRTranscriptionTime elapsed: OCR: <1 min Transcription: ~5min

“Think. Learn. Succeed.” Ver OCRTranscription

“Think. Learn. Succeed.” Ver TEI Demo Phase II: Capture (cont.) Time elapsed: Preliminary Data: ~45 min Page: ~5 min

“Think. Learn. Succeed.” Ver Phase II: Capture (cont.) Methodology Flow Chart Legend

“Think. Learn. Succeed.” Ver 1.2 Phase II: Capture (cont.) Militiaman’s Guide 155 pages total, type text, fair condition 40 hours (optimal) / 5 Gbs Per Page Estimates Photography: –~30 sec –2.5 5Mpxl.tiff Conversion –~3 min –23 Mbs.pdf Conversion –~1 min –300 Kbs OCR - ~45 sec Error Correction/Transcription: ~5 min TEI - ~5 min (~45 min overhead) 14 Labor Estimates Case Estimates Photography: –~1:15 –~ 400 Mbs.tiff Conversion –~7:45 –3.5 Gbs.pdf Conversion –~2:30 –50 Mbs OCR - ~2 hours Error Correction/Transcription: ~13 hrs TEI - ~14 hrs

“Think. Learn. Succeed.” Ver 1.2 Consumer-grade HP 5Mpxl digital camera ($125) Slightly above consumer-grade PC ($1100) –4 GB RAM –1 GB VRAM –500 GB, SATA HD –Dual Screens Consumer Software ($600) –Adobe Creative Suite 3 15 Equipment Baseline

“Think. Learn. Succeed.” Ver 1.2 Use a Tripod/Mount Use consistent lighting Safely flatten pages as much as possible Use a mounting frame Highest Resolution available OCR is NOT reliable Need an efficient method for TEI 16 Lessons Learned

“Think. Learn. Succeed.” Ver 1.2 This phase is the subject of this grant funding request. A team of professional developers will construct a suitable multi-media database for storage and access of original artifact captures, distributable.pdf versions, and XML-based data and metadata derived from the original. The team will also develop a working prototype web site to access the data. Fundamental to this phase will be data archiving and disaster recovery for the data. Successful conclusion of this phase will yield a working version 1.0 available for release and continued development. 17 Phase III: Web-Access

“Think. Learn. Succeed.” Ver Phase III: Web-Access (cont.) Flow Chart

“Think. Learn. Succeed.” Ver Work Breakdown Structure Phase III: Web-Access (cont.) Database Development Prototype Evaluation Prototype Web Development Alpha Test & Mod Beta Test & Mod RC1 Test & Mod v1.0 Documentation Disaster Recovery Testing Estimated Cost: $52,000

“Think. Learn. Succeed.” Ver Project Gantt Chart Phase III: Web-Access (cont.)

“Think. Learn. Succeed.” Ver 1.2 Beyond the scope of this grant request, this phase seeks to develop partnerships and data shares across multiple institutions with similar projects in development or production. The level of participation directly influences the scale of this phase. It is anticipated that the minimal costs will be shared across participating institutions. 21 Phase IV: Initial Expansion

“Think. Learn. Succeed.” Ver 1.2 Conduct Lifecycle Management Review Documentation Disaster Recover Testing Publish Methodology Find Partners Large Scale Capture Leverage v1.0 Update Code and Processes 22 Work Breakdown Structure Phase IV: Initial Expansion (cont.) Estimated Cost: $8,000

“Think. Learn. Succeed.” Ver 1.2 Optionally, and depending on the success of the earlier phases, this phase will greatly expand collaborative efforts by potentially make this capability available to amateur and resource- constrained archivists and historians by providing a standards-based methodology and data capture technique and a collaborative platform to share the data once stored. This aspect of the final phase will be limited only by technology maintenance and scalability costs. 23 Phase V: Infinite Expansion

“Think. Learn. Succeed.” Ver Work Breakdown Structure Phase V: Infinite Expansion (cont.) Publish Updated Methodology Publish Membership Schema Open Data Models Leverage Current Version Conduct Lifecycle Management Review Documentation Disaster Recover Testing Estimated Cost: $82,000 Release New Version(s)

“Think. Learn. Succeed.” Ver 1.2 Summary 5-Phase Approach “How-To” –Digitization –TEI –Manage the project Sets the stage –Broad/ambitious goals and plan –Manageable pieces Phase III support: –$51, –Prototype Validation –Database Development –Web Development –Hosting –Disaster Recovery Phase IV and V templates –Future expansion as desired –Flexible Planning 25 Project SummaryGrant Request / Funding Summary

“Think. Learn. Succeed.” Ver 1.2 QUESTIONS 26

“Think. Learn. Succeed.” Ver 1.2 CONCLUSION 27

“Think. Learn. Succeed.” Ver 1.2 Man had always assumed that he was more intelligent than dolphins because he had achieved so much... the wheel, New York, wars, and so on, whilst all the dolphins had ever done was muck about in the water having a good time. But conversely the dolphins believed themselves to be more intelligent than man for precisely the same reasons. - Douglas Adams 28 Dead Guy Quote

“Think. Learn. Succeed.” Ver 1.2 BACKUP 29

“Think. Learn. Succeed.” Ver Phase I: Prototype (cont.) Work Breakdown Structure Image Capture Image Preservation Image Manipulation Database Development TEI Process Development Data Development Static Web- Page Prototyping Documentation Disaster Recovery Testing Estimated Cost: $5,000

“Think. Learn. Succeed.” Ver Gantt Chart Phase I: Prototype (cont.)

“Think. Learn. Succeed.” Ver Phase II: Capture (cont.) Work Breakdown Structure Image Capture TEI Prototype Database Input Documentation Disaster Recovery Testing Estimated Cost: $2,000

“Think. Learn. Succeed.” Ver Phase II: Capture (cont.) Gantt Chart