1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel

Slides:



Advertisements
Similar presentations
Workshop Servers (Server Software) Browsers Media Delivery Technologies: o Flash o QuickTime o Windows Media o Real. New Internet technology: XML XHTML.
Advertisements

Repository models and policies for preservation Steve Hitchcock Preserv Project Intelligence Agents Multimedia Group, School of Electronics and Computer.
The Keys to Speed. File Extensions Definition A tag of three or four letters, preceded by a period, which identifies a data file's format or the application.
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Digital Multimedia.
1 Multimedia on the Web: Issues of Bandwidth Bandwidth is a measure of the amount of data that can be sent through a communication pipeline each second.
Copyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall
Preparing Audio for the Internet - Nick Kereakos - MPR Topics Covered: Topics Covered:  Static Audio Files  Audio Streams  Automation.
Section 9.1 Identify multimedia design guidelines Identify sources of multimedia files Explain the ethical use of multimedia files Describe multimedia.
Quicktime Howell Istance School of Computing De Montfort University.
Una DooneyMultimediaSlide 1 What is Multimedia? A combination of different media types such as text, graphics, audio, video and animation etc in a single.
WMES3103 : INFORMATION RETRIEVAL
Video on the Web. Should you add video to your web page? Three main questions 1. How will it enhance the purpose of my page? –Entertain –Explain a process.
DATA IN Qualitative Data Acquisitions Process Louise Corti ESDS Qualidata, UKDA IASSIST WORKSHOP 27 May 2003.
Creating & Distributing New Media Content lesson 24.
ADAPTING TO CHANGE TRAINING NEEDS OF LIS PROFESSIONALS By Mrs. Chandrakala N PaiMr. V Sriram Librarian, Chief Librarian, KNRaj Library Prin. KMK College.
1 The Vietnam Center and Archive Stephen Maxner, Ph.D.
Technology Bootcamp January 18, 2014 Large-Scale Digital Libraries Digitization Process Krystyna K. Matusiak, Ph.D. Assistant Professor Library & Information.
Nat 4/5 - Software Design and Development – Low Level Operations - 1 National 4/5 – Computing Science Information Systems Design and Development Media.
Video Streaming © Nanda Ganesan, Ph.D..
FILING SYSTEMS Research Data Management. Filing is more than saving files, it’s making sure you can find them later in your project. Naming Directory.
1 JCM 106 Computer Application for Journalism Lecture 1 – Introduction to Computing.
Introduction to Interactive Media 10: Audio in Interactive Digital Media.
Sem 1 v2 Chapter 14: Layer 6 - The Presentation layer.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Naresuan University Multimedia Paisarn Muneesawang
Dspace 1 Introduction to DSpace Mukesh Pund Scientist NISCAIR, New Delhi.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Institute of Technology Sligo - Dept of Computing Sem 1 Chapter 14: Layer 6 - The Presentation layer.
Allison Schein.  Adobe Audition (  Recommended program, metadata creation and manipulation is easy and complete.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Fundamentals of XML Management Greg Alexopoulos Systems Engineer Documentum.
© Keith Vander Linden, 2005 Jeremy D. Frens, Open up the box of a computer, and you won't find any numbers in there. You'll find electromagnetic.
1 CP Lecture 8 PC and Media exchange standards.
HTML Use of Multimedia on web page. HTML Media Q. How to call Image file in our web page ? A. That is the easy syntax for defining an image. 2.
Document Formats How to Build a Digital Library Ian H. Witten and David Bainbridge.
1.1 What is Multimedia Multimedia
The Role of File Formats in Digital Preservation: Opportunities and Threats ErpaTraining on File Formats for Preservation Vienna, May 10-11, 2004 Frank.
M404 Multimedia Elements Form 4.
© Keith Vander Linden, 2005 Jeremy D. Frens, Open up the box of a computer, and you won't find any numbers in there. You'll find electromagnetic.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
About Openness Letizia Jaccheri Pisa
More Meaningful Jargon Or, All You Need to Know to Speak Like a Geek Sound.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Audio Communications: Sound Mr. Butler Communication Systems John Jay High School Wappingers Central School District UPDATED 11/2011.
File Management Debi McGuire. What Is a File? Collection of data Data can be text, graphic, numbers.exe file is executable (program) File properties –Type.
Win OS & Hardware. Input Getting data into the computer.
File Analysis Dr. John P. Abraham Professor UTPA.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Chap 14 Presentation Layer Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology
COMP135/COMP535 Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 2 Lecture 2 – Digital Representations.
Multimedia in Web Introduction. Multimedia Elements in Web Page Images Voice Music Animation Video Text & Numbers.
PRESERVATION IN A DIGITAL WORLD Presented By: Darrell Garwood Imaging Lab Manager Library and Archives Division Kansas State Historical Society
1 What is Multimedia? Multimedia can have a many definitions Multimedia means that computer information can be represented through media types: – Text.
Layer 6 Presentation Layer. Overview Now that you have learned about Layer 5 of the OSI model, it is time to look at Layer 6, the presentation layer.
Information Systems Design and Development Media Types Computing Science.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Software Design and Development Storing Data Part 2 Text, sound and video Computing Science.
MIRC Overview Medical Imaging Resource Center. RSNA2006 MIRC Courses Overview of the RSNA MIRC Software Installing MIRC on Your Laptop Using MIRC for.
Section 9.1 Section 9.2 Identify multimedia design guidelines
Identifying Barriers To File Rendering In Bit-level Preservation Repositories A Preliminary Approach Kyle R. Rimkus, University Library Scott D. Witmer,
Video on the Web.
Lesson 24 Creating & Distributing New Media Content.
Lesson 24 Creating & Distributing New Media Content.
Introduction to DSpace
Multimedia: making it Work
Infty Software - Assistive Tools to Access STEM -
short term and long term speed, capacity, compression formats, access
Lesson 5: Multimedia on the Web
(c) V/2-Com (Verhaart) Multimedia Elements & standards 4/15/2019 (c) V/2-Com (Verhaart)
Presentation transcript:

1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel Lecture 6 Populating Digital Libraries

2 herbert van de sompel KWF: populating DLs Originator digital object originator makes a digital object Data which consists of Key-Metadata handle client Repository digital object goes into a repository

3 herbert van de sompel Populating DLs originator makes a digital object born digital / convert to digital digital media formats document model: structure of digital object - later naming digital objects (identifiers) - later digital object goes into a repository technological/organizational issues central/decentral submission central/decentral storage submission direct by author / via organization quality control terms and conditions (copyright, …)

4 herbert van de sompel Populating DLs The way in which the issues are addressed have fundamental impact on: economics of the DL there is no free lunch success of a DL with the target group arXiv physics: teX, central submission arXiv CS: does not fly originator makes a digital object digital object goes into a repository

5 herbert van de sompel Populating DLs The way in which the issues are addressed have fundamental impact on: searchability/retrievability of do’s decentral submission&storage => distributed searching? do identifiers: URL 404 archiving of do’s choice of media formats, do-model, central/decentral organization originator makes a digital object digital object goes into a repository

6 herbert van de sompel originator makes a digital object

7 herbert van de sompel Convergence of media Evolution of digital representation of media: Text => Images => Audio => Video processing software/hardware initially high-end, later desktop Evolution of formats to represent the media Different formats can serve different purposes Compression / Destructive Compression

8 herbert van de sompel Evolution of representation of characters basic ASCII - 7 bit ftp://dkuug.dk/i18n/WG15- collection/charmaps/ANSI_X ftp://dkuug.dk/i18n/WG15- collection/charmaps/ANSI_X EBCDIC - 8 bit innovations.com/boo/asciiebcdic.htmlhttp:// innovations.com/boo/asciiebcdic.html language-specific ASCII extensions ASCII/ISO – 8 bit so_table.html so_table.html UNICODE - 16 bit (currently 49,194 characters)

9 herbert van de sompel Evolution of Representation of text 2 families: based on looks or based on content all kinds of Wordprocessor formats (starting mid 80’s) rtf (cross-wordprocessor format) doc: MS Word 6 will not read MS Word 1 (Lesk, p. 194) ps TeX SGML XML HTML

10 herbert van de sompel different formats / different purposes Text: Original - doc, TeX, wp Archival - SGML, RTF Presentation – ps, pdf, HTML Images: see Original: eps Archival: TIFF, PICT Presentation: JPEG, png, GIF Audio: see Original: AIFF, wav Archival: AIFF Presentation: mp3, RealAudio, wav Video: see Original: DV Archival: digital BETACAM Presentation: RealVideo, QuickTime

11 herbert van de sompel Born digital / Become digital analog domaindigital domain analog record digital recording digitization born analog born digital record

12 herbert van de sompel Born digital Text: text typed into PC Images: image created from scratch in Photoshop Audio: computer generated audio files (C-Sound, Max DSP), software synths writing to disk, … Video: Special Effects in movies, Toy Story, …

13 herbert van de sompel Converting into digital Text/Images: Keying Speech-to-Text Scanning (lecture Anne Kenney): from paper to image quality: dpi, … OCR-ing from image to text quality: hardware/software ; heuristics ; learning

14 herbert van de sompel Converting into digital Audio: Sampling (DSP-cards) quality: sample rate (frequency – 44 kHz), bits/sample (dynamic range – 16 bit), mono/stereo, software tools for noise reduction, removal of clicks, … Text to Speech from text to phonemes from phonemes to audio file (MBROLA) Video: Capturing (Video-boards) quality: fps, window size, …

15 herbert van de sompel Converting to digital Rules of thumb: Create digital master copy in highest quality (although: see Kenney!) Archive master in format that includes some guarantees re longevity Do definitely not compress master in a lossy manner

16 herbert van de sompel digital object goes into a repository

17 herbert van de sompel KWF: populating DLs Originatordigital object user Originator digital object user submission model storage model publication model retrieval model

18 herbert van de sompel preprint archives (repositories)

19 herbert van de sompel Readings Lesk, M Books into Bytes. In: Scientific American, March Van de Sompel, H. & Krichel, T. & Neslon, M. & others The UPS Prototype: An experimental End-user service across E-Print archives. In: D-Lib Magazine. ups/02vandesompel-ups.html ups/02vandesompel-ups.html