Download presentation
Presentation is loading. Please wait.
Published byIris McBride Modified over 9 years ago
1
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD mccray@nlm.nih.gov Stanford University August 21, 1999
2
Design and Development of a Digital Library System u Early experiences in digital conversion u Development of Profiles in Science system u Critical role of metadata in system design – Framework for collection management – Foundation for Web delivery – Standards for resource description
3
Lessons learned from an early Digital Library Project u Digital conversion work begun in 1992 u Some 1,500 historical documents (about 40,000 pages) – mimeographs – oddly sized documents – interview transcripts u some audio segments – photographs and other memorabilia
4
Processing the early Collection u Created index (metadata) records – templates varied by document type – included topical index terms – compensated for poor OCR quality u Created digital master copy of the documents (TIFF) – subsequently derived first GIF and then PDF
5
Profiles in Science u Profiles in Science Web Site launched in September 1998 u Archival collections of eminent biomedical scientists donated to the NLM – Include text, audio, still images, video – Include books, journal volumes, pamphlets, diaries, letters, manuscripts, photographs – Metadata for each item in the collection
6
Profiles in Science u Educational and research applications – Scholars of the history of medicine and the biological sciences – Students gain an appreciation of the methods and success of science u Allows anyone to look “behind the scenes” at how science is done
7
Profiles Collections u Two collections currently available – Oswald T. Avery, Joshua Lederberg u Careful attention to copyright and intellectual property concerns u Electronic “exhibit” – Initial set of digitized items for exhibit u Papers continue to be digitized – Full paper collections available for scholarly use at NLM
8
Wide Range of Document Types
9
Collection -specific Categories of Information
10
Contextualizing the Content
11
Tiff and Pdf Documents
12
Zoom Pdf for Detail
13
Tiff & Jpeg Documents
14
Tiff & Jpeg Photographs
15
High Resolution Tiff
16
Streaming Video
17
Search across Full Data Set
18
Experiment: Profiled Scientist as Interactive User u Digital documents available before release to the public u Online annotation capability u Annotations complement original document – Give additional detail, set document in context, add keywords
19
Sample Annotation
20
Design of the Profiles in Science System u A single underlying system that is designed to handle the entire life-cycle of a large-scale digital conversion project u Principles – modularity – adherence to standards – extensibility u Metadata forms core of system
21
System Architecture
22
Metadata-driven Document Conversion u Interpret metadata in its broadest sense – data about data u Use metadata to drive the entire system u The metadata record is the basic unit in the system, managing the – digitization process – display and organization of the data – network-based resource discovery
23
Metadata: Framework for Collection Management u Metadata entry system manages all aspects of digitization process – Unique identifiers bind digital master files, Web-derivatives, and metadata records – Enforces quality control (pull-down menus, validation, error messages) – Reports that manage workflow – Security measures
24
Metadata: Display and Organization of the Data u Series of programs generate HTML from metadata RDBMS – Include consistency checking u Programs generate alternative views – alphabetical, chronological, resource type, content area u Filtering mechanisms for access management
25
Metadata: Networked-based Resource Discovery u Dublin Core elements derived from metadata entry system – simplicity – semantic interoperability – international consensus – modularity
26
Sample Metadata Record on Web Site
27
Digital Conversion Projects u Conversion projects involve extensive human and computational resources u Therefore, it is important to design systems that – Are extensible – Automate processes whenever possible – Adhere to standards – Ensure the persistence of the data
28
Profiles in Science http://profiles.nlm.nih.gov/
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.