Download presentation
Presentation is loading. Please wait.
Published byWendy Davidson Modified over 9 years ago
1
Practical Metadata Kathryn Lybarger
2
<METADATA>
3
What is metadata?
4
“data about data”
5
Types of Metadata Descriptive Metadata Descriptive Metadata Structural Metadata Structural Metadata Administrative Metadata Administrative Metadata Preservation Metadata Preservation Metadata Rights and Access Metadata Rights and Access Metadata Technical Metadata Technical Metadata
6
Examples This space intentionally left blank.
7
Examples DescriptiveStructuralAdministrative PRIVATE PUBLIC A - K L - Z PERSONAL BUSINESS
8
What does metadata look like?
9
May be same format as data Header added by Project Gutenberg Ebook submitted
10
Metadata may have different format WAV audio file Text metadata
11
Not all metadata is text “Okay, it's September the 21, 1987, I'm in Frankfurt Kentucky in the home of Clarence Gunther, who was a World War II veteran, served in the Navy, entering service on March 6, 1940, and separated March 8, 1946 as a boats and mate first class. He was at Pearl Harbor…”
12
Not all metadata is verbal
13
Metadata may have no structure "This book I gave to Mary Baxter. After her death, I gave it to Mrs. Spruill. After her death, to Kate Wilson. She never read it, so on a visit to her, I took back for my own reading."
14
Metadata may have some structure Word processors allow “document properties” Word processors allow “document properties” Anything can go in these fields Anything can go in these fields
15
File names are metadata cont-vocab.doc Some indication of content File type
16
Metadata may have rich structure Example: MARC record Example: MARC record Requires expertise to read and create Requires expertise to read and create Allows very detailed searching Allows very detailed searching
17
XML: eXtensible Markup Language Many rich metadata formats are encoded as XML Many rich metadata formats are encoded as XML A schema or DTD specifies rules which a document must follow A schema or DTD specifies rules which a document must follow Examples: XHTML, EAD, TEI, NDNP Examples: XHTML, EAD, TEI, NDNP
18
XML: Example Numerical Linear Algebra Numerical Linear Algebra Lloyd N. Trefethen Lloyd N. Trefethen David Bau, III David Bau, III <publisher>SIAM</publisher></book>
19
XML: Advantages “self-describing” “self-describing” Validation catches many errors Validation catches many errors XML tools may be used for any XML language XML tools may be used for any XML language searching searching transformation transformation communication communication
20
When is metadata created? Who creates metadata?
22
Where is metadata?
23
Metadata may be inside the data Physical: Physical: Title page Table of contents Index Digital: Digital: Header information
24
Binary data Binary data Header information in an image file XML metadata XML metadata
25
Metadata can be near the data Title and author on the spine of a book Title and author on the spine of a book Associated.txt file with a.wav file Associated.txt file with a.wav file Alternate data streams (Windows) Alternate data streams (Windows)
26
Metadata can be gathered elsewhere Card catalog Card catalog Index Index Search engine Search engine
27
Metadata can be multiple places Bee S-50 Earlington, KY 98’ 1892 negative microfilm catalog box lid
28
How is metadata different from normal data? No clear distinction! No clear distinction! Metadata is also data Metadata is also data Metadata can have metadata Metadata can have metadata
29
Meta-metadata?
30
How much metadata? Too little metadata? Too little metadata? Different objects may have the same metadata Different objects may have the same metadata Too much metadata? Too much metadata? You may never get started You may never get started Collection may take too long Collection may take too long Collection may be inconsistent / incomplete Collection may be inconsistent / incomplete
31
What is good metadata?
32
Metadata should be accessible Easy to find Easy to find Readable Readable Physical: legible, permanent Physical: legible, permanent Digital: standard, non-proprietary format Digital: standard, non-proprietary format
33
Metadata should be meaningful Relationship to data should be clear Relationship to data should be clear Digital Digital Encoded content should be parse-able Encoded content should be parse-able XML should be well-formed, valid XML should be well-formed, valid
34
Metadata should be accurate Adds to recall and precision in searching Adds to recall and precision in searching Not all metadata is apparent from looking at the data itself Not all metadata is apparent from looking at the data itself False metadata may lead to false conclusions about the data! False metadata may lead to false conclusions about the data!
35
False metadata: Example Apparent from file: 4800 x 6800 pixels Metadata: 400dpi Conclusion: Image: 12in x 17in Paper: 11in x 16in
36
False metadata: Example Apparent from file: 4800 x 6800 pixels Metadata: 200dpi Conclusion: Image: 24in x 34in Paper: 22in x 32in
37
OCR: Optical character recognition An automated process of turning images of letters into (searchable) text An automated process of turning images of letters into (searchable) text Very common metadata for images of books/newspapers Very common metadata for images of books/newspapers Often uncorrected, somewhat inaccurate Often uncorrected, somewhat inaccurate
38
Uncorrected OCR: Example THREE DS TRIUMPH Zacaweista Easily Qutsprints T S Jordan and Morsun Stadium Purse Feature of Wash ¬ ington Park ProgramWeather and Track Conditions Ideal HOMEWOOD Ill June 2 Zacaweista the good son of High Time in the Three Ds
39
Uncorrected OCR: Example ucrendifi-chdlcogrdpboilli, cuiusmahm pr£chr& iUiusdrtis imprcffos rie inuentorcsfucre, trddidifftm, utedcretur, exempkrcocilijTriburicrt
40
Why metadata?
41
Metadata can be used to identify data Title / author on the spine of a book Title / author on the spine of a book File names File names Labels Labels
42
Metadata can be used to interpret data Instructions for tax forms Instructions for tax forms Language / character encoding Language / character encoding XML DTD or schema XML DTD or schema
43
Metadata can be used to search data Card catalog Card catalog Index Index Search engine Search engine
44
Metadata can be used to manage data Call numbers Call numbers “Burn by” dates on file boxes “Burn by” dates on file boxes Rights and access Rights and access
45
Metadata can be used to communicate about data Finding aids Finding aids Abstracts Abstracts OAI OAI
46
Who uses metadata?
47
Internet users use metadata
48
Librarians use metadata Reference Reference Cataloging Cataloging Collection Development Collection Development
49
Children use metadata
50
What is metadata?
51
You already understand metadata!
52
</METADATA>
53
Any questions about metadata?
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.