Presentation is loading. Please wait.

Presentation is loading. Please wait.

Practical Metadata Kathryn Lybarger. <METADATA> What is metadata?

Similar presentations


Presentation on theme: "Practical Metadata Kathryn Lybarger. <METADATA> What is metadata?"— Presentation transcript:

1 Practical Metadata Kathryn Lybarger

2 <METADATA>

3 What is metadata?

4 “data about data”

5 Types of Metadata Descriptive Metadata Descriptive Metadata Structural Metadata Structural Metadata Administrative Metadata Administrative Metadata Preservation Metadata Preservation Metadata Rights and Access Metadata Rights and Access Metadata Technical Metadata Technical Metadata

6 Examples This space intentionally left blank.

7 Examples DescriptiveStructuralAdministrative PRIVATE PUBLIC A - K L - Z PERSONAL BUSINESS

8 What does metadata look like?

9 May be same format as data Header added by Project Gutenberg Ebook submitted

10 Metadata may have different format WAV audio file Text metadata

11 Not all metadata is text “Okay, it's September the 21, 1987, I'm in Frankfurt Kentucky in the home of Clarence Gunther, who was a World War II veteran, served in the Navy, entering service on March 6, 1940, and separated March 8, 1946 as a boats and mate first class. He was at Pearl Harbor…”

12 Not all metadata is verbal

13 Metadata may have no structure "This book I gave to Mary Baxter. After her death, I gave it to Mrs. Spruill. After her death, to Kate Wilson. She never read it, so on a visit to her, I took back for my own reading."

14 Metadata may have some structure Word processors allow “document properties” Word processors allow “document properties” Anything can go in these fields Anything can go in these fields

15 File names are metadata cont-vocab.doc Some indication of content File type

16 Metadata may have rich structure Example: MARC record Example: MARC record Requires expertise to read and create Requires expertise to read and create Allows very detailed searching Allows very detailed searching

17 XML: eXtensible Markup Language Many rich metadata formats are encoded as XML Many rich metadata formats are encoded as XML A schema or DTD specifies rules which a document must follow A schema or DTD specifies rules which a document must follow Examples: XHTML, EAD, TEI, NDNP Examples: XHTML, EAD, TEI, NDNP

18 XML: Example Numerical Linear Algebra Numerical Linear Algebra Lloyd N. Trefethen Lloyd N. Trefethen David Bau, III David Bau, III <publisher>SIAM</publisher></book>

19 XML: Advantages “self-describing” “self-describing” Validation catches many errors Validation catches many errors XML tools may be used for any XML language XML tools may be used for any XML language searching searching transformation transformation communication communication

20 When is metadata created? Who creates metadata?

21

22 Where is metadata?

23 Metadata may be inside the data Physical: Physical: Title page Table of contents Index Digital: Digital: Header information

24 Binary data  Binary data  Header information in an image file XML metadata  XML metadata 

25 Metadata can be near the data Title and author on the spine of a book Title and author on the spine of a book Associated.txt file with a.wav file Associated.txt file with a.wav file Alternate data streams (Windows) Alternate data streams (Windows)

26 Metadata can be gathered elsewhere Card catalog Card catalog Index Index Search engine Search engine

27 Metadata can be multiple places Bee S-50 Earlington, KY 98’ 1892 negative microfilm catalog box lid

28 How is metadata different from normal data? No clear distinction! No clear distinction! Metadata is also data Metadata is also data Metadata can have metadata Metadata can have metadata

29 Meta-metadata?

30 How much metadata? Too little metadata? Too little metadata? Different objects may have the same metadata Different objects may have the same metadata Too much metadata? Too much metadata? You may never get started You may never get started Collection may take too long Collection may take too long Collection may be inconsistent / incomplete Collection may be inconsistent / incomplete

31 What is good metadata?

32 Metadata should be accessible Easy to find Easy to find Readable Readable Physical: legible, permanent Physical: legible, permanent Digital: standard, non-proprietary format Digital: standard, non-proprietary format

33 Metadata should be meaningful Relationship to data should be clear Relationship to data should be clear Digital Digital Encoded content should be parse-able Encoded content should be parse-able XML should be well-formed, valid XML should be well-formed, valid

34 Metadata should be accurate Adds to recall and precision in searching Adds to recall and precision in searching Not all metadata is apparent from looking at the data itself Not all metadata is apparent from looking at the data itself False metadata may lead to false conclusions about the data! False metadata may lead to false conclusions about the data!

35 False metadata: Example Apparent from file: 4800 x 6800 pixels Metadata: 400dpi Conclusion: Image: 12in x 17in Paper: 11in x 16in

36 False metadata: Example Apparent from file: 4800 x 6800 pixels Metadata: 200dpi Conclusion: Image: 24in x 34in Paper: 22in x 32in

37 OCR: Optical character recognition An automated process of turning images of letters into (searchable) text An automated process of turning images of letters into (searchable) text Very common metadata for images of books/newspapers Very common metadata for images of books/newspapers Often uncorrected, somewhat inaccurate Often uncorrected, somewhat inaccurate

38 Uncorrected OCR: Example THREE DS TRIUMPH Zacaweista Easily Qutsprints T S Jordan and Morsun Stadium Purse Feature of Wash ¬ ington Park ProgramWeather and Track Conditions Ideal HOMEWOOD Ill June 2 Zacaweista the good son of High Time in the Three Ds

39 Uncorrected OCR: Example ucrendifi-chdlcogrdpboilli, cuiusmahm pr£chr& iUiusdrtis imprcffos rie inuentorcsfucre, trddidifftm, utedcretur, exempkrcocilijTriburicrt

40 Why metadata?

41 Metadata can be used to identify data Title / author on the spine of a book Title / author on the spine of a book File names File names Labels Labels

42 Metadata can be used to interpret data Instructions for tax forms Instructions for tax forms Language / character encoding Language / character encoding XML DTD or schema XML DTD or schema

43 Metadata can be used to search data Card catalog Card catalog Index Index Search engine Search engine

44 Metadata can be used to manage data Call numbers Call numbers “Burn by” dates on file boxes “Burn by” dates on file boxes Rights and access Rights and access

45 Metadata can be used to communicate about data Finding aids Finding aids Abstracts Abstracts OAI OAI

46 Who uses metadata?

47 Internet users use metadata

48 Librarians use metadata Reference Reference Cataloging Cataloging Collection Development Collection Development

49 Children use metadata

50 What is metadata?

51 You already understand metadata!

52 </METADATA>

53 Any questions about metadata?


Download ppt "Practical Metadata Kathryn Lybarger. <METADATA> What is metadata?"

Similar presentations


Ads by Google