10/26/2000Information Organization and Retrieval Metadata and Description University of California, Berkeley School of Information Management and Systems.

Slides:



Advertisements
Similar presentations
Ali Alshowaish. dc.coverage element articulates limitations in the scope of the resource, typically along the following lines: geographical, temporal,
Advertisements

Metadata and Search at Boeing Julie Martin Library & Learning Center Services
Geographic Information Systems and Science SECOND EDITION Paul A. Longley, Michael F. Goodchild, David J. Maguire, David W. Rhind © 2005 John Wiley and.
SLIDE 1IS 257 – Fall 2007 Codes and Rules for Description: History 2 University of California, Berkeley School of Information IS 245: Organization.
Bibliographic Records, Data Structures and Databases (Cont.)
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
8/28/97Information Organization and Retrieval Metadata and Data Structures University of California, Berkeley School of Information Management and Systems.
SLIDE 1IS 257 – Fall 2009 Organization of Information in Collections: Introduction University of California, Berkeley School of Information.
Using Metadata in CONTENTdm Diana Brooking and Allen Maberry Metadata Implementation Group, Univ. of Washington Crossing Organizational Boundaries Oct.
10/23/2001Information Organization and Retrieval Information Structures and Metadata University of California, Berkeley School of Information Management.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
10/24/2000Information Organization and Retrieval Information Structures and Metadata University of California, Berkeley School of Information Management.
OLC Spring Chapter Conferences Metadata, Schmetadata … Tell Me Why I Should Care? OLC Spring Chapter Conferences, 2004 Margaret.
The Library Cataloging Tradition
SLIDE 1IS Fall 2002 Lecture 05: Metadata: Introduction Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30.
SLIDE 1IS 245 – Spring 2009 Codes and Rules for Description: History University of California, Berkeley School of Information IS 245: Organization.
SLIDE 1IS FALL 2003 Lecture 06: Metadata Introduction Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30.
SLIDE 1IS 257 – Fall 2007 Codes and Rules for Description: History University of California, Berkeley School of Information IS 245: Organization.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
GFIS-Africa Editorial tutorial – prepared by Anne Handley February 2003 (modified by Eero Mikkola July 2004)Anne Handley Aims To teach the skills needed.
1 Open-source platform for accessible content management Museo & Web CMS.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
The Global Marketplace for Forest Information. Why should we create metadata? Users Information providers.
Cataloguing Electronic resources Prepared by the Cataloguing Team at Charles Sturt University.
CONSER RDA Bridge Training [date] Presenters : [names] 1.
The Library Cataloging Tradition Marty Kurth CS 431 February 9, 2005 [slides stolen from Diane Hillmann]
1 CS/INFO 430 Information Retrieval Lecture 20 Metadata 2.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
Content and Computer Platforms Week 3. Today’s goals Obtaining, describing, indexing content –XML –Metadata Preparing for the installation of Dspace –Computers.
SLIDE 1IS 257 – Fall 2007 Introduction to Description and AACR II University of California, Berkeley School of Information IS 245: Organization.
1 CS 430: Information Discovery Lecture 7 Descriptive Metadata 3 Dublin Core Automatic Generation of Catalog Records.
Developing Databases and Selecting an Appropriate Library System.
Implementation scenarios, encoding structures and display Rob Walls Director Database Services Libraries Australia.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
LIS654 lecture 5 DC metadata and omeka tables Thomas Krichel
Modularization and Interoperability: Dublin Core and the Warwick Framework Sandra D. Payette Digital Library Research Group Cornell University November.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Introduction to metadata
AACR2 Pt. 1, Monographic Description LIS Session 2.
Evidence from Metadata INST 734 Doug Oard Module 8.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
The physical parts of a computer are called hardware.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Description of Bibliographic Items. Review Encoding = Markup. The library cataloging “markup” language is MARC. Unlike HTML, MARC tags have meaning (i.e.,
Cataloging Unique Collections with RDA and Non-MARC Standards Melanie Wacker Metadata Coordinator Columbia University Libraries Jan. 21, 2012 ALA Midwinter.
1 CS 430: Information Discovery Lecture 5 Descriptive Metadata 1 Libraries Catalogs Dublin Core.
Functional Requirements for Bibliographic Records The Changing Face of Cataloging William E. Moen Texas Center for Digital Knowledge School of Library.
AACR 2 –Rules for Descriptive Cataloguing
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata Applications Marcia Lei Zeng NSDL All Project Meeting October, 2003.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Subject Description LIS 571 The Organization and Control of Recorded Information.
8/28/97Information Organization and Retrieval Introduction University of California, Berkeley School of Information Management and Systems SIMS 245: Organization.
Current initiatives in developing library linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar “Linked data and.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
The ___ is a global network of computer networks Internet.
An information retrieval system may include 3 categories of information:  Factual  Bibliographical  Institutional  Exchange and sharing of these categories.
Queensland University of Technology Faculty of Information Technology Michael Middleton 1 CRICOS No J Bibliographic description.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Information organization Week 2 Lecture notes INF 380E: Perspectives on Information Spring 2015 Karen Wickett UT School of Information.
Information organization Week 2 Lecture notes INF 380E: Perspectives on Information Spring 2015 Karen Wickett UT School of Information.
Theory, Tools, History: A Brief Introduction August 17, 2016.
Cataloging Unique Collections with RDA and Non-MARC Standards
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Attributes and Values Describing Entities.
Attributes and Values Describing Entities.
Presentation transcript:

10/26/2000Information Organization and Retrieval Metadata and Description University of California, Berkeley School of Information Management and Systems SIMS 202: Information Organization and Retrieval

10/26/2000Information Organization and Retrieval Review Organization of Information Information Life Cycle

10/26/2000Information Organization and Retrieval Course Schedule Organization –Overview –Metadata and Markup –Controlled Vocabularies, Classification, Thesauri –Information Design Thesaurus Design Database Design Retrieval –The Search Process –Content Analysis Tokenization, Zipf’s Law, Lexical Associations –IR Implementation –Term weighting and document ranking Vector space model Probabilistic model –User Interfaces Overviews, query specification, providing context, relevance feedback

10/26/2000Information Organization and Retrieval Why Organize Information? The main reason –So that you can find things more effectively I.e., Effective retrieval is predicated on some sort of organization applied to information resources Historically there have been many institutions and tools devoted to information organization –Libraries –Museums –Archives –Indexes and catalogs, dictionaries, Phone books, etc.

10/26/2000Information Organization and Retrieval Information Life Cycle Creation UtilizationSearching Active Inactive Semi-Active Retention/ Mining Disposition Discard Using Creating Authoring Modifying Organizing Indexing Storing Retrieval Distribution Networking Accessing Filtering

10/26/2000Information Organization and Retrieval Key issues in this course How to describe information resources or information-bearing objects in ways so that they may be effectively used by those who need to use them. –Organizing How to find the appropriate information resources or information-bearing objects for someone’s (or your own) needs. –Retrieving

10/26/2000Information Organization and Retrieval Key Issues Creation UtilizationSearching Active Inactive Semi-Active Retention/ Mining Disposition Discard Using Creating Authoring Modifying Organizing Indexing Storing Retrieval Distribution Networking Accessing Filtering

10/26/2000Information Organization and Retrieval Structure of an IR System Interest profiles & Queries Documents & data Rules of the game = Rules for subject indexing + Thesaurus (which consists of Lead-In Vocabulary and Indexing Language Storage Line Potentially Relevant Documents Comparison/ Matching Store1: Profiles/ Search requests Store2: Document representations Indexing (Descriptive and Subject) Formulating query in terms of descriptors Storage of profiles Storage of Documents Information Storage and Retrieval System

10/26/2000Information Organization and Retrieval Metadata Metadata is: – “data about data” (term usage database systems) –Information about Information –Structures and Languages for the Description of Information Resources and their elements (components or features) –“Metadata is information on the organization of the data, the various data domains, and the relationship between them” (Baeza-Yates p. 142)

10/26/2000Information Organization and Retrieval Types of Metadata Element names. Element description. Element representation. Element coding. Element semantics. Element classification.

10/26/2000Information Organization and Retrieval Today Bibliographic Metadata (traditional Library cataloging) Other Metadata systems Dublin Core

10/26/2000Information Organization and Retrieval How can you describe an information-bearing object?

10/26/2000Information Organization and Retrieval Bibliographic Information Describes documents What is a document (revisited)? Choice of descriptive elements and content of those elements typically governed by a set of rules: –AACR II Elements coded in standard ways for transmission. –MARC

10/26/2000Information Organization and Retrieval Goals of Descriptive Cataloging 1.To enable a person to find a document of which –the author, or –the title, or –the subject is known 2.To show what a library has –by a given author –on a given subject (and related subjects) –in a given kind (or form) of literature. 3.To assist in the choice of a document –as to its edition (bibliographically) –as to its character (literary or topical) Charles A. Cutter, 1876

10/26/2000Information Organization and Retrieval Rules for Descriptive Cataloging ISBD AACR AACR II

10/26/2000Information Organization and Retrieval AACRII Sources of Information ISBD areas Choice of Access Points

10/26/2000Information Organization and Retrieval Sources of Information Each different type of material has a preferred location for deriving information about it. –Books and printed material Title page –Cartographic Materials (Maps, globes, etc) The map itself, or containers, stands, etc. –Sound recordings Disc label, cassette label, etc.

10/26/2000Information Organization and Retrieval ISBD Areas Title and Statement of Responsibility Edition Material or type of publication specification Publication, Distribution (etc.) Physical Description Series Notes Standard Numbers

10/26/2000Information Organization and Retrieval ISBD Punctuation Title Proper (GMD) = Parallel title : other title info / First statement of responsibility ; others. -- Edition information. -- Material. -- Place of Publication : Publisher Name, Date. -- Material designation and extent ; Dimensions of item. -- (Title of Series / Statement of responsibility). -- Notes. -- Standard numbers: terms of availability (qualifications).

10/26/2000Information Organization and Retrieval Bibliographic Record Introduction to cataloging and classification / Bohdan S. Wynar. -- 8th ed. / Arlene G. Taylor. -- Englewood, Colo. : Libraries Unlimited, (Library science text series).

10/26/2000Information Organization and Retrieval Choice of Access Points Title(s) (Always main title) Main Entry?? Added Entries Series Titles Identifying Numbers

10/26/2000Information Organization and Retrieval More Metadata Systems The following are a sample of metadata systems for a variety of special types of data/documents/objects.

10/26/2000Information Organization and Retrieval Type of Metadata systems and standards Naming and ID systems Bibliographic description –Texts Music Images and objects Numeric Data Geospatial Data Collections

10/26/2000Information Organization and Retrieval Naming and ID Systems URLs (Uniform Resource Locators) –URIs (Uniform Resource Indentifiers) URNs (Uniform Resource Names ) URCs (Uniform Resource Characteristics) Kahn/Wilensky Handles SICI (Serial Item and Content Identifiers) ISBN ISSN

10/26/2000Information Organization and Retrieval Bibliographic Description MARC (Machine Readable Cataloging) DUBLIN CORE –Warwick Framework for Dublin Core Metadata GILS (Government Information Locator Service) RFC 1807 (Format for Bibliographic Records) RDF (Resource Description Format)

10/26/2000Information Organization and Retrieval More Bibliographic Descriptors TEI Headers (Text Encoding initiative) BibTex PICS (Platform for Internet Content Selection) SOIF (Summary Object Interchange Format)

10/26/2000Information Organization and Retrieval Music Standard Music Description Language (SMDL)

10/26/2000Information Organization and Retrieval Numeric Data ICPSR Data Documentation Initiative (SGML DTD development) Standard for Survey Design and Statistical Methodology Metadata (SDSM)

10/26/2000Information Organization and Retrieval Images and Objects Categories for the Description of Works of Art (Getty Art Institute) Consortium for the Computer Interchange of Museum Information (CIMI) RLG REACH Element Set (for Shared Description of Museum Objects) VRA Core Categories (Visual Resources Association)

10/26/2000Information Organization and Retrieval Geospatial Data Content Standards for Digital Geospatial Metadata FGDC (Federal Geographic Data Committee) ASTM Section D Draft Specification Content Specification for Digital Geospatial Metadata. (American Society for Testing and Materials (ASTM).

10/26/2000Information Organization and Retrieval Collection Level Descriptors EAD (Encoded Archival Description) Z39.50 Profile for Access to Digital Collections RSLP Collection Description (Research Support Libraries Programme)

10/26/2000Information Organization and Retrieval Dublin Core Simple metadata for describing internet resources. For “Document-Like Objects” 15 Elements.

10/26/2000Information Organization and Retrieval Dublin Core Elements Title Creator Subject Description Publisher Other Contributors Date Resource Type Format Resource Identifier Source Language Relation Coverage Rights Management

10/26/2000Information Organization and Retrieval Title Label: TITLE The name given to the resource by the CREATOR or PUBLISHER.

10/26/2000Information Organization and Retrieval Author or Creator Label: CREATOR The person(s) or organization(s) primarily responsible for the intellectual content of the resource. For example, authors in the case of written documents, artists, photographers, or illustrators in the case of visual resources.

10/26/2000Information Organization and Retrieval Subject and Keywords Label: SUBJECT The topic of the resource, or keywords or phrases that describe the subject or content of the resource. The intent of the specification of this element is to promote the use of controlled vocabularies and keywords. This element might well include scheme-qualified classification data (for example, Library of Congress Classification Numbers or Dewey Decimal numbers) or scheme-qualified controlled vocabularies (such as MEdical Subject Headings or Art and Architecture Thesaurus descriptors) as well.

10/26/2000Information Organization and Retrieval Description Label: DESCRIPTION A textual description of the content of the resource, including abstracts in the case of document-like objects or content descriptions in the case of visual resources. Future metadata collections might well include computational content description (spectral analysis of a visual resource, for example) that may not be embeddable in current network systems. In such a case this field might contain a link to such a description rather than the description itself.

10/26/2000Information Organization and Retrieval Publisher Label: PUBLISHER The entity responsible for making the resource available in its present form, such as a publisher, a university department, or a corporate entity. The intent of specifying this field is to identify the entity that provides access to the resource.

10/26/2000Information Organization and Retrieval Other Contributors Label: CONTRIBUTORS Person(s) or organization(s) in addition to those specified in the CREATOR element who have made significant intellectual contributions to the resource but whose contribution is secondary to the individuals or entities specified in the CREATOR element (for example, editors, transcribers, illustrators, and convenors).

10/26/2000Information Organization and Retrieval Date Label: DATE The date the resource was made available in its present form. The recommended best practice is an 8 digit number in the form YYYYMMDD as defined by ANSI X In this scheme, the date element for the day this is written would be , or December 3, Many other schema are possible, but if used, they should be identified in an unambiguous manner.

10/26/2000Information Organization and Retrieval Resource Type Label: TYPE The category of the resource, such as home page, novel, poem, working paper, preprint, technical report, essay, dictionary. It is expected that RESOURCE TYPE will be chosen from an enumerated list of types. One preliminary set of such types can be found at the following URL (now out of date):

10/26/2000Information Organization and Retrieval Format Label: FORMAT The data representation of the resource, such as text/html, ASCII, Postscript file, executable application, or JPEG image. The intent of specifying this element is to provide information necessary to allow people or machines to make decisions about the usability of the encoded data (what hardware and software might be required to display or execute it, for example). As with RESOURCE TYPE, FORMAT will be assigned from enumerated lists such as registered Internet Media Types (MIME types). In principal, formats can include physical media such as books, serials, or other non-electronic media.

10/26/2000Information Organization and Retrieval Resource Identifier Label: IDENTIFIER String or number used to uniquely identify the resource. Examples for networked resources include URLs and URNs (when implemented). Other globally-unique identifiers,such as International Standard Book Numbers (ISBN) or other formal names would also be candidates for this element.

10/26/2000Information Organization and Retrieval Source Label: SOURCE The work, either print or electronic, from which this resource is derived, if applicable. For example, an html encoding of a Shakespearean sonnet might identify the paper version of the sonnet from which the electronic version was transcribed.

10/26/2000Information Organization and Retrieval Language Label: LANGUAGE Language(s) of the intellectual content of the resource. Where practical, the content of this field should coincide with the Z39.53 three character codes for written languages. See:

10/26/2000Information Organization and Retrieval Relation Label: RELATION Relationship to other resources. The intent of specifying this element is to provide a means to express relationships among resources that have formal relationships to others, but exist as discrete resources themselves. For example, images in a document, chapters in a book, or items in a collection. A formal specification of RELATION is currently under development. Users and developers should understand that use of this element should be currently considered experimental.

10/26/2000Information Organization and Retrieval Coverage Label: COVERAGE The spatial locations and temporal duration characteristic of the resource. Formal specification of COVERAGE is currently under development. Users and developers should understand that use of this element should be currently considered experimental.

10/26/2000Information Organization and Retrieval Rights Management Label: RIGHTS The content of this element is intended to be a link (a URL or other suitable URI as appropriate) to a copyright notice, a rights-management statement, or perhaps a server that would provide such information in a dynamic way. The intent of specifying this field is to allow providers a means to associate terms and conditions or copyright statements with a resource or collection of resources. No assumptions should be made by users if such a field is empty or not present.

10/26/2000Information Organization and Retrieval The Same Item in Different Metadata Systems ISBD Dublin Core RFC 1807 TEI Header MARC Record

10/26/2000Information Organization and Retrieval ISBD Punctuation Title Proper (GMD) = Parallel title : other title info / First statement of responsibility ; others. -- Edition information. -- Material. -- Place of Publication : Publisher Name, Date. -- Material designation and extent ; Dimensions of item. -- (Title of Series / Statement of responsibility). -- Notes. -- Standard numbers: terms of availability (qualifications).

10/26/2000Information Organization and Retrieval Bibliographic Record Introduction to cataloging and classification / Bohdan S. Wynar. -- 8th ed. / Arlene G. Taylor. -- Englewood, Colo. : Libraries Unlimited, (Library science text series).

10/26/2000Information Organization and Retrieval Dublin Core TITLE: Introduction to cataloging and classification CREATOR: Taylor, Arlene G. OTHER CONTRIBUTOR: Wynar, Bohdan S. DATE: 1992 FORMAT: BOOK LANGUAGE: ENG PAGES: 633 PUBLISHER: Libraries Unlimited SUBJECT: Cataloging. SUBJECT: subject cataloging. SUBJECT: Classification -- Books DESCRIPTION: Textbook on cataloging and classification RESOURCE TYPE: text.monograph RESOURCE IDENTIFIER: (ISBN)

10/26/2000Information Organization and Retrieval RFC 1807 BIB-VERSION:: CS-TR-v2.1 ID:: UCB// ENTRY:: September 9, 1997 TYPE:: BOOK TITLE:: Introduction to cataloging and classification AUTHOR:: Wynar, Bohdan S. AUTHOR:: Taylor, Arlene G. DATE:: 1992 PAGES:: 633 COPYRIGHT:: Libraries Unlimited, 1992 SERIES:: Library Science Text Series END:: UCB//123456

10/26/2000Information Organization and Retrieval Minimal TEI Header Introduction to cataloging and classification Bohdan S. Wynar 8th edition by Arlene G. Taylor Libraries Unlimited Introduction to cataloging and classification / Bohdan S. Wynar. -- 8th ed. / Arlene G. Taylor. -- Englewood, Colo. : Libraries Unlimited, 1992.

10/26/2000Information Organization and Retrieval MARC Record (display) ID:DCLC B RTYP:c ST:p FRN: MS:c EL: AD: CC:9110 BLT:am DCF:a CSC: MOD: SNR: ATC: UD: CP:cou L:eng INT: GPC: BIO: FIC:0 CON:b PC:s PD:1992/ REP: CPI:0 FSI:0 ILC:a II:1 MMD: OR: POL: DM: RR: COL: EML: GEN: BSE: (cloth) (paper) 040 DLC$cDLC$dDLC Z693$b.W $ Wynar, Bohdan S Introduction to cataloging and classification /$cBohdan S. Wynar th ed. /$bArlene G. Taylor. 260 Englewood, Colo. :$bLibraries Unlimited,$c xvii, 633 p. :$bill. ;$c24 cm Library science text series 504 Includes bibliographical references (p ) and index Cataloging Subject cataloging Classification$xBooks Anglo-American cataloguing rules Taylor, Arlene G.,$d1941-

10/26/2000Information Organization and Retrieval Metadata Resources Check the Links section from the class home page Best site is the “Digital Library: Metadata Resources” page from IFLA at site is the “Digital Library: Metadata Resources” page from IFLA at