Metadata hussein suleman uct cs honours 2005. Data and Metadata  Data refers to digital objects that contain useful information for information seekers.

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

Dublin Core for Digital Video: Overview of the ViDe Application Profile.
A centre of expertise in digital information management Approaches To The Validation Of Dublin Core Metadata Embedded In (X)HTML Documents Background The.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
Metadata hussein suleman uct cs honours Data vs. Metadata  Data refers to digital objects that contain useful information for information seekers.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Non-MARC Metadata for Technical Services Librarians? Beth M. Russell The Ohio State University
1 Metadata, Structured Documents, and XML. 2 Metadata Literally “data about data” –“a set of data that describes and gives information about other data”
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
Presented by Karen W. Gwynn LS – Metadata University of Alabama Prof. Steven MacCall Spring 2011.
Incompatible or Interoperable? A METS bridge for a small gap between two digital preservation software packages Lucas Mak Metadata & CatalogLibrarian
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
OCLC Online Computer Library Center Two Paths to Interoperable Metadata Jean Godby, Devon Smith, Eric Childress DC-2003 September 29, 2003.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Why XML ? Problems with HTML HTML design - HTML is intended for presentation of information as Web pages. - HTML contains a fixed set of markup tags. This.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
Lucas Mak and Dao Rong Gong Michigan State University Millennium and XML: Repurposing and Customizing Metadata May , 2009.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Overview of Metadata For IRLS 504 Class, Summer II, 2006 Panel Discussion: New Directions in the Organization of Information in Libraries Yu Su 7/26/2006.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
I Never Met a Data I Didn’t Like Metadata Issues in Local and Shared Digital Collections Presentation to ALCTS Electronic Resources Interest Group January.
DLI Training April 2004 Kingston Ontario. DDI What, Why, How?
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Definitions and Concepts hussein suleman uct cs honours 2007.
Data and Metadata Standardisation hussein suleman uct cs honours 2007.
1 Metadata Standards Catherine Lai MUMT-611 MIR January 27, 2005.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Metadata Bridget Jones Information Architecture I February 23, 2009.
Introduction to metadata
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Evidence from Metadata INST 734 Doug Oard Module 8.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Sobek for Curators and Collection Managers Training Two: Submitting and Editing Resource Files and Metadata Mark Sullivan November 2013 University of Florida.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
If not DC, then MODS? A look at the Metadata Object Description Schema Cheryl Walters Kayla Willey ULA Annual Conference St. George, Utah May 17, 2006.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Digitization – Basics and Beyond workshop Interoperability of cultural and academic resources New services for digitized collections Muriel Foulonneau.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Metadata Standards in Various Environments Spring January, 2006 Bharat Mehra IS 520 Organization and Representation of Information School of Information.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Introduction to Digital Libraries hussein suleman uct cs honours 2004.
A centre of expertise in digital information management UKOLN is supported by: Metadata – what, why and how Ann Chapman.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
1 XML and XML in DLESE Katy Ginger November 2003.
7th Annual Hong Kong Innovative Users Group Meeting
Lecture 12 Why metadata? CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Metadata Standards - Types
Catherine Lai MUMT-611 MIR January 27, 2005
Introduction to Metadata
Sobek for Curators and Collection Managers
Introduction to Semantic Metadata & Semantic Web
Metadata to fit your needs... How much is too much?
Introduction to Metadata
Oya Y. Rieger Cornell University Library May 2004
Attributes and Values Describing Entities.
Presentation transcript:

Metadata hussein suleman uct cs honours 2005

Data and Metadata  Data refers to digital objects that contain useful information for information seekers.  Metadata refers to descriptions of objects, digital or physical.  Many DLs manipulate metadata records, which contain pointers to the actual data.  The definition is fuzzy as metadata contains useful information as well and in some cases could contain all the data e.g., metadata describing a person.

An Example of Metadata  Metadata name: chalk owner: hussein colour: white size: 2.5 description: used to write on board location: honours lecture room source: Waltons Stationers  Object:

Another Metadata Example  Metadata colour: white title: RG123 owner: UCT lifetime: 2 months size: 1 identifier: RG123 description: white powdery stick  Object:

Metadata Comparisons  Metadata name: chalk owner: hussein colour: white size: 2.5 description: used to write on board location: honours lecture room source: Waltons Stationers  Metadata colour: white title: RG123 owner: UCT lifetime: 2 months size: 1 identifier: RG123 description: white powdery stick What problems can occur?

Types of Metadata  Descriptive title, author, type, format, …  Structural part, subpart, relation, child, …  Administrative location, identifier, submitter, …  Preservation resolution, capture device, watermark, …  Provenance source archive, previous version, source format, …

Creating Metadata  Follow metadata guidelines.  Use terms from controlled vocabularies.  Avoid duplication of information across fields.  Use accepted standards for common elements. e.g., ISO 8601 for dates  instead of 03/03/05  Use XML-based encoding according to standardised Schema/DTD.

Metadata Standards  To promote interoperability among systems, DLs support popular metadata standards to describe objects (both semantically and syntactically). Dublin Core  15 simple elements to describe anything. MARC  Comprehensive system devised to describe items in a (physical) library. RFC1807  Computer science publications format. IMS Metadata Specification  Courseware object description. VRA-Core  Multimedia (especially image) description. EAD  Library finding aids to locate archived items. Why didn’t the CS folks use MARC?

Newer Metadata Standards METS  Descriptive, administrative and structural encoding for metadata of digital objects MODS  Richer than DC, subset of MARC21 MPEG21-DIDL  Structural descriptions of complex multimedia objects

Dublin Core  Dublin Core is one of the most popular and simplest metadata formats.  15 elements with recommended semantics.  All elements are optional and repeatable. TitleCreatorSubject DescriptionPublisherContributor DateTypeFormat IdentifierSourceLanguage RelationCoverageRights

DC in HTML  Source:

DC Metadata in XML 02uct1 Hussein Suleman Visit to UCT the view that greets you as you emerge from the tunnel under the freeway - WOW - and, no, the mountain isnt that close - it just looks that way in 2- D Hussein Suleman image image/jpeg

DC Metadata in Valid Qualified XML 02uct1 Hussein Suleman Visit to UCT the view that greets you as you emerge from the tunnel under the freeway - WOW - and, no, the mountain isnt that close - it just looks that way in 2-D Hussein Suleman image image/jpeg en-us unrestricted Why is there a separate namespace for the root element?

What Metadata Format?  Every project/DL has its own metadata/data requirements, therefore most use a proprietary format.  For maximum interoperability, Map metadata to most descriptive format for use by close collaborators. Map metadata to DC for use by all and sundry.  How do we “map” metadata formats? Do we actually store data in XML?

Metadata Transformation  Use XML parser to parse data.  Use SAX/DOM to extract individual elements and generate new format.  Example (to convert UCT to DC): my $parser = new DOMParser; my $document = $parser->parsefile (‘uct.xml’)->getDocumentElement; foreach my $title ($document->getElementsByTagName (‘title’)) { print “ ”.$title->getFirstChild->getData.” \n”; } foreach my $author ($document->getElementsByTagName (‘author’)) { print “ ”.$author->getFirstChild->getData.” \n”; } print “ UCT \n”; foreach my $version ($document->getElementsByTagName (‘version’)) { foreach my $number ($version->getElementsByTagName (‘number’)) { print “ ”. $number->getFirstChild->getData.” \n”; } } Come on, there must be an easier way!

Metadata Transformation (XSLT) 1/2 <stylesheet version='1.0' xmlns=' xmlns:oaidc=' xmlns:dc=' xmlns:xsi=' xmlns:uct=' > <!-- UCT to DC transformation Hussein Suleman v1.0 : 24 July > UCT

Metadata Transformation (XSLT) 2/2 <oaidc:dc xsi:schemaLocation="

Automatic Metadata Extraction  Create metadata automatically from a digital object.  Embedded Metadata e.g., MP3 tags  Heuristic Techniques e.g., The first string that looks like a date is the date of publication  Machine Learning e.g., Neural networks  Dictionary Techniques e.g., If it looks like a name, it could be an author

References  Dublin Core Metadata Initiative (2004). Dublin Core Metadata Element Set, Version 1.1: Reference Description. Available  IMS Global Learning Consortium, Inc. (2001). IMS Learning Resource Meta-Data Information Model, Version Final Specification. Available ml ml  Lasher, R. and D. Cohen (1995). A Format for Bibliographic Records. Network Working Group, RFC1807. Available  Library of Congress (2002). Encoded Archival Description (EAD), Official EAD Version 2002 Web Site. Website  Library of Congress (2005). MARC Standards. Website  Library of Congress (2005). Metadata Encoding and Transmission Standard. Website  Library of Congress (2005). Metadata Object Description Schema. Website  Visual Resources Association Data Standards Committee. (2002). VRA Core Categories, Version 3.0. Available  XML Cover Pages (2005). MPEG-21 Part 2: Digital Item Declaration Language (DIDL). Website