A centre of expertise in digital information management UKOLN is supported by: First steps in metadata Ann Chapman Policy and Advice team, UKOLN
A centre of expertise in digital information management What is metadata? Structured data about something Encountered every day –bus & rail timetables –phone directories –Internet shopping sites (e.g. Amazon) –ingredient lists on food items –calendars (public holidays, religious festivals) –event (e.g. seminar, workshop) programme
A centre of expertise in digital information management More about metadata Structured data about resources –Library catalogues –Abstracting and indexing services –Archival finding aids –Museum documentation –Collection description –Community information Carriers –Formats (e.g. MARC) –Markup languages (e.g. HTML, SGML, XML)
A centre of expertise in digital information management Markup languages SGML = Standard Generalised Markup Language - controls document formatting for publication XML = Extensible Markup Language - next generation SGML HTML = Hyper Text Markup Language - SGML subset, controls display of web pages All use tags (usually paired) to structure text into elements e.g. headings, paragraphs, lists, etc.
A centre of expertise in digital information management Overview MARC ONIX Dublin Core & application profiles RSLP Collection Description MARC 21 Community Information Other metadata types
A centre of expertise in digital information management MARC Formats MAchine Readable Catalogue records –Library of Congress, 1960s –Now widespread use in many countries –Catalogue once, use record many times –Holdings can be attached –1960s: books, serials, maps, music scores –2006: any physical or digital resource
A centre of expertise in digital information management MARC - structure Structured format and carrier Numeric and alpha tags Fixed fields –Leader, , Variable fields –100, 110, 111, 245, 260, etc.
A centre of expertise in digital information management MARC - elements 1XX Main entry 2XX Title, Statement of Responsibility, edition, publication 3XX Physical description 4XX Series information 5XX Notes 6XX Subject access 7XX Added entries (alternative titles, multiple authors, etc.) 8XX Added entries for series 9XX References and local use fields
A centre of expertise in digital information management MARC 21 record 021 $a $s $c $a Nederveen Pieterse, Jan P. 245 $a Development theory: $b deconstruction. 260 $a London: $b Sage, $c $a xii, 195p. $c 25cm $e cased 440 $a Theory, culture and society 650 $a Economic development
A centre of expertise in digital information management ONIX Formats Primary use –Publishers to Internet booksellers –Rich product information 3 Formats for product information metadata –Books, Serials, Licensing Terms ONIX for Books in use: –First version 1999 –Current version release 2.0 (2001) Carrier – XML Elements – XML reference name and tag
A centre of expertise in digital information management ONIX - elements Message header Product record –identifiers, author, title, edition, language, subject, audience, descriptions, publisher, dates –territorial rights, dimensions, suppliers, availability, promotions Main series and sub-series records
A centre of expertise in digital information management ONIX for Books - record Alice in Wonderland Author Carroll, Lewis Collins 2000
A centre of expertise in digital information management Dublin Core - structure Simple resource discovery DCMES – Dublin Core Metadata Element set HTML the most common carrier Comprises 15 elements with –Element qualifiers –Element encoding schemes –Optional/mandatory elements Application profiles
A centre of expertise in digital information management Dublin Core - elements TitleFormat CreatorResource identifier SubjectSource DescriptionLanguage PublisherRelation ContributorCoverage DateRights Resource Type
A centre of expertise in digital information management Dublin Core - record Alice in Wonderland Lewis Carroll Fiction Project Gutenberg 2000 ASCII file via FTP htttp://promo.net/pg/…
A centre of expertise in digital information management RSLP Collection Description Schema developed May 2000 for RSLP programme MS Access database for RSLP – summer 2001 Web-based implementations: Revealweb, Cornucopia, Backstage, PADDI, MASC25, SCONE, Cecilia, RASCAL Based on same model: SCONE General attributes Subject Dates Associated agents External relationships
A centre of expertise in digital information management Coll. Desc. - elements General: title, identifier, description, strength, physical characteristics, language, type, access control, accrual status, legal status, custodial history, note, location Subject: concept, object, name, place, time Dates: accumulation, contents Agents: creator, owner Relationships: sub & super-collections, catalogues and descriptions, associated collections and publications
A centre of expertise in digital information management Coll. Desc. - record Title: Pitman Collection Strength: Shorthand – national significance Phys.Char.: printed texts and manuscripts Lang: English, Spanish, Esperanto, …. Access: Written request to the Librarian, University of Bath Accrual: passive, deposit Location: The Library, University of Bath, Bath Subject: shorthand, Sir Isaac Pitman, phonetic alphabets Owner: Pitman Publishing Co. Catalogue: University of Bath Library OPAC
A centre of expertise in digital information management MARC 21 Community Information Same principles as MARC 21 Bibliographic Leader –Individual / organization / program / event / other Fixed fields – , fixed fields –007 disability facilities –008 special aspects Variable fields
A centre of expertise in digital information management M 21 Comm. Inf. – elements 1XX Name 2XX Title and Address 3XX Physical description 4XX Series (for events) 5XX Notes 6XX Subject access 7XX Added entries 8XX Other variable fields
A centre of expertise in digital information management M 21 Comm. Inf. – record 110 $a CILIP 245 $a CILIP HQ 247 $a LA HQ $f 19?? – $a Ridgmount St, London WC1E 7AE $k $m $r 9am to 6pm 311 $a Ewart Room $d seats 50 $g £100 per day 312 $a Overhead projector $f £10 per day 581 $a Library + Information Update 856 $a
A centre of expertise in digital information management Other metadata formats IEEE LOM – learning object metadata EAD – Encoded Archival Description Theatre Information Group DTD – performance data
A centre of expertise in digital information management Metadata – fit for purpose MARC 21 Bibliographic – libraries ONIX – book trade and libraries Dublin Core – Internet EAD – archives Collection description – archives, libraries, museums M21 Community Information – primarily libraries
A centre of expertise in digital information management Contact details Ann Chapman UKOLN University of Bath, Bath BA2 7AY