CLIR/DLF primer Metadata: practice and practice Lorcan Dempsey VP Research and Chief Strategist CLIR/DLF. Managing Digital Assets: A Primer for Library and Information Technology Administrators Charleston, SC February 4-6, 2005
CLIR/DLF primer Overview Part 1 Part 2 Part 3 Part 4
CLIR/DLF primer Some themes Consolidation: fragmentation gets in the way Industrialization: much of our metadata creation is a cottage industry: current approaches will not scale Cost: value Intellectual and machine: need to work harder to programmatically create metadata Institutions and service: move from projects to service
CLIR/DLF primer Example: Metasearch/portal Metadata is everywhere ;-)
CLIR/DLF primer A ‘portal’ turned inside out … Common services Content services Application services Presentation services I need a few references
CLIR/DLF primer Common services Content services Application services Presentation services authentication
CLIR/DLF primer Common services Content services Application services Presentation services Directory: user profile
CLIR/DLF primer Common services Content services Application services Presentation services Query broker
CLIR/DLF primer Common services Content services Application services Presentation services Directory: service/collection description
CLIR/DLF primer Common services Content services Application services Presentation services Content: results list
CLIR/DLF primer I’d like to get this book. Common services Content services Application services Presentation services Request broker
CLIR/DLF primer Common services Content services Application services Presentation services Directory: ILL policy
CLIR/DLF primer Common services Content services Application services Presentation services Directory: service/collection description
CLIR/DLF primer Common services Content services Application services Presentation services Content: circ/ILL system
CLIR/DLF primer I need this article too. Common services Content services Application services Presentation services Request broker
CLIR/DLF primer Common services Content services Application services Presentation services openURL resolver
CLIR/DLF primer Common services Content services Application services Presentation services Directory: local knowledge base
CLIR/DLF primer Nearly there … Common services Content services Application services Presentation services Directory: service/collection description
CLIR/DLF primer Common services Content services Application services Presentation services Content: article
CLIR/DLF primer Directory: ILL policy Common services Content services Application services Presentation services Authentication Directory: user profile Query broker Directory: service/collection description Reference db Request broker Circ/ILL system OpenURL resolver Directory: local knowledge base Article db Metadata for multiple entities required to support operations. This picture could be extended in multiple ways. Metadata for multiple entities required to support operations. This picture could be extended in multiple ways.
CLIR/DLF primer Metadata as intelligence … Know what resources are available Know how to play a resource Know provenance of a resource Know what use policy governs a resource Know how to ingest a resource Know how to interact with a resource Know how to compose/decompose resources …
CLIR/DLF primer … allows people and machines to work smarter … Metadata?
CLIR/DLF primer Metadata? Schematized … … statements about … … resources
CLIR/DLF primer Something about Resources
CLIR/DLF primer Resources: everything that moves Multiple types of information Objects Collections Services People Organizations Places Terms Formats Rights Business terms License … and will support multiple operations Discovery to delivery Digital asset management Publishing interfaces: intersections between user information spaces and library information spaces
CLIR/DLF primer Different classes of metadata increasingly a part of complex object models SCORM/Content package METS MPEG 21 … Descriptive Structural Technical Administrative Rights Preservation Tracking Provenance
CLIR/DLF primer Community? Cultural heritage Media industry Web/ Internet Library Instructional technology E-gov Research communities EAD, MARC AMC,.. MARC, MODS, DC, RSLP,.. Onix, … XML, RDF, OWL, … CSDGM, DDI, NBII, IVOA, … EGMS, AGLS, GILS, … GEM, DC-ED, IEEE-LOM, SCORM, … MPEG, JPEG, TIAA-CREF… * * * *
CLIR/DLF primer So … More than discovery … More than information objects … More than library …
CLIR/DLF primer Something about schematized
CLIR/DLF primer Simple descriptive metadata!! ‘Element set’ Information model Encoding Values/content Application profile FRBR INDECS CIDOC … MARC21 DC VRA Core MODS Onix … Cataloging rules Controlled vocabs. … XML ISO2709 …
CLIR/DLF primer OAI
CLIR/DLF primer OAI-based mediation OAI Server#1 OAI Server#2 OAI Server#3 OAIHarvester WebBrowser Merged resource
CLIR/DLF primer OAI A way of ‘publishing’ processable metadata on the network A way of synchronizing databases And … The same for resources themselves? A nice building block for other services
CLIR/DLF primer An example Following pages show some experimental services where OAI is used to ‘publish’ metadata. There is a WIKI interface to metadata stores managed under OAI. Following pages show some experimental services where OAI is used to ‘publish’ metadata. There is a WIKI interface to metadata stores managed under OAI.
CLIR/DLF primer
Edit Tools
CLIR/DLF primer
Interoperability Recombinant potential Economic and service issues Cost
CLIR/DLF primer Interoperability a factor at all these levels.. For example.. Encoding Element set Content/values Encoding Element set Content/values Examples: Z39.2/MARC/AACR DC OAI
CLIR/DLF primer Importance of agreements DC profile Vocabularies
CLIR/DLF primer This gives a context for discussing … Traditional library practice Strive for consistency at all three levels in the ISO 2709/MARC/AACR model Institutionalised in standards, OCLC/RLG/LC, committees, … Dublin Core Consistency of element set A small number of encodings Content/values subject to separate agreement OAI A transport for resources. No control over the transported resources So …
CLIR/DLF primer Something about collections
CLIR/DLF primer
Collections grid highlow high Stewardship Uniqueness Books Journals Newspapers Gov. docs CD, DVD Maps Scores Special collections Archives Rare books Local history materials Archives & Manuscripts Theses & dissertations Freely-accessible web resources Research and learning materials ePrints/tech reports Learning objects Courseware E-portfolios Research data Untransferred records
CLIR/DLF primer Collections grid highlow high E-learning E-research Publishing Cultural heritage disclosure Reformatting Digital asset management Amazoogle D2D
Some observations Bought materials Licensed materials Special collections/archives Research and learning materials highlo w high Stewardship Uniqueness Metadata Cost/value Routine? Consolidated?
CLIR/DLF primer Making data work Reading in the dark Reading in the dark
CLIR/DLF primer
Some thoughts … Fragmentation Fragmentation reduces gravitational pull Fragmentation increases cost Consolidation Services Processing Mobilize collective capacity Routinization/industrialization Programmatic extraction of metadata from digital resources Agreement Plural disclosure Want to make stuff available in lots of ways
Thank you! Lorcan Dempsey