Metadata Week 4 LBSC 671 Creating Information Infrastructures.

Slides:



Advertisements
Similar presentations
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Advertisements

Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
RDA: A New Standard Supporting Resource Discovery Presentation given at the CLA conference session The Future of Resource Discovery: Promoting Resource.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Evidence from Metadata LBSC 796/CMSC 828o Session 6 – March 1, 2004 Douglas W. Oard.
3. Technical and administrative metadata standards Metadata Standards and Applications.
Evidence from Metadata LBSC 796/INFM 718R Session 9: November 5, 2007 Douglas W. Oard.
Introducing Symposia : “ The digital repository that thinks like a librarian”
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Description Week 5 LBSC 671 Creating Information Infrastructures.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Metadata Week 4 LBSC 671 Creating Information Infrastructures.
Metadata Considerations Implementing Administrative and Descriptive Metadata for your digital images 1.
Evidence from Metadata LBSC 796/INFM 718R Session 9: April 6, 2011 Douglas W. Oard.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Introduction to Omeka. What is Omeka? - An Open Source web publishing platform - Used by libraries, archives, museums, and scholars through a set of commonly.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Metadata Bridget Jones Information Architecture I February 23, 2009.
Resource Description and Access Deirdre Kiorgaard ACOC Seminar, September 2007.
Introduction to metadata
Evidence from Metadata INST 734 Doug Oard Module 8.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
The physical parts of a computer are called hardware.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Evidence from Metadata INST 734 Doug Oard Module 8.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
Current initiatives in developing library linked data Gordon Dunsire Presented at the Cataloguing and Indexing Group Scotland seminar “Linked data and.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
Metadata & Repositories Jackie Knowles RSP Support Officer.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
Information organization Week 2 Lecture notes INF 380E: Perspectives on Information Spring 2015 Karen Wickett UT School of Information.
Slides Template for Module 3 Contextual details needed to make data meaningful to others CC BY-NC.
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Introduction to Metadata
PREMIS Tools and Services
Metadata in Digital Preservation: Setting the Scene
Attributes and Values Describing Entities.
FRBR and FRAD as Implemented in RDA
Presentation transcript:

Metadata Week 4 LBSC 671 Creating Information Infrastructures

Muddiest Points Memory madness –Hard drives, DVD’s, solid state “disks,” tape, … Digitization –Images, audio, video, compression, file names, … Where LOCKSS fits in all this Digital preservation vs. digitization for preservation

Tonight Finishing Preservation Metadata

Preserving Behavior Word processors –Formatting, track changes, undo deleted text Spreadsheets –Formulas, visualizations Databases –Queries, forms, derived values Computer-Assisted Design (CAD) –Display, modification, manufacturing Software –Simulation, games, embedded systems, …

Behavior Preservation Strategies Format migration –For example, convert Word Perfect to PDF Emulation –Allows running old software on newer systems

Apollo Guidance Computer Emulation

An Integrated Strategy Delay decay of organic materials –But balance costs and benefits Balance quality and scale –Preservation: rescue at-risk collections –Access: Quantity has a quality all its own Design in diversity –Technologies, risk exposure, institutions Adequately resource the process

Tonight Finishing Preservation  Metadata

Two Ways of Searching Write the document using terms to convey meaning Author Content-Based Query-Document Matching Document Terms Query Terms Construct query from terms that may appear in documents Free-Text Searcher Retrieval Status Value Construct query from available concept descriptors Controlled Vocabulary Searcher Choose appropriate concept descriptors Indexer Metadata-Based Query-Document Matching Query Descriptors Document Descriptors

Functional Requirements for Bibliographic Records (FRBR) Midsummer Night’s Dream August 23 Performance 2005 Free for All Seat 23G

FRBR Bibliographic User Tasks Find it –Search (“to find”) –Recognize (“to identify”) –Choose (“to select”) Serve it –Location (“to obtain”)

FRBR Entity Types Subject-Only Entities –(abstract) Concepts –(tangible) Objects –(any kind of) Places –Events Subject or Responsibility Entities –Persons –“Corporate” Bodies (~any kind of organization) –Families (technically, only in FRAD) Product Entities –Works, Expressions, Manifestations, Items

Functional Requirements for Authority Data IFLA, 2013

FRBR in OCLC’s FictionFinder

Dublin Core Goals: –Easily understood, implemented and used –Broadly applicable to many applications Approach: –Intersect several standards (e.g., MARC) –Suggest only “best practices” for element content Implementation: –Initially 15 optional and repeatable “elements” Refined using a growing set of “qualifiers” –Now extended to 22 elements

Dublin Core Elements (version 1.1) Content Title Subject [LCSH, MeSH, …] Description Type Coverage [spatial, temporal, …] Related resource Rights Instantiation Date [Created, Modified, Copyright, …] Format Language Identifier [URI, Citation, …] Responsibility Creator Contributor Source Publisher

Resource Description Framework XML schema for describing resources Can integrate multiple metadata standards –Dublin Core, P3P, PICS, vCARD, … Dublin Core provides a XML “namespace” –DC Elements are XML “properties DC Refinements are RDF “subproperties” –Values are XML “content”

Dublin Core in RDF XML <rdf:RDF xmlns:rdf=" xmlns:dc=" <rdf:Description rdf:about=" Rose Bush A Guide to Growing Roses Describes process for planting and nurturing different kinds of rose bushes

Aspects of Metadata Framework –Functional Requirements for Bibliographic Records (FRBR) Schema (“Data Fields and Structure”) –Dublin Core Guidelines (“Data Content and Values”) –Resource Description and Access (RDA) –Library of Congress Subject Headings (LCSH) Representation (abstract “Data Format”) –Resource Description Framework (RDF) Serialization (“Data Format”) –RDF in eXtensible Markup Language (RDF/XML) Adapted from Elings and Waibel, First Monday, (12)3, 2007

Some Types of “Metadata” Descriptive –Content, creation process, relationships Technical –Format, system requirements Administrative –Acquisition, authentication, access rights Preservation –Media migration Usage –Display, derivative works Adapted from Introduction to Metadata, Getty Information Institute (2000) Not in Taylor & Joudrey

Metadata Encoding and Transmission Standard (METS) Descriptive metadata (e.g., subject, author) Administrative metadata (e.g., rights, provenance) Technical metadata (e.g., resolution, color space) Behavior (which program can render this?) Structural map (e.g., page order) –Structural links (e.g., Web site navigation links) Files (the raw data) Root (meta-metadata)

Aspects of Metadata What kinds of objects can we describe? –MARC, Dublin Core, FRBR, … How can we convey it? –MODS, RDF, OAI-PMH, METS What can we say? –LCSH, MeSH, PREMIS, … What can we do with it? –Discovery, description, reasoning

FRBR Bibliographic User Tasks Find it –Search (“to find”) –Recognize (“to identify”) –Choose (“to select”) Serve it –Location (“to obtain”)

Broader View of Metadata Uses Have it –Preservation (e.g., PREMIS) –Validation –Disposition Find it –Search/Recognize/Choose –Browse (“Navigation”) Serve it –Persistent location –Structure –Surrogates Use it –Context –Rights management –User behavior capture –Reasoning (“Semantic Web”)

Metadata Sources Automated –Capture –Extraction –Classification Manual –Professional –Community –Personal

Metadata Capture: Exchangeable Image Format (EXIF) Time Location Camera manufacturer and model Camera orientation Exposure information (shutter speed, f stop) Thumbnail versions –Altering the image may not change the thumbnail!

Inconsistent Metadata

Metadata Capture: Message metadata –Times Sent Resent Received –Route –In-reply-to –Attachment file type System metadata –Folder

Metadata Capture: Windows File System (NTFS) Time file created (or copied) –Most recent one; optionally “journaled” Time file content changed (or made changeable) –Most recent one; optionally “journaled” Time file renamed (or moved) –Most recent one Time file metadata created or changed –Most recent one Time file accessed (content or metadata) –Most recent one; optionally disabled

Metadata Capture: Microsoft Word Author Title Dates (may not agree with file system) –Created –Modified –Accessed –Printed –Each tracked change

Minimum Scope SegmentObjectClass View Listen Select Print Bookmark Save Purchase Delete Subscribe Copy / paste Quote Forward Reply Link Cite Mark up Tag Publish Organize Behavior Category Examine Retain Reference Annotate Create Type Edit Metadata Capture: User Behavior

Exploiting Behavioral Metadata

Metadata Extraction: Named Entity “Tagging” Machine learning techniques can find: –Location –Extent –Type Two types of features are useful –Orthography e.g., Paired or non-initial capitalization –Trigger words e.g., Mr., Professor, said, …

Metadata Sources Automated –Capture –Extraction –Classification Manual –Professional –Community –Personal

Community Metadata: “Folksonomies”

van Ahn and Dabbish, CHI 2004 Community Metadata: Games With a Purpose

Community Metadata: Crowdsourcing

Sources of File Type Metadata Capture: –MyDocument.xls –Attachment MIME type Extraction –“Magic bytes” Classification –Machine learning on byte sequences Manual –Mechanical Turk

Metadata Challenges Balancing cost and benefit Accommodating dynamic factors –Content –Location Reuse for unanticipated purposes Remaining interpretable in the far future

Open Archives Initiative- Protocol for Metadata Harvesting (OAI-PMH)

Linked Open Data

Web Ontology Language (OWL) astronaut Astronaut astronaute <rdfs:subClassOf rdf:resource="

“Semantic Web” Search

Putting It All Together Adapted from Elings and Waibel, First Monday, (12)3, 2007

Before You Go! On a sheet of paper (no names), answer the following question: What was the muddiest point in today’s class?