Metadata Standards & Applications 7. Approaches to Models of Metadata Creation, Storage, and Retrieval.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Metasearching: The Problem, Promise, Principles, Possibilities & Perils Roy Tennant California Digital Library.
Usage Statistics in Context: related standards and tools Oliver Pesch Chief Strategist, E-Resources EBSCO Information Services Usage Statistics and Publishers:
Z39.50 and the Web ZIG July 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Copyright 2004 Monash University IMS5401 Web-based Systems Development Topic 2: Elements of the Web (g) Interactivity.
Best Web Directories and Search Engines Order Out of Chaos on the World Wide Web.
River Campus Libraries Metadata That Supports Real User Needs David Lindahl Director of Digital Library Initiatives University of Rochester Libraries.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
River Campus Libraries Metadata That Supports Real User Needs Jennifer Bowen Head of Cataloging University of Rochester Libraries David Lindahl Director.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
Search engines. The number of Internet hosts exceeded in in in in in
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
River Campus Libraries Metadata That Supports Real User Needs Jennifer Bowen Head of Cataloging University of Rochester Libraries David Lindahl Director.
Information Retrieval
Basic Concepts Architecture Topology Protocols Basic Concepts Open e-Print Archive Open Archive -- generalization of e-print Data Provider and Service.
What’s The Difference??  Subject Directory  Search Engine  Deep Web Search.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
Homework Full-text article – entire textual contents of article in online format Abstract – brief summary of article Citation – basic information required.
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
Metadata Standards and Applications 1. Introduction to Digital Libraries and Metadata.
Strategies for improving Web site performance Google Webmaster Tools + Google Analytics Marshall Breeding Director for Innovative Technologies and Research.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Organizing Internet Resources OCLC’s Internet Cataloging Project -- funded by the Department of Education -- from October 1, 1994 to March 31, 1996.
Improving Metadata Quality: Augmentation and Recombination Diane I. Hillmann Naomi Dushay Jon Phipps National Science Digital Library.
Web Scale Discovery Service Vs Federated Search NIKESH NARAYANAN
OpenURL Link Resolvers 101
7. Approaches to Models of Metadata Creation, Storage and Retrieval Metadata Standards and Applications.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Resource Curation and Automated Resource Discovery.
UKOLN is supported by: Approaches to Metadata Quality Marieke Guy QA Focus A centre of expertise in digital information management
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Librarians as a Resource for African Journals Partnership Project (AJPP) Journals Christine Wamunyima Kanyengo
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Marshall Breeding Director for Innovative Technology and Research Vanderbilt University
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Programs and Research Moving to the network level: discovery and disclosure Lorcan Dempsey ALCTS ALA Midwinter, Seattle January
Usability Issues in Metasearch Interface Design: persectives of an information provider LITA Human Machine Interface Interest Group June 25, 2004 Oliver.
Search Engines Reyhaneh Salkhi Outline What is a search engine? How do search engines work? Which search engines are most useful and efficient? How can.
1 A Very Large Digital Library Technology Demonstration William Y. Arms Cornell University.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
OWL Representing Information Using the Web Ontology Language.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
Introduction to the Semantic Web and Linked Data
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Providing web services to mobile users: The architecture design of an m-service portal Minder Chen - Dongsong Zhang - Lina Zhou Presented by: Juan M. Cubillos.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
University of North Texas Federated Search Mark E. Phillips August 24, 2006.
A RCHIVAL COLLECTIONS IN A D IGITAL W ORLD Cheryl Walters Nov. 6, 2008.
Metayogi Increasing the Accessibility of the Semantic Web Karim Tharani Doug Macdonald Rachel Heidecker.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
Search Engines and Search techniques
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
HCT: The Library Catalogue
Federated & Meta Search
Cataloging the Internet
Presentation transcript:

Metadata Standards & Applications 7. Approaches to Models of Metadata Creation, Storage, and Retrieval

Goals for Session  Understand the differences between traditional vs. digital library … –Metadata creation –Storage options for metadata and content –Retrieval and discovery issues 2 Metadata Standards & Applications

Creating Metadata Records  The “Library Model” –Trained catalogers, one-at-a-time metadata records  The “Submission Model” –Authors create metadata when submitting resources  The “Automated Model” –Automated tools create metadata for resources  Combination approaches 3 Metadata Standards & Applications

The Library Model  Records created “by hand,” one at a time  Shared documentation and content standards (AACR2, etc.)  Efficiencies achieved by sharing information on commonly held resources  Not easily extended past the “Granularity Assumptions” in current practice 4 Metadata Standards & Applications

The Submission Model  Based on author or user generated metadata  Can be wildly inconsistent –Submitters generally untrained –May be expert in one area, clueless in others  Often requires editing support for usability  Inexpensive, but not satisfactory as an only option 5 Metadata Standards & Applications

The Automated Model  Based largely on text analysis; doesn’t usually extend well to non-text or low- text  Requires development of appropriate evaluation and editing processes to support even minimal quality standards  Still largely research; few large, successful production examples... Yet  One simple automated tool to try: Metadata Standards & Applications

“Like any other data management processes (such as data normalization or the control of information quality), the creation of metadata requires an investment of resources. However, the relationship between investment in metadata creation and the resulting level of resource discoverability is not linear. The more elements from a metadata set that are implemented, the greater the investment of resources that is required. In addition, the more data elements used, the greater the chances for error and divergence among record creators and implementations.” -- Norm Friesen, CanCore Guidelines: Introduction -- Norm Friesen, CanCore Guidelines: Introduction. 7 Metadata Standards & Applications

Combination Approaches  Combination Machine and Human created Metadata –Ex.: INFOMINE ( –Check out their tool: (  Combination metadata and content indexing –Ex.: NSDL ( 8 Metadata Standards & Applications

Content “Storage” and Retrieval Models  ‘Storage models’ in this context relate to the relationship between the metadata and content (not the systems that purport to ‘store’ content for various uses)  This relationship affects how access to the information is accomplished, and how the metadata either helps or hinders the process (or is irrelevant to it) 9 Metadata Standards & Applications

Common ‘Storage Models’  Content with metadata  Metadata only  Service only 10 Metadata Standards & Applications

Content with Metadata  Examples: –HTML pages with embedded ‘meta’ tags –Most content management systems (though they may store only technical or structural metadata) –Text Encoding Initiative (TEI), a full-text markup language (as an example of an application, see the Comic Book Markup Language at  Often proves difficult to scale  Not optimized to manage metadata well over time 11 Metadata Standards & Applications

Metadata only  Library catalogs –Web-based catalogs often provide some services for digital content  Electronic Resource Management (ERM) Systems –Provide metadata records for title level only –Usually intended to manage licensing and access to article level information  Metadata aggregations (a.k.a. ‘Digital Libraries’ or ‘Portals’ linking to other people’s content) –Using APIs or OAI-PMH for harvest and re- distribution 12 Metadata Standards & Applications

Service only  Often supported partially or fully by metadata  Google, Yahoo (and others) –Sometimes provide both search services and distributed search software –Using metadata from libraries as part of their large-scale digitization projects  Electronic journals (article level) –Linked using ‘link resolvers’ or available independently from websites –Have metadata behind their services but don’t generally distribute it separately 13 Metadata Standards & Applications

Common Retrieval Models  Library catalogs  Web-based (“Amazoogle”)  Portals and federations 14 Metadata Standards & Applications

“Old” Library Catalogs  Based on a ‘Granularity Consensus’ increasingly mysterious to users  Include expectations of uniformity of information content and presentation  Designed to optimize recall and precision  Addition of relevance ranking and keyword searching by vendor systems of limited value (the only ‘text’ used is the metadata itself, not the content)  Retrieval options limited by LMS vendor ignorance of library data 15 Metadata Standards & Applications

“New” Library Catalogs  ENDECA –North Carolina State University Libraries in 2006, was one of the first to experiment with new catalog technologies using legacy metadata  eXtensible Catalog Project –University of Rochester attempting to provide a FRBR-ized catalog and integrated access to previously “silo-ed” data managed by libraries. 16 Metadata Standards & Applications

Web-based  The “Amazoogle” model: –Lorcan Dempsey: “Amazon, Google, eBay: massive computational and data platforms which exercise strong gravitational web attraction.” –Based primarily on full-text searching and link- or usage-based relevance ranking (lots of recall, little precision) –Some efforts to combine catalog and Amazoogle searches (ex.: collaborations with WorldCat) 17 Metadata Standards & Applications

Portals and Federations  Portals: defined content boundaries –Some content also available elsewhere –ex.: Specific library portals, subject portals like Portals to the World (ex.  Federations: protected content and services –Often specialized services based on specifically purposed metadata (ex.: BEN Metadata Standards & Applications

Information Discovery & Retrieval  Z39.50 –Basis for most federated search applications in current library software  SRU ( Search and Retrieval Via URL) –Seen as a replacement for Z39.50 –To learn more about it see:  Federated search (Metasearch) –Simultaneous search multiple data sources –Not much uptake, seen as only as robust as its weakest link 19 Metadata Standards & Applications

Newer Possibilities  RDF data is increasingly using options like the Simple Protocol And RDF Query Language (SPARQL) – Currently a W3C Recommendation  Approaches using graphs, ontologies, topic maps, etc. seen as more attractive as Semantic Web technologies become more robust –These based more on “statements” than “records” … Metadata Standards & Applications 20

Data Management Challenges for Libraries  Moving from text to URIs for controlled values –Including personal and organization names as well as controlled concepts and topics  Developing useful and efficient normalization and “smartening up” processes  Ensuring that their changes are visible to downstream services 21 Metadata Standards & Applications

Can You Tell?  Can you tell what’s going on behind these sites?  How are they organized?  What creation and storage models are used?  Plant and Insect Parasitic Nematodes:  Internet Movie Database: Metadata Standards & Applications