DCMI Workshop on Metadata and Search Vendor Panel Presentation Bradley P. Allen

Slides:



Advertisements
Similar presentations
Can I Use It, and If so, How? Christian Lieske SAP AG – MultiLingual Technology Discussion of Consortium Proposal for OLIF2 File Header.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
DC2001, Tokyo DCMI Registry : Background and demonstration DC2001 Tokyo October 2001 Rachel Heery, UKOLN, University of Bath Harry Wagner, OCLC
Copyright, UCL LEADERS: Linking EAD to Electronically Retrievable Sources Developing a Generic Toolkit: Architecture and technology issues ALLC/ACH Conference.
A centre of expertise in digital information management Approaches To The Validation Of Dublin Core Metadata Embedded In (X)HTML Documents Background The.
Native XML Database or RDBMS. Data or Document orientation If you are primarily storing documents, then a Native XML Database may be the best option.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
The Semantic Web and Digital Libraries Eric Miller, W3C DC 2004 / SILF 2004 Shanghai Library, Shanghai, China
Z39.50 and the Web ZIG July 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
RDF Tutorial.
The Documentum Team Lance Callaway, Brooke Durbin, Perry Koob, Lorie McMillin, Jennifer Song Missouri University of Science and Technology Rolla, Missouri.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. The Web Services Modeling Toolkit Mick Kerrigan.
The CERIF-2000 Implementation. Andrei S. Lopatenko CERIF Implementation Guidelines Andrei Lopatenko Vienna University of Technology
Ontology Notes are from:
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
A Methodology for Developing a Taxonomy – A Subject Oriented Approach
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
UPortal: A framework for the Personalization of Library Services John Fereira: Programmer/Analyst Cornell University Mann Library.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
AgriDrupal - a “suite of solutions” for agricultural information management and dissemination, built on the Drupal CMS; - the community of practice around.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
Malaysian Grid for Learning October DC 2004, Shanghai, China. © 2004 MIMOS Berhad. All Rights Reserved Metadata Management System DC2004: International.
Data Exchange Tools (DExT) DExT PROJECTAN OPEN EXCHANGE FORMAT FOR DATA enables long-term preservation and re-use of metadata,
Formalizing and Querying Heterogeneous Documents with Tables Krishnaprasad Thirunarayan and Trivikram Immaneni Department of Computer Science and Engineering.
1 © Netskills Quality Internet Training, University of Newcastle Metadata Explained © Netskills, Quality Internet Training.
DMSO Technical Exchange 3 Oct 03 1 Web Services Supporting Simulation to Global Information Grid Mark Pullen George Mason University with support from.
The role of metadata schema registries XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN.
Practical RDF Chapter 1. RDF: An Introduction
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Introduction to MDA (Model Driven Architecture) CYT.
Information Retrieval and Knowledge Organisation Knut Hinkelmann.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Interfacing Registry Systems December 2000.
WEB BASED DATA TRANSFORMATION USING XML, JAVA Group members: Darius Balarashti & Matt Smith.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check for a license violation.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
2nd Concertation Day 18 February 2000 The Charity Centre RSLP Collection Description.
Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
Tallinn, 13 December 2005 Syndication Adriana Baciu Finsiel Romania.
The future of the Web: Semantic Web 9/30/2004 Xiangming Mu.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
1 © Xchanging 2010 no part of this document may be circulated, quoted or reproduced without prior written approval of Xchanging. MOSS Training – UI customization.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Semantic Web unleashes your data! The Semantic Web will transform the use of content. Semantic Web – is an extension of the current web. Semantic Web.
Sitecore. Compelling Web Experiences Page 1www.sitecore.net Patrick Schweizer Director of Sales Enablement 2013.
XML and Distributed Applications By Quddus Chong Presentation for CS551 – Fall 2001.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
XML Related Technologies
Repository Software - Standards
Seamark Navigator Project Rincon PRD UI Concepts ENTERPRISE RESTYLED
Cataloging the Internet
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Attributes and Values Describing Entities.
Presentation transcript:

DCMI Workshop on Metadata and Search Vendor Panel Presentation Bradley P. Allen

Copyright © 2003 Siderean Software LLC. All rights reserved. Overview Our perspective is that of a Semantic Web application vendor Our belief is that faceted search will be the first killer application of the Semantic Web Our goal is to show how this is possible and what the benefits are But first, some general statements…

Copyright © 2003 Siderean Software LLC. All rights reserved. Tools that leverage Dublin Core Do supportable tools exist that take advantage of Dublin Core and other metadata standards to enhance search results? Yes, our work is a case in point Also relevant: Weblog CMS RSS aggregators Other RDF applications

Copyright © 2003 Siderean Software LLC. All rights reserved. What's missing? What do people need to be able to do to actually use metadata effectively on their intranets? Start using whats out there Data in relational tables CMS-generated metadata A lot of metadata is lying around unexploited

Copyright © 2003 Siderean Software LLC. All rights reserved. Are Dublin Core guidelines sufficient? What additional specifications are needed? None: DC is an excellent minimal vocabulary that has achieved broad acceptance What we need are best practices, e.g.: Encouraging resource values over literal values for DC attributes as good style dc:subject using controlled vocabularies dc:creator using authority records dc:date using temporal hierarchies Implementing DCMI validation services

Copyright © 2003 Siderean Software LLC. All rights reserved. Is XML the primary coding language? Is it being used for Dublin Core and other metadata applications? Yes, for all the right reasons Open standards Leverage of existing tools What other encoding methods are being used? RDF/N3 for some RDF-based applications

Copyright © 2003 Siderean Software LLC. All rights reserved. Our application: Seamark A navigation engine built on three key ideas Metadata represented in Resource Description Framework (RDF) is aggregated from existing enterprise content and data Faceted metadata retrieval turns the RDF into a navigation web service Web services make navigation applications easy to install and integrate with existing Web applications

Copyright © 2003 Siderean Software LLC. All rights reserved. Faceted search and RDF: why? Enabling more effective retrieval is a major goal for the Semantic Web RDF is a superb foundation for faceted search RDF as an open standard for metadata exchange RDF Schema as a framework for defining facets The Semantic Web will enable faceted search to become pervasive Widespread sharing and reuse of ontologies, vocabularies and DC instance data becomes possible The blogosphere as an existence proof View Source for the Semantic Web

Copyright © 2003 Siderean Software LLC. All rights reserved. Seamark, Dublin Core, and CVs Enables Dublin Core Using RDF encodings of DC Handles controlled vocabularies Using emerging RDF-based standards like TIF(S) Supports building and maintaining controlled vocabularies Concepts and terms represented as resources and encoded in RDF in the same way as other content Therefore the same tools apply

Copyright © 2003 Siderean Software LLC. All rights reserved. Seamarks search interface Use of flat or hierarchical controlled vocabularies Transparency and customizability of results ranking Parametric search with customizable pull-down menus

Copyright © 2003 Siderean Software LLC. All rights reserved. Lookups into large CVs in Seamark Use of standard vocabularies represented in RDF (e.g. LCs Thesaurus of Graphical Materials Faceted search over controlled vocabulary terms Syndication of CVs, instance data and ontologies for sharing

Copyright © 2003 Siderean Software LLC. All rights reserved. Query processing in Seamark Based on XML for Retrieval By Reformulation (XRBR) A query language that Provides support for query reformulation and refinement while minimizing roundtrips Supports a stateless protocol for faceted metadata retrieval with SOAP as a transport mechanism Handles very large result sets gracefully Think of XRBR as an application profile in the digital library sense Specifies a view over heterogeneous metadata schemas with hints as to its interpretation and display

Copyright © 2003 Siderean Software LLC. All rights reserved. Query processing in Seamark Disambiguation Suggestions provide this implicitly Query expansion and concept mapping RDF models plus XRBR structure queries provide a general mechanism for this Entity extraction XSLT extensions at import augments raw metadata with additional extracted attributes Natural language processing Direct manipulation now; QA to come

Copyright © 2003 Siderean Software LLC. All rights reserved. Searching across collections Metadata aggregation using RDF provides a general platform for federated search We can directly leverage emerging SW approaches to: Thesaurus mapping tif:concept-equivalence Schema mapping rdfs:subPropertyOf

Copyright © 2003 Siderean Software LLC. All rights reserved. Setup and maintenance Installation and configuration for Windows, Linux and Mac OS X Administration Simple web-based administration interface for aggregating feeds and specifying initial queries Training 135 page tutorial Extensive on-line API documentation Courses One-day on-site introduction

Copyright © 2003 Siderean Software LLC. All rights reserved. Setup and maintenance Shelley Powers, Practical RDF, O'Reilly & Associates, 2003:... the application is easily installed and configured, and comes with considerable documentation What I was most impressed with about the product, though, was how quickly and easily it integrated my RDF/XML data … into a sophisticated query engine with little or no effort.

Copyright © 2003 Siderean Software LLC. All rights reserved. Seamarks administration interface Users can specify URLs serving RDF to load into a given model … then load them manually or on a schedule basis Alternatively, queries can be executed against an SQL database XSLT stylesheets transform XML documents and SQL result sets into RDF Aggregated models can be dumped to RDF

Copyright © 2003 Siderean Software LLC. All rights reserved. Sites using Seamark

Copyright © 2003 Siderean Software LLC. All rights reserved.