1 EnviroInfo 2006, 05/09/06 Graz Automatic Concept Space Generation in Support of Resource Discovery in Spatial Data Infrastructures Paul Smits, Anders.

Slides:



Advertisements
Similar presentations
Delta Confidential 1 5/29 – 6/6, 2001 SAP R/3 V4.6c PP Module Order Change Management(OCM)
Advertisements

You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Advanced Piloting Cruise Plot.
Feichter_DPG-SYKL03_Bild-01. Feichter_DPG-SYKL03_Bild-02.
Current design issues for digital archives Robert Munro (presented by David Nathan) Endangered Languages Archive (ELAR), School of Oriental and African.
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
Workshop TOWARDS A EUROPEAN QUALIFICATIONS FRAMEWORK FOR LIFELONG LEARNING Relevance, Feasibility and Implications for SEE A Wider European Area of Education.
Chapter 1 Image Slides Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
UNITED NATIONS Shipment Details Report – January 2006.
SDI Business Phases and derived INSPIRE Horizontal Services Relates to INSPIRE DT Network Services, DT Sharing Relates to OGC GeoDRM WG, Price & Order.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
Module N° 7 – Introduction to SMS
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
My Alphabet Book abcdefghijklm nopqrstuvwxyz.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Addition Facts
Year 6 mental test 5 second questions
|epcc| NeSC Workshop Open Issues in Grid Scheduling Ali Anjomshoaa EPCC, University of Edinburgh Tuesday, 21 October 2003 Overview of a Grid Scheduling.
Copyright 2006 Digital Enterprise Research Institute. All rights reserved. MarcOnt Initiative Tools for collaborative ontology development.
1/ 26 AGROVOC and the OWL Web Ontology Language: the Agriculture Ontology Service - Concept Server OWL model NKOS workshop Alicante,
UKOLN, University of Bath
Pure Silver Reusing and Repurposing Bibliographic Data in a Current Research Information System and Institutional Repository 15 September.
BT Wholesale October Creating your own telephone network WHOLESALE CALLS LINE ASSOCIATED.
1 Competitive Privacy: Secure Analysis on Integrated Sequence Data Raymond Chi-Wing Wong 1, Eric Lo 2 The Hong Kong University of Science and Technology.
Configuration management
Software change management
DOROTHY Design Of customeR dRiven shOes and multi-siTe factorY Product and Production Configuration Method (PPCM) ICE 2009 IMS Workshops Dorothy Parallel.
Yavapai College Self Service Banner Training. Agenda Definition of Key Concepts Log Into Finance Self Service Budget Query Overview Budget Query Procedures.
ABC Technology Project
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
1 Undirected Breadth First Search F A BCG DE H 2 F A BCG DE H Queue: A get Undiscovered Fringe Finished Active 0 distance from A visit(A)
VOORBLAD.
15. Oktober Oktober Oktober 2012.
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
1 Evaluations in information retrieval. 2 Evaluations in information retrieval: summary The following gives an overview of approaches that are applied.
Heppenheim Producer-Archive Interface Specification Status of standardisation project Main characteristics, major changes, items pending.
BIOLOGY AUGUST 2013 OPENING ASSIGNMENTS. AUGUST 7, 2013  Question goes here!
© 2012 National Heart Foundation of Australia. Slide 2.
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
Chapter 5 Test Review Sections 5-1 through 5-4.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
Who are the Experts?Simon KampaSlide 1 Who are the Experts? Simon Kampa IAM Group University of Southampton
02-Oct-2008 European Forum for GeoStatistics 2008 in Bled Concept for an Integrated Web Solution / an Infrastructure for Geostatistics (Subproject 3)
Proposed update of Technical Guidance for INSPIRE Download services based on SOS Matthes Rieke, Dr. Albert Remke (m.rieke, 52°North.
Addition 1’s to 20.
25 seconds left…...
H to shape fully developed personality to shape fully developed personality for successful application in life for successful.
Januar MDMDFSSMDMDFSSS
Week 1.
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
PSSA Preparation.
Immunobiology: The Immune System in Health & Disease Sixth Edition
Immunobiology: The Immune System in Health & Disease Sixth Edition
McGraw-Hill©The McGraw-Hill Companies, Inc., 2001 Chapter 16 Integrated Services Digital Network (ISDN)
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
From Model-based to Model-driven Design of User Interfaces.
1 EcoInformatics meeting, 17/01/06 Ispra INSPIRE - Infrastructure for Spatial Information in Europe Examples of research in support of SDI Paul Smits,
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
Presentation transcript:

1 EnviroInfo 2006, 05/09/06 Graz Automatic Concept Space Generation in Support of Resource Discovery in Spatial Data Infrastructures Paul Smits, Anders Friis-Christensen European Commission, DG Joint Research Centre Institute for Environment and Sustainability Spatial Data Infrastructures Unit TP 262, Ispra (VA), Italy

2 EnviroInfo 2006, 05/09/06 Graz The mission of the JRC is to provide customer-driven scientific and technical support for the conception, development, implementation and monitoring of EU policies. As a service of the European Commission, the JRC functions as a reference centre of science and technology for the Union. Close to the policy-making process, it serves the common interest of the Member States, while being independent of special interests, whether private or national. JRCs Mission

3 EnviroInfo 2006, 05/09/06 Graz Outline Introduction Objectives of the study Approach Results Conclusions

4 EnviroInfo 2006, 05/09/06 Graz GI Policy GI standards Spatial Information Services Fundamental GI data sets Introduction – components of a European SDI

5 EnviroInfo 2006, 05/09/06 Graz Introduction Metadata and discovery services are key components of SDI Multilingualism important

6 EnviroInfo 2006, 05/09/06 Graz Introduction INSPIRE requirements metadata* spatial data sets and spatial data services* network services* –EU geo-portal access and rights of use for Community institutions and bodies** monitoring and reporting mechanisms** process and procedures * technical: under JRC responsibility ** legal/procedural: under Eurostat responsibility

7 EnviroInfo 2006, 05/09/06 Graz Introduction European interoperability framework for pan- European eGovernment servicesEuropean interoperability framework for pan- European eGovernment services Recommendations related to multilingualism, e.g.,Recommendations related to multilingualism, e.g., –For the Pan-European services provided via portals, the top-level EU portal interface should be fully multilingual, the second-level pages (introductory texts and the descriptions of links) should be offered in the official languages and the external links and related pages on the national websites should be available in at least one other language (for example English) in addition to the national language(s).

EcoInformatics meeting, 17/01/06 Ispra Introduction Issues on Multilingualism identified by the INSPIRE DT on Network Services –only mentioned in the context of the interoperability of spatial data sets and services for key attributes and corresponding multilingual thesauri –Granularity: should the list of available languages be a service feature or at the data set or even at the feature attribute level ? –Metadata/Data: should only metadata be multilingual or datasets as well ? –Attributes label versus Attribute value: Should only attributes label be multilingual or should the attribute values be as well multilingual?

EcoInformatics meeting, 17/01/06 Ispra Introduction

10 EnviroInfo 2006, 05/09/06 Graz Outline Introduction Objectives of the study Approach Results Conclusions

11 EnviroInfo 2006, 05/09/06 Graz Objective of the study Focus on discovery of resources Answer question: –Is, from a technical point of view, a common ontology or thesaurus desirable and feasible for multi-lingual resource discovery in a European Spatial Data Infrastructure?

12 EnviroInfo 2006, 05/09/06 Graz Outline Introduction Objectives of the study Approach Results Conclusions

13 EnviroInfo 2006, 05/09/06 Graz Approach Implement and extend work of H. Chen, et al., "A Parallel Computing Approach to Creating Engineering Concept Spaces for Semantic Retrieval: The Illinois Digital Library Initiative Project," IEEE Transactions on Pattern Analysis and Machine Intelligence vol. 18 pp , Integrate thesauri, vocabularies and gazetteers in resource discovery Experiments P. Smits, A. Friis-Christensen, Resource Discovery in a European Spatial Data Infrastructure. IEEE Transactions on Knowledge and Data Engineering (accepted for publication)

14 EnviroInfo 2006, 05/09/06 Graz Approach What is a Concept Space? Simply put: –An index of all concepts existing in a metadata repository –With numerical relationships defined between any two concepts –To be queried by associative retrieval

15 EnviroInfo 2006, 05/09/06 Graz Two-step approach –Creation of multi- lingual concept space –Associative retrieval based on a neural network H. Chen, B. Schatz, T. Ng, J. Martinez, A. Kirchhoff, C. Lin, A parallel computing approach to creating engineering concept spaces for semantic retrieval: the Illinois digital library initiative project. IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 18, No. 8, August 1996, pp Approach

16 EnviroInfo 2006, 05/09/06 Graz Approach Creation of the multi-lingual concept space –Collection of resource descriptors –Object filtering and indexing identify those concepts and terms that we already have in our human-created ontology which includes any thesauri and vocabulary to filter out any irrelevant terms like stop words in order to improve performance to store any remaining terms in the concept space

17 EnviroInfo 2006, 05/09/06 Graz Approach - Associative query Initialize the associative retrieval –The neural network is initialized at query time by assigning initial membership values to the units of the neural network = concepts in the Concept Space Terms in the concept space that match exactly a query term: 1 Partial matches get membership value < 1 Terms that do not match the query: 0

18 EnviroInfo 2006, 05/09/06 Graz Approach - Associative query Initialize the associative retrieval Query: soil Soil, bodem 1 Sub-surface information 0 0 Situation at t=0 Wij = 0 Wij = 0.7

19 EnviroInfo 2006, 05/09/06 Graz Approach - Associative query Iterate though the neural network Soil, bodem 1 Sub-surface information 0 0 Situation at t=0 Wij = 0 Wij = 0.7 Soil, bodem 1 Sub-surface information Situation at t=1 Wij = 0 Wij = 0.7

20 EnviroInfo 2006, 05/09/06 Graz Approach - Associative query Link membership values of concepts to resource descriptors Soil, bodem 1 Sub-surface information Situation at t=1 Wij = 0 Wij = 0.7 Membership > threshold? Use index to find resources that contain the concept Order found resources in order of relevance, based on membership values

21 EnviroInfo 2006, 05/09/06 Graz Outline Introduction Objectives of the study Approach Results Conclusions

22 EnviroInfo 2006, 05/09/06 Graz

23 EnviroInfo 2006, 05/09/06 Graz Results Creating the metadata repository

24 EnviroInfo 2006, 05/09/06 Graz Results

25 EnviroInfo 2006, 05/09/06 Graz Results

26 EnviroInfo 2006, 05/09/06 Graz Results Query computationally expensive queryRemark Time required for four iterations of neural network (600 MHz, 512 MB RAM) soil (eng)Query term found in the concept space (GEMET concept no. 7843) 16.1 s. infrastructuur (nld)Query term not literally defined in the concept space or ontology s.

27 EnviroInfo 2006, 05/09/06 Graz Outline Introduction Objectives of the study Approach Results Conclusions

28 EnviroInfo 2006, 05/09/06 Graz Conclusions from the study It will be impractical to rely only on one common ontology for resource discovery in a European SDI The approach of using human-created ontologies in combination with automatic concept space generation and associative retrieval is a powerful means to the discovery of geospatial resources. Proposed approach is useful and merits further investigation and development The importance of structured information, using metadata standards, is underlined by our study and is also a basic assumption of our work.