Using formal ontology for integrated spatial data mining Julie Sungsoon Hwang Department of Geography State University of New York at Buffalo ICCSA04 Perugia,

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

1 Copyright 1998 by Dragos Manolescu and Joseph W. Yoder Building Frameworks With Patterns “An Active Object-Model For A Dynamic Web-Based Application”
SEBGIS 2005, Agia Napa, Cyprus, October 31 - November 4, 2005 MECOSIG Adapted to the Design of Distributed GIS F. Pasquasy, F. Laplanche, J-C. Sainte &
Using the Crosscutting Concepts As conceptual tools when meeting an unfamiliar problem or phenomenon.
A Paradigm for Space Science Informatics Kirk D. Borne George Mason University and QSS Group Inc., NASA-Goddard or
A Framework for Ontology-Based Knowledge Management System
Software Testing and Quality Assurance
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
The Data Mining Visual Environment Motivation Major problems with existing DM systems They are based on non-extensible frameworks. They provide a non-uniform.
Software Factory Assembling Applications with Models, Patterns, Frameworks and Tools Anna Liu Senior Architect Advisor Microsoft Australia.
1 Pendahuluan Pertemuan 9 Matakuliah: H0062/Teori Sistem Tahun: 2006.
Requirements Analysis Concepts & Principles
Engineering the Presentation Layer of Adaptable Web Information Systems Zoltán Fiala 1, Flavius Frasincar 2, Michael Hinz 1, Geert-Jan Houben 2, Peter.
Intelligent User Interfaces Research Group Directed by: Frank Shipman.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy and Mark A. Musen.
Second Language Acquisition and Real World Applications Alessandro Benati (Director of CAROLE, University of Greenwich, UK) Making.
Developed by Reneta Barneva, SUNY Fredonia Component Level Design.
Course Instructor: Aisha Azeem
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
SYSTEM ANALYSIS AND DESIGN
FRE 2672 Urban Ontologies : the Towntology prototype towards case studies Chantal BERDIER (EDU), Catherine ROUSSEY (LIRIS)
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Formalizing and Querying Heterogeneous Documents with Tables Krishnaprasad Thirunarayan and Trivikram Immaneni Department of Computer Science and Engineering.
Chapter 6 System Engineering - Computer-based system - System engineering process - “Business process” engineering - Product engineering (Source: Pressman,
Slide 1 Wolfram Höpken RMSIG Reference Model Special Interest Group Second RMSIG Workshop Methodology and Process Wolfram Höpken.
Text CONSEG 09 Domain Knowledge assisted Requirements Evolution (K-RE)
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
Copyright © 2013 Curt Hill The Zachman Framework What is it all about?
Author: Lornet LD team Reuse freely – Just quote Desired Properties of a MOT Graphic Representation Formalism Simplicity and User Friendliness (win spec,
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
Database System Concepts and Architecture
GLOSSARY COMPILATION Alex Kotov (akotov2) Hanna Zhong (hzhong) Hoa Nguyen (hnguyen4) Zhenyu Yang (zyang2)
A service-oriented middleware for building context-aware services Center for E-Business Technology Seoul National University Seoul, Korea Tao Gu, Hung.
Odyssey A Reuse Environment based on Domain Models Prepared By: Mahmud Gabareen Eliad Cohen.
R R R 1 Frameworks III Practical Issues. R R R 2 How to use Application Frameworks Application developed with Framework has 3 parts: –framework –concrete.
Design engineering Vilnius The goal of design engineering is to produce a model that exhibits: firmness – a program should not have bugs that inhibit.
Ontology Summit 2015 Track C Report-back Summit Synthesis Session 1, 19 Feb 2015.
Comp 15 - Usability & Human Factors Unit 8a - Approaches to Design This material was developed by Columbia University, funded by the Department of Health.
Extending Spatial Hot Spot Detection Techniques to Temporal Dimensions Sungsoon Hwang Department of Geography State University of New York at Buffalo DMGIS.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Chapter 8 Object Design Reuse and Patterns. Object Design Object design is the process of adding details to the requirements analysis and making implementation.
Media Arts and Technology Graduate Program UC Santa Barbara MAT 259 Visualizing Information Winter 2006George Legrady1 MAT 259 Visualizing Information.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Interoperability & Knowledge Sharing Advisor: Dr. Sudha Ram Dr. Jinsoo Park Kangsuk Kim (former MS Student) Yousub Hwang (Ph.D. Student)
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
A Goal Based Methodology for Developing Domain-Specific Ontological Frameworks Faezeh Ensan, Weichang Du Faculty of Computer Science, University of New.
THE SUPPORTING ROLE OF ONTOLOGY IN A SIMULATION SYSTEM FOR COUNTERMEASURE EVALUATION Nelia Lombard DPSS, CSIR.
Week III  Recap from Last Week Review Classes Review Domain Model for EU-Bid & EU-Lease Aggregation Example (Reservation) Attribute Properties.
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
Tool for Ontology Paraphrasing, Querying and Visualization on the Semantic Web Project By Senthil Kumar K III MCA (SS)‏
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
Personalized Recommendation of Related Content Based on Automatic Metadata Extraction Andreas Nauerz 1, Fedor Bakalov 2, Birgitta.
Virtual Information and Knowledge Environments Workshop on Knowledge Technologies within the 6th Framework Programme -- Luxembourg, May 2002 Dr.-Ing.
Landscape Animation Summary of an experimental approach to inform the use of camera dynamics in visualization.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Banaras Hindu University. A Course on Software Reuse by Design Patterns and Frameworks.
Cluster Analysis What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods.
GUILLOU Frederic. Outline Introduction Motivations The basic recommendation system First phase : semantic similarities Second phase : communities Application.
An Ontology framework for Knowledge-Assisted Semantic Video Analysis and Annotation Centre for Research and Technology Hellas/ Informatics and Telematics.
Cluster Analysis This work is created by Dr. Anamika Bhargava, Ms. Pooja Kaul, Ms. Priti Bali and Ms. Rajnipriya Dhawan and licensed under a Creative Commons.
Data mining in web applications
CCNT Lab of Zhejiang University
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
CS 641 – Requirements Engineering
CS 641 – Requirements Engineering
Web Mining Department of Computer Science and Engg.
Building Ontologies with Protégé-2000
Presentation transcript:

Using formal ontology for integrated spatial data mining Julie Sungsoon Hwang Department of Geography State University of New York at Buffalo ICCSA04 Perugia, Italy May 14, 2004

Research purposes  Enlighten the role of formal ontology in KDD  Propose the conceptual framework for ontology-based spatial data mining  Case study: ontology-based spatial clustering algorithms

Problems in focus (cont.)  No single algorithm is best suited to all research purposes and application domains. The same algorithm can yield results inconsistent with fact without considering domain knowledge The same data may have to be analyzed in different ways depending on users’ goal

Problems in focus  Developing new algorithms Algorithm D Algorithm C Algorithm A Algorithm B Algorithm D’ DomainTask  Re-using existing algorithms Suited to domain and task How can algorithms be customized to varying domain and task?

Relation between data mining and ontology construction Knowledge Ontology Ontology Construction (Knowledge acquisition) Level of abstraction Data Information Data Mining (Knowledge discovery) Knowledge

Role of formal ontology in KDD  Provide the context in which the knowledge extracted from data is interpreted and evaluated  Guide algorithms such that they can be suitable for domain-specific and task-oriented concepts KDD Process Diagram

Using ontology for spatial data mining  Ontology formalizes how the knowledge is conceptualized, thereby making implicit meaning explicit  Data mining extracts a high-level knowledge from a low-level data, thereby enhancing the level of understanding DomainModelTaskModel OntologySpatial Data Mining Low-level data High-level knowledge

Domain-specific spatial data mining  Let’s compare two different domains: traffic accident versus retailers Domain of traffic accident Domain of retailers Is-a Spatial constraints EventPhysical object In road network Outside of road network Spatial data mining algorithms should take into account different conceptualization (domain-specific properties)

Task-oriented spatial data mining  Let’s compare two different tasks: detecting hotspots of traffic accident versus partitioning market areas based on the location of retail Detect hotspots of traffic accident Partition market areas to a retailer # of clusters k Level of details Spatial data mining algorithms should take into account different tasks and users’ need Depend on spatial distributn. Given (resource constraint) Varies with scale (depends on area of users’ interest) Doesn’t vary with scale

Ontology as an active component of information system e.g. medicine e.g. diagnosing e.g. space, time, matter, object, event Application Ontology Task Ontology Domain Ontology Top-level Ontology dependence subject From Guarino, 1998

Conceptual framework for ontology- based spatial data mining (OBSDM)

Component of OBSDM

OBSDM:: Input:: Metadata  Tag structure of XML can be utilized to inform domain ontology of the semantics of data

Component of OBSDM

OBSDM:: OBSDMM:: Domain Ont.  Terms within the “theme” tag in the metadata are used as a token to locate the appropriate domain ontology  Domain ontology specifies the definition, class, and properties Class example: Accident is a Subclass-Of Temporal- Thing Properties example: Road has a Geographic-Region as a Value-Type  Properties of class inherit from top-level ontology

Domain ontology := Traffic accident  Theory TRAFFIC-ACCIDENT-DOMAIN  As a spatial thing, Point(x)  On(x, y)  Roadway(y) Line(y)  In(y, z)  Geographic-Region(z)  As a temporal thing, Point(x)  At(x, y)  Time(y) Event(x) Occurrence(x)  Notification(x)  Response(x)  Arrival(x) Before(Occurrence(x), Notification(x))  As an intangible thing, Accident (x)  RelatedTo(x, y)  Vehicle(y)

Component of OBSDM

OBSDM:: Input:: User Interface  Users can specify a goal, level of detail, and geographic area of interest through UI

Component of OBSDM

OBSDM:: OBSDMM:: Task Ont.  The inputs specified by users in the user interface are translated into task ontology  Task ontology explicitly specify goal, methods, requirements, and constraint

Task ontology := Spatial clustering  Theory SPATIAL-CLUSTERING-TASK  Documentation: This theory defines a task ontology for the spatial clustering task. The spatial clustering task, which is a class of clustering task, is a problem of grouping similar spatial objects into classes.  Super classes: Clustering  Subclasses: Sub goal:  “Find hot spots”  “Group similar patterns”  “Partition into k-clusters” Requirement:  Assignment-Object Source: Spatial Objects Target: Clusters  Geographic-Scale  Detail-Level Constraint:  Spatial Objects  Operational Constraints

Component of OBSDM

OBSDM:: OBSDMM:: Alg. Builder OBSDM:: Output:: GVis tool  Algorithm builder puts together requirements for building the best algorithm suited to domain of data and users’ input (task).  Data content is filtered through domain ontology, and the users’ requirement is filtered through task ontology.  The geographic visualization tool displays results (pattern discovered)

Case study: ontology-based spatial clustering of traffic accidents OBS C Input: 353 features in Erie Setting Metadata Theme := Traffic Accident User interface Goal := “identify hot spots” LevelOfDetail := State PlaceName := New York Method Algorithm := SMTIN Constraint := Named-Roadway Output: 18 clusters in Erie County

Case study: Effect of scale (Task ontology)  OBSC clusters reflect spatial distribution specific to the scale of users’ interest Control AlgorithmOBSC Algorithm TASK LevelOfDetail := Null LevelOfDetail := Null PlaceName := Null PlaceName := NullDOMAIN Constraint := Roadway Constraint := RoadwayTASK LevelOfDetail := County LevelOfDetail := County PlaceName := New York PlaceName := New YorkDOMAIN Constraint := Roadway Constraint := Roadway Specifying area of interest doesn’t mask details

Case study: Effect of constraint (Domain ontology)  OBSC clusters identify the physical barrier due to concept implicit in domain Control AlgorithmOBSC Algorithm TASK LevelOfDetail := State LevelOfDetail := State PlaceName := New York PlaceName := New YorkDOMAIN Constraint := Null Constraint := NullTASK LevelOfDetail := State LevelOfDetail := State PlaceName := New York PlaceName := New YorkDOMAIN Constraint := Roadway Constraint := Roadway Separated by body of water

Case study: Benefit of using ontology in spatial clustering  Incorporating ontology in spatial clustering algorithms enhances the quality of spatial clustering results Task ontology makes clusters usable  Responsive to users’ view Domain ontology makes clusters natural  Dictated by concept implicit in domain

Conclusion (cont.)  Presents how ontology are incorporated in spatial data mining algorithms Semantic linkage between ontologies and algorithms through parameterization  Scale as a task-oriented property  Constraint as a domain-specific property

Conclusion  Ontology is examined as a means to customize algorithms to varying domain and task Ontology enables algorithms to reflect concepts implicit in domain, and adapt to users’ view Ontology provides the semantically plausible way to re- use existing algorithms  Ontology provides the systematic way of organizing various factors that dictate mechanisms underlying data mining process