1 Modeling and Language Support for the management of PBMS Manolis Terrovitis Panos Vassiliadis Spiros Skiadopoulos Elisa Bertino Barbara Catania Anna.

Slides:



Advertisements
Similar presentations
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Advertisements

Representing and Querying Correlated Tuples in Probabilistic Databases
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
1 A Description Logic with Concrete Domains CS848 presentation Presenter: Yongjuan Zou.
1 Knowledge Representation We’ve discussed generic search techniques. Usually we start out with a generic technique and enhance it to take advantage of.
On Comparing the Expressing Power of Access Control Model Frameworks Workshop on Logical Foundations of an Adaptive Security Infrastructure (WOLFASI) A.
Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.
Automatic Classification of Accounting Literature Nineteenth Annual Strategic and Emerging Technologies Workshop Vasundhara Chakraborty, Victoria Chiu,
Civil and Environmental Engineering Carnegie Mellon University Sensors & Knowledge Discovery (a.k.a. Data Mining) H. Scott Matthews April 14, 2003.
01 -1 Lecture 01 Artificial Intelligence Topics –Introduction –Knowledge representation –Knowledge reasoning –Machine learning –Applications.
Dynamic Ontologies on the Web Jeff Heflin, James Hendler.
Relational Data Mining in Finance Haonan Zhang CFWin /04/2003.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
A First Attempt towards a Logical Model for the PBMS PANDA Meeting, Milano, 18 April 2002 National Technical University of Athens Patterns for Next-Generation.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
Spatial and Temporal Data Mining V. Megalooikonomou Introduction to Decision Trees ( based on notes by Jiawei Han and Micheline Kamber and on notes by.
PatManQL: A language to manipulate patterns and data in hierarchical catalogs Panagiotis Bouros, Theodore Dalamagas, Timos Sellis, Manolis Terrovitis Knowledge.
Architecture for Pattern- Base Management Systems Manolis TerrovitisPanos Vassiliadis National Technical Univ. of Athens, Dept. of Electrical and Computer.
Data Mining with Decision Trees Lutz Hamel Dept. of Computer Science and Statistics University of Rhode Island.
Queensland University of Technology An Ontology-based Mining Approach for User Search Intent Discovery Yan Shen, Yuefeng Li, Yue Xu, Renato Iannella, Abdulmohsen.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Data Mining By Archana Ketkar.
02 -1 Lecture 02 Agent Technology Topics –Introduction –Agent Reasoning –Agent Learning –Ontology Engineering –User Modeling –Mobile Agents –Multi-Agent.
Data Mining – Intro.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
Business Intelligence
Query Processing Presented by Aung S. Win.
Overview of Distributed Data Mining Xiaoling Wang March 11, 2003.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Context Tailoring the DBMS –To support particular applications Beyond alphanumerical data Beyond retrieve + process –To support particular hardware New.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
Ming Fang 6/12/2009. Outlines  Classical logics  Introduction to DL  Syntax of DL  Semantics of DL  KR in DL  Reasoning in DL  Applications.
School of Computing FACULTY OF ENGINEERING Developing a methodology for building small scale domain ontologies: HISO case study Ilaria Corda PhD student.
Introduction to Web Mining Spring What is data mining? Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web,
The Relational Model: Relational Calculus
LDK R Logics for Data and Knowledge Representation Modeling First version by Alessandro Agostini and Fausto Giunchiglia Second version by Fausto Giunchiglia.
1 A Theoretical Framework for Association Mining based on the Boolean Retrieval Model on the Boolean Retrieval Model Peter Bollmann-Sdorra.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Multi-Relational Data Mining: An Introduction Joe Paulowskey.
Panel Discussion on Foundations of Data Mining at RSCTC2004 J. T. Yao University of Regina Web:
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Introduction to Databases
Ranking objects based on relationships Computing Top-K over Aggregation Sigmod 2006 Kaushik Chakrabarti et al.
OLAP in DWH Ján Genči PDT. 2 Outline OLAP Definitions and Rules The term OLAP was introduced in a paper entitled “Providing On-Line Analytical.
DL Overview Second Pass Ming Fang 06/19/2009. Outlines  Description Languages  Knowledge Representation in DL  Logical Inference in DL.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Restrictions on Concept Lattices for Pattern Management Léonard Kwuida, Rokia Missaoui, Beligh Ben Amor, Lahcen Boumedjout, Jean Vaillancourt October 20,
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Friday, 14 November 2003 William.
PSYCHO: A Prototype System for Pattern Management Barbara Catania, Anna Maddalena, Maurizio Mazza DISI - University of Genoa, Italy VLDB ’05 – Trondheim.
Lecture 7: Foundations of Query Languages Tuesday, January 23, 2001.
Data Mining and Decision Support
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Differential Analysis on Deep Web Data Sources Tantan Liu, Fan Wang, Jiedan Zhu, Gagan Agrawal December.
1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Visualization in Process Mining
School of Computer Science & Engineering
A Black-Box Approach to Query Cardinality Estimation
Probabilistic Data Management
Architectural Support for Database Visualization
Introduction to Databases
CSc4730/6730 Scientific Visualization
Relational Algebra Chapter 4, Sections 4.1 – 4.2
PBKM: A Secure Knowledge Management Framework
CoXML: A Cooperative XML Query Answering System
Presentation transcript:

1 Modeling and Language Support for the management of PBMS Manolis Terrovitis Panos Vassiliadis Spiros Skiadopoulos Elisa Bertino Barbara Catania Anna Maddalena

2 Outline Introduction Modeling of data and patterns Query operators Summary and future work

3 Motivation Huge amounts of data are produced. Interesting knowledge has to be detected and extracted. Knowledge extraction techniques (i.e., Data Mining) are not sufficient:  Huge amounts of results (clusters, association tules, decision trees etc)  Arbitrary modeling of results

4 Motivation (con’t) We need to be able to manipulate the knowledge discovered! The basic requirements:  A generic and homogenous model for patterns.  Well defined query operators.  Efficient storage.

5 The Patterns and PBMS [Rizzi et. al. ER 2003] Patterns are compact and rich in semantics representations of raw data.  Clusters, association rules, decision trees e.t.c. Pattern Base Management System  Patterns are treated as first class citizens  Pattern-based queries  Approximate mapping between patterns and raw data

6 Contributions We formally define the logical foundations for pattern management We present a pattern specification language We introduce queries and query operators

7 Outline Introduction Modeling of data and patterns Query operators Summary and future work

8 PBMS architecture Pattern Space:  Pattern Types  Pattern Classes  Patterns Intermediate Results Data Space

9 The patterns Patterns hold information for:  the data source  the structure of the pattern  The relation between the structure and the source, in an approximate logical formula.

10 Pattern - Cluster Example Pid337 Structure[CENTER: [X: 21, Y: 1200], RAD: 12 ] DataEMP: {[Age, Salary]} Formula(t.Age - 21) 2 + (t.Salary ) 2 ≤ 12 2 where t  EMP

11 Pattern type - example NameDisk Structure Schema[CENTER: [X:real, Y: real], RAD: real ] Data SchemaREL: {[X: real, Y: real]} Formula Schema(t.X - CENTER.X) 2 + (t.Y - CENTER.Y ) 2 ≤RAD 2 where t  REL

12 The formula An intentional description of the pattern- data relation  pros: Efficiency, more intuitive results  cons: Accuracy

13 Intentional vs. Extensional

14 The formula (con’t) The formula is a predicate: fp(x,y) where x  Source,y  Structure Expressiveness.  Functions and predicates Safety.  Range restriction. Queries employing the formula are n-depth domain independent.

15 Outline Introduction Modeling of data and patterns Query operators Summary and future work

16 Query Operators Query operator classes:  Database operators  Pattern Base operators  Crossover database operators  Crossover pattern base operators

17 Crossover Operators PID data structure formula Pattern Space Data Space Exact Approximation Exact evaluation, via the intermediate mappings Approximate evaluation, via the formula

18 Crossover Operators Database  Drill-Through: Which data are represented by these patterns?  Data-Covering: Which data from this dataset can be represented by this pattern? Pattern Base  Pattern-Covering: Which of these patterns represent this dataset?

19 Query Example Drill-through( { p | p intersects q})

20 Outline Introduction Modeling of data and patterns Query Operators Summary and future work

21 Summary Formal specification of basic PBMS concepts Investigation on the representation of the pattern-data relation Formal definition of query operators

22 Future Work Query language Generic similarity measures Efficient implementation of intermediate mappings Statistical measures for the patterns.