A conceptual Data Model for Trajectory Data Mining

Slides:



Advertisements
Similar presentations
Office of SA to CNS GeoIntelligence Introduction Data Mining vs Image Mining Image Mining - Issues and Challenges CBIR Image Mining Process Ontology.
Advertisements

Chapter 5: Introduction to Information Retrieval
Ch2 Data Preprocessing part3 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
1 Copyright Jiawei Han; modified by Charles Ling for CS411a/538a Data Mining and Data Warehousing  Introduction  Data warehousing and OLAP for data mining.
Mining Frequent Spatio-temporal Sequential Patterns
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
A First Attempt towards a Logical Model for the PBMS PANDA Meeting, Milano, 18 April 2002 National Technical University of Athens Patterns for Next-Generation.
Week 9 Data Mining System (Knowledge Data Discovery)
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
Tools for semantic trajectory data mining. A importância de considerar a semântica T1 T2 T3 T4 T1 T2 T3 T4 H H H Hotel R R R Restaurant C C C Cinema Padrão.
Data Mining – Intro.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
Major Tasks in Data Preprocessing(Ref Chap 3) By Prof. Muhammad Amir Alam.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
Integrating Multi-Media with Geographical Information in the BORG Architecture R. George Department of Computer Science Clark Atlanta University Atlanta,
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
GeoPKDD Geographic Privacy-aware Knowledge Discovery and Delivery Kick-off meeting Pisa, March 14, 2005.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
GEOPKDD - Meeting Venezia 17 Oct 051 Privacy-preserving data warehousing for spatio- temporal data Maria L. Damiani, Università Milano (I)
Concepts and Terminology Introduction to Database.
Chapter 1 Introduction to Data Mining
Web Usage Mining for Semantic Web Personalization جینی شیره شعاعی زهرا.
1 Knowledge Discovery Transparencies prepared by Ho Tu Bao [JAIST] ITCS 6162.
1 A Conceptual Framework of Data Mining Y.Y. Yao Department of Computer Science, University of Regina Regina, Sask., Canada S4S 0A2
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Expert Systems with Applications 34 (2008) 459–468 Multi-level fuzzy mining with multiple minimum supports Yeong-Chyi Lee, Tzung-Pei Hong, Tien-Chin Wang.
Introduction of Data Mining and Association Rules cs157 Spring 2009 Instructor: Dr. Sin-Min Lee Student: Dongyi Jia.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Association Rule Mining Data Mining and Knowledge Discovery Prof. Carolina Ruiz and Weiyang Lin Department of Computer Science Worcester Polytechnic Institute.
INFERRING HUMAN ACTIVITY FROM GPS TRACKS Sun Simiao.
Semantic Aspects in Spatial Data Mining Vania Bogorny.
Data Mining BY JEMINI ISLAM. Data Mining Outline: What is data mining? Why use data mining? How does data mining work The process of data mining Tools.
Towards Semantic Trajectory Outlier Detection Artur Ribeiro de Aquino 1 Luis Otavio Alvares 1 Chiara Renso 2 Vania Bogorny Dep. de Matemática e Estatística.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Friday, 14 November 2003 William.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Wang-Chien Lee 2, Tz-Chiao Weng 1 and Vincent S. Tseng 1 1 Department of Computer.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
1 The T4SQL Temporal Query Language Presented by 黃泰豐 2007/12/26.
Jun-Ki Min KUT.  Data & Information ◦ Data: facts or values obtained by observation or measurement ◦ Information: interpretation or relationship to help.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 28 Data Mining Concepts.
Differential Analysis on Deep Web Data Sources Tantan Liu, Fan Wang, Jiedan Zhu, Gagan Agrawal December.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Part III – Trajectory Data Mining Vania Bogorny Universidade Federal de Santa Catarina, Brazil Department of Informatics and Statistics
Mining Association Rules in Large Database This work is created by Dr. Anamika Bhargava, Ms. Pooja Kaul, Ms. Priti Bali and Ms. Rajnipriya Dhawan and licensed.
Managing Massive Trajectories on the Cloud
Queensland University of Technology
Knowledge Representation Techniques
Data Mining – Intro.
By Arijit Chatterjee Dr
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
DATA MINING © Prentice Hall.
Data and Applications Security Developments and Directions
Physical Database Design and Performance
A Methodology for Finding Bad Data
Data Mining.
MIS 451 Building Business Intelligence Systems
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Mining Spatio-Temporal Reachable Regions over Massive Trajectory Data
Data Mining: Concepts and Techniques Course Outline
Association Rule Mining
Data Mining Association Rules: Advanced Concepts and Algorithms
Spatio-Temporal Databases
Data Warehousing and Data Mining
another noSql customization for the HDB++ archiving system
Evaluation of Relational Operations: Other Techniques
Data Pre-processing Lecture Notes for Chapter 2
Data Preprocessing Copyright, 1996 © Dale Carnegie & Associates, Inc.
Presentation transcript:

A conceptual Data Model for Trajectory Data Mining Universidade Federal de Santa Catarina, Florianopolis, Brazil Informatics and Statistics Department A conceptual Data Model for Trajectory Data Mining * Prof. Vania Bogorny (INE/UFSC - Brazil) vania@inf.ufsc.br Prof. Carlos Alberto Heuser (II/UFRGS - Brazil) Prof. Luis Otavio Alvares (II/UFRGS-Brazil) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

GIScience 2010 – A conceptual data model for trajectory data mining Outline Motivation Objective Basic concepts Proposed Model Evaluation Conclusion 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Introduction and Motivation On the one side (database technology.......) Since its origin, database design has the purpose of modeling data for operational purposes only Database designers don't think about data mining during the conceptual database design Data mining (DM) and knowledge discovery (KDD) from databases has become very popular in the last years in many fields and several application domains Dozens of papers with new data mining algorithms have been published in the last decade, but very little has been done for the automatic data preprocessing for (DM), which is the most time consuming step. Statistically it takes between 60 and 80% of the whole discovery process 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 3 3

Introduction and Motivation On the other side (artificial intelligence.......) Data mining (DM) or knowledge discovery (KDD) from databases has become very popular in the last years in many fields and several application domains Dozens of new data mining algorithms have been proposed in the last decade, but very little has been done for the automatic data preprocessing, which is the most time consuming step Data mining (DM) and knowledge discovery (KDD) from databases has become very popular in the last years in many fields and several application domains Dozens of papers with new data mining algorithms have been published in the last decade, but very little has been done for the automatic data preprocessing for (DM), which is the most time consuming step. Statistically it takes between 60 and 80% of the whole discovery process 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 4 4

Introduction and Motivation DATABASE Modelling (Normalization) DATA MINING (Disnormalization) One single file 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Introduction and Motivation Another problem for data mining: data have to be preprocessed and transformed into different granularities Examples: Louvre Museum  Museum  TuristicPlace Instance + type type 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Introduction and Motivation These problems increase when dealing with trajectories of moving objects, which is the focus of this paper This is specially true for trajectories of moving objects, which is a new kind of spatio-temporal data generated by mobile devices, that in the last few years have become very popular in daily life. 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

GIScience 2010 – A conceptual data model for trajectory data mining Objective We propose a conceptual framework for trajectory database modeling that supports data mining 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

GIScience 2010 – A conceptual data model for trajectory data mining Basic Concepts 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 9

GIScience 2010 – A conceptual data model for trajectory data mining Trajectories are new kind of spatio-temporal data Trajectories have attracted intensive research in both databases and data mining communities This is specially true for trajectories of moving objects, which is a new kind of spatio-temporal data generated by mobile devices, that in the last few years have become very popular in daily life. 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

GIScience 2010 – A conceptual data model for trajectory data mining Trajectory Raw Data Trajectory Data are: Spatio-temporal data Represented by a set of points located in space and time Form: (tid, x,y,t), where tid is the trajectory identifier, (x,y) represent the spatial location at time t Tid position (x,y) time (t) 1 48.890018 2.246100 08:25 1 48.890018 2.246100 08:26 ... ... ... 1 48.890020 2.246102 08:40 1 48.888880 2.248208 08:41 1 48.885732 2.255031 08:42 ... ... ... 1 48.858434 2.336105 09:04 1 48.853611 2.349190 09:05 2 ... ... 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

The Model of Stops and Moves (Spaccapietra 2008) Important parts of trajectories Where the moving object has stayed for a minimal amount of time Stops are application dependent Tourism application Hotels, touristic places, airport, … Traffic Management Application Traffic lights, roundabouts, big events… MOVES Are the parts that are not stops 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Semantic Trajectories A semantic trajectory is a set of stops and moves Stops have by a place, a start time and an end time Moves are characterized by two consecutive stops 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

STOPS at Multiple-Granularities Stop at Ibis Hotel from 6:04PM to 7:42PM, september 16, 2010 time space IbisHotel or Hotel or Accommodation Afternoon or Thursday or 6:00PM – 8:00PM or RUSH-HOUR 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

ITEMS - the building blocks for semantic pattern discovery An item is generated either from a stop or a move An item is a set of complex information (space + time), that can be defined in many formats/types and at different granularities 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Building an ITEM for Data Mining Formats/types for an item: NameOnly: is the name of the stop/move STOPS: name of the spatial feature instance IbisHotel MOVES: name of the two stops which define the move ZurichAirport – IbisHotel NameStart: is the name of the stop/move + start time IbisHotel [morning] --stop LouvreMuseum [weekend] --stop IbisHotel-ZurichAirport [10:00AM-11:00AM] --move 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Building an ITEM for Data Mining NameEnd: name of a stop/move + end time IbisHotel[morning] stop IbisHotel-ZurichAirport[10:00AM-11:00AM]  move NameStartEnd: name of a stop/move + start time + end time IbisHotel[08:00AM-11:00AM][1:00pm-6:00pm]  stop LouvreMuseum[morning][afternoon]  stop ZurichAirport– IbisHotel [10:00AM-11:00PM] [10:00AM-6:00PM] 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Semantic Trajectory Patterns Frequent Patterns Sequential Patterns and Association Rules 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 18

Trajectory Frequent Pattern Is a set of items that occur a minimal number of times (support s) Examples: {LouvreMuseum [08:00-10:00]} (s=0.1) {Airport [morning], hotel [morning]} (s=0.2) {Airport-Hotel, Hotel-Museum} (s=0.15) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Trajectory Sequential Pattern Is an ordered list of items that occur a minimal number of times (support s) Examples: <Airport[morning], Hotel[morning], Museum[afternoon] > (s=0.15) <Airport-Hotel, Hotel-Museum> (s= 0.1) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Trajectory Association Rule Is a rule where the items occur a minimal number of times (support s) and with a minimal confidence (c) Example Airport[morning], Hotel[morning]  Museum[afternoon] (s=0.1) (c=0.5) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

GIScience 2010 – A conceptual data model for trajectory data mining The Proposed Model 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 22

GIScience 2010 – A conceptual data model for trajectory data mining The Proposed Model We extend the model of stops and Moves proposed by Spaccapietra with new attributes and methods Add new classes and relationships, with attributes and methods to automatic data preprocessing and multiple-level mining 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

The Conceptual Data Model of Stops and Moves (Spaccapietra 2008) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

GIScience 2010 – A conceptual data model for trajectory data mining Proposed OO Model Compute and Store the patterns Data Pre-processing The data to be mined will be stops or moves. Therefore, we build the item over stops and moves ITEM has the objective to aggregate and disaggregate data for mining Time in stop and move is a timestamp (specific time), while in BESITEM and MOVEITEM, time is na aggregate time Spaccapietra´s Model 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 25 25

GIScience 2010 – A conceptual data model for trajectory data mining Proposed OO Model Stops and Moves are extended with new attributes (specific time, e.g. 07:10 – 08:05 ) and methods to instatiate stops and moves The data to be mined will be stops or moves. Therefore, we build the item over stops and moves ITEM has the objective to aggregate and disaggregate data for mining Time in stop and move is a timestamp (specific time), while in BESITEM and MOVEITEM, time is na aggregate time Concept Hierarchy for the spatial feature type (e.g.: AccomodationPlace  Hotel) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 26 26

Proposed Model OO Model Generic class to represent the 3 kinds of patterns Attributes: support, listOfItems Methods: countSupport(), sequentialPattern() Attributes: support, confidence, antecedent (set of items) and consequent (set of items) Methods: countSupport(), associatePattern(), and computeConfidence() Attributes : startT, endT (generic time, e.g. Morning) Methods: getGenericSpatialFeature() – retrieves the hierarchy level timeG() – generalizes time spaceG() – generalizes space based on the hierarchy buildItem() – creates generalized ITEM Frequent Patterns: Attributes: support, setOfItems Methods: countSupport(), frequentPattern() The data to be mined will be stops or moves. Therefore, we build the item over stops and moves ITEM has the objective to aggregate and disaggregate data for mining Time in stop and move is a timestamp (specific time), while in BESITEM and MOVEITEM, time is na aggregate time 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 27 27

Example of an Instantiated Model 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 28

Schema of Stops and Moves STOP (Tid integer, Sid integer, SFTname string, SFid integer, startT timestamp, endT timestamp) Ex.: stop (1,1,Hotel, 3, 10AM, 11AM) MOVE (Tid integer, Mid integer, SFT1name string, SF1id integer, SFT2name string, SF2id integer, startT timestamp, endT timestamp, the_move geometry) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

GIScience 2010 – A conceptual data model for trajectory data mining Schema of the Patterns FrequentPattern/ SequentialPattern (Pid integer, pattern itemSetType, support real) itemSetType (SFT1name string, SF1id integer, SFT2name string, SF2id integer, startT string, endT string) AssociatePattern (Pid integer, antecedent itemSetType, consequent itemSetType, support real, confidence real) Nested relation 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Instantiating and Querying Patterns To instantiate the patterns we can use the ST-DMQL proposed in (Bogorny 2009) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 31

Instantiating Stops and Moves SELECT generateS (method, candidateStops, buffer) FROM trajectory SELECT generateM (method, candidateStops, buffer) IB-SMOT CB-SMOT DB-SMOT ...... 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 32

Instatiating Sequential Patterns Q1 (tourism application): Which are the sequences of moves that occur most frequently in the morning and in the evening? SELECT sequentialPattern (itemType = NameEnd, timeG = [8:00-12:00 AS morning, 18:00-23:00 AS evening], spaceG = instance, minsup=0.03) FROM move Method in the ST-DMQL Ans: {IbisHotel - NotreDame[morning], EiffelTower – IbisHotel [evening]} (s=0.04) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Example of Pattern Queries Q: How many moves of sequential patterns cross Pont Neuf bridge? SELECT count(m.*) FROM sequentialPattern s, bridge b, move m WHERE s.pattern.SFT1name=m.SFT1name AND s.pattern.SF1id=m.SF1id AND s.pattern.SFT2name=m.SFT2name AND s.pattern.SF2id=m.SF2id AND b.name='Pont Neuf' AND intersects (m.the_geom,b.the_geom) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 34

GIScience 2010 – A conceptual data model for trajectory data mining Conclusions Data pre-processing is the most time consuming step for DM and KDD To think about data mining during the conceptual design of a database can significantly reduce these steps 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 35

GIScience 2010 – A conceptual data model for trajectory data mining Conclusions The proposed model: Reduces the pre-processing tasks Supports mining at multiple granularity levels Automatically prepares the data for data mining * Stores the patterns for futures queries Multiple-granularities data patterns Queries 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 36

Geometric Patterns X Semantic Patterns (Bogorny 2009) TP Touristic Place R Restaurant CC CC T3 H Hotel T2 T3 T2 T1 T4 T1 T4 Semantic trajectory Pattern Hotel to Restaurant, passing by CC (b) go to Cinema, passing by CC 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 37

GIScience 2010 – A conceptual data model for trajectory data mining Thank You! 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 38

More examples for generating stops SELECT generateS (CB-SMOT, [Hotel,60,TouristicPlace,15,ShoppingCenter,30], 5) FROM trajectory t, district d WHERE d.name='Bela Vista' and intersects (t.movingpoint.geometry, d.geometry) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania 39

GIScience 2010 – A conceptual data model for trajectory data mining Querying Rules Suppose that the user is interested in association patterns which have weekend as the time dimension in the antecedent of the rule SELECT * FROM associatePattern WHERE antecedent.startT='weekend' or antecedent.endT='weekend' 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Basic Concepts: Support 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Basic Concepts: Semantic Trajectory Patterns Example Sequential Patterns: Let T be a set of semantic trajectories. A sequence of items X={x1; x2; ... ; xn}, ordered in time, is a trajectory sequential pattern with respect to T and minSup if s(X) >= minSup. Example: Work [morning], ShoppingCenter [afternoon], Gym [afternoon] (s=0.08%) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Basic Concepts: Semantic Trajectory Patterns Example Home [night], Work [afternoon]  Gym [afternoon] (s=0.10%) (c=0.50) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Basic Concepts: Semantic Trajectory Patterns Example ReligiousPlace [weekend], Restaurant [weekend] (s=0. 07) 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

GIScience 2010 – A conceptual data model for trajectory data mining Related Works Mining Trajectory Samples – Extract Geometric Patterns Mining Semantic Trajectories or Trajectory pre-processing for mining Attempts to reduce the gap between databases and data mining Laube 2002, 2005 Giannotti 2007 Lee 2007 Cao 2006, 2007 Li 2010 Alvares 2007 Zhou 2007 Palma 2008 Bogorny 2009 Manso 2010 Data mining query languages, but not for trajectories (Wang 2003, Malerba 2004, Han 1995) b 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania

Example of Frequent Pattern Instantiation Q2: Which are the types of places most frequently visited by tourists on weekdays and weekends? SELECT frequentPattern (itemType =NameStart, timeG = WEEKEND-WEEKDAY, spaceG = [type, GenericHotel = 1], minsup = 0.15) FROM stop Ans: {4StarsHotel[weekend], Museum[weekend], Restaurant[weekend] } (s=0.16) Method in the ST-DMQL 2/22/2019 GIScience 2010 – A conceptual data model for trajectory data mining Vania Bogorny, Universidade Federal de Santa Catarina, Brazil, www.inf.ufsc.br/~vania