CENTRE Cellular Network’s Positioning Data Generator Fosca GiannottiKDD-Lab Andrea MazzoniKKD-Lab Puntoni SimoneKDD-Lab Chiara RensoKDD-Lab.

Slides:



Advertisements
Similar presentations
Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
Advertisements

On-Line Discovery of Hot Motion Paths D. Sacharidis 1, K. Patroumpas 1, M. Terrovitis 1, V. Kantere 1, M. Potamias 2, K. Mouratidis 3, T. Sellis 1 1 National.
By Venkata Sai Pulluri ( ) Narendra Muppavarapu ( )
Han-na Yang Trace Clustering in Process Mining M. Song, C.W. Gunther, and W.M.P. van der Aalst.
Human Mobility Modeling at Metropolitan Scales Sibren Isaacman, Richard Becker, Ramón Cáceres, Margaret Martonosi, James Rowland, Alexander Varshavsky,
Indexing and Range Queries in Spatio-Temporal Databases
Multi-Scale Analysis of Crime and Incident Patterns in Camden Dawn Williams Department of Civil, Environmental & Geomatic.
Patch to the Future: Unsupervised Visual Prediction
Generated Waypoint Efficiency: The efficiency considered here is defined as follows: As can be seen from the graph, for the obstruction radius values (200,
Constructing Popular Routes from Uncertain Trajectories Ling-Yin Wei 1, Yu Zheng 2, Wen-Chih Peng 1 1 National Chiao Tung University, Taiwan 2 Microsoft.
Seismo-Surfer a tool for collecting, querying, and mining seismic data Yannis Theodoridis University of Piraeus
Critical Analysis Presentation: T-Drive: Driving Directions based on Taxi Trajectories Authors of Paper: Jing Yuan, Yu Zheng, Chengyang Zhang, Weilei Xie,
Emulatore di Protocolli di Routing per reti Ad-hoc Alessandra Giovanardi DI – Università di Ferrara Pattern Project Area 3: Problematiche di instradamento.
Computational Geometry and Spatial Data Mining
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Probabilistic Cardinal Direction Queries On Spatio-Temporal Data Ganesh Viswanathan Midterm Project Report CIS 6930 Data Science: Large-Scale Advanced.
Yu Zheng, Lizhu Zhang, Xing Xie, Wei-Ying Ma Microsoft Research Asia
Spatio-Temporal Databases
Week 9 Data Mining System (Knowledge Data Discovery)
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Blogosphere  What is blogosphere?  Why do we need to study Blog-space or Blogosphere?
The Data Mining Visual Environment Motivation Major problems with existing DM systems They are based on non-extensible frameworks. They provide a non-uniform.
Dieter Pfoser, LBS Workshop1 Issues in the Management of Moving Point Objects Dieter Pfoser Nykredit Center for Database Research Aalborg University, Denmark.
Mining Behavior Models Wenke Lee College of Computing Georgia Institute of Technology.
GeoPKDD Geographic Privacy-aware Knowledge Discovery and Delivery Kick-off meeting Pisa, March 14, 2005.
Dr. Awad Khalil Computer Science Department AUC
Data Mining Chun-Hung Chou
Privacy Preserving Data Mining on Moving Object Trajectories Győző Gidófalvi Geomatic ApS Center for Geoinformatik Xuegang Harry Huang Torben Bach Pedersen.
Mirco Nanni, Roberto Trasarti, Giulio Rossetti, Dino Pedreschi Efficient distributed computation of human mobility aggregates through user mobility profiles.
Time-focused density-based clustering of trajectories of moving objects Margherita D’Auria Mirco Nanni Dino Pedreschi.
Knowledge Discovery and Delivery Lab (ISTI-CNR & Univ. Pisa)‏ www-kdd.isti.cnr.it Anna Monreale Fabio Pinelli Roberto Trasarti Fosca Giannotti A. Monreale,
Trajectory Pattern Mining
CENTRE CEllular Network Trajectories Reconstruction Environment F. Giannotti, A. Mazzoni, S. Puntoni, C. Renso KDDLab, Pisa.
Spazio ZeroUno Cell Broadcast Forum Plenary, Milan, May 28-29th 2002 Andrea Ghirardini Business Development Spazio ZeroUno An efficient and effective method.
Trajectory Pattern Mining Fosca Giannotti, Mirco Nanni, Dino Pedreschi, Fabio Pinelli KDD Lab (ISTI-CNR & Univ. Pisa) Presented by: Qiming Zou.
Web Usage Mining for Semantic Web Personalization جینی شیره شعاعی زهرا.
Modeling Storing and Mining Moving Object Databases Proceedings of the International Database Engineering and Applications Symposium (IDEAS’04) Sotiris.
Wireless Communications: System Design Dr. Mustafa Shakir.
Spatio-Temporal Case-Based Reasoning for Behavioral Selection Maxim Likhachev and Ronald Arkin Mobile Robot Laboratory Georgia Tech.
Top right corner for field-mark, customer or partner logotypes. See Best practice for example. Slide title 40 pt Slide subtitle 24 pt Text 24 pt Bullets.
Spatio-temporal Pattern Queries M. Hadjieleftheriou G. Kollios P. Bakalov V. J. Tsotras.
INFERRING HUMAN ACTIVITY FROM GPS TRACKS Sun Simiao.
Everyday Mapping of Traffic Conditions - An Urban Planning Tool Laboratory of Geodesy Aristotle University of Thessaloniki, Department of Civil Engineering.
Cmpe 589 Spring 2006 Lecture 2. Software Engineering Definition –A strategy for producing high quality software.
Mining Weather Data for Decision Support Roy George Army High Performance Computing Research Center Clark Atlanta University Atlanta, GA
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
A SEMINAR REPORT ON CELLULAR SYSTEM Introduction to cellular system The cellular concept was developed and introduce by the bell laboratories in the.
Enhancing service system design : An entity interaction pattern approach Author : Soe-Tsyr Daphne Yuan, Shiou-Tian Hsu Professor : Soe-Tsyr Daphne Yuan.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying 1, Wang-Chien Lee 2, Tz-Chiao Weng 1 and Vincent S. Tseng 1 1 Department of Computer.
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
Chapter 14 : Modeling Mobility Andreas Berl. 2 Motivation  Wireless network simulations often involve movements of entities  Examples  Users are roaming.
Location Privacy Protection for Location-based Services CS587x Lecture Department of Computer Science Iowa State University.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
Wireless Network Management SANDEEP. Network Management Network management is a service that employs a variety of tools, applications, and devices to.
An Energy-Efficient Approach for Real-Time Tracking of Moving Objects in Multi-Level Sensor Networks Vincent S. Tseng, Eric H. C. Lu, & Kawuu W. Lin Institute.
Data Stashing: Energy-Efficient Information Delivery to Mobile Sinks through Trajectory Prediction (IPSN 2010) HyungJune Lee, Martin Wicke, Branislav Kusy,
Mapping of Traffic Conditions at Downtown Thessaloniki with the Help of GPS Technology P. D. Savvaidis and K. Lakakis Aristotle University of Thessaloniki,
1 DIVYA K 1RN09IS016 RNSIT. 2 The main purpose in car-to-car networks is to improve communication performance. To demonstrate real scenarios with car-to-car.
Location-based Social Networks 6/11/20161 CENG 770.
Verein Konstantin Melnik Svetlana
Cluster Analysis What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods.
Big Data Quality Challenges for the Internet of Things (IoT) Vassilis Christophides INRIA Paris (MUSE team)
Mobility Trajectory Mining Human Mobility Modeling at Metropolitan Scales Sibren Isaacman 2012 Mobisys Jie Feng 2016 THU FIBLab.
Profiling based unstructured process logs
Architecture Components
Spatio-temporal Pattern Queries
Object-Oriented Analysis
Data Warehousing and Data Mining
Presentation transcript:

CENTRE Cellular Network’s Positioning Data Generator Fosca GiannottiKDD-Lab Andrea MazzoniKKD-Lab Puntoni SimoneKDD-Lab Chiara RensoKDD-Lab

Why to generate data? Trouble in finding  Due to ITC Companies reticence  …and for legal and privacy reasons Need to have ad-hoc datasets  To improve algorithm development  To have a tools for validation and testing phases

CENTRE: CEllular Network Trajectory Reconstruction Environment : A positioning data (LOG) generation Environment aimed to Mobile technology Developed as tool of GeoPKDD projects

GSM technology

GeoPKDD: Geographic Privacy-Aware Knowledge Discovery & Delivery

The Idea To generate positional mobile data (LOG) by the simulation of the event deriving from:  Trajectories of hypothetical mobile network’s users that travel on territory  The resulting survey of this movements using synthetic ad-hoc GSM coverage (the set of BTSs) So we can analyze the set of LOGs and recontruct trajectories of mobile network’s users

Motivation With this model we want to reach:  More rigorous and realistic semantic of generating data.  Possibility to compare synthetic trajectories with reconstructed one.  Chance of validate mining and knowledge discovery algorithms results with synthetic trajectories.

CENTRE architecture

What CENTRE do… First of all we generate a sequence of spatio-temporal points represent a trajectory. We can customize:  Starting point  Velocity  Agility  Direction  Groups of behavior  Infrastructures, ect. Then we overlap a set of antennas represented by circles of their coverage areas:

LOG extraction Where: 1. Obj_ID is the identifier of observed object 2. BTS_ID is the identifier of antenna that made this survey 3. TimeStamp is the time of survey 4. D is a evaluation of distance from object to the center of BTS So LOG is represented by a tuple: ( Obj_ID, BTS_ID, TimeStamp, d) Result of extraction:  LOG at time tt2 (P2) {Cell1, BTS1, tt2, d12}  LOG at time tt3 (P3) {Cell1, BTS1, tt3, d13}, {Cell1, BTS2, tt3, d23}, {Cell1, BTS3, tt3, d33}  LOG at time tt4 (P4) {Cell1, BTS2, tt4, d24}

Dataset

Trajectories reconstruction Once LOG are produced and stored, we forget about synthetic trajectories and try to reconstruct these only from:  LOG collection  Synthetic coverage

Information types Reconstruction was performed considering all LOGs produced on a single temporal instant for a single trajectoty The number of LOGs with same time and same device identificator (id_cell) represent the number of simultaneous relevations 3 LOGs 1 LOG 2 LOGs

Recontruction method When we have:  Only one relevation: our point may be inside the entire antenna covered area, so we take antenna center as point positions  With two or more relevations: point may be only inside the intersection area of them, so we take centroid of this area as point position

Reconstructed trajectories dataset

And now …examples!

………

Now we work on… Make new extensions to main generation engine  In order to test and validate spatial KD algorithms with more efficiency and accuracy. Change old code (that was derived from GSTD code)  Introducing improvements on class structures  Introducing new data characterization specially on spatial and temporal aspects

Multiple generation engines The Idea is to develop extensions to main engine every time we need new features to test and validate KD algorithms. And use each time the best implementation on sinthetic trajectories production engine depending of type of data we need to obtain

Density based clustering We have seen that for best results with this algorithm is useful to have a simple method for:  create clusters and  identify relation between objects and clusters.

Attraction engine For this particular type of algorithm we are developing a new engine extension that use an attraction-like mechanism. Each objects chooses and tries to reach its next attraction area. When it reaches its destination area chooses another one, and so on…

Cluster construction A cluster if formed by a set of objects that are forced to pass through a sequence of areas.

…a simple example In this scenario we can see one object that every time chooses a region with a completely random order. Chosen a region, and a point on it, the object tries to reach this point. …and so on

Others improvements Formalization of some concepts (at code level):  Spatio-temporal data  Spatio-temporal object  Trajectory and a real measures in data values:  Positions are expressed in meters  Velocities are expressed in meters/seconds  Times are expressed in seconds

Conclusions Nowadays work is in progress, and we hope to test as soon as possible a Density Based Algorithm on this new generation engine Contextually we also work on a engine for testing Temporal and Sequential Frequent Pattern Algorithm And also to improve generator use, through simplification of number and form of parameters, graphical interface, ect.