Spatial Mining.

Slides:



Advertisements
Similar presentations
1 DATA STRUCTURES USED IN SPATIAL DATA MINING. 2 What is Spatial data ? broadly be defined as data which covers multidimensional points, lines, rectangles,
Advertisements

The A-tree: An Index Structure for High-dimensional Spaces Using Relative Approximation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa.
Spatial Database Systems. Spatial Database Applications GIS applications (maps): Urban planning, route optimization, fire or pollution monitoring, utility.
7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 1 6. Spatial Mining Spatial Data and Structures Images Spatial Mining Algorithms.
Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
Searching on Multi-Dimensional Data
1 Enviromatics Spatial database systems Spatial database systems Вонр. проф. д-р Александар Маркоски Технички факултет – Битола 2008 год.
Data Mining Techniques: Clustering
Spring 2003Data Mining by H. Liu, ASU1 6. Spatial Mining Spatial Data and Structures Images Spatial Mining Algorithms.
2-dimensional indexing structure
Geodatabases by Shawn J. Dorsch Spatial Databases Part 2.
Spring 2008MMDB--MIRS1 Multimedia Information Retrieval Systems.
Multiple-key indexes Index on one attribute provides pointer to an index on the other. If V is a value of the first attribute, then the index we reach.
Spatial Access Methods Chapter 26 of book Read only 26.1, 26.2, 26.6 Dr Eamonn Keogh Computer Science & Engineering Department University of California.
Spatial Indexing for NN retrieval
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part III Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Spatio-Temporal Databases
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Spatial Indexing SAMs.
Multi-dimensional Indexes
Spatial Information Systems (SIS) COMP Spatial access methods: Indexing.
Spatial Indexing I Point Access Methods.
Spatial Queries Nearest Neighbor Queries.
KNN, LVQ, SOM. Instance Based Learning K-Nearest Neighbor Algorithm (LVQ) Learning Vector Quantization (SOM) Self Organizing Maps.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Prepared by Abzamiyeva Laura Candidate of the department of KKGU named after Al-Farabi Kizilorda, Kazakstan 2012.
Indexing Spatial Data (Parts of Chapter 25+R-tree paper)
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
R-Trees Extension of B+-trees.  Collection of d-dimensional rectangles.  A point in d-dimensions is a trivial rectangle.
Trees for spatial indexing
Data Mining Techniques
Spatial Data Management Chapter 28. Types of Spatial Data Point Data –Points in a multidimensional space E.g., Raster data such as satellite imagery,
Database management Systems, 3ed, R. Ramakrishnan and J. Gehrke1 Spatial Data Management Chapter 28.
Mutlidimensional Indices Instructor: Randal Burns Lecture for 29 November 2005 Computer Science Johns Hopkins University.
Indexing for Multidimensional Data An Introduction.
Database System Concepts ©Silberschatz, Korth and Sudarshan See for conditions on re-usewww.db-book.com 1 Indexing Spatial Data.
Multidimensional Indexes Applications: geographical databases, data cubes. Types of queries: –partial match (give only a subset of the dimensions) –range.
R-Tree. 2 Spatial Database (Ia) Consider: Given a city map, ‘index’ all university buildings in an efficient structure for quick topological search.
Nearest Neighbor Queries Chris Buzzerd, Dave Boerner, and Kevin Stewart.
Spatial Data Mining hari agung.
Spatial Query Processing Spatial DBs do not have a set of operators that are considered to be basic elements in a query evaluation. Spatial DBs handle.
Spatial DBMS Spatial Database Management Systems.
Challenges in Mining Large Image Datasets Jelena Tešić, B.S. Manjunath University of California, Santa Barbara
Bin Yao (Slides made available by Feifei Li) R-tree: Indexing Structure for Data in Multi- dimensional Space.
Levels of Image Data Representation 4.2. Traditional Image Data Structures 4.3. Hierarchical Data Structures Chapter 4 – Data structures for.
Spatial Database 2/5/2011 Reference – Ramakrishna Gerhke and Silbershatz.
Spatial Indexing Techniques Introduction to Spatial Computing CSE 5ISC Some slides adapted from Spatial Databases: A Tour by Shashi Shekhar Prentice Hall.
Spatial and Geographic Databases. Spatial databases store information related to spatial locations, and support efficient storage, indexing and querying.
Trajectory Data Mining Dr. Yu Zheng Lead Researcher, Microsoft Research Chair Professor at Shanghai Jiao Tong University Editor-in-Chief of ACM Trans.
© Prentice Hall1 ADVANCED TOPICS IN DATA MINING CSE 8331 Spring 2008 Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Spatial Indexing Techniques Introduction to Spatial Computing CSE 5ISC Some slides adapted from Spatial Databases: A Tour by Shashi Shekhar Prentice Hall.
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part III Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
Data Mining By Farzana Forhad CS 157B. Agenda Decision Tree and ID3 Rough Set Theory Clustering.
File Processing : Multi-dimensional Index 2015, Spring Pusan National University Ki-Joune Li.
What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial data”
Multi-dimensional Range Query Processing on the GPU Beomseok Nam Date Intensive Computing Lab School of Electrical and Computer Engineering Ulsan National.
CLUSTERING GRID-BASED METHODS Elsayed Hemayed Data Mining Course.
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part III.
Multidimensional Access Structures COMP3017 Advanced Databases Dr Nicholas Gibbins –
Indexing Multidimensional Data
Multidimensional Access Structures
Spatial Indexing I Point Access Methods.
Nearest-Neighbor Classifiers
Research Areas Christoph F. Eick
Multidimensional Indexes
Multidimensional Search Structures
Donghui Zhang, Tian Xia Northeastern University
Data Mining CSCI 307, Spring 2019 Lecture 11
Presentation transcript:

Spatial Mining

Introduction Spatial Mining is a specialised domain of data mining whose goal is to find implicit knowledge in spatial data. Spatial data may be viewed as objects with some location in physical space. Spatial data also contain non-spatial attributes that may be either dependent or independent on location.

Spatial Data Also known as geospatial data or geographic information it is the data or information that identifies the geographic location of features and boundaries on Earth, such as natural or constructed features, oceans, and more. Spatial data is usually stored as coordinates and topology, and is data that can be mapped.

Spatial Data Spatial data has location or geo-referenced features. Some of these features are: Address, latitude/longitude (explicit) Location-based partitions in databases (implicit)

Spatial Object Contains both spatial and nonspatial attributes. Must have a location type attributes: Latitude/longitude Zip code Street address May retrieve object using either (or both) spatial or nonspatial attributes.

Applications and Problems Geographic information systems (GIS) store information related to geographic locations on Earth Weather, community infrastructure needs, disaster management, and hazardous waste Homeland security issues such as prediction of unexpected events and planning of evacuation. Remote sensing and image classification. Biomedical applications include medical imaging and illness diagnosis.

Spatial Queries Spatial selection may involve specialized selection comparison operations: Near North, South, East, West Contained in Overlap/intersect Region (Range) Query – find objects that intersect a given region. Nearest Neighbor Query – find object close to identified object. Distance Scan – find object within a certain distance of an identified object where distance is made increasingly larger.

Spatial Data Structures Data structures designed specifically to store or index spatial data These are:- Minimum bounding rectangles (MBR) Different tree structures Quad tree R-Tree kd-Tree Image databases

MBR Minimum Bounding Rectangle Smallest rectangle that completely contains the object

MBR Examples

Quad Tree Hierarchical decomposition of the space into quadrants (MBRs) Each level in the tree represents the object as the set of quadrants which contain any portion of the object. Each level is a more exact representation of the object. The number of levels is determined by the degree of accuracy desired. © Prentice Hall

Tree Structures Quad Tree: every four quadrants in one layer forms a parent quadrant in an upper layer An example Dunham Pp224-226

R-Tree As with Quad Tree the region is divided into successively smaller rectangles (MBRs). Rectangles need not be of the same size or number at each level. Rectangles may actually overlap. Lowest level cell has only one object. Tree maintenance algorithms similar to those for B-trees.

R-Tree Indexing MBRs in a tree An R-tree of order m has at most m entries in one node An example (order of 3) R8 R8 R1 R2 R3 R6 R5 R4 R7 R6 R7 R1 R2 R3 R4 R5

R-Tree Example

K-D Tree Designed for multi-attribute data, not necessarily spatial Variation of binary search tree Each level is used to index one of the dimensions of the spatial object. Lowest level cell has only one object Divisions not based on MBRs but successive divisions of the dimension range.

k-D Tree Example

kd-Tree Indexing multi-dimensional data, one dimension for a level in a tree An example

Common Tasks dealing with Spatial Data Data focusing Spatial queries Identifying interesting parts in spatial data Progress refinement can be applied in a tree structure Feature extraction Extracting important/relevant features for an application Classification or others Using training data to create classifiers Many mining algorithms can be used Classification, clustering, associations

Spatial Mining Tasks Spatial classification Spatial clustering Spatial association rules

Spatial Classification Use spatial information at different (coarse/fine) levels (different indexing trees) for data focusing Determine relevant spatial or non-spatial features Perform normal supervised learning algorithms e.g., Decision trees,

Spatial Clustering Use tree structures to index spatial data DBSCAN: R-tree CLIQUE: Grid or Quad tree Clustering with spatial constraints (obstacles  need to adjust notion of distance)

Spatial Association Rules Spatial objects are of major interest, not transactions A  B A, B can be either spatial or non-spatial (3 combinations) What is the fourth combination? Pp 234-235