7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 1 6. Spatial Mining Spatial Data and Structures Images Spatial Mining Algorithms.

Slides:



Advertisements
Similar presentations
1 DATA STRUCTURES USED IN SPATIAL DATA MINING. 2 What is Spatial data ? broadly be defined as data which covers multidimensional points, lines, rectangles,
Advertisements

Density-Based Clustering Math 3210 By Fatine Bourkadi.
Office of SA to CNS GeoIntelligence Introduction Data Mining vs Image Mining Image Mining - Issues and Challenges CBIR Image Mining Process Ontology.
Spatial Database Systems. Spatial Database Applications GIS applications (maps): Urban planning, route optimization, fire or pollution monitoring, utility.
Nearest Neighbor Search
Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
CS690L: Clustering References:
Data Mining Cluster Analysis: Advanced Concepts and Algorithms
Geog 405: Advanced Spatial Analysis © J.M. Piwowar1Spatial Data Models Characteristics of Raster Data Longley et al., Geographic Information Systems.
Spring 2003Data Mining by H. Liu, ASU1 6. Spatial Mining Spatial Data and Structures Images Spatial Mining Algorithms.
Spatial Mining.
2-dimensional indexing structure
Geodatabases by Shawn J. Dorsch Spatial Databases Part 2.
SAK 5609 DATA MINING Prof. Madya Dr. Md. Nasir bin Sulaiman
Data Mining Cluster Analysis: Advanced Concepts and Algorithms Lecture Notes for Chapter 9 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Information Systems and GIS Chapter 2 Slides from James Pick, Geo-Business: GIS in the Digital Organization, John Wiley and Sons, Copyright © 2008.
Spatial Queries Nearest Neighbor Queries.
Aprendizagem baseada em instâncias (K vizinhos mais próximos)
KNN, LVQ, SOM. Instance Based Learning K-Nearest Neighbor Algorithm (LVQ) Learning Vector Quantization (SOM) Self Organizing Maps.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
© Prentice Hall1 DATA MINING Introductory and Advanced Topics Part II Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
19 th Advanced Summer School in Regional Science Overview and more advanced directions with ArcGIS.
Basic Concepts of GIS January 29, What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial.
Oracle Data Mining Ying Zhang. Agenda Data Mining Data Mining Algorithms Oracle DM Demo.
Basic Spatial Analysis
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
Module 04: Algorithms Topic 07: Instance-Based Learning
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
Mutlidimensional Indices Instructor: Randal Burns Lecture for 29 November 2005 Computer Science Johns Hopkins University.
1 CSE 980: Data Mining Lecture 17: Density-based and Other Clustering Algorithms.
Multidimensional Indexes Applications: geographical databases, data cubes. Types of queries: –partial match (give only a subset of the dimensions) –range.
 Data Models in Commercial GIS Systems E. Hoel, page CSCI 5715 – Fall /14/2014 Sameera Shah Shyam Sundar Somasundaram.
Nearest Neighbor Queries Chris Buzzerd, Dave Boerner, and Kevin Stewart.
Map of the Great Divide Basin, Wyoming, created using a neural network and used to find likely fossil beds See:
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Spatial Data Mining hari agung.
Spatial Query Processing Spatial DBs do not have a set of operators that are considered to be basic elements in a query evaluation. Spatial DBs handle.
Spatial DBMS Spatial Database Management Systems.
Spatial Database 2/5/2011 Reference – Ramakrishna Gerhke and Silbershatz.
Spatial Indexing Techniques Introduction to Spatial Computing CSE 5ISC Some slides adapted from Spatial Databases: A Tour by Shashi Shekhar Prentice Hall.
Waqas Haider Bangyal. 2 Source Materials “ Data Mining: Concepts and Techniques” by Jiawei Han & Micheline Kamber, Second Edition, Morgan Kaufmann, 2006.
What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial data”
Eick: kNN kNN: A Non-parametric Classification and Prediction Technique Goals of this set of transparencies: 1.Introduce kNN---a popular non-parameric.
CLUSTERING GRID-BASED METHODS Elsayed Hemayed Data Mining Course.
1 SBM411 資料探勘 陳春賢. 2 Lecture I Class Introduction.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
Spatial Data Management
Strategies for Spatial Joins
Machine Learning for Computer Security
DATA MINING Spatial Clustering
Data Science Algorithms: The Basic Methods
Geographical Data Mining
Spatial Database Systems
Data Mining: Concepts and Techniques (3rd ed
Map of the Great Divide Basin, Wyoming, created using a neural network and used to find likely fossil beds See:
Data Mining: Concepts and Techniques Course Outline
K Nearest Neighbor Classification
Nearest-Neighbor Classifiers
Research Areas Christoph F. Eick
Multidimensional Indexes
DATA MINING Introductory and Advanced Topics Part II - Clustering
COSC 4335: Other Classification Techniques
Chap 8. Instance Based Learning
CSE572, CBS572: Data Mining by H. Liu
Nearest Neighbors CSC 576: Data Mining.
Multidimensional Search Structures
CSE572: Data Mining by H. Liu
Data Mining CSCI 307, Spring 2019 Lecture 11
Presentation transcript:

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 1 6. Spatial Mining Spatial Data and Structures Images Spatial Mining Algorithms

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 2 Definitions Spatial data is about instances located in a physical space Spatial data has location or geo-referenced features Some of these features are: –Address, latitude/longitude (explicit) –Location-based partitions in databases (implicit)

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 3 Applications and Problems Geographic information systems (GIS) store information related to geographic locations on Earth –Weather, community infrastructure needs, disaster management, and hazardous waste Homeland security issues such as prediction of unexpected events and planning of evacuation Remote sensing and image classification Biomedical applications include medical imaging and illness diagnosis

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 4 Use of Spatial Data Map overlay – merging disparate data –Different views of the same area: (Level 1) streets, power lines, phone lines, sewer lines, (Level 2) actual elevations, building locations, and rivers Spatial selection – find all houses near WSU Spatial join – nearest for points, intersection for areas Other basic spatial operations –Region/range query for objects intersecting a region –Nearest neighbor query for objects closest to a given place –Distance scan asking for objects within a certain radius

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 5 Spatial Data Structures Minimum bounding rectangles (MBR) Different tree structures –Quad tree –R-Tree –kd-Tree Image databases

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 6 MBR Representing a spatial object by the smallest rectangle [(x1,y1), (x2,y2)] or rectangles (x1,y1) (x2,y2)

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 7 Tree Structures Quad Tree: every four quadrants in one layer forms a parent quadrant in an upper layer –An example

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 8 R-Tree Indexing MBRs in a tree –An R-tree of order m has at most m entries in one node –An example (order of 3) R8 R1 R2 R3 R6 R5R4 R7 R8 R7R6 R3R2R1R5R4

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 9 kd-Tree Indexing multi-dimensional data, one dimension for a level in a tree –An example

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 10 Common Tasks dealing with Spatial Data Data focusing –Spatial queries –Identifying interesting parts in spatial data –Progress refinement can be applied in a tree structure Feature extraction –Extracting important/relevant features for an application Classification or others –Using training data to create classifiers –Many mining algorithms can be used Classification, clustering, associations

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 11 Spatial Mining Tasks Spatial classification Spatial clustering Spatial association rules

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 12 Spatial Classification Use spatial information at different (coarse/fine) levels (different indexing trees) for data focusing Determine relevant spatial or non-spatial features Perform normal supervised learning algorithms –e.g., Decision trees,

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 13 Spatial Clustering Use tree structures to index spatial data DBSCAN: R-tree CLIQUE: Grid or Quad tree Clustering with spatial constraints (obstacles  need to adjust notion of distance)

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 14 Spatial Association Rules Spatial objects are of major interest, not transactions A  B –A, B can be either spatial or non-spatial (3 combinations) –What is the fourth combination? Association rules can be found w.r.t. the 3 types

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 15 Summary Spatial data can contain both spatial and non- spatial features. When spatial information becomes dominant interest, spatial data mining should be applied. Spatial data structures can facilitate spatial mining. Standard data mining algorithms can be modified for spatial data mining, with a substantial part of preprocessing to take into account of spatial information.

7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 16 Bibliography M. H. Dunham. Data Mining – Introductory and Advanced Topics. Prentice Hall R.O. Duda, P.E. Hart, D.G. Stork. Pattern Classification, 2 nd edition. Wiley-Interscience. J. Han and M. Kamber. Data Mining – Concepts and Techniques Morgan Kaufmann.