Spatial Data Mining hari agung.

Slides:



Advertisements
Similar presentations
1 DATA STRUCTURES USED IN SPATIAL DATA MINING. 2 What is Spatial data ? broadly be defined as data which covers multidimensional points, lines, rectangles,
Advertisements

Copyright Jiawei Han, modified by Charles Ling for CS411a
7/03Spatial Data Mining G Dong (WSU) & H. Liu (ASU) 1 6. Spatial Mining Spatial Data and Structures Images Spatial Mining Algorithms.
1 Chapter 2 The Digital World. 2 Digital Data Representation.
Spatial Data Mining-Applications Hemant Kumar Jerath,B.Tech. MS Project Student Mangalore University Advisors: Dr. B.K Mohan & Dr.(Mrs.).P. Venkatachalam.
Geographic Information Systems (GIS) Dr. Roy Cole Department of geography and Planning GVSU.
The Evolution of Spatial Outlier Detection Algorithms - An Analysis of Design CSci 8715 Spatial Databases Ryan Stello Kriti Mehra.
Spring 2003Data Mining by H. Liu, ASU1 6. Spatial Mining Spatial Data and Structures Images Spatial Mining Algorithms.
Spatial Mining.
Week 9 Data Mining System (Knowledge Data Discovery)
Extraction of high-level features from scientific data sets Eui-Hong (Sam) Han Department of Computer Science and Engineering University of Minnesota Research.
PROCESS IN DATA SYSTEMS PLANNING DATA INPUT DATA STORAGE DATA ANALYSIS DATA OUTPUT ACTIVITIES USER NEEDS.
GTECH 361 Lecture 02 Introduction to ArcGIS. Today’s Objectives explore a map and get information about map features preview geographic data and metadata.
A Unified Approach to Spatial Outliers Detection Chang-Tien Lu Spatial Database Lab Department of Computer Science University of Minnesota
Why Geography is important.
Data Mining – Intro.
Geographic Information System Geog 258: Maps and GIS February 17, 2006.
Spatial Data Mining Yang Yubin Joint Laboratory for Geoinformation Science The Chinese University of Hong Kong
KDD for Science Data Analysis Issues and Examples.
On Some Fundamental Geographical Concepts 176B Lecture 3.
GIS Introduction What is GIS?. Geographic Information Systems A database system in which the organizing principle is explicitly SPATIAL.
PHYSICAL GEOGRAPHY: CONCEPTS AND PERSPECTIVES.
Slope and Aspect Calculated from a grid of elevations (a digital elevation model) Slope and aspect are calculated at each point in the grid, by comparing.
T HE BASIC IDEAS OF GEOGRAPHY Core units: Key understandings Years F–4 Illustration 1: Pointers to understanding.
Bits, Bytes, KiloBytes, MegaBytes, GigaBytes & TeraBytes.
Communications Technology 2104 Mercedes Lahey. Bit 1. bit=From a shortening of the words “binary digit” 2. the basic unit of information for computers.
Communications technology Ali Kennedy.  Bit= from a shortening of the words “ bi nary digit”  The basic unit ofinformation for computers  1 or 0 are.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
Data Mining Techniques
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Chun-Hung Chou
Spatial Data Models. What is a Data Model? What is a model? (Dictionary meaning) A set of plans (blueprint drawing) for a building A miniature representation.
Spatial Data Mining Hari Agung Departemen Ilmu Komputer FMIPA IPB
Spatial Database Souhad Daraghma.
Chapter 1: Introduction to Spatial Databases 1.1 Overview 1.2 Application domains 1.3 Compare a SDBMS with a GIS 1.4 Categories of Users 1.5 An example.
IST 210 Introduction to Spatial Databases. IST 210 Evolution of acronym “GIS” Fig 1.1 Geographic Information Systems (1980s) Geographic Information Science.
Introduction to Geographic Information Systems (GIS) Lesson 1.
8. Geographic Data Modeling. Outline Definitions Data models / modeling GIS data models – Topology.
Section 2 : The Geographer’s Craft
Center for Modeling & Simulation.  Introduction to GIS ◦ General Definitions ◦ Concept of space and time ◦ History ◦ Components ◦ Objectives / why use.
Spatial Data Mining Ashkan Zarnani Sadra Abedinzadeh Farzad Peyravi.
Geographic Information Systems Temporal GIS Lecture 8 Eng. Osama Dawoud.
1 CS599 Spatial & Temporal Database Spatial Data Mining: Progress and Challenges Survey Paper appeared in DMKD96 by Koperski, K., Adhikary, J. and Han,
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
1 Spatial Data Models and Structure. 2 Part 1: Basic Geographic Concepts Real world -> Digital Environment –GIS data represent a simplified view of physical.
Spatial Data Mining Satoru Hozumi CS 157B. Learning Objectives Understand the concept of Spatial Data Mining Understand the concept of Spatial Data Mining.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Wednesday, March 29, 2000.
Spatial DBMS Spatial Database Management Systems.
Christoph F. Eick Questions and Topics Review November 11, Discussion of Midterm Exam 2.Assume an association rule if smoke then cancer has a confidence.
Networking for Home and Small Businesses –.  Explain the binary representation of data.
So, what’s the “point” to all of this?….
Concepts of Geographic Thinking
© Vipin Kumar IIT Mumbai Case Study 2: Dipoles Teleconnections are recurring long distance patterns of climate anomalies. Typically, teleconnections.
Mr. Idrissa Y. H. Assistant Lecturer, Geography & Environment Department of Social Sciences School of Natural & Social Sciences State University of Zanzibar.
Grid-based Map Analysis Techniques and Modeling Workshop Part 1 – Maps as Data Part 2– Surface Modeling Part 3 – Spatial Data Mining Linking geographic.
Cluster Analysis What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods.
Spatial statistics Lecture 3 2/4/2008. What are spatial statistics Not like traditional, a-spatial or non-spatial statistics But specific methods that.
Data Mining – Introduction (contd…) Compiled By: Umair Yaqub Lecturer Govt. Murray College Sialkot.
CLARANS: A Method for Clustering Objects for Spatial Data Mining IEEE Transactions on Knowledge and Data Enginerring, 2002 Raymond T. Ng et al. 22 MAR.
Cluster Analysis This work is created by Dr. Anamika Bhargava, Ms. Pooja Kaul, Ms. Priti Bali and Ms. Rajnipriya Dhawan and licensed under a Creative Commons.
Overview of Mining Spatial Data
Data Mining – Intro.
Introduction to Spatial Statistical Analysis
Self organizing networks
CSE572, CBS572: Data Mining by H. Liu
Topic 5: Cluster Analysis
The basic ideas of geography
UNIT 1 REVIEW GEOGRAPHY.
Unit 1: Thinking Geographically
Section 2 Physical geography is the study of the earth’s land and features. People who work in this field are called physical geographers. Climate is not.
Presentation transcript:

Spatial Data Mining hari agung

What is Spatial Data? The data related to objects that occupy space traffic, bird habitats, global climate, logistics, ... Object types: Points, Lines, Polygons,etc. Used in/for: GIS - Geographic Information Systems Meteorology Astronomy Environmental studies, etc.

Why do we need Data Mining? Large number of records(cases) (108-1012 bytes) One thousand (103) bytes = 1 kilobyte (KB) One million (106) bytes = 1 megabyte (MB) One billion (109) bytes = 1 gigabyte (GB) One trillion (1012) bytes = 1 terabyte (TB) High dimensional data (variables) 10-104 attributes Only a small portion, typically 5% to 10%, of the collected data is ever analyzed We are drowning in data, but starving for knowledge!

Spatial Data Mining Spatial Patterns Primary Tasks Spatial outliers Location prediction Associations, co-locations Hotspots, Clustering, trends, … Primary Tasks Mining Spatial Association Rules Spatial Classification and Prediction Spatial Data Clustering Analysis Spatial Outlier Analysis

Spatial Classification Use spatial information at different (coarse/fine) levels (different indexing trees) for data focusing Determine relevant spatial or non-spatial features Perform normal supervised learning algorithms e.g., Decision trees,

Spatial Clustering Use tree structures to index spatial data DBSCAN: R-tree CLIQUE: Grid or Quad tree Clustering with spatial constraints (obstacles  need to adjust notion of distance)

Spatial Association Rules Spatial objects are of major interest, not transactions A  B A, B can be either spatial or non-spatial (3 combinations) What is the fourth combination? Association rules can be found w.r.t. the 3 types Pp 234-235

Spatial Data Mining Results Understanding spatial data, discovering relationships between spatial and nonspatial data, construction of spatial knowledge bases, etc. In various forms The description of the general weather patterns in a set of geographic regions is a spatial characteristic rule. The comparison of two weather patterns in two geographic regions is a spatial discriminant rule. A rule like “most cities in Canada are close to the Canada-US border” is a spatial association rule near(x,coast) ^ southeast(x, USA) ) hurricane(x), (70%) Others: spatial clusters,…

Basic Concepts (1) Spatial data mining follows along the same functions in data mining, with the end objective to find patterns in geography, meteorology, etc. The main difference (Spatial autocorrelation) the neighbors of a spatial object may have an influence on it and therefore have to be considered as well Spatial attributes Topological adjacency or inclusion information Geometric position (longitude/latitude), area, perimeter, boundary polygon

Basic Concepts (2) Spatial neighborhood Topological relation “intersect”, “overlap”, “disjoint”, … distance relation “close_to”, “far_away”,… direction/orientation relation “left_of”, “west_of”,… Global model might be inconsistent with regional models Global Model Local Model

Applications NASA Earth Observing System (EOS): Earth science data National Inst. of Justice: crime mapping Census Bureau, Dept. of Commerce: census data Dept. of Transportation (DOT): traffic data National Inst. of Health(NIH): cancer clusters

Example: What Kind of Houses Are Highly Valued Example: What Kind of Houses Are Highly Valued?—Associative Classification

Data SOM Application for DataMining Downscaling Weather Forecasts ERA-15 using a T106L31 model (from 1978 to 1994) with 1.125◦ resolution Terabytes Comprises data from approx. 20 variables (such as temperature,humidity, pressure, etc.) at 30 pressure levels of a 360x360 nodes grid 6 SOM Application for DataMining Downscaling Weather Forecasts Adaptive Competitive Learning Sub-grid details scape from numerical models

Dept. of Applied Mathematics Universidad de Cantabria Santander, Spain

And now discussion