Artificial Neural Network Applications on Remotely Sensed Imagery Kaushik Das, Qin Ding, William Perrizo North Dakota State University

Slides:



Advertisements
Similar presentations
Major Operations of Digital Image Processing (DIP) Image Quality Assessment Radiometric Correction Geometric Correction Image Classification Introduction.
Advertisements

Resolution Resolving power Measuring of the ability of a sensor to distinguish between signals that are spatially near or spectrally similar.
Raster Data in ArcSDE 8.2 Why Put Images in a Database? What are Basic Raster Concepts? How Raster data stored in Database?
You have just been given an aerial photograph that is not registered to real world coordinates. How do you display the aerial with other data layers that.
1 Data & Database Development. 2 Data File Bit Byte Field Record File Database Entity Attribute Key field Key file management concepts include:
Digital Data Introduction to Remote Sensing Instructor: Dr. Cheng-Chien LiuCheng-Chien Liu Department of Earth Sciences National Cheng Kung University.
Introduction to ArcView ArcView_module_2 May 12, 10:40 AM.
Raster and Vector 2 Major GIS Data Models. Raster and Vector 2 Major GIS Data Models.
Intro. To GIS Lecture 4 Data: data storage, creation & editing
Spatial data models (types)
Performance Improvement for Bayesian Classification on Spatial Data with P-Trees Amal S. Perera Masum H. Serazi William Perrizo Dept. of Computer Science.
GIS is composed of layers Layers –land/water –roads –urban areas –pollution levels Data can be represented by VECTORS, or Data can be represented by RASTERS.
Vertical Set Square Distance: A Fast and Scalable Technique to Compute Total Variation in Large Datasets Taufik Abidin, Amal Perera, Masum Serazi, William.
GIS 1110 Designing Geodatabases. Representation Q. How will we model our real world data? A. Typically: Features Continuous Surfaces and Imagery Map Graphics.
Geo-referenced Information Processing System. ISPRS Geoprocessing Technologies to collect and treat spatial information for a specific goal. Geoprocessing.
Raster Data Chapter 7. Introduction  Vector – discrete  Raster – continuous  Continuous –precipitation –elevation –soil erosion  Regular grid cell.
Faculty of Applied Engineering and Urban Planning Civil Engineering Department Geographic Information Systems Vector and Raster Data Models Lecture 3 Week.
MULTI-LAYERED SOFTWARE SYSTEM FRAMEWORK FOR DISTRIBUTED DATA MINING
Remotely Sensed Data EMP 580 Fall 2015 Dr. Jim Graham Materials from Sara Hanna.
Clustering Analysis of Spatial Data Using Peano Count Trees Qiang Ding William Perrizo Department of Computer Science North Dakota State University, USA.
Bit Sequential (bSQ) Data Model and Peano Count Trees (P-trees) Department of Computer Science North Dakota State University, USA (the bSQ and P-tree technology.
Partitioning – A Uniform Model for Data Mining Anne Denton, Qin Ding, William Jockheck, Qiang Ding and William Perrizo.
Introduction to ArcView NPS Introduction to GIS: Lecture 2 Based on NINC, ESRI and Other Sources.
Chapter 3 Digital Representation of Geographic Data.
How do we represent the world in a GIS database?
Raster Data Model.
CHAPTER 4 RASTER DATA MODEL 4.1 Elements of the Raster Data Model
Remote Sensing. Vulnerability is the degree to which a system is susceptible to, or unable to cope with, adverse effects of climate change, including.
Ptree * -based Approach to Mining Gene Expression Data Fei Pan 1, Xin Hu 2, William Perrizo 1 1. Dept. Computer Science, 2. Dept. Pharmaceutical Science,
Lecture 3 The Digital Image – Part I - Single Channel Data 12 September
Satellite Imagery and Remote Sensing DeeDee Whitaker SW Guilford High EES & Chemistry
Raster data models Rasters can be different types of tesselations SquaresTrianglesHexagons Regular tesselations.
Association Rule Mining on Remotely Sensed Imagery Using Peano-trees (P-trees) Qin Ding, Qiang Ding, and William Perrizo Computer Science Department North.
Data Mining and Data Warehousing Many-to-Many Relationships Applications William Perrizo Dept of Computer Science North Dakota State Univ.
Accuracy of Land Cover Products Why is it important and what does it all mean Note: The figures and tables in this presentation were derived from work.
Map of the Great Divide Basin, Wyoming, created using a neural network and used to find likely fossil beds See:
GIS Data Structures How do we represent the world in a GIS database?
K-Nearest Neighbor Classification on Spatial Data Streams Using P-trees Maleq Khan, Qin Ding, William Perrizo; NDSU.
Efficient OLAP Operations for Spatial Data Using P-Trees Baoying Wang, Fei Pan, Dongmei Ren, Yue Cui, Qiang Ding William Perrizo North Dakota State University.
Converting raster images to XML and SVG The potential of XML - encoded images and SVG image files in Geomatics.
CFR 250/590 Introduction to GIS, Autumn 1999 View Basics © Phil Hurvitz, intro.ppt 1 Overview Getting data into ArcView Displaying themes Theme.
TEMPLATE DESIGN © Predicate-Tree based Pretty Good Protection of Data William Perrizo, Arjun G. Roy Department of Computer.
Clustering using Wavelets and Meta-Ptrees Anne Denton, Fang Zhang.
P-Tree Implementation Anne Denton. So far: Logical Definition C.f. Dr. Perrizo’s slides Logical definition Defines node information Representation of.
The role of a data model in GIS. Levels of GIS data model abstraction.
Peano Count Trees and Association Rule Mining for Gene Expression Profiling using DNA Microarray Data Dr. William Perrizo, Willy Valdivia, Dr. Edward Deckard,
Accelerating Multilevel Secure Database Queries using P-Tree Technology Imad Rahal and Dr. William Perrizo Computer Science Department North Dakota State.
Data Models, Pixels, and Satellite Bands. Understand the differences between raster and vector data. What are digital numbers (DNs) and what do they.
Window based software for Neuro-Fuzzy Classification of Remotely Sensed Image (Stand along application and extension for ArcGIS) Xiaogang Yang POEC 6387.
What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial data”
NOTE, THIS PPT LARGELY SWIPED FROM
Landsat Satellite Data. 1 LSOS (1-ha) 9 Intensive Study Areas (1km x 1km) 3 Meso-cell Study Areas (25km x 25km) 1 Small Regional Study Area (1.5 o x 2.5.
P Left half of rt half ? false  Left half pure1? false  Whole is pure1? false  0 5. Rt half of right half? true  1.
Multimedia Data Mining using P-trees* William Perrizo,William Jockheck, Amal Perera, Dongmei Ren, Weihua Wu, Yi Zhang Computer Science Department North.
Graphics and image data representation
Data Mining Motivation: “Necessity is the Mother of Invention”
Decision Tree Classification of Spatial Data Streams Using Peano Count Trees Qiang Ding Qin Ding * William Perrizo Department of Computer Science.
Digital Data Format and Storage
This week’s earth observatory: false colour image
Yue (Jenny) Cui and William Perrizo North Dakota State University
Spatial Data Models Raster uses individual cells in a matrix, or grid, format to represent real world entities Vector uses coordinates to store the shape.
Dr. William Perrizo North Dakota State University
3. Vertical Data LECTURE 2 Section 3.
Specifics on Digitizing and More tips on a Layout
Storage Structure and Efficient File Access
A Spatial Data and Sensor Network Application:
Spatial Databases - Representation
Spatial Databases - Representation
The P-tree Structure and its Algebra Qin Ding Maleq Khan Amalendu Roy
Integrating Query Processing and Data Mining in Relational DBMSs
Presentation transcript:

Artificial Neural Network Applications on Remotely Sensed Imagery Kaushik Das, Qin Ding, William Perrizo North Dakota State University

Remotely Sensed Image (RSI) data Satellite image / aerial photography –Landsat scenes covers 180 by 180 kilometers. One scene for every place on earth every 18 days Nearly a petabyte of data. Valuable for precision agriculture. –A aerial photograph may cover a particular field (e.g., 800 by 800 meters). Non-satellite imagery –Soil moisture –Nitrate concentration –Yield maps

Spatial Data Pixel – a point in a space Band – feature attribute of the pixels Value – we will assume all are one byte (0~255) Images have different numbers of bands –TM4/5: 7 bands (B, G, R, NIR, MIR, TIR, MIR2) –TM7: 8 bands (B, G, R, NIR, MIR, TIR, MIR2, PC) –TIFF: 3 bands (B, G, R) –Ground data: individual bands (Yield, Moisture, Nitrate level, Temperature, elevation…)

Spatial dataset example TIFF imageYield Map Spatial dataset can be viewed as collection of pixels, each having a value for each feature attribute For example, the spatial dataset above has 320 rows and 320 columns of pixels (102,400 pixels) and 4 feature attributes (B,G,R,Y). The (B,G,R) feature bands are in the TIFF image and the Y feature is color coded in the Yield Map.

Spatial Data Formats Existing formats –BSQ (Band Sequential) –BIL (Band Interleaved by Line) –BIP (Band Interleaved by Pixel) New format –bSQ (bit Sequential)

Spatial Data Formats (Cont.) BAND ( ) ( ) ( ) ( ) BAND ( ) ( ) ( ) ( ) BSQ format (2 files) Band 1: Band 2:

Spatial Data Formats (Cont.) BAND ( ) ( ) ( ) ( ) BAND ( ) ( ) ( ) ( ) BSQ format (2 files) Band 1: Band 2: BIL format (1 file)

Spatial Data Formats (Cont.) BAND ( ) ( ) ( ) ( ) BAND ( ) ( ) ( ) ( ) BSQ format (2 files) Band 1: Band 2: BIL format (1 file) BIP format (1 file)

Spatial Data Formats (Cont.) BAND ( ) ( ) ( ) ( ) BAND ( ) ( ) ( ) ( ) BSQ format (2 files) Band 1: Band 2: BIL format (1 file) BIP format (1 file) bSQ format (16 files) B11 B12 B13 B14 B15 B16 B17 B18 B21 B22 B23 B24 B25 B26 B27 B

bSQ Format Split each band into eight separate files, one for each bit position. Reasons for using bSQ format –Different bits contribute to the value differently. –bSQ format facilitates the representation of a precision hierarchy (from 1 bit up to 8 bit precision). –bSQ format facilitates the creation of an efficient data mining-ready data structure, Peano-Count-tree (Ptree).

The “tabular” formats (inverted list) BSQ and bSQ are “tabular” formats –BSQ consist of a separate table for each feature band –bSQ consist of a separate table for each bit of each band One can view it this way: –The data set is initially one table, R(K 1,..,K k, A 1, A 2, …, A n ) where K 1,..,K k are structure attributes and each A i is a feature attribute. The structure attributes of a 2-D spatial dataset are the X and Y coordinates of the pixels (rows). The feature attributes are the bands, B,G,R, NIR, … In BSQ we separate each feature into a separate file and suppress the structure attributes altogether (under the assumption that the pixels are always arranged in raster order. In bSQ we separate each bit of each feature into a separate file (same raster order assumption)

Peano Count Tree (P-tree) P-tree represents spatial bSQ data bit-by-bit in a recursive quadrant-by-quadrant arrangement. An P-tree is a lossless representation of the original data. A P-tree is a compressed structure. A P-tree is “count pre-computed”.

An example of Ptree Peano or Z-ordering Pure (Pure-1/Pure-0) quadrant Root Count  Level  Fan-out  QID (Quadrant ID)

Ptree features Peano or Z-ordering Pure (Pure-1/Pure-0) quadrant Root Count  Level  Fan-out  QID (Quadrant ID) ( 7, 1 ) ( 111, 001 )  Level-0  Level-3  Level-2  Level-1

Basic, Value and Tuple Ptrees Value Ptrees (i.e., P 1, 001 = P 11 ’ AND P 12 ’ AND P 13 ) Tuple Ptrees (i.e., P 001, 010, 111 = P 1, 001 AND P 2, 010 AND P 3, 111 ) AND Basic Ptrees (i.e., P 11, P 12, …, P 18, P 21, …, P 28, …, P 71, …, P 78 )

Self-organizing Map (SOM) application SOM – a special class of Artificial Neural Networks Competitive learning – only one winner neuron per group SOM can gives an intuitive two-dimensional map of a spatial data set in P-tree format.

Goal Use SOM to cluster yield attribute into high, medium and low yield regions. Create pointers from cluster points to the corresponding areas of an aerial photo. Derive association rules from the SOM map

System Architecture Client server architecture Using CORBA as the backbone

System Screen Layout

High Yield Medium Yield Low Yield Generated SOM from the image 29NW072894

Advantages of using CORBA We can add more servers easily. CORBA is a standard. More services are provided. –Security –Dynamic method invocation –Multi-threaded service Makes code efficient and clean. CORBA + XML constitute a rudimentary “single server view of the network as discussed by Dr. Mochida.

Conclusion Considered new data structures for data mining and a clustering application. Use wavelet for data preprocessing. Generate SOM and cluster the yield map into high, medium and low yield regions.