1 Modeling Evolution in Spatial Datasets Paul Amalaman 2/17/2012 Dr Eick Christoph Nouhad Rizk Zechun Cao Sujing Wang Data Mining and Machine Learning.

Slides:



Advertisements
Similar presentations
Chapter 8 Geocomputation Part A:
Advertisements

Discriminative and generative methods for bags of features
AIR POLLUTION. ATMOSPHERIC CHEMICAL TRANSPORT MODELS Why models? incomplete information (knowledge) spatial inference = prediction temporal inference.
Machine Learning Neural Networks
New Geometric Methods of Mixture Models for Interactive Visualization PIs: Jia Li, Xiaolong (Luke) Zhang, Bruce Lindsay Department of Statistics College.
SSP Re-hosting System Development: CLBM Overview and Module Recognition SSP Team Department of ECE Stevens Institute of Technology Presented by Hongbing.
Transitioning to HPC: Experiences from the Atmospheric Sciences Dr. Joe Galewsky Department of Earth and Planetary Sciences University of New Mexico
Meta Learning and Active Learning: Meta Learning and Active Learning: Collaborative Knowledge Discovery in Distributed Systems Dr Yonghong Peng Department.
Classification and Prediction: Regression Analysis
Nawaf M Albadia Introduction. Components. Behavior & Characteristics. Classes & Rules. Grid Dimensions. Evolving Cellular Automata using Genetic.
Machine Learning in Simulation-Based Analysis 1 Li-C. Wang, Malgorzata Marek-Sadowska University of California, Santa Barbara.
Data Mining Techniques
University of Toronto 8/30/20151 Data Mining The Art and Science of Obtaining Knowledge from Data Dr. Saed Sayad.
Data Mining Chun-Hung Chou
Issues with Data Mining
Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Introduction to Data Mining Group Members: Karim C. El-Khazen Pascal Suria Lin Gui Philsou Lee Xiaoting Niu.
Indiana GIS Conference, March 7-8, URBAN GROWTH MODELING USING MULTI-TEMPORAL IMAGES AND CELLULAR AUTOMATA – A CASE STUDY OF INDIANAPOLIS SHARAF.
The normalized cell error for cell at time n |V(H c(1,1) )|=25 |E(H c(1,1) )|=40 |V(H c(2,1) )|=21 |E(H c(2,1) )|=24 |V(H c(3,1) )|=9 |E(H c(3,1) )|=8.
Name: Sujing Wang Advisor: Dr. Christoph F. Eick
Chapter 9 Neural Network.
Improved Gene Expression Programming to Solve the Inverse Problem for Ordinary Differential Equations Kangshun Li Professor, Ph.D Professor, Ph.D College.
Outline 1-D regression Least-squares Regression Non-iterative Least-squares Regression Basis Functions Overfitting Validation 2.
AI Week 14 Machine Learning: Introduction to Data Mining Lee McCluskey, room 3/10
Department of Computer Science 2015 Research Areas and Projects 1.Data Mining and Machine Learning Group (UH-DMML) Its research is focusing on: 1.Spatial.
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
INTERNATIONAL INSTITUTE FOR GEO-INFORMATION SCIENCE AND EARTH OBSERVATION Transition Rule Elicitation Methods for Urban Cellular Automata Models Junfeng.
Simulating Human Agropastoral Activities Using Hybrid Agent- Landscape Modeling M. Barton School of Human Evolution and Social Change College of Liberal.
ESIP Federation Air Quality Cluster Partner Agencies.
10/28/2014 Xiangshang Li, Yunsoo Choi, Beata Czader Earth and Atmospheric Sciences University of Houston The impact of the observational meteorological.
Cellular Automata based Edge Detection. Cellular Automata Definition A discrete mathematical system characterized by local interaction and an inherently.
Cellular Automata Machine For Pattern Recognition Pradipta Maji 1 Niloy Ganguly 2 Sourav Saha 1 Anup K Roy 1 P Pal Chaudhuri 1 1 Department of Computer.
An Investigation of Commercial Data Mining Presented by Emily Davis Supervisor: John Ebden.
Office of Research and Development National Exposure Research Laboratory, Atmospheric Modeling and Analysis Division Office of Research and Development.
AUTOMATIC TARGET RECOGNITION AND DATA FUSION March 9 th, 2004 Bala Lakshminarayanan.
Patch Based Prediction Techniques University of Houston By: Paul AMALAMAN From: UH-DMML Lab Director: Dr. Eick.
U.S. EPA and WIST Rob Gilliam *NOAA/**U.S. EPA
An Agent Epidemic Model Toward a general model. Objectives n An epidemic is any attribute that is passed from one person to others in society è disease,
Chenglin Xie1, Bo Huang1, Christophe Claramunt2 and
Department of Computer Science 1 Data Mining / KDD Let us find something interesting! Definition := “KDD is the non-trivial process of identifying valid,
CISC Machine Learning for Solving Systems Problems Microarchitecture Design Space Exploration Lecture 4 John Cavazos Dept of Computer & Information.
Support to scientific research on seasonal-to-decadal climate and air quality modelling Pierre-Antoine Bretonnière Francesco Benincasa IC3-BSC - Spain.
Data Mining Concepts and Techniques Course Presentation by Ali A. Ali Department of Information Technology Institute of Graduate Studies and Research Alexandria.
Downscaling Global Climate Model Forecasts by Using Neural Networks Mark Bailey, Becca Latto, Dr. Nabin Malakar, Dr. Barry Gross, Pedro Placido The City.
Spatial statistics What is spatial statistics?  Refers to a very broad collection of methods and techniques of visualization, exploration and analysis.
Demand Management and Forecasting Chapter 11 Portions Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
7. Air Quality Modeling Laboratory: individual processes Field: system observations Numerical Models: Enable description of complex, interacting, often.
of Temperature in the San Francisco Bay Area
What Else is Important in AI we Did not Cover?
On Routine Evolution of Complex Cellular Automata
PARALLEL COMPUTING.
What is Correlation Analysis?
CSE 4705 Artificial Intelligence
Introductory Seminar on Research: Fall 2017
Intelligent Information System Lab
ALZHEIMER DISEASE PREDICTION USING DATA MINING TECHNIQUES P.SUGANYA (RESEARCH SCHOLAR) DEPARTMENT OF COMPUTER SCIENCE TIRUPPUR KUMARAN COLLEGE FOR WOMEN.
of Temperature in the San Francisco Bay Area
Geog 192 – Urban GIS Applications
Research Focus Objectives: The Data Analysis and Intelligent Systems (DAIS) Lab  aims at the development of data analysis, data mining, GIS and artificial.
Geospatial Technology in Climate Change
Data Analysis and Intelligent Systems Lab
Shashi Shekhar Weili Wu Sanjay Chawla Ranga Raju Vatsavai
Spatial Data Mining Definition: Spatial data mining is the process of discovering interesting patterns from large spatial datasets; it organizes by location.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Somi Jacob and Christian Bach
Graph-based Security and Privacy Analytics via Collective Classification with Joint Weight Learning and Propagation Binghui Wang, Jinyuan Jia, and Neil.
Scientific Computing Lab
Hydrology Modeling in Alaska: Modeling Overview
Presentation transcript:

1 Modeling Evolution in Spatial Datasets Paul Amalaman 2/17/2012 Dr Eick Christoph Nouhad Rizk Zechun Cao Sujing Wang Data Mining and Machine Learning Lab Team Members Anirup Dutta Swati Goyal Tarikul Islam Paul Amalaman

I- Background II-Research Goals III-Case Study IV-Summary 2

Machine Learning Techniques are mostly used where modeling implicit trends is possible (Regression) stable patterns exist in dataset (Classification) Simulation Systems are used when a model is hard to establish there is a great degree of randomness in the attribute values there are a lot of interactions between objects when attributes have to be predicted recursively over many steps Example Applications of Simulation Systems: Traffic Modeling, Weather Forecasting, Social Networks, Urban Modeling 3 I-Background

I-Background continued(3) Spatial Simulation Systems Cellular Automata (CA) (Cell centered approach) Continuous Agent Space Or Multi Agent System (MAS) (Agent centered approach) ABM 4

Concept of neighborhood  Moore Neighborhood  Von Newman neighborhood Moore Neighborhood Von Newman Neighborhood 5 D(x-1,y-1)D(x-1,y)D(x+1,y-1) D(x-1,y)P(x,y)D(x+1,y) D(x-1,y+1) D(x+1,y+1) D(x-1,y) P(x,y)D(x+1,y) D(x-1,y+1) I-Background continued(3) Modeling with Cellular Automata

I-Background continued(4) Modeling with Cellular Automata Cellular Automata provides the programmer a cell-centered programming style where the set of cells represents computing units that are regularly organized good efficiency with parallel architecture 6

II-Research Goals Using Data Mining and Machine Learning Techniques to Enhance Simulation Systems New approach= Machine Learning Techniques + Spatial Simulation Systems Goal1: Grid-based Models for Progression in Spatial Datasets Goal2: Development of Cluster-based Bias Removal Methods 7

8 ? y i,j,t+1 = f ij (x 1,1,1,t,…, x 1,n,n,t,…, x m,1,1,t,…, x m,n,n,t, y 1,1,t,…,y,n,n,t ) II-Research Goal continued (1) Goal1:Grid-based Models for Progression in Spatial Datasets t t +1 X1(t) X2(t). Xn(t) Y(t) X1(t+Δt)=? X2(t+Δt)=?. Xn(t+Δt)=? Y(t+Δt)=? Given that at t we know all the attribute values including the output variable Y, can we predict all attribute values at t+1? Challenges: 1. Many target variables to predict; different variables have to be predicted at different location 2. Target variables are not independent of each other (e.g. some are auto-correlated) 3. Models has to be used over multiple steps

EPA prediction models are meteorological and chemical transport models. Those models are derived from solving differential equations. Over time, the model bias grows larger 9 II-Research Goal continued (2) Goal2:Development of Cluster-based Bias Removal Methods Model Output + bias b(x) Input x Whether pattern recognition Model Output Correction (bias removal) Input x Output h(b(x), group(x)) Bias removal based on whether pattern recognition Our model, model h learn group(x), and b(x) and make better prediction b(x) group(x)

III-Case Study Improving Ozone Forecasting For Houston- Galveston Area Goal1: Development of a Grid-based Prediction Framework Goal2: Development of Cluster-based Bias Removal Methods In Collaboration with UH-IMAQS Institute for Multidimensional Air Quality Studies (UH Department of Earth and Atmospheric Science) -Dr Rappenglueck, Bernhard -Dr Li, Xiangshang 10

III-Case Study Continued(1) Ozone Prediction Goal 1:Improving Prediction for Spatial Progression Given what happened at t, can we predict what happens at t+Δ, t+2Δ,..? 11

Goal 2- Improving forecast Accuracy 12 III-Case Study Continued(2) Ozone Prediction

III-Case Study Continued(2) Status of Dissertation Methods to collect ozone data and to capture it in a relational database have been developed. The necessary knowledge for simulation- based prediction systems in general, and ozone prediction in particular has been obtained Started work on different modeling approaches for grid-based prediction 13

IV-SUMMARY 14

Thank you! 15