Location Prediction and Spatial Data Mining (S. Shekhar)

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Design of Experiments Questions Network Inference Working Group October 8, 2008.
Brief introduction on Logistic Regression
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 15 Nov, 1, 2011 Slide credit: C. Conati, S.
Spatial Dependency Modeling Using Spatial Auto-Regression Mete Celik 1,3, Baris M. Kazar 4, Shashi Shekhar 1,3, Daniel Boley 1, David J. Lilja 1,2 1 CSE.
Data Mining Classification: Alternative Techniques
Ensemble Methods An ensemble method constructs a set of base classifiers from the training data Ensemble or Classifier Combination Predict class label.
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
Intelligent Environments1 Computer Science and Engineering University of Texas at Arlington.
Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.
Multivariate Methods Pattern Recognition and Hypothesis Testing.
Virtual Computing Environment for Future Combat Systems.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
A PARALLEL FORMULATION OF THE SPATIAL AUTO-REGRESSION MODEL FOR MINING LARGE GEO-SPATIAL DATASETS HPDM 2004 Workshop at SIAM Data Mining Conference Barış.
Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,
Spatial Data Mining: Three Case Studies For additional details Shashi Shekhar, University of Minnesota Presented.
Aprendizagem baseada em instâncias (K vizinhos mais próximos)
Arizona State University DMML Kernel Methods – Gaussian Processes Presented by Shankar Bhargav.
Modeling Gene Interactions in Disease CS 686 Bioinformatics.
Data Mining – Intro.
LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
1 Project Ideas. 2 Algorithmic Evaluations/Comparisons  Compare variants of (nested) policy rollout using different bandit algorithms  Compare some.
Midterm Review. 1-Intro Data Mining vs. Statistics –Predictive v. experimental; hypotheses vs data-driven Different types of data Data Mining pitfalls.
Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,
Army High Performance Computing Research Center Prof. Shashi Shekhar Computational Sciences & Engineering for Defense Technology Applications Enabling.
Active Learning for Class Imbalance Problem
by B. Zadrozny and C. Elkan
COMP3503 Intro to Inductive Modeling
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
DATA MINING LECTURE 10 Classification k-nearest neighbor classifier Naïve Bayes Logistic Regression Support Vector Machines.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Automated Detection and Classification Models SAR Automatic Target Recognition Proposal J.Bell, Y. Petillot.
Today Ensemble Methods. Recap of the course. Classifier Fusion
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Statistical Inference (By Michael Jordon) l Bayesian perspective –conditional perspective—inferences.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Last Words DM 1. Mining Data Steams / Incremental Data Mining / Mining sensor data (e.g. modify a decision tree assuming that new examples arrive continuously,
Module networks Sushmita Roy BMI/CS 576 Nov 18 th & 20th, 2014.
Statistical Methods for Data Analysis Introduction to the course Luca Lista INFN Napoli.
Page  1 LAND COVER GEOSTATISTICAL CLASSIFICATION FOR REMOTE SENSING  Kęstutis Dučinskas, Lijana Stabingiene and Giedrius Stabingis  Department of Statistics,
Slides for “Data Mining” by I. H. Witten and E. Frank.
Data Mining and Decision Support
NTU & MSRA Ming-Feng Tsai
Supervised Machine Learning: Classification Techniques Chaleece Sandberg Chris Bradley Kyle Walsh.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Measure of System Effectiveness Missile Defense System By Alfred Terris UNCL:ASSIFIED1.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Shape2Pose: Human Centric Shape Analysis CMPT888 Vladimir G. Kim Siddhartha Chaudhuri Leonidas Guibas Thomas Funkhouser Stanford University Princeton University.
Data Mining: Concepts and Techniques1 Prediction Prediction vs. classification Classification predicts categorical class label Prediction predicts continuous-valued.
Linear Discriminant Functions Chapter 5 (Duda et al.) CS479/679 Pattern Recognition Dr. George Bebis.
Data Mining – Intro.
Chapter 7. Classification and Prediction
Introduction to Marketing Research
Prepared by: Mahmoud Rafeek Al-Farra
Classification of Remotely Sensed Data
Ben Saylor1, Anagha Kulkarni1, Neo Martinez2, Ilmi Yoon1
Automatic Picking of First Arrivals
Spatial statistics Topic 4 2/2/2007.
(Geo) Informatics across Disciplines!
Shashi Shekhar Weili Wu Sanjay Chawla Ranga Raju Vatsavai
COSC 4335: Other Classification Techniques
Classification and Prediction
Spatial Data Analysis: Intro to Spatial Statistical Concepts
Machine Learning: Lecture 6
Machine Learning: UNIT-3 CHAPTER-1
A protocol for data exploration to avoid common statistical problems
Spatial Data Mining: Three Case Studies
Dan Roth Department of Computer Science
Presentation transcript:

Location Prediction and Spatial Data Mining (S. Shekhar) Specific Project in 2001-2002 Evaluation of location prediction techniques Towards high performance parallel implementation AHPCRC Relevance – Projectile Target Interaction Portfolio Increase lethality of weapons such as guided missiles Location prediction for map matching to check correctness of missile trajectory To identify unanticipated obstacle Towards possible rerouting Army Relevance in general Predicting global hot spots (FORMID) Army land management endangered species vs. training and war games Search for local trends in massive simulation data Critical infra-structure defense (threat assessment) Inferring enemy tactics (e.g. flank attack) from blobology Locating enemy (e.g. sniper in a haystack, sensor networks) Locating friends to avoid friendly fire

Location Prediction Problem Definition: Past Approaches: Given: 1. Spatial Framework 2. Explanatory functions: 3. A dependent function: 4. A family of function mappings: Find: A function Objective: maximize classification accuracy Constraints: Spatial Autocorrelation in dependent function Past Approaches: Non-spatial: logistic regression, decision trees, Bayesian Assume independent distribution for learning samples Auto-correlation => poor prediction performance Spatial: Spatial auto-regression (SAR), Markov random field Bayesian classifier (MRF) No literature comparing the two! Learning algorithms for SAR are slow (took 3 hours for 5000 data points)! Nest locations Distance to open water Vegetation durability Water depth

Accomplishments Formal Results Experimental results SAR - parametric statistics, provides confidence measures in model MRF from non-parametric statistics SAR : MRF-BC :: linear regression : Bayesian Classifier Rewrite SAR as y = (QX)  + Q, where Q = (I- W)-1 SAR has linear class boundaries in transformed space (QX, y) MRF-BC can represent non-linear class boundaries Experimental results MRF-BC can provide better classification accuracies than SAR But solution procedure is very slow Details in Recent paper in IEEE Transactions on Multimedia