Modeling spatially-correlated sensor network data Apoorva Jindal, Konstantinos Psounis Department of Electrical Engineering-Systems University of Southern.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Spatial point patterns and Geostatistics an introduction
Bayesian Belief Propagation
Empirical Analysis and Statistical Modeling of Errors in Satellite Precipitation Sensors Yudong Tian, Ling Tang, Robert Adler, and Xin Lin University of.
Exploiting Sparse Markov and Covariance Structure in Multiresolution Models Presenter: Zhe Chen ECE / CMR Tennessee Technological University October 22,
Objectives (BPS chapter 24)
STAT 497 APPLIED TIME SERIES ANALYSIS
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
Non-Linear Statistical Static Timing Analysis for Non-Gaussian Variation Sources Lerong Cheng 1, Jinjun Xiong 2, and Prof. Lei He 1 1 EE Department, UCLA.
Deterministic Solutions Geostatistical Solutions
Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.
1 Unsupervised Learning With Non-ignorable Missing Data Machine Learning Group Talk University of Toronto Monday Oct 4, 2004 Ben Marlin Sam Roweis Rich.
1 A Dynamic Clustering and Scheduling Approach to Energy Saving in Data Collection from Wireless Sensor Networks Chong Liu, Kui Wu and Jian Pei Computer.
The Impact of Spatial Correlation on Routing with Compression in WSN Sundeep Pattem, Bhaskar Krishnamachri, Ramesh Govindan University of Southern California.
Biostatistics Frank H. Osborne, Ph. D. Professor.
Probability Grid: A Location Estimation Scheme for Wireless Sensor Networks Presented by cychen Date : 3/7 In Secon (Sensor and Ad Hoc Communications and.
Department of Computer Engineering Koc University, Istanbul, Turkey
Experimental Evaluation
Computer Science Characterizing and Exploiting Reference Locality in Data Stream Applications Feifei Li, Ching Chang, George Kollios, Azer Bestavros Computer.
Choosing an Accurate Network Model using Domain Analysis Almudena Konrad, Mills College Ben Y. Zhao, UC Santa Barbara Anthony Joseph, UC Berkeley The First.
Chapter 1: Introduction to Statistics. Learning Outcomes Know key statistical terms 1 Know key measurement terms 2 Know key research terms 3 Know the.
Testing Hypotheses.
Detecting Distance-Based Outliers in Streams of Data Fabrizio Angiulli and Fabio Fassetti DEIS, Universit `a della Calabria CIKM 07.
Data Selection In Ad-Hoc Wireless Sensor Networks Olawoye Oyeyele 11/24/2003.
Chapter 8 Introduction to Hypothesis Testing
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.
 1  Outline  stages and topics in simulation  generation of random variates.
Understanding Statistics
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
The Effects of Ranging Noise on Multihop Localization: An Empirical Study from UC Berkeley Abon.
1 A Bayesian Method for Guessing the Extreme Values in a Data Set Mingxi Wu, Chris Jermaine University of Florida September 2007.
Extraction of Fetal Electrocardiogram Using Adaptive Neuro-Fuzzy Inference Systems Khaled Assaleh, Senior Member,IEEE M97G0224 黃阡.
1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Geographic Information Science
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
A comparison of the ability of artificial neural network and polynomial fitting was carried out in order to model the horizontal deformation field. It.
Doc.: IEEE /1011r0 Submission September 2009 Alexander Maltsev, IntelSlide 1 Verification of Polarization Impact Model by Experimental Data Date:
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Experimental Design Experimental Designs An Overview.
Spatial Analysis & Geostatistics Methods of Interpolation Linear interpolation using an equation to compute z at any point on a triangle.
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
Geo479/579: Geostatistics Ch4. Spatial Description.
Review of Random Process Theory CWR 6536 Stochastic Subsurface Hydrology.
Problems with the Durbin-Watson test
Data Modeling Patrice Koehl Department of Biological Sciences National University of Singapore
Information Geometry and Model Reduction Sorin Mitran 1 1 Department of Mathematics, University of North Carolina, Chapel Hill, NC, USA Reconstruction.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
Statistics What is the probability that 7 heads will be observed in 10 tosses of a fair coin? This is a ________ problem. Have probabilities on a fundamental.
Evaluation of gene-expression clustering via mutual information distance measure Ido Priness, Oded Maimon and Irad Ben-Gal BMC Bioinformatics, 2007.
Lab for Remote Sensing Hydrology and Spatial Modeling Dept of Bioenvironmental Systems Engineering National Taiwan University 1/45 GEOSTATISTICS INTRODUCTION.
Statistics Module Statistics Statistics are a powerful tool for finding patterns in data and inferring important connections between events in.
Stochastic Hydrology Random Field Simulation Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Geostatistics GLY 560: GIS for Earth Scientists. 2/22/2016UB Geology GLY560: GIS Introduction Premise: One cannot obtain error-free estimates of unknowns.
EASE: An Energy-Efficient In-Network Storage Scheme for Object Tracking in Sensor Networks Jianliang Xu Department of Computer Science Hong Kong Baptist.
Intro to Research Methods
Statistical Models for Automatic Speech Recognition
12 Inferential Analysis.
Statistics in Applied Science and Technology
Simple Linear Regression - Introduction
Stochastic Hydrology Random Field Simulation
7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to record.
Feifei Li, Ching Chang, George Kollios, Azer Bestavros
12 Inferential Analysis.
Analyzing Reliability and Validity in Outcomes Assessment
7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to record.
7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to record.
Basic descriptions of physical data
Presentation transcript:

Modeling spatially-correlated sensor network data Apoorva Jindal, Konstantinos Psounis Department of Electrical Engineering-Systems University of Southern California SECON 2004

Outline Introduction Statistical analysis of experimental data The model Model verification and validation Tools to generate large synthetic traces Conclusion

Introduction The sensors in sensor networks will be densely deployed and detect common phenomena. It is expected that a high degree of spatial correlation will exist in the sensor networks data.

Introduction However since very few real systems have been deployed, there is hardly any experimental data available to test the proposed algorithms. No effort has been made to propose a model which captures the spatial correlation in sensor networks.

Introduction We propose a mathematical model to capture the spatial correlation in sensor network data. We present a method to generate large synthetic traces from a small experimental trace while preserving the correlation pattern, and a method to generate synthetic traces exhibiting arbitrary correlation patterns

Statistical analysis of experimental data A. data set description (1)S-Pol Radar Data Set The resampled S-Pol radar data, provided by NCAR, records the intensity of reflectivity of atmosphere in dBZ. (2)Precipitation Data Set This data set consists of the daily rainfall precipitation for the Pacific Northwest from

Statistical analysis of experimental data b. Statistic used to Measure Correlation in Data Given a two dimensional stationary process X(x,y), the autocorrelation function is defined as Another statistic often to characterize spatial correlation in data is the variogram defined as

Statistical analysis of experimental data b. Statistic used to Measure Correlation in Data For isotropic random process, the variogram depends only on the distance d=d 1 +d 2 between two nodes. For a set of samples x(x i,y j ), i=1,2,…,γ(d) can be estimated as follows

Statistical analysis of experimental data c. analysis of data using Variograms

The model

The parameters of the model are h, the α i ’s, β, f Y (y), f Z (z). For mathematical convenience, we define the three random variable :

We can find the probability density function f X (x) as follows :

In stationarity, have the same distribution. Using the above and equation(6) the characteristic function of f X (x) can be written as: Characteristic function

Without loss of generality, we will assume that Z is a normal random variable with (0,σ Z )

Since X and Z are independent the characteristic function of f A can be written as Hence, Equation(7) reduce to

For mathematical convenience, we define a new random variable L having characteristic function given by

The model A. Parameters of the Model an Correlation

The model B. Inferring Model Parameters Infer f X (x) from its empirical distribution. Inferring σ z,α i ’s and β is more involved. Using Equation(3) leads to the following:

The model B. Inferring Model Parameters Equating for 1 ≦ i ≦ h+1 gives h+1 equations. These equations along with the equation form a system of h+2 equations. After solving the above system, we can obtain σ z,α i ’s,β and f Y (y) through Determining h. To start from an overestimated h and lower its value until all the α i ’s are positive.

Model verification and validation A. verification (1)S-Pol Radar data set

Model verification and validation A. verification

(2)Precipitation data set: The data inferred for the trace are h =1, α 1 =0.72, β=0.28 and σ Z =2.61

Model verification and validation A. verification

Model verification and validation B. Model Validation DIMENSIONS[6] proposes wavelet based multi- resolution summarization and drill down querying. Spatial Correlation based Collaborative Medium Access Control (CMAC) [10] The evaluation metric used is the query error which is defined as

Model verification and validation B. Model Validation

Tools to generate large synthetic traces The tools are freely available at generateLargeTraceFromSmall generateSyntheticTraces

Conclusion We have proposed a model to capture the spatial correlation, which can generate synthetic traces. We also described a mathematical procedure to extract the parameters of the model from a real data set. We verified and validated the model. Final, we have created two freely available tools to enable researchers to generate data.