Adaptive Cleaning for RFID Data Streams. RFID: Radio Frequency IDentification.

Slides:



Advertisements
Similar presentations
Sampling: Final and Initial Sample Size Determination
Advertisements

Outline input analysis input analyzer of ARENA parameter estimation
Presented for: CPS Lab-ASU By: Ramtin Kermani
Statistical model for count data Speaker : Tzu-Chun Lo Advisor : Yao-Ting Huang.
Adaptive Cleaning for RFID Data Streams Shawn Jeffery Minos Garofalakis Michael Franklin UC Berkeley Intel Research Berkeley UC Berkeley Presented by:
Probabilistic Aggregation in Distributed Networks Ling Huang, Ben Zhao, Anthony Joseph and John Kubiatowicz {hling, ravenben, adj,
Change Detection in Dynamic Environments Mark Steyvers Scott Brown UC Irvine This work is supported by a grant from the US Air Force Office of Scientific.
LARGE SAMPLE TESTS ON PROPORTIONS
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference (Sec. )
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference.
Declarative Support for Sensor Data Cleaning Shawn Jeffery Gustavo Alonso Michael Franklin Wei Hong Jennifer Widom UC Berkeley ETH Zurich UC Berkeley Arch.
The Enforcer Laura Celentano Glenn Ramsey Michael Szalkowski.
Abstractions for Shared Sensor Networks DMSN September 2006 Michael J. Franklin.
Probabilistic Databases Amol Deshpande, University of Maryland.
Statistical Inference Lab Three. Bernoulli to Normal Through Binomial One flip Fair coin Heads Tails Random Variable: k, # of heads p=0.5 1-p=0.5 For.
Adaptive Cleaning for RFID Data Streams VLDB /12/06 Shawn Jeffery Minos Garofalakis Michael Franklin UC Berkeley Intel Research Berkeley UC Berkeley.
1 (Student’s) T Distribution. 2 Z vs. T Many applications involve making conclusions about an unknown mean . Because a second unknown, , is present,
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
On the Accuracy of Modal Parameters Identified from Exponentially Windowed, Noise Contaminated Impulse Responses for a System with a Large Range of Decay.
ETM 607 – Random Number and Random Variates
Body size distribution of European Collembola Lecture 9 Moments of distributions.
1 A Local and Remote Radio Frequency Identification Learning Environment Andrew Shields & David Butcher Wireless and Mobility Research Group, Institute.
Laws of Logic and Rules of Evidence Larry Knop Hamilton College.
Radio Frequency Identification (RFID) Be Safe Security Solutions.
Statistics for Engineer Week II and Week III: Random Variables and Probability Distribution.
40S Applied Math Mr. Knight – Killarney School Slide 1 Unit: Statistics Lesson: ST-5 The Binomial Distribution The Binomial Distribution Learning Outcome.
Binomial Distributions Calculating the Probability of Success.
1 Bernoulli trial and binomial distribution Bernoulli trialBinomial distribution x (# H) 01 P(x)P(x)P(x)P(x)(1 – p)p ?
Physical Layer Informed Adaptive Video Streaming Over LTE Xiufeng Xie, Xinyu Zhang Unviersity of Winscosin-Madison Swarun KumarLi Erran Li MIT Bell Labs.
Mapping and Localization with RFID Technology Matthai Philipose, Kenneth P Fishkin, Dieter Fox, Dirk Hahnel, Wolfram Burgard Presenter: Aniket Shah.
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Identify.
Testing Models on Simulated Data Presented at the Casualty Loss Reserve Seminar September 19, 2008 Glenn Meyers, FCAS, PhD ISO Innovative Analytics.
Chapter 10 – Sampling Distributions Math 22 Introductory Statistics.
Bernoulli Trials Two Possible Outcomes –Success, with probability p –Failure, with probability q = 1  p Trials are independent.
Math 22 Introductory Statistics Chapter 8 - The Binomial Probability Distribution.
HY436: Mobile Computing and Wireless Networks Data sanitization Tutorial: November 7, 2005 Elias Raftopoulos Ploumidis Manolis Prof. Maria Papadopouli.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Wobbles, humps and sudden jumps1 Transitions in time: what to look for and how to describe them …
40S Applied Math Mr. Knight – Killarney School Slide 1 Unit: Statistics Lesson: ST-5 The Binomial Distribution The Binomial Distribution Learning Outcome.
Statistics 3502/6304 Prof. Eric A. Suess Chapter 4.
Statistics Chapter 6 / 7 Review. Random Variables and Their Probability Distributions Discrete random variables – can take on only a countable or finite.
Introduction to RFID Technology By: Vahid Bagher Poor 1.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Yanlei Diao, University of Massachusetts Amherst Capturing Data Uncertainty in High- Volume Stream Processing Yanlei Diao, Boduo Li, Anna Liu, Liping Peng,
1 Math 4030 – 10b Inferences Concerning Proportions.
Chapter 3 Statistical Models or Quality Control Improvement.
Hypothesis Testing for the Mean:
From the population to the sample The sampling distribution FETP India.
LIGO-G Z r statistics for time-domain cross correlation on burst candidate events Laura Cadonati LIGO-MIT LSC collaboration meeting, LLO march.
The Normal Distribution AS Mathematics Statistics 1 Module.
3.1 Statistical Distributions. Random Variable Observation = Variable Outcome = Random Variable Examples: – Weight/Size of animals – Animal surveys: detection.
Bias Management in Time Changing Data Streams We assume data is generated randomly according to a stationary distribution. Data comes in the form of streams.
Differential Methylation Analysis
Lecture Slides Essentials of Statistics 5th Edition
Discrete Probability Distributions
Effective Data-Race Detection for the Kernel
Chapter Six Normal Curves and Sampling Probability Distributions
Baselining PMU Data to Find Patterns and Anomalies
Adaptive Cleaning for RFID Data Streams
Real-Time RFID Localization Using RSS
Section 4 Testing Two Variances.
GATES: A Grid-Based Middleware for Processing Distributed Data Streams
Essential Statistics Two-Sample Problems - Two-sample t procedures -
If the question asks: “Find the probability if...”
Probability.
Learning from Data Streams
Statistical Models or Quality Control Improvement
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Presentation transcript:

Adaptive Cleaning for RFID Data Streams

RFID: Radio Frequency IDentification

RFID data is dirty A simple experiment: 2 RFID-enabled shelves 10 static tags 5 mobile tags

RFID Data Cleaning Time Raw readings Smoothed output RFID data has many dropped readings Typically, use a smoothing filter to interpolate SELECT distinct tag_id FROM RFID_stream [RANGE ‘5 sec’] GROUP BY tag_id SELECT distinct tag_id FROM RFID_stream [RANGE ‘5 sec’] GROUP BY tag_id Smoothing Filter

Smoothing filter Middleware Clean RFID Completeness Tag dynamics Read all tags in range

RFID Data Cleaning Time Raw readings Smoothed output RFID data has many dropped readings Typically, use a smoothing filter to interpolate SELECT distinct tag_id FROM RFID_stream [RANGE ‘5 sec’] GROUP BY tag_id SELECT distinct tag_id FROM RFID_stream [RANGE ‘5 sec’] GROUP BY tag_id But, how to set the size of the window? But, how to set the size of the window? Smoothing Filter

Window Size for RFID Smoothing Fido movingFido resting Small window Reality Raw readings Large window  Need to balance completeness vs. capturing tag movement

Truly Declarative Smoothing Problem: window size non-declarative  Application wants a clean stream of data  Window size is how to get it Solution: adapt the window size in response to data

RFID EpochTagIDReadRate Tag 1 Tag 2 Tag 3 Tag 4 Antenna & reader Tags E1E2E3E4E5E6E7E8E9E0 Read Cycle (Epoch) (For Alien readers) Tag List 1. Interrogation cycle 2. Epoch

Controlled condition real condition

SMURF Statistical Smoothing for Unreliable RFID Data Adapts window based on statistical properties Mechanisms for: Per-tag and multi-tag cleaning

Per-Tag Smoothing: Model and Background Epoch t, Tag population N t p i,t : Per epoch sampling prob. Response count of tag i per epoch (total interrogation cycle) EpochTagIDReadRate

Smoothing window size w i epoch Per epoch sampling prob: p i Number of successful observations of tag i  Binominal distribution B(wi,pi) Per-Tag Smoothing: Model and Background

Use a binomial sampling model Time (epochs) pipi 1 0 Smoothing Window w i Bernoulli trials p i avg SiSi (Read rate of tag i) E1E2E3E4E5E6E7E8E9E0 Set of epochs where tag i can be seen

We want to ensure that there are enough epochs in Wi such that tag i is observed (if it exists within the reader’s range)  Completeness Per-Tag Smoothing: Completeness

If the tag is there, read it with high probability  Want a large window pipi 1 0 Reading with a low p i Expand the window Time (epochs) E1E2E3E4E5E6E7E8E9E0

Per-Tag Smoothing: Completeness

Expected epochs needed to read With probability 1-  Desired window size for tag i

Per-Tag Smoothing: Transitions Detect transitions as statistically significant changes in the data pipi 1 0 Statistically significant difference Flag a transition and shrink the window The tag has likely left by this point Time (epochs) E1E2E3E4E5E6E7E8E9E0

Significant difference between mean observed sample size Si and expected size Find outlier (2  ) Number of successful epochs in a window SiSi Mean

Per-Tag Smoothing: Transitions # expected readings Is the difference “statistically significant”? # observed readings Statistically significantStatistically significant

Algorithm

SMURF in Action Fido movingFido resting SMURF  Experiments with real and simulated data show similar results

Normal sliding windowCompleteness Transition