Life Under Your Feet. Sensor Network Design Philosophies Use low cost components No access to line power –Deployed in remote locations Radio is the.

Slides:



Advertisements
Similar presentations
Intel Research Internet Coordinate Systems - 03/03/2004 Internet Coordinate Systems Marcelo Pias Intel Research Cambridge
Advertisements

Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Life Under Your Feet Johns Hopkins University Computer Science Earth and Planetary Sciences Physics and Astronomy
Fault-Tolerant Target Detection in Sensor Networks Min Ding +, Dechang Chen *, Andrew Thaeler +, and Xiuzhen Cheng + + Department of Computer Science,
Jayant Gupchup Graduate student, Johns Hopkins University Representative Slides.
Advanced Metrology Lab., Texas A&M University Collaborative data reduction for sensor power efficiency Aug. 25, 2008 by Chiwoo Park.
Benchmark database based on surrogate climate records Victor Venema.
Civil and Environmental Engineering Carnegie Mellon University Sensors & Knowledge Discovery (a.k.a. Data Mining) H. Scott Matthews April 14, 2003.
Probabilistic Aggregation in Distributed Networks Ling Huang, Ben Zhao, Anthony Joseph and John Kubiatowicz {hling, ravenben, adj,
Distributed Regression: an Efficient Framework for Modeling Sensor Network Data Carlos Guestrin Peter Bodik Romain Thibaux Mark Paskin Samuel Madden.
Microsoft E-Science Data Storage Model for Environmental Monitoring Wireless Sensor Networks Jayant Gupchup †, R.
UNIVERSITY OF SOUTH CAROLINA Department of Computer Science and Engineering On-line Alert Systems for Production Plants A Conflict Based Approach.
Bioinformatics Challenge  Learning in very high dimensions with very few samples  Acute leukemia dataset: 7129 # of gene vs. 72 samples  Colon cancer.
Fitting. Choose a parametric object/some objects to represent a set of tokens Most interesting case is when criterion is not local –can’t tell whether.
INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
1 Challenges in Power Systems State Estimation Lamine Mili Virginia Tech Alexandria Research Institute.
FMRI: Biological Basis and Experiment Design Lecture 26: Significance Review of GLM results Baseline trends Block designs; Fourier analysis (correlation)
Privacy Preservation for Data Streams Feifei Li, Boston University Joint work with: Jimeng Sun (CMU), Spiros Papadimitriou, George A. Mihaila and Ioana.
1 NOAA’s National Climatic Data Center April 2005 Climate Observation Program Blended SST Analysis Changes and Implications for the Buoy Network 1.Plans.
Benchmark database inhomogeneous data, surrogate data and synthetic data Victor Venema.
1 Activity and Motion Detection in Videos Longin Jan Latecki and Roland Miezianko, Temple University Dragoljub Pokrajac, Delaware State University Dover,
Data Selection In Ad-Hoc Wireless Sensor Networks Olawoye Oyeyele 11/24/2003.
Data Mining Lecture 2: data.
Prognostics of Aircraft Bleed Valves Using a SVM Classification Algorithm Renato de Pádua Moreira Cairo L. Nascimento Jr. Instituto Tecnológico de Aeronáutica.
Large Two-way Arrays Douglas M. Hawkins School of Statistics University of Minnesota
IN23A-1072: Life Under Your Feet: A Wireless Soil Ecology Sensor Network K. Szlavecz 1, A. Terzis 1, R. Musaloiu 1, A. Szalay 1, J. Gupchup 1, C.-J. Liang.
The Effects of Ranging Noise on Multihop Localization: An Empirical Study from UC Berkeley Abon.
Data Mining & Knowledge Discovery Lecture: 2 Dr. Mohammad Abu Yousuf IIT, JU.
Distortion Correction ECE 6276 Project Review Team 5: Basit Memon Foti Kacani Jason Haedt Jin Joo Lee Peter Karasev.
1 Robust Statistical Methods for Securing Wireless Localization in Sensor Networks (IPSN ’05) Zang Li, Wade Trappe Yanyong Zhang, Badri Nath Rutgers University.
Gap filling of eddy fluxes with artificial neural networks
Microsoft E-Science Data Storage Model for Environmental Monitoring Wireless Sensor Networks Jayant Gupchup †, R.
Digital Media Lab 1 Data Mining Applied To Fault Detection Shinho Jeong Jaewon Shim Hyunsoo Lee {cinooco, poohut,
RIDA: A Robust Information-Driven Data Compression Architecture for Irregular Wireless Sensor Networks Nirupama Bulusu (joint work with Thanh Dang, Wu-chi.
Spatio-Temporal Surface Vector Wind Retrieval Error Models Ralph F. Milliff NWRA/CoRA Lucrezia Ricciardulli Remote Sensing Systems Deborah K. Smith Remote.
Jeff Howbert Introduction to Machine Learning Winter Machine Learning Data.
What is Data? Attributes
Gap-filling and Fault-detection for the life under your feet dataset.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Data Lecture Notes for Chapter 2 Introduction to Data Mining by Tan, Steinbach,
Automated QA/QC Technique for Climate Sensor Data EPSCoR Hawaii HGDR Scientific Data Management Portal Development Team.
Microsoft Research Faculty Summit Liqian Luo Networked Embedded Computing Microsoft Research.
Jayant Gupchup Phoenix, EWSN 2010 Phoenix: An Epidemic Approach to Time Reconstruction Jayant Gupchup †, Douglas Carlson †, Răzvan Musăloiu-E. †,*, Alex.
Leverage the data characteristics of applications and computing to reduce the communication cost in WSNs. Design advanced algorithms and mechanisms to.
BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data ACM EuroSys 2013 (Best Paper Award)
An Investigation of Xen and PTLsim for Exploring Latency Constraints of Co-Processing Units Grant Jenks UCLA.
Analyzing wireless sensor network data under suppression and failure in transmission Alan E. Gelfand Institute of Statistics and Decision Sciences Duke.
Principal Component Analysis (PCA)
Model Based Event Detection in Sensor Networks Jayant Gupchup, Andreas Terzis, Randal Burns, Alex Szalay.
Data-Driven Processing in Sensor Networks Adam Silberstein, Rebecca Braynard, Gregory Filpus, Gavino Puggioni, Alan Gelfand, Kamesh Munagala, Jun Yang.
Locations. Soil Temperature Dataset Observations Data is – Correlated in time and space – Evolving over time (seasons) – Gappy (Due to failures) – Faulty.
3/13/2016Data Mining 1 Lecture 1-2 Data and Data Preparation Phayung Meesad, Ph.D. King Mongkut’s University of Technology North Bangkok (KMUTNB) Bangkok.
ArrayTrack : A Fine-Grained Indoor Location System Jie Xiong, Kyle Jamieson USENIX NSDI ‘ Jungmin Yoo *some slides.
Chapter 8: Maintenance and Software Evolution Ronald J. Leach Copyright Ronald J. Leach, 1997, 2009, 2014,
Martina Uray Heinz Mayer Joanneum Research Graz Institute of Digital Image Processing Horst Bischof Graz University of Technology Institute for Computer.
How Dirty is your Data : The Duality between detecting Events and Faults J. Gupchup A. Terzis R. Burns A. Szalay Department of Computer Science Johns Hopkins.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Automatically Collect Ground Control Points from Online Aerial Maps
ASEN 5070: Statistical Orbit Determination I Fall 2014
Lecture Notes for Chapter 2 Introduction to Data Mining
Line Fitting James Hayes.
Multiple Linear Regression
Privacy and Fault-Tolerance in Distributed Optimization Nitin Vaidya University of Illinois at Urbana-Champaign.
Fundamentals Data.
A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.
Satellite data Marco Puts
Feature Selection Methods
Palanivel Kodeswaran, Ravi Kokku, Sayandeep Sen, Mudhakar Srivatsa
INTRODUCTION TO Machine Learning
Data Pre-processing Lecture Notes for Chapter 2
Overview Motivation Objectives Case study Technical approach
Presentation transcript:

Life Under Your Feet

Sensor Network Design Philosophies Use low cost components No access to line power –Deployed in remote locations Radio is the biggest consumer of power –Minimize radio communication Transfer data in bulk (amortize radio costs) –Data Latency vs. Lifetime tradeoff

Deployments

All Data Online!

Cub hill locations (50)

Soil Temperature Dataset

Data Features –Correlated in time and space –Evolving over time (seasons) –Gappy (Due to hardware failures) –Faulty (Noise, Jumps in values)

Goals Leverage spatiotemporal correlations: Model-based fault detection Increase network lifetime –Tradeoff latency for reduced communication

Examples of Faults

Motivation Environmental Scientists want fault-free data and want to see visualizations They start looking at the actual data much later –Opportunity to postpone collecting all data ~ 5% of data is faulty

Basic Observations Hardware failures causes entire “days” of data to be missing –Temporal correlation alone is not enough If we look in the spatial domain, –For a given time, at least > 50% of the sensors are active

Initialization Good Period

Tamas’ Iterative and Robust gap-filling (MATLAB)

Sketch Estimate fixed common effect (daily median) Subtract location-specific deviation from common effect Model residuals using PCA –Basis defined in the spatial domain –Initialized using good data –Vectors are randomized and fed one by one –Gaps corrected using L2 minimization –Use reprojection error to remove outliers

Smooth / Original

Original and Ratio

Moving forward Better way to model these correlations? How do we pick a subset of locations that are informative: –2 phase operation –Periodically download from everyone –In between, use correlations to produce maps and movies –Extend network lifetime

Soil Moisture and Ratio

Moisture vs Ratio (zoom in)