Towards improving data presentation in the TripCheck system Rafael J. Fernández-Moctezuma

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Neural networks Introduction Fitting neural networks
Economics 20 - Prof. Anderson1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
8 - 1 Multivariate Linear Regression Chapter Multivariate Analysis Every program has three major elements that might affect cost: – Size » Weight,
Scatterplots, Association, and Correlation 60 min.
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Yard. Doç. Dr. Tarkan Erdik Regression analysis - Week 12 1.
x – independent variable (input)
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 7. Specification and Data Problems.
Decision Trees (2). Numerical attributes Tests in nodes are of the form f i > constant.
Optimal Adaptation for Statistical Classifiers Xiao Li.
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Arizona State University DMML Kernel Methods – Gaussian Processes Presented by Shankar Bhargav.
October 7, 2010Neural Networks Lecture 10: Setting Backpropagation Parameters 1 Creating Data Representations On the other hand, sets of orthogonal vectors.
Dr. Mario MazzocchiResearch Methods & Data Analysis1 Correlation and regression analysis Week 8 Research Methods & Data Analysis.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
An Introduction to Support Vector Machines Martin Law.
1 Prediction of Software Reliability Using Neural Network and Fuzzy Logic Professor David Rine Seminar Notes.
Inference for regression - Simple linear regression
LINEAR REGRESSION Introduction Section 0 Lecture 1 Slide 1 Lecture 5 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall.
Scatterplots, Association, and Correlation Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Gaussian process modelling
David Corne, and Nick Taylor, Heriot-Watt University - These slides and related resources:
Latent (S)SVM and Cognitive Multiple People Tracker.
COMP3503 Intro to Inductive Modeling
STIFF: A Forecasting Framework for Spatio-Temporal Data Zhigang Li, Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist.
2.8Exploring Data: Quadratic Models Students will classify scatter plots. Students will use scatter plots and a graphing utility to find quadratic models.
Michigan REU Final Presentations, August 10, 2006Matt Jachowski 1 Multivariate Analysis, TMVA, and Artificial Neural Networks Matt Jachowski
Prognosis of Gear Health Using Gaussian Process Model Department of Adaptive systems, Institute of Information Theory and Automation, May 2011, Prague.
An Introduction to Support Vector Machines (M. Law)
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
BioSS reading group Adam Butler, 21 June 2006 Allen & Stott (2003) Estimating signal amplitudes in optimal fingerprinting, part I: theory. Climate dynamics,
Neural Network Implementation of Poker AI
BCS547 Neural Decoding. Population Code Tuning CurvesPattern of activity (r) Direction (deg) Activity
Unscented Kalman Filter 1. 2 Linearization via Unscented Transform EKF UKF.
Copyright © 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
Radial Basis Function ANN, an alternative to back propagation, uses clustering of examples in the training set.
Regression Analysis: Part 2 Inference Dummies / Interactions Multicollinearity / Heteroscedasticity Residual Analysis / Outliers.
fall ‘ 97Principles of MicroeconomicsSlide 1 This is a PowerPoint presentation on fundamental math tools that are useful in principles of economics. A.
CSC321: Lecture 7:Ways to prevent overfitting
Neural Networks Presented by M. Abbasi Course lecturer: Dr.Tohidkhah.
1 Statistics & R, TiP, 2011/12 Neural Networks  Technique for discrimination & regression problems  More mathematical theoretical foundation  Works.
From OLS to Generalized Regression Chong Ho Yu (I am regressing)
I n 2 0 y e a r s B y C a m e r o n My job P o k é m o n g a m e c r e a t o r. I a m g o i n g t o c o m b i n e d r a g o n i t e a n d m e g a n i.
NTU & MSRA Ming-Feng Tsai
Artificial Neural Networks (ANN). Artificial Neural Networks First proposed in 1940s as an attempt to simulate the human brain’s cognitive learning processes.
Pattern Recognition. What is Pattern Recognition? Pattern recognition is a sub-topic of machine learning. PR is the science that concerns the description.
LOAD FORECASTING. - ELECTRICAL LOAD FORECASTING IS THE ESTIMATION FOR FUTURE LOAD BY AN INDUSTRY OR UTILITY COMPANY - IT HAS MANY APPLICATIONS INCLUDING.
Neural networks (2) Reminder Avoiding overfitting Deep neural network Brief summary of supervised learning methods.
Linear Models Tony Dodd. 21 January 2008Mathematics for Data Modelling: Linear Models Overview Linear models. Parameter estimation. Linear in the parameters.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.
32931 Technology Research Methods Autumn 2017 Quantitative Research Component Topic 4: Bivariate Analysis (Contingency Analysis and Regression Analysis)
Correlation & Regression
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Ch3: Model Building through Regression
Wave height prediction in the Apostle Islands
Unscented Kalman Filter
Data Mining Lecture 11.
Stat 112 Notes 4 Today: Review of p-values for one-sided tests
Linear Regression.
دانشگاه صنعتی امیرکبیر Instructor : Saeed Shiry
Descriptive Analysis and Presentation of Bivariate Data
Introduction to Predictive Modeling
Regression All about going back (regressing) from the points to the function from which they came. We are looking at linear regression.
Regression Forecasting and Model Building
Chapter 11: Stereopsis Stereopsis: Fusing the pictures taken by two cameras and exploiting the difference (or disparity) between them to obtain the depth.
Descriptive Statistics Univariate Data
Rgh
Regression and Correlation of Data
Presentation transcript:

Towards improving data presentation in the TripCheck system Rafael J. Fernández-Moctezuma

You’re about to leave… … and you’re addicted to TripCheck. So you go and check it out and see this:

What’s wrong with this picture?

Gray is not the new black!

Estimation may be better than no data on final products Data may not be displayed for various reasons –Sensor failure –Data quality May prefer to estimate the system state instead of displaying gray areas Not enough sensors – but may still be able to recover information. Must be careful with estimation – at least report a confidence factor.

System State Estimation Must carefully choose good sources of correlated data Every sensor station has its own estimator The PORTAL project does a great job at archiving data – this makes statistical regressors for state estimation a good alternative. May consider to rely on observed features from the past in addition to well-known transportation theory.

Regression Find a description of data in terms of a function Example: height (H) and weight (W) data transformed into a function F(H) = W.

Which functional family? May consider a linear family first… … which can easily be derived (Least squares). May also consider the expected value of a conditional Gaussian: A conditional Gaussian buys us statistics: The conditional mean is a linear regressor! Plus, estimating the joint is easy.

Which functional family? May also want to consider non-linear functions. A good first approach is an Artificial Neural Network

Experimental results Looked at rush hour (06:00 – 10:30) data from a “typical” Portland week, from US 26 E (Oct. 16 – Oct ) Found a segment that is typically shown gray (it is my commute, so I notice these things) Inputs: current measurements of speed at nearby stations Goal: come up with a good enough estimate to color the TripCheck map

Segment

Confusion matrices Milepost 73.62Milepost Linear ANN Prediction R Y G Observed R Y G Prediction R Y G Prediction R Y G Prediction R Y G Observed R Y G Observed R Y G Observed R Y G 80% 89%100%

Future work May still be able to recover system information with a nonlinear model from far away stations Still need to explore other segments and build a representative amount of model regressors (20% ?) to demonstrate effectiveness of the approach What keeps us from using this approach to estimate intermediate location states?

Future work May want to consider regressors with more inputs (shifted speeds, time, etc.) If nonlinear regressors are effective, we may want to use Gaussian Mixture Models (cheaper to train, statistically rich) Addressing quality in data presentation can be a sub-product of a more general problem: construct a framework for reliable system state estimation.