DEALING WITH UNCERTAINTY From the GEOPRIV motivational series.

Slides:



Advertisements
Similar presentations
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Advertisements

Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Theoretical Probability Distributions We have talked about the idea of frequency distributions as a way to see what is happening with our data. We have.
Copyright © 2010 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
Errors & Uncertainties Confidence Interval. Random – Statistical Error From:
Chapter 19 Confidence Intervals for Proportions.
FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.
Chapter 19: Confidence Intervals for Proportions
1 Analysis of Variance This technique is designed to test the null hypothesis that three or more group means are equal.
Point estimation, interval estimation
Statistics: Data Analysis and Presentation Fr Clinic II.
User Interface Testing. Hall of Fame or Hall of Shame?  java.sun.com.
Sample Size Determination
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
1 The Assumptions. 2 Fundamental Concepts of Statistics Measurement - any result from any procedure that assigns a value to an observable phenomenon.
Tracking with Linear Dynamic Models. Introduction Tracking is the problem of generating an inference about the motion of an object given a sequence of.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Non-parametric statistics
Bootstrapping applied to t-tests
Hypothesis Testing. Distribution of Estimator To see the impact of the sample on estimates, try different samples Plot histogram of answers –Is it “normal”
Confidence Intervals and Hypothesis Testing
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
ANOVA Greg C Elvers.
+ DO NOW What conditions do you need to check before constructing a confidence interval for the population proportion? (hint: there are three)
INFERENCE ABOUT MEANS Chapter 23. CLT!! If our data come from a simple random sample (SRS) and the sample size is sufficiently large, then we know the.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 9 Section 1 – Slide 1 of 39 Chapter 9 Section 1 The Logic in Constructing Confidence Intervals.
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
Chap 8-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Business Statistics: A First Course.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
CHAPTER 18: Inference about a Population Mean
Department of Cognitive Science Michael J. Kalsher PSYC 4310 COGS 6310 MGMT 6969 © 2015, Michael Kalsher Unit 1B: Everything you wanted to know about basic.
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Standard Error and Confidence Intervals Martin Bland Professor of Health Statistics University of York
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Identify.
Section 8.3 Estimating a Population Mean. Section 8.3 Estimating a Population Mean After this section, you should be able to… CONSTRUCT and INTERPRET.
Copyright © 2009 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
1 Chapter 18 Sampling Distribution Models. 2 Suppose we had a barrel of jelly beans … this barrel has 75% red jelly beans and 25% blue jelly beans.
Hypotheses tests for means
Confidence Intervals for Proportions Chapter 8, Section 3 Statistical Methods II QM 3620.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Confidence intervals and hypothesis testing Petter Mostad
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 18 Sampling Distribution Models.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Measures of variability: understanding the complexity of natural phenomena.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Review of Statistics.  Estimation of the Population Mean  Hypothesis Testing  Confidence Intervals  Comparing Means from Different Populations  Scatterplots.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. PPSS The situation in a statistical problem is that there is a population of interest, and a quantity or.
Chapter 13 Sampling distributions
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Lecture 8: Measurement Errors 1. Objectives List some sources of measurement errors. Classify measurement errors into systematic and random errors. Study.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models.
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
Sampling Distribution Models
Market-Risk Measurement
What’s Reasonable About a Range?
Modelling Dr Andy Evans In this lecture we'll look at modelling.
Estimating with Confidence
CHAPTER 18: Inference about a Population Mean
Professor John Canny Fall 2004
Presentation transcript:

DEALING WITH UNCERTAINTY From the GEOPRIV motivational series

Drafts ◦ draft-thomson-geopriv-uncertainty-08 ◦ What uncertainty (and confidence) mean and are good for ◦ A bunch of shortcuts for dealing with uncertainty ◦ Intended status: Informational ◦ draft-thomson-geopriv-confidence-04 ◦ A small addition to PIDF-LO ◦ Intended status: Proposed Standard

UNCERTAINTY draft-thomson-geopriv-uncertainty-08

Statistics Refresher: Confidence Intervals ◦ It’s common to describe measurements of stochastic processes (i.e., random $#!^) as confidence intervals ◦ Graphs with the following are very common in scientific literature: The average/median is here 95% of the time, it’s between here…and here.

Terminology is surprisingly important ◦ Accuracy = ◦ fuzzy, feel good term ◦ qualitative, no numbers, use it when talking in the abstract ◦ Uncertainty = ◦ quantitative, concrete, supported with numbers ◦ useless without confidence ◦ Confidence = ◦ probabilistic measure for uncertainty ◦ quantitative, concrete, has numbers [0, 1) or [0, 100%) ◦ Combine uncertainty and confidence: ◦ 95% of the time (confidence), the value is between X and Y (uncertainty range)

Lies, damned lies, and… ◦ The error bars hide a lot of details ◦ The observed probability distribution is rarely perfectly normal ◦ Outliers can be irrelevant, or interesting, but they disappear ◦ Other interesting points like mean, median, variance, all go ◦ But that’s OK, because it’s hard to process more detailed information

RFC 5491 defines error bars in 3D (and 2D)

Still masking greater complexity Ellipse/ellipsoid are pretty good for capturing the product of least squares or Kalman filters Particle filters are much harder to capture …draft-hoene-geopriv-bli

Mo’ data, mo’ troubles ◦ Even the simplified information can be too much ◦ There are a bunch of things you can’t do safely/easily ◦ Most amount to the fact that you can’t invent information you don’t have ◦ E.g., can’t scale uncertainty without information loss ◦ Some applications require very little information ◦ A point ◦ Maybe a circle/sphere radius (so they can report “accuracy”) ◦ Is this location estimate “the same” as this other one ◦ So how do we get there? ◦ draft-thomson-geopriv-uncertainty contains a bunch of cheats

Cheats ◦ Convert to point: ◦ A simple method for calculating centroids of all the RFC 5491 shapes ◦ Not so easy for polygons, but a robust approximation method provided ◦ Get single number for uncertainty:; ◦ Two cheats: convert to circle ◦ Use point calculation and find furthest point ◦ Scale uncertainty based on probability distribution assumptions

Scaling ◦ Not always a good idea ◦ Relies on assumptions ◦ Big mistakes possible ◦ Scaling down is risky, scaling up is basically impossible ◦ Unless you have some extra information. Reported (95%) Scaled down (assumed to be ~68%, but closer to 5%) PDF

CONFIDENCE draft-thomson-geopriv-confidence-04

PIDF-LO assumes 95% confidence ◦ …and doesn’t allow for divergence from this number ◦ That’s a problem for implementations that are required to convert ◦ Many existing systems produce estimates at other values ◦ Conversion without sufficient knowledge requires assumptions ◦ Assumptions cause data loss and errors

Impossibilities ◦ Sometimes 95% is unattainable ◦ A >5% absolute error rate can happen ◦ Maybe you are operating from a source that it just that bad ◦ No alteration of the uncertainty value (other than to have it encompass the entire planet) can compensate for the errors ◦ e.g., Location determination based on a data set that is completely, irretrievably wrong 13.4% of the time ◦ Confidence cannot be 86.6% or higher

Scaling hint ◦ Help with scaling by having an optional hint on PDF shape ◦ Normal – scale up or down safely ◦ Rectangular – scale down safely ◦ Unknown – scale at own risk

Backwards compatibility ◦ None ◦ Intentionally – confidence changes everything ◦ …but it does more damage when you solve for backward compatibility ◦ Scaling = bad ◦ Alternative is no location information at all ◦ Only real solution is to admonish not to use confidence unless you are reasonably sure that the recipient will understand

THERE IS HOPE Adopt