Richard W. Hamming Learning to Learn The Art of Doing Science and Engineering Session 27: Unreliable Data Learning to Learn The Art of Doing Science and.

Slides:



Advertisements
Similar presentations
DATA & STATISTICS 101 Presented by Stu Nagourney NJDEP, OQA.
Advertisements

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Estimation in Sampling
Chapter 8 Interval Estimation Population Mean:  Known Population Mean:  Known Population Mean:  Unknown Population Mean:  Unknown n Determining the.
Copyright © 2006 Pearson Addison-Wesley. All rights reserved. Lecture 4: Mathematical Tools for Econometrics Statistical Appendix (Chapter 3.1–3.2)
Statistics Interval Estimation.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
1 BA 275 Quantitative Business Methods Statistical Inference: Confidence Interval Estimation Estimating the population mean  Margin of Error Sample Size.
Beginning the Research Design
Sample Size Determination
PSY 330 Lect 2 1 What is Science? Science – set of practices and methods to obtain ____________. Its primary aim is to find order and lawful relations.
Tracking using the Kalman Filter. Point Tracking Estimate the location of a given point along a sequence of images. (x 0,y 0 ) (x n,y n )
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 10 Notes Class notes for ISE 201 San Jose State University.
The Experimental Approach September 15, 2009Introduction to Cognitive Science Lecture 3: The Experimental Approach.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
A new sampling method: stratified sampling
Sample Size Determination
Probability and Statistics in Engineering Philip Bedient, Ph.D.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Measurement & Experimental Design. Types of Variables Dependent variables – those variables we expect to change as the result of an experimental manipulation.
1 Product Reliability Chris Nabavi BSc SMIEEE © 2006 PCE Systems Ltd.
1. Homework #2 2. Inferential Statistics 3. Review for Exam.
Chapter 15 Forecasting Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
Chapter 24: Comparing Means.
Hypothesis Testing in Linear Regression Analysis
1 Sampling Distributions Presentation 2 Sampling Distribution of sample proportions Sampling Distribution of sample means.
Respected Professor Kihyeon Cho
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 24 Statistical Inference: Conclusion.
Determining Sample Size
Chapter 8 - Interval Estimation
Gaussian process modelling
Sociology 5811: Lecture 7: Samples, Populations, The Sampling Distribution Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
10.2 Estimating a Population Mean when σ is Unknown When we don’t know σ, we substitute the standard error of for its standard deviation. This is the standard.
Introduction Theoretical Perspectives Research.  Sampling : Identifying the appropriate population of people to be studied.  Random Sample : Each member.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Topic 5 Statistical inference: point and interval estimate
Estimates and Sample Sizes Lecture – 7.4
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Introduction to Sampling “If you don’t believe in sampling, the next time you have a blood test tell the doctor to take it all.”
THE SCIENTIFIC METHOD. What is Scientific Inquiry? SCIENCE  Science assumes the natural world is  Consistent  Predictable  Goals of science are 
Richard W. Hamming Learning to Learn The Art of Doing Science and Engineering Session 13: Information Theory ` Learning to Learn The Art of Doing Science.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Active Learning Lecture Slides For use with Classroom Response Systems Statistical Inference: Confidence Intervals.
Uncertainty & Error “Science is what we have learned about how to keep from fooling ourselves.” ― Richard P. FeynmanRichard P. Feynman.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 MARKETING RESEARCH Week 5 Session A IBMS Term 2,
Copyright  2003 by Dr. Gallimore, Wright State University Department of Biomedical, Industrial Engineering & Human Factors Engineering Human Factors Research.
Manijeh Keshtgary Chapter 13.  How to report the performance as a single number? Is specifying the mean the correct way?  How to report the variability.
Sociology 5811: Lecture 11: T-Tests for Difference in Means Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Section 10.1 Confidence Intervals
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
STA Lecture 171 STA 291 Lecture 17 Chap. 10 Estimation – Estimating the Population Proportion p –We are not predicting the next outcome (which is.
AP Statistics Chapter 24 Comparing Means.
Writing Multiple Choice Questions. Types Norm-referenced –Students are ranked according to the ability being measured by the test with the average passing.
M Machine Learning F# and Accord.net.
What is Science? - Ideas developed by scientists and the methods used to gain information about the idea. – A process of making observations and asking.
Inference: Probabilities and Distributions Feb , 2012.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
Warsaw Summer School 2015, OSU Study Abroad Program Normal Distribution.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Chapter 7: The Distribution of Sample Means
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Warsaw Summer School 2014, OSU Study Abroad Program Sampling Distribution.
EC 827 Module 2 Forecasting a Single Variable from its own History.
Introduction to Instrumentation Engineering
Regression Forecasting and Model Building
Simulation Berlin Chen
Presentation transcript:

Richard W. Hamming Learning to Learn The Art of Doing Science and Engineering Session 27: Unreliable Data Learning to Learn The Art of Doing Science and Engineering Session 27: Unreliable Data

Bad data The data you get is not as reliable as advertised Simulation data is normally assumed to be accurate, but we cannot depend on data the way we expect First, some examples of bad physical data… The data you get is not as reliable as advertised Simulation data is normally assumed to be accurate, but we cannot depend on data the way we expect First, some examples of bad physical data…

Life Testing Life testing – how reliable is equipment Example: testing reliability of vacuum tubes Wanted 20 year life – expected failure cost = $1MWanted 20 year life – expected failure cost = $1M Tubes available for testing 18 months before useTubes available for testing 18 months before use Why believe testing equipment is as reliable as what you are testing? Life testing – how reliable is equipment Example: testing reliability of vacuum tubes Wanted 20 year life – expected failure cost = $1MWanted 20 year life – expected failure cost = $1M Tubes available for testing 18 months before useTubes available for testing 18 months before use Why believe testing equipment is as reliable as what you are testing?

Life Testing (cont’) Tricks in Life Testing Raise temp 17 degrees C and item ages twice as fastRaise temp 17 degrees C and item ages twice as fast Increase voltage and stress resistance breakdownIncrease voltage and stress resistance breakdown Increase frequency and expose failures earlierIncrease frequency and expose failures earlier Hard to adequately test highly reliable equipment in a short time period Too anxious to get latest technology into the fieldToo anxious to get latest technology into the field Not enough time to test it right, but plenty of time to fixNot enough time to test it right, but plenty of time to fix Tricks in Life Testing Raise temp 17 degrees C and item ages twice as fastRaise temp 17 degrees C and item ages twice as fast Increase voltage and stress resistance breakdownIncrease voltage and stress resistance breakdown Increase frequency and expose failures earlierIncrease frequency and expose failures earlier Hard to adequately test highly reliable equipment in a short time period Too anxious to get latest technology into the fieldToo anxious to get latest technology into the field Not enough time to test it right, but plenty of time to fixNot enough time to test it right, but plenty of time to fix

Accurate Measurements Never assume people, equipment can’t make measurement mistakes Take apart test equipment and recalibrate it Never trust the manufacturer on advertised accuracyNever trust the manufacturer on advertised accuracy Just because a machine records data doesn’t make it correct Never process data before checking for errors first it wastes more time to redo workit wastes more time to redo work Never assume people, equipment can’t make measurement mistakes Take apart test equipment and recalibrate it Never trust the manufacturer on advertised accuracyNever trust the manufacturer on advertised accuracy Just because a machine records data doesn’t make it correct Never process data before checking for errors first it wastes more time to redo workit wastes more time to redo work

Test Studies Never trust a pilot program (i.e. a test study) to accurately predict the success of the main study

Basic Scientific Data

Hamming examines increased accuracy of National Bureau of Standards physical constants Assuming new constants are correct, averaged 5 sigma for previous estimated errorAssuming new constants are correct, averaged 5 sigma for previous estimated error Any number with a probable error is typically much smaller than it should be Scientists fine tune equipment to minimize variance, not errorScientists fine tune equipment to minimize variance, not error 90% of the time the next independent measurement will fall outside the previous 90% confidence limits Hamming examines increased accuracy of National Bureau of Standards physical constants Assuming new constants are correct, averaged 5 sigma for previous estimated errorAssuming new constants are correct, averaged 5 sigma for previous estimated error Any number with a probable error is typically much smaller than it should be Scientists fine tune equipment to minimize variance, not errorScientists fine tune equipment to minimize variance, not error 90% of the time the next independent measurement will fall outside the previous 90% confidence limits

Fitting Data to a Model There can be errors in both the model and the data Initially, the model is a guess and the data is badInitially, the model is a guess and the data is bad As the data & measurements improve, the model is rarely reexamined for errorAs the data & measurements improve, the model is rarely reexamined for error Thus, as measurement accuracy increases, the error in the model increasesThus, as measurement accuracy increases, the error in the model increases Estimates combined with reliable measurements are taken at the reliability of the best measurement “A chain is only as strong as its weakest link”“A chain is only as strong as its weakest link” There can be errors in both the model and the data Initially, the model is a guess and the data is badInitially, the model is a guess and the data is bad As the data & measurements improve, the model is rarely reexamined for errorAs the data & measurements improve, the model is rarely reexamined for error Thus, as measurement accuracy increases, the error in the model increasesThus, as measurement accuracy increases, the error in the model increases Estimates combined with reliable measurements are taken at the reliability of the best measurement “A chain is only as strong as its weakest link”“A chain is only as strong as its weakest link”

Economic Definitions An economic definition today is not necessarily what it was yesterday. e.g. gold flow, income, inventory figures, poverty ratee.g. gold flow, income, inventory figures, poverty rate Hard to draw conclusions from a time series when the definition of what is being measured constantly changes. Often a changing definition is used to satisfy the needs of a desired policy (e.g. poverty rate) or outcome (e.g. unstated inventory to avoid taxes). An economic definition today is not necessarily what it was yesterday. e.g. gold flow, income, inventory figures, poverty ratee.g. gold flow, income, inventory figures, poverty rate Hard to draw conclusions from a time series when the definition of what is being measured constantly changes. Often a changing definition is used to satisfy the needs of a desired policy (e.g. poverty rate) or outcome (e.g. unstated inventory to avoid taxes).

Economic Predictors and Indices … change very slowly & lag behind reality Poorly represents current state of affairsPoorly represents current state of affairs Example: manufacturing vs. processing informationExample: manufacturing vs. processing information Worthy argument: it’s better to have an index that is comparable and wrong, versus gaining the right index that is incomparable … change very slowly & lag behind reality Poorly represents current state of affairsPoorly represents current state of affairs Example: manufacturing vs. processing informationExample: manufacturing vs. processing information Worthy argument: it’s better to have an index that is comparable and wrong, versus gaining the right index that is incomparable

EconomistsEconomists Refuse to admit a data problem exists Economists & governments typically skew figures to enact a particular policy (e.g. economic aid, gold holdings)(e.g. economic aid, gold holdings) Refuse to admit a data problem exists Economists & governments typically skew figures to enact a particular policy (e.g. economic aid, gold holdings)(e.g. economic aid, gold holdings)

SurveysSurveys A very small (1% or.1%) and accurate survey is more accurate than a 100% survey with the same amount of money. Verbal surveys are very unreliable (biased based on wording of questions, order of questions, person who asks questions) Written surveys just as badWritten surveys just as bad Some questions just don’t get answered honestly e.g. infidelity ratese.g. infidelity rates A very small (1% or.1%) and accurate survey is more accurate than a 100% survey with the same amount of money. Verbal surveys are very unreliable (biased based on wording of questions, order of questions, person who asks questions) Written surveys just as badWritten surveys just as bad Some questions just don’t get answered honestly e.g. infidelity ratese.g. infidelity rates

Social statistics Average person doesn’t exist e.g. one breast, one testiclee.g. one breast, one testicle Can only average homogenous data social data is rarely homogeneoussocial data is rarely homogeneous Average person doesn’t exist e.g. one breast, one testiclee.g. one breast, one testicle Can only average homogenous data social data is rarely homogeneoussocial data is rarely homogeneous

Management use of data Surveys are answered quicker by those who think they will rate well than those who don’t Presence of management will change data, surveys, observation, etc.Presence of management will change data, surveys, observation, etc. Subordinates will skew data for the boss so as not to break bad news.Subordinates will skew data for the boss so as not to break bad news. Keep all this in mind as you move through the ranks, and realize that the data that reaches you is often unreliable Surveys are answered quicker by those who think they will rate well than those who don’t Presence of management will change data, surveys, observation, etc.Presence of management will change data, surveys, observation, etc. Subordinates will skew data for the boss so as not to break bad news.Subordinates will skew data for the boss so as not to break bad news. Keep all this in mind as you move through the ranks, and realize that the data that reaches you is often unreliable