Lecture 8: Measurement Errors 1. Objectives List some sources of measurement errors. Classify measurement errors into systematic and random errors. Study.

Slides:



Advertisements
Similar presentations
Errors in Chemical Analyses: Assessing the Quality of Results
Advertisements

Sampling Distributions (§ )
Physics 114: Lecture 7 Uncertainties in Measurement Dale E. Gary NJIT Physics Department.
Introduction to Statistics
Central Limit Theorem.
EXPERIMENTAL ERRORS AND DATA ANALYSIS
Quality Control Procedures put into place to monitor the performance of a laboratory test with regard to accuracy and precision.
Sampling Distributions
Evaluating Hypotheses
PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.
Chapter 7: Variation in repeated samples – Sampling distributions
Sampling Distributions
CHAPTER 6 Statistical Analysis of Experimental Data
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Probability and Statistics in Engineering Philip Bedient, Ph.D.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics: A First Course 5 th.
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
Standard error of estimate & Confidence interval.
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Chapter 6 Random Error The Nature of Random Errors
Review of normal distribution. Exercise Solution.
Chapter 6 The Normal Probability Distribution
STAT02 - Descriptive statistics (cont.) 1 Descriptive statistics (cont.) Lecturer: Smilen Dimitrov Applied statistics for testing and evaluation – MED4.
Measurement Errors Class 5.
Statistics Chapter 9. Statistics Statistics, the collection, tabulation, analysis, interpretation, and presentation of numerical data, provide a viable.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
16-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 16 The.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
PARAMETRIC STATISTICAL INFERENCE
Theory of Probability Statistics for Business and Economics.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
The normal distribution Binomial distribution is discrete events, (infected, not infected) The normal distribution is a probability density function for.
LECTURER PROF.Dr. DEMIR BAYKA AUTOMOTIVE ENGINEERING LABORATORY I.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Fundamentals of Data Analysis Lecture 3 Basics of statistics.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
LECTURE 3: ANALYSIS OF EXPERIMENTAL DATA
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
ME Mechanical and Thermal Systems Lab Fall 2011 Chapter 3: Assessing and Presenting Experimental Data Professor: Sam Kassegne, PhD, PE.
Confidence Interval Estimation For statistical inference in decision making:
PCB 3043L - General Ecology Data Analysis. PCB 3043L - General Ecology Data Analysis.
Module 1: Measurements & Error Analysis Measurement usually takes one of the following forms especially in industries: Physical dimension of an object.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
Lean Six Sigma: Process Improvement Tools and Techniques Donna C. Summers © 2011 Pearson Higher Education, Upper Saddle River, NJ All Rights Reserved.
Chapter 13 Sampling distributions
Chapter 20 Statistical Considerations Lecture Slides The McGraw-Hill Companies © 2012.
From the population to the sample The sampling distribution FETP India.
ERT 207 Analytical Chemistry ERT 207 ANALYTICAL CHEMISTRY Dr. Saleha Shamsudin.
CHAPTER – 1 UNCERTAINTIES IN MEASUREMENTS. 1.3 PARENT AND SAMPLE DISTRIBUTIONS  If we make a measurement x i in of a quantity x, we expect our observation.
Warsaw Summer School 2015, OSU Study Abroad Program Normal Distribution.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Chapter 6: Random Errors in Chemical Analysis. 6A The nature of random errors Random, or indeterminate, errors can never be totally eliminated and are.
THE NORMAL DISTRIBUTION
Theoretical distributions: the Normal distribution.
SUR-2250 Error Theory.
Estimating the Value of a Parameter Using Confidence Intervals
Introduction to Instrumentation Engineering
Descriptive and inferential statistics. Confidence interval
Sampling Distributions (§ )
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Advanced Algebra Unit 1 Vocabulary
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Presentation transcript:

Lecture 8: Measurement Errors 1

Objectives List some sources of measurement errors. Classify measurement errors into systematic and random errors. Study several statistical tools to treat measurement data contaminated by random errors. 2

Error sources When a physical quantity is measured, the value obtained should not be expected to be exactly what the quantity actually is. With every measured quantity there is associated some error. These errors can arise from: (1) Instrument errors: can arise in the manufacturing of the instrument or due to using it under different conditions to which it was calibrated, e.g. at a different temperature. (2) Human errors wrong reading of an instrument. (3) Insertion errors i.e. the measuring device affects the measured quantity ( the loading effect) (4) Errors of unexplainable origin classified as random errors. 3

Error types Measurement errors can be classified into random or systematic. Systematic errors do not vary from one reading to another (include instrument, human, and insertion errors). Random errors, on the contrary, are caused by unpredictable variations in the measurement system. Random errors are usually observed as small perturbations of the measurement around the correct value, i.e. positive and negative errors occur in approximately equal numbers for a series of measurements made of the same constant quantity. Therefore, random errors can largely be eliminated by calculating the average or mean of a number of repeated measurements. 4

The mean (average) value The mean of a set of n readings {x 1, x 2, …, x n } is given by: The mean is a statistical term that helps in describing the central tendency (mid-point) of a set of readings. The more readings we take the more likely we can cancel out the random variations that occur between readings and obtain the true value of the measured variable. 5

Example 1 The length of a steel bar is measured and the following set of 5 measurements are recorded (in mm): {407, 403, 404, 403, 408}. The mean of the readings is 405 mm. Imagine that a 6 th reading of 441 mm was obtained. Then the set of readings will be {407, 403, 404, 403, 408, 441}. The mean of the 6 readings is 411 mm. We realize that the two values of the mean are quite different. The reason is that the 441 mm reading is much higher than the other readings. This reading is called an outlier. Outlier is too low or high reading compared to the rest of readings. Therefore, the mean is very sensitive to outliers. 6

The median The median is another statistical term which plays a role similar to the mean in describing the central tendency of a set of readings. However, the median is less sensitive to outliers. This is why a median is sometimes taken as a better measure of the mid point. To calculate the median, we do not sum the measurements, but we write them in an ascending order. 7

For a set of n measurements {x 1, x 2, …, x n } written down in ascending order, the median is the middle value: For example, for a set of 5 measurements x 1, x 2, …, x 5 arranged in order of magnitude, the median value is x 3. For an even number of measurements, the median value is midway between the two center values, e.g. for 6 measurements x 1, x 2, …, x 6, the median value is given by: 8 The median

Example 2 Back to Example 1, recall that mean of the set of 5 measurements, {407, 403, 404, 403, 408}, is 405 mm. To calculate the median, we reorder the readings as {403, 403, 404, 407, 408}. The median is 404 mm. With 6 readings, {407, 403, 404, 403, 408, 441}, the mean is 411 mm. The median is ( )/2 = mm. We can realize that the median does not change too much compared to the mean. 9

Standard deviation Consider the two sets of readings A and B shown. Both sets have the same average of 201. Which of these two measurement sets should we have more confidence in? 10 Set ASet B It can be seen that set B is more spread out (shows more random fluctuations) around its mean than set A. So, we should be more confident in set A!. A reading from Set A has a greater chance of being closer to the mean value than a reading from Set B.

The spread of readings is described by a quantity called the standard deviation, σ, which is calculated as: where d i is the deviation of the i th reading from the mean and n is the number of readings. For Set A, σ = 0.7 and for Set B, σ = 4.8. This indicates a greater spread of readings in Set B compared to Set A. 11 Standard deviation Set A Datadidi (d i ) ∑(d i ) 2 2 σ √(2/4) = 0.7

Graphical data analysis Graphical techniques are very useful in analyzing how random measurement errors are distributed. A simple way of doing this is the called histogram. To draw the histogram, bands of equal width across the range of measurement values are defined and the number of measurements within each band is counted. For example, the histogram of the following set of 23 readings: {407, 407, 404, 407, 404, 405, 407, 402, 406, 409, 408, 405, 406, 410, 406, 408, 406, 409, 406, 405, 409, 406, 407}, using bands 2mm wide, is shown next. 12

The histogram This histogram has the characteristic shape shown by truly random data, with symmetry about the mean value of the measurements. What happens to the histogram as the number of measurements increases? 13 There are 11 measurements in the range from to There are 5 measurements in the range from to 409.5

The histogram 14 As the number of measurements → ∞, the histogram becomes a smooth bell-shaped curve

If the area under the curve is normalized to unity, that is, then the curve is called the probability density function (pdf). This bell-shaped curve is also known as the Gaussian or the normal distribution. Probability distribution function (pdf) 15 The probability that a measurement lies between two values D 1 and D 2 equal the area under the curve between D 1 and D 2, as shown:

Cumulative distribution function (cdf) The cumulative distribution function (cdf) is defined as the probability of observing a value less than or equal to D 0, and is expressed as: 16 Thus, the cdf is the area under the curve to the left of a vertical line drawn through D 0.

Gaussian (normal) distribution 17

Standard Gaussian distribution If a Gaussian distribution has zero mean μ = 0 and a standard deviation σ = 1, it is called a standard Gaussian distribution. Any non-standard Gaussian distribution can be converted to a standard Gaussian distribution by the transformation 18

Standard Gaussian tables Standard Gaussian distribution tables are available to tabulate the cdf function, F(z), for various values of z given by: 19

Example 3 How many measurements in a data set subject to random errors lie outside the boundaries of +σ and -σ, around the mean i.e. how many measurements have a deviation greater than |σ|? 20 Solution The required number is represented by the sum of the two shaded areas in the Figure.

This area can be expressed as: We need to transform x to a standard normally distributed variable, z, using the transformation z = (x-μ)/σ. This gives: 21

22 That is 32% of the measurements lie outside the ±σ boundaries, while 68% of the measurements lie inside.

Gaussian distribution: a general rule 23 Deviation boundaries % of data points within boundary Probability of any particular data point being outside boundary ±σ±σ68.0%32.0% ±2σ95.4%4.6% ±3σ99.7%0.3%

Example 4 An integrated circuit chip contains 10 5 transistors. The transistors have a mean current gain of 20 and a standard deviation of 2. Assuming that the current gain is normally distributed, calculate the following: (a) the number of transistors with a current gain between 19.8 and 20.2 (b) the number of transistors with a current gain greater than

Solution (a)We want to find the probability of having transistors with current gain between 19.8 and 20.2, that is, Thus, 7960 ( x 10 5 ) transistors have a current gain in the range from 19.8 to (a)The number of transistors with gain > 17 is given by: Thus, 93.32%, i.e transistors have a gain >

Standard error of the mean (sme) Imagine that we have calculated the mean of a set of n data points. This yields an estimate x of the true mean μ. Usually, some error exists between x and μ. If several subsets, of size n, are taken from an infinite data population, then, by the central limit theorem, the means of the subsets will form a Gaussian distribution about the true mean. The standard deviation of that distribution is defined as the standard error of the mean, α, calculated as Clearly, α tends to zero as the number of measurements (n) in the data set tends to infinity. 26

Standard error of the mean (sme) The standard error of the mean can be used to attach a level of uncertainty to the estimated mean calculated using a finite set of measurements. We know that a range of ± two standard deviation (i.e., ±2α) encompasses 95.4% of the deviations of sample means around the true value. Thus we can say that the true mean lies in the interval x±2α with probability 95.4% 27

Example 5 Given the lengths of a set of 100 men. The average length is found to be 173 cm and a standard deviation of 10 cm. What conclusion can be drawn about the true population mean? The standard error of the mean is Hence, we may conclude that the true mean lies between 171~175cm (173±2 cm) with confidence 95.4%. 28