Lecture 2 Basic Experimental Statistics. What does this mean? From Snee, R.D. (1983) “Graphical Analysis of Process Variation Studies,” Journal of Quality.

Slides:



Advertisements
Similar presentations
DATA & STATISTICS 101 Presented by Stu Nagourney NJDEP, OQA.
Advertisements

THE CENTRAL LIMIT THEOREM
Mean, Proportion, CLT Bootstrap
Chapter 6 Sampling and Sampling Distributions
Sampling: Final and Initial Sample Size Determination
Unit 7 Section 6.1.
Sampling Distributions (§ )
Sampling Distributions. Review Random phenomenon Individual outcomes unpredictable Sample space all possible outcomes Probability of an outcome long-run.
Chapter 6 Introduction to Sampling Distributions
Chapter 7 Sampling and Sampling Distributions
Lecture 5 Outline – Tues., Jan. 27 Miscellanea from Lecture 4 Case Study Chapter 2.2 –Probability model for random sampling (see also chapter 1.4.1)
Quality Control Procedures put into place to monitor the performance of a laboratory test with regard to accuracy and precision.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Topic 2: Statistical Concepts and Market Returns
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 9: Hypothesis Tests for Means: One Sample.
CHAPTER 6 Statistical Analysis of Experimental Data
Part III: Inference Topic 6 Sampling and Sampling Distributions
Statistical inference Population - collection of all subjects or objects of interest (not necessarily people) Sample - subset of the population used to.
BCOR 1020 Business Statistics
Inference on averages Data are collected to learn about certain numerical characteristics of a process or phenomenon that in most cases are unknown. Example:
Standard error of estimate & Confidence interval.
Chapter 6 Sampling and Sampling Distributions
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Review of normal distribution. Exercise Solution.
AM Recitation 2/10/11.
Overview Definition Hypothesis
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
 The situation in a statistical problem is that there is a population of interest, and a quantity or aspect of that population that is of interest. This.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Sampling Distributions.
AP Statistics Chapter 9 Notes.
Estimation of Statistical Parameters
Topic 5 Statistical inference: point and interval estimate
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Random Sampling, Point Estimation and Maximum Likelihood.
Continuous Probability Distributions Continuous random variable –Values from interval of numbers –Absence of gaps Continuous probability distribution –Distribution.
PARAMETRIC STATISTICAL INFERENCE
Chapter 11 – 1 Lecture 7 How certain are we? Sampling and the normal distribution.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling Distributions.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Estimation This is our introduction to the field of inferential statistics. We already know why we want to study samples instead of entire populations,
Fundamentals of Data Analysis Lecture 3 Basics of statistics.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
LECTURE 3: ANALYSIS OF EXPERIMENTAL DATA
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
1 Sampling Distribution of Arithmetic Mean Dr. T. T. Kachwala.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. PPSS The situation in a statistical problem is that there is a population of interest, and a quantity or.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Lecture 5 Introduction to Sampling Distributions.
1 Probability and Statistics Confidence Intervals.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Lecture 8: Measurement Errors 1. Objectives List some sources of measurement errors. Classify measurement errors into systematic and random errors. Study.
Green Belt – SIX SIGMA OPERATIONAL Central Limit Theorem.
Chapter 7: The Distribution of Sample Means
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
© 2001 Prentice-Hall, Inc.Chap 8-1 BA 201 Lecture 12 Confidence Interval Estimation.
Chapter 6 Sampling and Sampling Distributions
ESTIMATION.
Sampling Distributions
Chapter 10: Estimating with Confidence
Sampling Distribution of a Sample Proportion
Advanced Algebra Unit 1 Vocabulary
Presentation transcript:

Lecture 2 Basic Experimental Statistics

What does this mean? From Snee, R.D. (1983) “Graphical Analysis of Process Variation Studies,” Journal of Quality Technology, 15, 76-88

Are the specimen different? Sp #5 Sp #3 Sp #1 Sp #2 Sp #6

Are the operators different? Operator #3 Operator #2 Operator #1

Statistics in Experimentation Project Planning Phase – What is to be measured? – How large is the likely variation? – What are the influential factors? Experimental Design Phase – Control known sources of variation – Estimate the size of uncontrolled variation – Investigate suitable models Statistical Analysis Phase – Next few weeks…

Population All possible items or units that determine an outcome of a well-defined experiment are collectively called a “population”. Examples: – All 30-ohm resistors produced by a certain manufacturer during a fixed time period. – All measurements of the fracture strength of one- inch-thick underwater welds on a steel alloy base plate that is 200ft deep in salt-water.

Process A repeatable series of actions that result in an observable characteristic or measurement. Examples: – The effect of aspirin on blood pressure.

Sample A group of observations taken from a population or a process. We usually take a “convenience sample” (i.e., easy to obtain), but these can be of dubious value because they may not be representative of the variation in the population!! Goal: random sample

Other definitions Variable : A property or characteristic on which information is obtained in an experiment. Distribution: a tabular or graphical description of the values of a variable using some measure of how frequently they occur in a population, process, or sample. (think: histogram)

Example Population or a Sample?

Example Convenience Sample or Random Sample?

Example Is this a Normal Distribution?

Beware the Normal Distribution aka, Gaussian Distribution Approximately normal distributions occur in many situations, as explained by the central limit theorem. When there is reason to suspect the presence of a large number of small effects acting additively and independently, it is reasonable to assume that observations will be normal.central limit theorem Good example: laser light intensity

Beware the Normal Distribution Bad example: Black–Scholes modelBlack–Scholes model Changes in the logarithm of exchange rates, price indices, and stock market indices; these variables behave like compound interest, not like simple interest, and so are multiplicative; While the Black–Scholes model assumes normality, in reality these variables exhibit heavy tails, as seen in stock market crashes (see The Black Swan)heavy tailsstock market crashes

Beware the Normal Distribution Concrete example: S&P 500, daily change:  = % ± 0.9% (from W. Egan) Based on this, you conclude: –68% of the time, changes will be less than 0.9% – 95% of the time, changes will be less than 1.8% – 99.7% of the time, changes will be less than 2.7% – % of the time, changes will be less than 3.6% (1 in every 17,000 trading days or once every 70 years) – A 4.5% change should almost never happen, but it has happened multiple times in your lives!

Sample Mean or Average Sample median: – M=x (q) if n is odd, where q=(n+1)/2. – M=[x (q) + x (q+1) ]/2 if n is even where q=n/2 Do not use text for equations (like I just did) in reports! It looks like crap!

Sample Standard Deviation In your reports, all values should be reported as: or INCLUDE UNITS!!!!

Sample Standard Error Some people like to use standard errors because the smaller values seem to imply better results. You need to be careful with such people!

Thanks to Wikipedia…. standard error is an estimate of how close to the population mean your sample mean is likely to be – it behaves like a confidence interval. standard deviation is the degree to which individuals within the sample differ from the sample mean (“Spread”). Standard error should decrease with larger sample sizes, as the estimate of the population mean improves. Standard deviation will be unaffected by sample size. Std. Err. vs. Std. Dev.

Percent Error Note: The order of terms in the numerator can vary. Percent Difference Note: There is no universally accepted choice for the value that should go in the denominator. If one value is believed to be closer to the true value, that should be in the denominator.

Which to use? We almost always need to give an indication of how accurate our experimental measurements are. Which measure should we use? When should we use  ? When should we use % error or % diff?