Introductory Statistics By Peter Woolf University of Michigan Michigan Chemical Process Dynamics and Controls Open Textbook version.

Slides:



Advertisements
Similar presentations
Control Charts for Variables
Advertisements

1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Quality Assurance (Quality Control)
1 DSCI 3123 Statistical Process Control Take periodic samples from a process Plot the sample points on a control chart Determine if the process is within.
1 Manufacturing Process A sequence of activities that is intended to achieve a result (Juran). Quality of Manufacturing Process depends on Entry Criteria.
BPT2423 – STATISTICAL PROCESS CONTROL
Estimation in Sampling
Statistics: Purpose, Approach, Method. The Basic Approach The basic principle behind the use of statistical tests of significance can be stated as: Compare.
Statistical Issues in Research Planning and Evaluation
1 Normal Probability Distributions. 2 Review relative frequency histogram 1/10 2/10 4/10 2/10 1/10 Values of a variable, say test scores In.
Agenda Review homework Lecture/discussion Week 10 assignment
Chapter 8 Estimating Single Population Parameters
Chapter 6 Introduction to Continuous Probability Distributions
Comparing Distributions I: DIMAC and Fishers Exact By Peter Woolf University of Michigan Michigan Chemical Process Dynamics and Controls.
Comparing Distributions II: Bayes Rule and Acceptance Sampling By Peter Woolf University of Michigan Michigan Chemical Process Dynamics.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Comparing Distributions III: Chi squared test, ANOVA By Peter Woolf University of Michigan Michigan Chemical Process Dynamics and Controls.
Software Quality Control Methods. Introduction Quality control methods have received a world wide surge of interest within the past couple of decades.
Dynamical Systems Analysis I: Fixed Points & Linearization By Peter Woolf University of Michigan Michigan Chemical Process Dynamics.
8-1 Quality Improvement and Statistics Definitions of Quality Quality means fitness for use - quality of design - quality of conformance Quality is.
Process Improvement Dr. Ron Tibben-Lembke. Quality Dimensions  Quality of Design Quality characteristics suited to needs and wants of a market at a given.
Dynamical Systems Analysis II: Evaluating Stability, Eigenvalues By Peter Woolf University of Michigan Michigan Chemical Process Dynamics.
Chapter 6 The Normal Distribution and Other Continuous Distributions
Chapter Sampling Distributions and Hypothesis Testing.
Sampling Distributions
CHAPTER 6 Statistical Analysis of Experimental Data
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
PSY 307 – Statistics for the Behavioral Sciences Chapter 8 – The Normal Curve, Sample vs Population, and Probability.
Central Tendency and Variability
The Bell Shaped Curve By the definition of the bell shaped curve, we expect to find certain percentages of the population between the standard deviations.
Statistical Process Control
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 10 Sampling Distributions.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Statistical Applications in Quality and Productivity Management Sections 1 – 8. Skip 5.
Overview Definition Hypothesis
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 14 Sampling Variation and Quality.
Continuous Probability Distributions Continuous random variable –Values from interval of numbers –Absence of gaps Continuous probability distribution –Distribution.
PARAMETRIC STATISTICAL INFERENCE
Agronomic Spatial Variability and Resolution What is it? How do we describe it? What does it imply for precision management?
Basic Statistics for Engineers. Collection, presentation, interpretation and decision making. Prof. Dudley S. Finch.
1 Lecture 19: Hypothesis Tests Devore, Ch Topics I.Statistical Hypotheses (pl!) –Null and Alternative Hypotheses –Testing statistics and rejection.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
To accompany Quantitative Analysis for Management, 9e by Render/Stair/Hanna 17-1 © 2006 by Prentice Hall, Inc. Upper Saddle River, NJ Chapter 17.
Brian Macpherson Ph.D, Professor of Statistics, University of Manitoba Tom Bingham Statistician, The Boeing Company.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Distributions of the Sample Mean
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Two Main Uses of Statistics: 1)Descriptive : To describe or summarize a collection of data points The data set in hand = the population of interest 2)Inferential.
MORE THAN MEETS THE EYE Wayne Gaul, Ph.D., CHP, CHMM Tidewater Environmental Columbia, SC SRHPS Technical Seminar, April 15, 2011.
Statistical Quality Control
1 Six Sigma Green Belt Introduction to Control Charts Sigma Quality Management.
Inferences from sample data Confidence Intervals Hypothesis Testing Regression Model.
Production and Operations Management: Manufacturing and Services PowerPoint Presentation for Chapter 7 Supplement Statistical Quality Control © The McGraw-Hill.
Basic Business Statistics
Sampling Fundamentals 2 Sampling Process Identify Target Population Select Sampling Procedure Determine Sampling Frame Determine Sample Size.
Inspection- “back-end quality control” BUT, Start by designing quality into the front end of the process- the design QFD (Quality Function Deployment)
1 SMU EMIS 7364 NTU TO-570-N Control Charts Basic Concepts and Mathematical Basis Updated: 3/2/04 Statistical Quality Control Dr. Jerrell T. Stracener,
From the population to the sample The sampling distribution FETP India.
Warsaw Summer School 2015, OSU Study Abroad Program Normal Distribution.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions Basic Business.
10 March 2016Materi ke-3 Lecture 3 Statistical Process Control Using Control Charts.
LSM733-PRODUCTION OPERATIONS MANAGEMENT By: OSMAN BIN SAIF LECTURE 30 1.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
© 2010 Pearson Prentice Hall. All rights reserved Chapter Hypothesis Tests Regarding a Parameter 10.
Sources of Variation and Error in Wafer Fab Processing
Agenda Review homework Lecture/discussion Week 10 assignment
What is the point of these sports?
Process Capability.
Xbar Chart By Farrokh Alemi Ph.D
Presentation transcript:

Introductory Statistics By Peter Woolf University of Michigan Michigan Chemical Process Dynamics and Controls Open Textbook version 1.0 Creative commons

“A foolish consistency is the hobgoblin of little minds” R. W. Emerson But is this always true??

Consistency Why might we want consistency? –Integration of products within a larger system –Examples: want parts to fit together, want consistent chemical feeds, want consistent material properties, want consistent energy content, want consistent flavor

Consistency What can be the downsides of consistency? –Make something consistently bad, but consistent. –Sometimes people trade consistency for quality--this is not the goal. –Examples: Fast food vs home made food (depends the cook)

Measures of “Quality” (or lack there of): Six Sigma: “Number of defects per million opportunities” Genichi Taguchi: “Uniformity around a target value” or “The loss a product imposes on society after it is shipped” Process control is a central tool for reducing variability by adjusting and correcting for variations. Key Questions: How can we know if our control system is working well enough? How can we measure variability?

Process Specific Questions 1)Do recent data indicate that the process is broken or changed? 2)Is the process “out of control”? 3)What are the odds that two samples come from the same distribution? 4)What factors influence this outcome?

Detecting if a process has changed Scenario: You are a small Acai juice vendor trying to expand to a world market with a consistent product.

Acai juice production Acai berries in the market Berry crusher juice

Acai juice production juice A key selling point of your acai juice is that it contains a large concentration of antioxidants. With your berry crusher you get a good quality product most of the time, but not always. You don’t want to waste berries if your crusher is hurting your product, but how can you know if it is not working right? How can you test this?

Acai juice production juiceHow can you test this? 1) Gather many samples from your current process and measure the antioxidant concentration N sample values: 40.1, 41.3, 44.3, 39.3, 38.6,….. How do we summarize this?

juice N sample values: 40.1, 41.3, 44.3, 39.3, 38.6,….. How do we summarize this? Average: Deviation from the average: (std deviation)

Deviation from the average: (std deviation) Interpretation: The average distance from the mean OR the width of the dispersion around the mean Problem: What if I have only one sample (e.g. N=1)?  =0!! Does this mean that the underlying process has no variation or that I have not sampled it sufficiently? Result:When N is small, the standard deviation will underestimate the true variation Solution: sample standard deviation

Population standard deviation (“Real deviation”) Sample standard deviation (“Observed deviation”) With a measure of the mean and standard deviation, you have enough information to define a Gaussian distribution Bell curve shape based on a model of a large number of random, uncorrelated changes

Gaussian or Normal Distribution: From previous lecture on Noise: Approximate Gaussian distribution in Excel by: =RAND()+RAND()+RAND()-RAND()-RAND()-RAND() The approximation is better and better for larger numbers of pairs of add and subtract Gaussian distribution is the basis of much of statistical quality control, six sigma, and quality engineering in general. 22 33 -2  -3  66 How do we mathematically define a normal distribution?

How can we calculate this? Excel: Error function is Erf(), thus the solution above could be expressed as =1/2*(1+erf((z-m)/(s*sqrt(2)))) Mathematica: Nintegrate[ f(x), {x,start, end}] Or N[1/2*(1+Erf[(z-m)/(s*Sqrt[2])])] General numerical integration Using analytical solution with error function

juice Acai juice problem revisited From 100 samples of the current process we calculate the following: Mean=40 units Standard deviation= 2 units From these data, what are the odds that the next batch will have an antioxidant value of 37.5 or less?

Mean=40 units Standard deviation= 2 units From these data, what are the odds that the next batch will have an antioxidant value of 37.5 or less? In Mathematica: short hand notation Answer: ~10% of the time we expect this situation

Acai juice production as a function of time time Antioxidant value  Is this process “out of control”? Yes: “It is unusual to see so many batches with such a high value-- this is strange and suggests something has changed.” No: “This is just normal variation-- nothing is fundamentally different.” Key question: How do we define “unusual”

One definition: Variation outside of the six sigma window is unusual 22 33 -2  -3  66 What are the odds of finding something that falls out of this bound by chance? Find by integration! For both tails the probability is ~ or 1 in 370 Common confusion: The “Six Sigma” process defines unusual as 3.4 defects out of 1 million, not within 6 standard deviations (more like 10.2 deviations)

Acai juice production as a function of time time Antioxidant value  Is this process “out of control”? Translation: if we assume outside of 6 sigma variation is “unusual”: Is this pattern expected to happen less than 1 in 370 of our samples? Solution: Control charts!

Image from wikipedia western_electric_rules Control charts determine if a process is behaving in an unusual way.

Image from wikipedia western_electric_rules Control charts determine if a process is behaving in an unusual way. What are the odds? If each dot is a single measurement, and UCL is +3 sigma then UCL=Upper control limit X-bar= average LCL=Lower control limit For both tails the probability is ~ or 1 in 370 Rule 1:

Control charts determine if a process is behaving in an unusual way. What are the odds? UCL=Upper control limit X-bar= average LCL=Lower control limit Rule 2:Can do using probability theory. Assuming each sample is independent, then can find the total probability of: 2*[P1(out+)P2(out+)P3(out+)+P1(out+)P2(out+)P3(in) +P1(out+)P2(in)P3(out+)+P1(in)P2(out+)P3(out+)] =P(out+)P(in)=1-P(out+) = or 1 in in in 326

What are the odds?Alternative solution by sampling 1 in in 326 Approach: Generate thousands of samples and test to see how many satisfy the rule ~ similar to 1 in 370

What are the odds? 1 in 326 Alternative solution by sampling Rule 2: ~ similar to 1 in 326 Message: Many complex decision processes can be evaluated numerically with good accuracy (see mathematica code on website under Lecture 21.nb)

Odds 1 in 370 Odds 1 in 326 Odds 1 in 256 Odds 1 in 180 In all cases these represent somewhat “rare” cases in a statistical sense, but they are not all equally rare. These are not only constrained on statistics though.. e.g. What are the odds of finding 15 consecutive samples in zone c? =Odds 1 in 306 Thus is this system “out of control”? Yes, but in a good way.

Acai juice problem revisited What if you know that each batch of berries has some variation, but you are unsure if the machine is behaving strangely? Can you still use your control charts? Solution: Take samples from each batch, average them and plot these average values and statistics on a control chart. Problem: The process of averaging out different samples will change your odds--averaging reduces out variation. Day 1: 40.36, 39.36, 38.43, Day 2: 39.96, 40.32, 39.88, …

Acai process control using X-bar charts Raw Data: Plotting the raw data, it is hard to say if anything is going on..

To get something like this need UCL and LCL Acai process control using X-bar charts Raw Data: Data in excel example online Lecture.21.xls

To get something like this need UCL and LCL UCL= grand avg+ A3*(avg stdev) = *0.55 =40.76

To get something like this need UCL and LCL UCL= grand avg+ A3*(avg stdev) = *0.55 =40.76 Note: If you use A2, you use the average R. The result is nearly the same. LCL=grand avg-A3*(avg stdev) =38.96 UCL represents 3 standard deviations away from the mean, so the line between zones A/B is 2 standard deviations away: A/B line=grand avg+ A3*(avg stdev)*(2/3)= 40.46

X-bar chart Is it “in control”? Rule 2: fail: points 9 and 10 are in zone A Rule 1: okay, no points outside of zone A Rules 3 and 4: okay Conclusion: Not in statistical control.

Take Home Messages Statistical process control is a method for systematically identifying inconsistencies. Probabilities are often based on a Gaussian process Control charts provide a systematic method for evaluating if a process is under control.

“A foolish consistency is the hobgoblin of little minds” --R. W. Emerson “An intelligent consistency is a virtue in an integrated global economy” --Anonymous