Cost-optimal control charts (Markov- chains in industrial statistics) András Zempléni Eötvös Loránd University, Budapest some parts are joint work with.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Chapter 7 Title and Outline 1 7 Sampling Distributions and Point Estimation of Parameters 7-1 Point Estimation 7-2 Sampling Distributions and the Central.
Confidence Intervals This chapter presents the beginning of inferential statistics. We introduce methods for estimating values of these important population.
Chap 9: Testing Hypotheses & Assessing Goodness of Fit Section 9.1: INTRODUCTION In section 8.2, we fitted a Poisson dist’n to counts. This chapter will.
Introduction to Statistical Quality Control, 4th Edition Chapter 7 Process and Measurement System Capability Analysis.
1 Detection and Analysis of Impulse Point Sequences on Correlated Disturbance Phone G. Filaretov, A. Avshalumov Moscow Power Engineering Institute, Moscow.
CHAPTER 16 MARKOV CHAIN MONTE CARLO
5. CONCLUSIONS Initially constructed and applied for the New York State’s rainfall stations (Wilks, 1998), DUM was able to reproduce its good performance.
Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.
Software Quality Control Methods. Introduction Quality control methods have received a world wide surge of interest within the past couple of decades.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Section 7.2 Tables and summarize hypothesis tests about one mean. (These tests assume that the sample is taken from a normal distribution,
Evaluating Hypotheses
8-1 Quality Improvement and Statistics Definitions of Quality Quality means fitness for use - quality of design - quality of conformance Quality is.
Chapter 6 The Normal Distribution and Other Continuous Distributions
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 10 Notes Class notes for ISE 201 San Jose State University.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Lecture Slides Elementary Statistics Twelfth Edition
Inferences About Process Quality
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
15 Statistical Quality Control CHAPTER OUTLINE
Sampling Theory Determining the distribution of Sample statistics.
The Neymann-Pearson Lemma Suppose that the data x 1, …, x n has joint density function f(x 1, …, x n ;  ) where  is either  1 or  2. Let g(x 1, …,
Dimitrios Konstantas, Evangelos Grigoroudis, Vassilis S. Kouikoglou and Stratos Ioannidis Department of Production Engineering and Management Technical.
Chapter 10 Hypothesis Testing
9/3/2015 IENG 486 Statistical Quality & Process Control 1 IENG Lecture 11 Hypothesis Tests to Control Charts.
Chapter 5 Sampling and Statistics Math 6203 Fall 2009 Instructor: Ayona Chatterjee.
Introduction to Statistical Quality Control, 4th Edition Chapter 7 Process and Measurement System Capability Analysis.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Random Sampling, Point Estimation and Maximum Likelihood.
MS 305 Recitation 11 Output Analysis I
Statistical Interval for a Single Sample
Stochastic Linear Programming by Series of Monte-Carlo Estimators Leonidas SAKALAUSKAS Institute of Mathematics&Informatics Vilnius, Lithuania
1 Statistical Distribution Fitting Dr. Jason Merrick.
Testing Models on Simulated Data Presented at the Casualty Loss Reserve Seminar September 19, 2008 Glenn Meyers, FCAS, PhD ISO Innovative Analytics.
A Process Control Screen for Multiple Stream Processes An Operator Friendly Approach Richard E. Clark Process & Product Analysis.
What Does the Likelihood Principle Say About Statistical Process Control? Gemai Chen, University of Calgary Canada July 10, 2006.
Graphical Models for Machine Learning and Computer Vision.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
Chapter 7. Control Charts for Attributes
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Section 6-3 Estimating a Population Mean: σ Known.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
ENGM 620: Quality Management Session 8 – 30 October 2012 Process Capability.
Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.
Chapter 8. Process and Measurement System Capability Analysis
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Chapter 1 Introduction to Statistics. Section 1.1 Fundamental Statistical Concepts.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 6-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Copyright © Cengage Learning. All rights reserved. 5 Joint Probability Distributions and Random Samples.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
1 Lecture Plan Modelling Profit Distribution from Wind Production (Excel Case: Danish Wind Production and Spot Prices) Reasons for copula.
Bayesian Estimation and Confidence Intervals Lecture XXII.
Lecture 1.31 Criteria for optimal reception of radio signals.
Bayesian Estimation and Confidence Intervals
Combining Random Variables
CONCEPTS OF HYPOTHESIS TESTING
Remember that our objective is for some density f(y|) for observations where y and  are vectors of data and parameters,  being sampled from a prior.
Hidden Markov Models Part 2: Algorithms
Discrete Event Simulation - 4
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 9 Hypothesis Testing: Single Population
Further Topics on Random Variables: Derived Distributions
Further Topics on Random Variables: Derived Distributions
Further Topics on Random Variables: Derived Distributions
Presentation transcript:

Cost-optimal control charts (Markov- chains in industrial statistics) András Zempléni Eötvös Loránd University, Budapest some parts are joint work with Belmiro Duarte and Pedro Saraiva Inst. Superior de Engenharia Department of Chemical de Coimbra Engineering, Univ. of Coimbra

2 Introduction: statistical process control Control charts: developed for signaling deviations from the normal operation (originated by Shewhart, 1930) Classical version: x (x-bar) chart, with control limits based on the 3  -rule.

3 Introduction: objective To design process control charts, which are –cost-effective –easy to implement Needed knowledge to the cost-calculations: –(approximate) cost of sampling, errors of type I and II –frequency and distribution of shift in process mean. Similar approach is to be found in the work of Duncan (for a recent summary see Kenett-Zacks, 2000), but our solutions are more general.

4 Introduction: this talk Methods and results for optimization Applications –for real processes –in teaching (simulated data)

5 Assumptions Observations: not only the normal, but other distributions can be investigated Shift in process mean (two-sided): it is supposed to be randomly distributed. Other characteristics remain unchanged. The process control is based on the x-chart, with Taguchian approach (wide range of options) for both types of error.

6 X-chart: basic parameters d: probability of shift in a unit interval (stationarity is assumed), different shift size distributions k: the cost of a single sample element. l: cost (per unit interval) of a not recognized shift of unit size. For shift size z, the loss is lz 2. f: cost of a false alarm. As we suppose a continuous change in the mean, we have chosen an exponentially decreasing, but nonnull, value for this term. For shift size z, this cost is The total cost for a time period is simply the sum of the components above.

7

8 Prior information needed The parameters d, k, l and f must be estimated from experience knowledge. A reasonable shift size distribution can also be proposed (including its parameters). Our method is determined by the following: c: UCL (-c=LCL) t: interval between sampling (a one-element sample is taken j=1/t times in a unit time interval)

9 Parameters of the shift distribution It is not easy to estimate the parameters d and s (in Zempléni et al, 2004 a maximum likelihood estimator was presented for the one-sided case). Graphical methods or maximum likelihood can be based on the values X i+1 - X i.

10 The cases under investigation Shift size distribution is now supposed to be i.i.d. N(0,s), which allows for an easy computation of its convolution powers (needed if there is more than one shift during an interval of length t). We investigate d=0.2 in this presentation (analogous results are expected for other shift intensities).

11 Alarm (with actual shift value) Simulated shifts, s=1, d=0.2, j=1

12 Shift and the observations, d=0.2, j=5, s=1, no control chart applied

13 Preparations to cost-optimisation Shift intensity: d, this implies that the number of shifts within an interval of length  has a Poisson distribution with parameter d . Markov-chain approach: the process behaviour at consecutive instants (captured through sampling) is characterized by the pair (shift size, action) where action can either be „alarm” or „no action” Discretisation: m=150 equidistant classes were used for positive shifts x: it belongs to class i  m if iM/m<x  (i+1)M/m, similarly for another m classes for negative shifts. The number of classes was doubled in order to distinguish between alarm and non-alarm cases. The choice of the maximal value M=s(c+4) turned out to be suitable.

14 Optimization Transition probability for non-alarm cases: (  (c-x)-  (-c-x))P(y-x-  <Z t <y-x) where the first term ensures that there is no alarm, and Z t denotes the shift distribution over an interval of length t. Other cases: if there is an alarm, the mean is supposed to be moved to 0 in the next step. Stationary distribution of the Markov chain can be calculated, allowing the cost function evaluation. Cost function per time unit: lE(Z 1 2 )+fE(e -10Z 1 )+k/t. Minimization took 8-10 minutes per case.

15 P(Z=0) for the stationary distribution (c=2)

16 Parameters of the optimal charts: cost function (d=0.2, k=1)

17 Parameters of the optimal charts: sampling frequency (d=0.2, k=1)

18 Parameters of the optimal charts: critical value (d=0.2, k=1)

19 Robustness of the methods Simulation studies: the optimal charts were used for different parameter values. For l=100, f=100, k=1, s=0.7, d=0.2 the optimum is achieved for j=5.9, c=1.9 (cost, based on the stationary distribution: 18.1) The cost of this setting for different parameter combinations (based on 100 simulations, each of length T=1000): The minimal cost for this last case: parameterss=0.7, d=0.2 s=0.7, d=0.33 s=1, d=0.33 cost function (std. dev) (0.97) (1.22) (1.45)

20 The uniform shift-distribution The distribution function of its convolution powers were approximated by simulation (by a sample of 20000). Results: for shift distributions with the same variance, and f=l, the uniform is slightly more dangerous, so –sampling frequency: higher; –critical value: about the same. If f<l, this difference diminishes.

21 Real-life applications Here we present results for a precipitated calcium carbonate production plant, where for the centered data we have USL=0.25, LSL=-0.25 and the cost function elements were defined as follows: l: continuous function, such as 200 for values outside specification, a(x-0.22) 2 for |x|> for |x|  0.22 f=20 k=10 (due to labor needed).

22 Real-life applications: the parameters of the process It was estimated that d=0.2 and s=0.05 (for the original data; after standardisation it corresponds to s=0.57). The optimal chart has c=1.25 and time between consecutive sample elements is t=1.91. The corresponding value of the cost function : 11.5 For a slightly simplified version of the current method: t=1, c=3, its cost is 47.3 The difference is impressive!

23 Real-life applications: results Different values as cost of the false alarm has been tried, with the following results

24 Simulations for the real case Simulation studies: the optimal chart corresponding to s=0.57, d=0.2 were used for different parameter values The cost of this optimal chart for different parameter combinations (based on 100 simulations, each of length T=10000): parameterss=0.75, d=0.2 s=0.75, d=0.33 increase of cost function 7%11.2%

25 Conclusions We presented a methodology to design optimal control charts by using additional knowledge about the process characteristics and related costs, tested over both simulated and industrial data sets. Our approach has proven that one can achieve substantial cost-reduction by choosing the optimal methods.

26 The use of our methods in teaching Sequence of exercises (based on simulated data) Costs are supposed to be known

27 Next step Shift intensity and size has to be estimated (based on the observed data) Optimization is carried out for these estimated values

28 Finalisation “Optimal” cost value: 12 The results with the “optimal” parameters

29 References 1.Duncan, A.J. (1974): Quality Control and Industrial Statistics. 4 th edition, Homewood, Illinois. 2.Kenett, R and Zacks, S: Modern Industrial Statistics, 2000, Duxbury Press 3.Zempléni, A., Véber, M., Duarte B. and Saraiva, P. Control Charts: a cost-optimization approach for processes with random shifts (Appl. Stoch. Models in Business and Industry, 2004)

30 Acknowledgement This work was developed by members of the Pro-ENBIS network. The Pro-Enbis project is supported by funding under the European Commission's Fifth Framework 'Growth' Programme via the Thematic Network "Pro-ENBIS“, contract reference: G6RT-CT