Cost-optimal control charts (Markov- chains in industrial statistics) András Zempléni Eötvös Loránd University, Budapest some parts are joint work with Belmiro Duarte and Pedro Saraiva Inst. Superior de Engenharia Department of Chemical de Coimbra Engineering, Univ. of Coimbra
2 Introduction: statistical process control Control charts: developed for signaling deviations from the normal operation (originated by Shewhart, 1930) Classical version: x (x-bar) chart, with control limits based on the 3 -rule.
3 Introduction: objective To design process control charts, which are –cost-effective –easy to implement Needed knowledge to the cost-calculations: –(approximate) cost of sampling, errors of type I and II –frequency and distribution of shift in process mean. Similar approach is to be found in the work of Duncan (for a recent summary see Kenett-Zacks, 2000), but our solutions are more general.
4 Introduction: this talk Methods and results for optimization Applications –for real processes –in teaching (simulated data)
5 Assumptions Observations: not only the normal, but other distributions can be investigated Shift in process mean (two-sided): it is supposed to be randomly distributed. Other characteristics remain unchanged. The process control is based on the x-chart, with Taguchian approach (wide range of options) for both types of error.
6 X-chart: basic parameters d: probability of shift in a unit interval (stationarity is assumed), different shift size distributions k: the cost of a single sample element. l: cost (per unit interval) of a not recognized shift of unit size. For shift size z, the loss is lz 2. f: cost of a false alarm. As we suppose a continuous change in the mean, we have chosen an exponentially decreasing, but nonnull, value for this term. For shift size z, this cost is The total cost for a time period is simply the sum of the components above.
7
8 Prior information needed The parameters d, k, l and f must be estimated from experience knowledge. A reasonable shift size distribution can also be proposed (including its parameters). Our method is determined by the following: c: UCL (-c=LCL) t: interval between sampling (a one-element sample is taken j=1/t times in a unit time interval)
9 Parameters of the shift distribution It is not easy to estimate the parameters d and s (in Zempléni et al, 2004 a maximum likelihood estimator was presented for the one-sided case). Graphical methods or maximum likelihood can be based on the values X i+1 - X i.
10 The cases under investigation Shift size distribution is now supposed to be i.i.d. N(0,s), which allows for an easy computation of its convolution powers (needed if there is more than one shift during an interval of length t). We investigate d=0.2 in this presentation (analogous results are expected for other shift intensities).
11 Alarm (with actual shift value) Simulated shifts, s=1, d=0.2, j=1
12 Shift and the observations, d=0.2, j=5, s=1, no control chart applied
13 Preparations to cost-optimisation Shift intensity: d, this implies that the number of shifts within an interval of length has a Poisson distribution with parameter d . Markov-chain approach: the process behaviour at consecutive instants (captured through sampling) is characterized by the pair (shift size, action) where action can either be „alarm” or „no action” Discretisation: m=150 equidistant classes were used for positive shifts x: it belongs to class i m if iM/m<x (i+1)M/m, similarly for another m classes for negative shifts. The number of classes was doubled in order to distinguish between alarm and non-alarm cases. The choice of the maximal value M=s(c+4) turned out to be suitable.
14 Optimization Transition probability for non-alarm cases: ( (c-x)- (-c-x))P(y-x- <Z t <y-x) where the first term ensures that there is no alarm, and Z t denotes the shift distribution over an interval of length t. Other cases: if there is an alarm, the mean is supposed to be moved to 0 in the next step. Stationary distribution of the Markov chain can be calculated, allowing the cost function evaluation. Cost function per time unit: lE(Z 1 2 )+fE(e -10Z 1 )+k/t. Minimization took 8-10 minutes per case.
15 P(Z=0) for the stationary distribution (c=2)
16 Parameters of the optimal charts: cost function (d=0.2, k=1)
17 Parameters of the optimal charts: sampling frequency (d=0.2, k=1)
18 Parameters of the optimal charts: critical value (d=0.2, k=1)
19 Robustness of the methods Simulation studies: the optimal charts were used for different parameter values. For l=100, f=100, k=1, s=0.7, d=0.2 the optimum is achieved for j=5.9, c=1.9 (cost, based on the stationary distribution: 18.1) The cost of this setting for different parameter combinations (based on 100 simulations, each of length T=1000): The minimal cost for this last case: parameterss=0.7, d=0.2 s=0.7, d=0.33 s=1, d=0.33 cost function (std. dev) (0.97) (1.22) (1.45)
20 The uniform shift-distribution The distribution function of its convolution powers were approximated by simulation (by a sample of 20000). Results: for shift distributions with the same variance, and f=l, the uniform is slightly more dangerous, so –sampling frequency: higher; –critical value: about the same. If f<l, this difference diminishes.
21 Real-life applications Here we present results for a precipitated calcium carbonate production plant, where for the centered data we have USL=0.25, LSL=-0.25 and the cost function elements were defined as follows: l: continuous function, such as 200 for values outside specification, a(x-0.22) 2 for |x|> for |x| 0.22 f=20 k=10 (due to labor needed).
22 Real-life applications: the parameters of the process It was estimated that d=0.2 and s=0.05 (for the original data; after standardisation it corresponds to s=0.57). The optimal chart has c=1.25 and time between consecutive sample elements is t=1.91. The corresponding value of the cost function : 11.5 For a slightly simplified version of the current method: t=1, c=3, its cost is 47.3 The difference is impressive!
23 Real-life applications: results Different values as cost of the false alarm has been tried, with the following results
24 Simulations for the real case Simulation studies: the optimal chart corresponding to s=0.57, d=0.2 were used for different parameter values The cost of this optimal chart for different parameter combinations (based on 100 simulations, each of length T=10000): parameterss=0.75, d=0.2 s=0.75, d=0.33 increase of cost function 7%11.2%
25 Conclusions We presented a methodology to design optimal control charts by using additional knowledge about the process characteristics and related costs, tested over both simulated and industrial data sets. Our approach has proven that one can achieve substantial cost-reduction by choosing the optimal methods.
26 The use of our methods in teaching Sequence of exercises (based on simulated data) Costs are supposed to be known
27 Next step Shift intensity and size has to be estimated (based on the observed data) Optimization is carried out for these estimated values
28 Finalisation “Optimal” cost value: 12 The results with the “optimal” parameters
29 References 1.Duncan, A.J. (1974): Quality Control and Industrial Statistics. 4 th edition, Homewood, Illinois. 2.Kenett, R and Zacks, S: Modern Industrial Statistics, 2000, Duxbury Press 3.Zempléni, A., Véber, M., Duarte B. and Saraiva, P. Control Charts: a cost-optimization approach for processes with random shifts (Appl. Stoch. Models in Business and Industry, 2004)
30 Acknowledgement This work was developed by members of the Pro-ENBIS network. The Pro-Enbis project is supported by funding under the European Commission's Fifth Framework 'Growth' Programme via the Thematic Network "Pro-ENBIS“, contract reference: G6RT-CT