Selected Statistics Examples from Lectures

Slides:



Advertisements
Similar presentations
Class 6: Hypothesis testing and confidence intervals
Advertisements

Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
Lecture 5 1 Continuous distributions Five important continuous distributions: 1.uniform distribution (contiuous) 2.Normal distribution  2 –distribution[“ki-square”]
Presentation on Probability Distribution * Binomial * Chi-square
Ch8 Inference concerning variance
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Statistics review of basic probability and statistics.
1 Midterm Review. 2 Econ 240A  Descriptive Statistics  Probability  Inference  Differences between populations  Regression.
Chapter 5 Discrete Random Variables and Probability Distributions
Sampling Distributions (§ )
Chapter 11 Problems of Estimation
Simulation Modeling and Analysis
LARGE SAMPLE TESTS ON PROPORTIONS
BHS Methods in Behavioral Sciences I
Horng-Chyi HorngStatistics II127 Summary Table of Influence Procedures for a Single Sample (I) &4-8 (&8-6)
Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.
CHAPTER 6 Statistical Analysis of Experimental Data
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 14 Goodness-of-Fit Tests and Categorical Data Analysis.
Lecture 7 1 Statistics Statistics: 1. Model 2. Estimation 3. Hypothesis test.
Quantitative Business Methods for Decision Making Estimation and Testing of Hypotheses.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
1/49 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 9 Estimation: Additional Topics.
Confidence Interval Estimation
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 8-6 Testing a Claim About a Standard Deviation or Variance.
STATISTICAL INFERENCE PART VII
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
L Berkley Davis Copyright 2009 MER301: Engineering Reliability Lecture 5 1 MER301: Engineering Reliability LECTURE 5: Chapter 3: Probability Plotting,The.
Matlab: Statistics Probability distributions Hypothesis tests
Statistics Galton’s Heights of Hypothetical Men (1869)
Modeling and Simulation CS 313
5-1 Business Statistics: A Decision-Making Approach 8 th Edition Chapter 5 Discrete Probability Distributions.
1. Population Versus Sample 2. Statistic Versus Parameter 3. Mean (Average) of a Sample 4. Mean (Average) of a Population 5. Expected Value 6. Expected.
Statistical Review We will be working with two types of probability distributions: Discrete distributions –If the random variable of interest can take.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Mid-Term Review Final Review Statistical for Business (1)(2)
Probability Theory 1.Basic concepts 2.Probability 3.Permutations 4.Combinations.
Multinomial Distribution
Tests for Random Numbers Dr. Akram Ibrahim Aly Lecture (9)
OPIM 5103-Lecture #3 Jose M. Cruz Assistant Professor.
Ch9. Inferences Concerning Proportions. Outline Estimation of Proportions Hypothesis concerning one Proportion Hypothesis concerning several proportions.
 A probability function is a function which assigns probabilities to the values of a random variable.  Individual probability values may be denoted by.
Testing Hypothesis That Data Fit a Given Probability Distribution Problem: We have a sample of size n. Determine if the data fits a probability distribution.
Selecting Input Probability Distribution. Simulation Machine Simulation can be considered as an Engine with input and output as follows: Simulation Engine.
Probability Distributions
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
GG 313 Lecture 9 Nonparametric Tests 9/22/05. If we cannot assume that our data are at least approximately normally distributed - because there are a.
The final exam solutions. Part I, #1, Central limit theorem Let X1,X2, …, Xn be a sequence of i.i.d. random variables each having mean μ and variance.
Probability –classical approach P(event E) = N e /N, where N = total number of possible outcomes, N e = total number of outcomes in event E assumes equally.
Sampling and estimation Petter Mostad
Applied Quantitative Analysis and Practices LECTURE#14 By Dr. Osman Sadiq Paracha.
1 ES Chapter 18 & 20: Inferences Involving One Population Student’s t, df = 5 Student’s t, df = 15 Student’s t, df = 25.
Section 6.4 Inferences for Variances. Chi-square probability densities.
Elementary Probability.  Definition  Three Types of Probability  Set operations and Venn Diagrams  Mutually Exclusive, Independent and Dependent Events.
Environmental Modeling Basic Testing Methods - Statistics II.
Unit 4 Review. Starter Write the characteristics of the binomial setting. What is the difference between the binomial setting and the geometric setting?
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Business Statistics,
Objectives (BPS chapter 12) General rules of probability 1. Independence : Two events A and B are independent if the probability that one event occurs.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
CHI SQUARE DISTRIBUTION. The Chi-Square (  2 ) Distribution The chi-square distribution is the probability distribution of the sum of several independent,
Selected Statistics Examples from Lectures. Matlab: Histograms >> y = [
Probability Theory Basic concepts Probability Permutations
Chapter Five The Binomial Probability Distribution and Related Topics
MECH 373 Instrumentation and Measurements
Chapter 4. Inference about Process Quality
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Chapter 9: Inferences Involving One Population
Lecture 13 Sections 5.4 – 5.6 Objectives:
Goodness-of-Fit Tests
When You See (This), You Think (That)
Presentation transcript:

Selected Statistics Examples from Lectures

Matlab: Histograms >> y = [1 3 5 8 2 4 6 7 8 3 2 9 4 3 6 7 4 1 5 3 5 8 9 6 2 4 6 1 5 6 9 8 7 5 3 4 5 2 9 6 5 9 4 1 6 7 8 5 4 2 9 6 7 9 2 5 3 1 9 6 8 4 3 6 7 9 1 3 4 7 5 2 9 8 5 7 4 5 4 3 6 7 9 3 1 6 9 5 6 7 3 2 1 5 7 8 5 3 1 9 7 5 3 4 7 9 1]’; >> mean(y)  ans = 5.1589 >> var(y)  ans = 6.1726 >> std(y)  ans = 2.4845 >> hist(y,9)  histogram plot with 9 bins >> n = hist(y,9)  store result in vector n >> x = [2 4 6 8]’ >> n = hist(y,x)  create histogram with bin centers specified by vector x

Probability Examples Probability that at least one coin will turn heads up from five tossed coins Number of outcomes: 25 = 32 Probability of each outcome: 1/32 Probability of no heads: P(AC) = 1/32 Probability at least one head: P(A) = 1-P(AC) = 31/32 Probability of getting an odd number or a number less than 4 from a single dice toss Probability of odd number: P(A) = 3/6 Probability of number less than 4: P(B) = 3/6 Probability of both: Probability of either: complement set

Permutation Examples Process for manufacturing polymer thin films Compute the probability that the first 6 films will be too thin and the next 4 films will be too thick if the thickness is a random variable with the mean equal to the desired thickness n1 = 6, n2 = 4 Probability Encryption cipher Letters arranged in five-letter words: n = 26, k = 5 Total number of different words: nk = 265 = 11,881,376 Total number of different words containing each letter no more than once:

Combination Examples Effect of repetitions Three letters a, b, c taken two at a time (n = 3, k = 2) Combinations without repetition Combinations with repetitions 500 thin films taken 5 at a time Combinations without repetitions

Matlab: Permutations and Combinations >> perms([2 4 6])  all possible permutations of 2, 4, 6 6 4 2 6 2 4 4 6 2 4 2 6 2 4 6 2 6 4 >> randperm(6)  returns one possible permutation of 1-6 5 1 2 3 4 6 >> nchoosek(5,4)  number of combinations of 5 things taken 4 at a time without repetitions ans = 5 >> nchoosek(2:2:10,4)  all possible combinations of 2, 4, 6, 8, 10 taken 4 at a time without repetitions 2 4 6 8 2 4 6 10 2 4 8 10 2 6 8 10 4 6 8 10

Continuous Distribution Example Probability density function Cumulative distribution function Probability of events

Matlab: Normal Distribution Normal distribution: normpdf(x,mu,sigma) normpdf(8,10,2)  ans = 0.1210 normpdf(9,10,2)  ans = 0.1760 normpdf(8,10,4)  ans = 0.0880 Normal cumulative distribution: normcdf(x,mu,sigma) normcdf(8,10,2)  ans = 0.1587 normcdf(12,10,2)  ans = 0.8413 Inverse normal cumulative distribution: norminv(p,mu,sigma) norminv([0.025 0.975],10,2)  ans = 6.0801 13.9199 Random number from normal distribution: normrnd(mu,sigma,v) normrnd(10,2,[1 5])  ans = 9.1349 6.6688 10.2507 10.5754 7.7071

Matlab: Normal Distribution Example The temperature of a bioreactor follows a normal distribution with an average temperature of 30oC and a standard deviation of 1oC. What percentage of the reactor operating time will the temperature be within +/-0.5oC of the average? Calculate probability at 29.5oC and 30.5oC, then calculate the difference: p=normcdf([29.5 30.5],30,1) p = [0.3085 0.6915] p(2) – p(1) 0.3829 The reactor temperature will be within +/- 0.5oC of the average ~38% of the operating time

Discrete Distribution Example Matlab function: unicdf(x,n), >> x = (0:6); >> y = unidcdf(x,6); >> stairs(x,y)

Poisson Distribution Example Probability of a defective thin polymer film p = 0.01 What is the probability of more than 2 defects in a lot of 100 samples? Binomial distribution: m = np = (100)(0.01) = 1 Since p <<1, can use Poisson distribution to approximate the solution.

Matlab: Maximum Likelihood In a chemical vapor deposition process to make a solar cell, there is an 87% probability that a sufficient amount of silicon will be deposited in each cell. Estimate the maximum likelihood of success in 5000 cells. s = binornd(n,p) – randomly generate the number of positive outcomes given n trials each with probability p of success s = binornd(5000,0.87) s=4338 phat = binofit(s,n) – returns the maximum likelihood estimate given s successful outcomes in n trials phat = binofit(s,5000) phat = 0.8676

Mean Confidence Interval Example Measurements of polymer molecular weight (scaled by 10-5) Confidence interval

Variance Confidence Interval Example Measurements of polymer molecular weight (scaled by 10-5) Confidence interval

Matlab: Confidence Intervals >> [muhat,sigmahat,muci,sigmaci] = normfit(data,alpha) data: vector or matrix of data alpha: confidence level = 1-alpha muhat: estimated mean sigmahat: estimated standard deviation muci: confidence interval on the mean sigmaci: confidence interval on the standard deviation >> [muhat,sigmahat,muci,sigmaci] = normfit([1.25 1.36 1.22 1.19 1.33 1.12 1.27 1.27 1.31 1.26],0.05) muhat = 1.2580 sigmahat = 0.0697 muci = 1.2081 1.3079 sigmaci = 0.0480 0.1273

Mean Hypothesis Test Example Measurements of polymer molecular weight Hypothesis: m0 = 1.3 instead of the alternative m1 < m0 Significance level: a = 0.10 Degrees of freedom: m = 9 Critical value Sample t Reject hypothesis

Polymerization Reactor Control Monomers Catalysts Solvent Hydrogen On-line measurements Lab Polymer Monomer Content Viscosity Lab measurements Viscosity and monomer content of polymer every four hours Three monomer content measurements: 23%, 18%, 22% Mean content control Expected mean and known variance: m0 = 20%, s2 = 1 Control limits: Sample mean: 21%  process is in control

Goodness of Fit Example Maximum likelihood estimates

Goodness of Fit Example cont.

Matlab: Goodness of Fit Example Find maximum likelihood estimates for µ and σ of a normal distribution with function mle >> data=[320 … 360]; >> phat = mle(data) phat = 364.7 26.7 Analyze dataset for goodness of fit using function chi2gof – usage: [h,p,stats]=chi2gof(x,’edges’,edges) for data in vector x divided into intervals with the specified edges If h = 1, reject hypothesis at 5% significance If h = 0, accept hypothesis at 5% significance p: probability of observing given statistic stats: includes chi-square statistic and degrees of freedom >> [h,p,stats]=chi2gof(data,’edges’,[-inf,325:10:405,inf]); >> h = 0 >> p = 0.8990 >> chi2stat = 2.8440 >> df = 7