Proportions for the Binomial Distribution ©2005 Dr. B. C. Paul.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

The T Distribution ©Dr. B. C. Paul Wasn’t the Herby Assembly Line Problem Fun But there is one little problem But there is one little problem We.
Introduction to Regression ©2005 Dr. B. C. Paul. Things Favoring ANOVA Analysis ANOVA tells you whether a factor is controlling a result It requires that.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Warm Up The box-and-whisker plot shows the test scores in Mrs. Howard’s first period math class. 1. Find the minimum, maximum, median, and quartile values.
8-4 Significance of Experimental Results Warm Up Lesson Presentation
Objectives Use simulations and hypothesis testing to compare treatments from a randomized experiment.
12.1 Inference for A Population Proportion.  Calculate and analyze a one proportion z-test in order to generalize about an unknown population proportion.
Chapter 10: Hypothesis Testing
Sampling Distributions for Proportions Allow us to work with the proportion of successes rather than the actual number of successes in binomial experiments.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
8-3 Testing a Claim about a Proportion
SW388R6 Data Analysis and Computers I Slide 1 One-sample T-test of a Population Mean Confidence Intervals for a Population Mean.
7.1 Lecture 10/29.
8/15/2015Slide 1 The only legitimate mathematical operation that we can use with a variable that we treat as categorical is to count the number of cases.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
8/23/2015Slide 1 The introductory statement in the question indicates: The data set to use: GSS2000R.SAV The task to accomplish: a one-sample test of a.
Confidence Intervals and Hypothesis Testing - II
The smokers’ proportion in H.K. is 40%. How to testify this claim ?
1 Chapter Seven Introduction to Sampling Distributions Section 3 Sampling Distributions for Proportions.
Unequal Variance and ANOVA ©2005 Dr. B. C. Paul. ANOVA Assumptions ANOVA assumes the populations sampled in each class are normally distributed Also assumes.
1 Psych 5500/6500 Data Transformations Fall, 2008.
STATISTICAL INFERENCE PART VII
5.5 Distributions for Counts  Binomial Distributions for Sample Counts  Finding Binomial Probabilities  Binomial Mean and Standard Deviation  Binomial.
Binomial Distributions Calculating the Probability of Success.
Two Way ANOVA ©2005 Dr. B. C. Paul. ANOVA Application ANOVA allows us to review data and determine whether a particular effect is changing our results.
Hypothesis Tests with Proportions Chapter 10. Write down the first number that you think of for the following... Pick a two-digit number between 10 and.
CHAPTER 18: Inference about a Population Mean
MS 305 Recitation 11 Output Analysis I
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
90288 – Select a Sample and Make Inferences from Data The Mayor’s Claim.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Chi-Square Goodness-of-Fit Test PowerPoint Prepared.
Confidence intervals and hypothesis testing Petter Mostad
Section 10.1 Confidence Intervals
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Multiple Regression ©2005 Dr. B. C. Paul. Problems with Regression So Far We have only been able to consider one factor as controlling at a time Everything.
Non-Parametric Statistics ©2005 Dr. B. C. Paul. The Normal Problem Techniques we have used so far relied mostly on underlying distribution to be normal.
Essential Statistics Chapter 91 Introducing Probability.
STT 315 Ashwini maurya This lecture is based on Chapter 5.4 Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Overview Of Probability Distribution. Standard Distributions  Learning Objectives  Be familiar with the standard distributions (normal, binomial, and.
Welcome to MM570 Psychological Statistics
Dan Piett STAT West Virginia University Lecture 12.
P. 403 – 404 #71 – 73, 75 – 78, 80, 82, 84 #72B: Binary? Yes – Success is a person is left-handed. I: Independent? Yes, since students are selected randomly,
12.1 Inference for A Population Proportion.  Calculate and analyze a one proportion z-test in order to generalize about an unknown population proportion.
Essential Questions How do we use simulations and hypothesis testing to compare treatments from a randomized experiment?
Math 3680 Lecture #13 Hypothesis Testing: The z Test.
Quadratic Regression ©2005 Dr. B. C. Paul. Fitting Second Order Effects Can also use least square error formulation to fit an equation of the form Math.
Simulation Chapter 16 of Quantitative Methods for Business, by Anderson, Sweeney and Williams Read sections 16.1, 16.2, 16.3, 16.4, and Appendix 16.1.
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
Math 3680 Lecture #15 Confidence Intervals. Review: Suppose that E(X) =  and SD(X) = . Recall the following two facts about the average of n observations.
Simulation. Simulation is a way to model random events, such that simulated outcomes closely match real-world outcomes. By observing simulated outcomes,
Problems with Variance ©2005 Dr. B. C. Paul. Determining What To Do We have looked at techniques that depend on normally distributed data with variance.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The T-Test Are our results reliable enough to support a conclusion?
Lesson Runs Test for Randomness. Objectives Perform a runs test for randomness Runs tests are used to test whether it is reasonable to conclude.
Welcome to MM570 Psychological Statistics Unit 5 Introduction to Hypothesis Testing Dr. Ami M. Gates.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 7 l Hypothesis Tests 7.1 Developing Null and Alternative Hypotheses 7.2 Type I & Type.
Hypothesis Tests l Chapter 7 l 7.1 Developing Null and Alternative
Social Science Research Design and Statistics, 2/e Alfred P
CHAPTER 9 Testing a Claim
Chapter 7 (Probability)
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 18: Inference about a Population Mean
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Presentation transcript:

Proportions for the Binomial Distribution ©2005 Dr. B. C. Paul

We Did an Example in which we somehow knew all the values for p How do we really know the proportion of successes and failures Done with statistical tests like all the other values

Example We saw the gassifier system was the IGCC Achilles Heal last time around Coolstick Candles has a metal candle they believe can replace ceramic candles and increase available up time  Coolstick claims they can get 96% availability  You go to actual operations using the Coolstick Candles and look at availability statistics Taking random checks of availability you enter 1 if the candles were available and 0 if not.

First Enter Data in SPSS

Next Test the Proportion Highlight and Click on Analyze to Pull down the menu Go to Non Parametric Tests and Highlight to bring out the pop-out Menu Highlight and click on Binomial

Set Up the Test Select Candleswork for our variable We will need to change the test Proportion since the program wants To compare every proportion to 0.5

Set the Proportion and Look at Options I reset the proportion to 0.95 (the company’s claimed Availability) I clicked on options to bring up The option menu I will check-off descriptive statistics And hit continue.

After Clicking OK I get my output Our 60 observations Showed 90% Availability not 95%

Looking at Our Test Table The test tells us that our chances of pulling a sample of 60 with a proportion of 90% when the actual value of p is 95% is just under 8% (Since 5% is standard in statistics we probably will not reject that 0.95 is possible)

Paying Attention to the Footnotes The program does a one tailed test on the odds of the observed value being in Which ever direction chosen from the actual value. It also mentions a Z Approximation.

The Z Approximation If you have a p value of 0.5 and a decent size sample (above 50 or more)  And you plot the number of cases with 0 positives, 1 positive etc.  The distribution comes out looking like discrete value points under a normal distribution curve This suggests that the normal distribution can be used for getting odds on binomial events  Program uses that fact.

Problems with the Z approximation It works well when p=0.5  Which is the reason SPSS tried to default to 0.5 Odds are actually skew compared to normal if p is different from 0.5  P=0.95 is an extreme case  We would need probably over 1000 samples to get a good approximation with the normal distribution SPSS has exact options for people who know their Z approximations are at risk of being screwed up

Using An Exact Option I clicked on exact to bring up the next menu I can approach it two ways Monte Carlo Exact The default is asymptotic (ie meaning as Your data set gets large enough you will tend To converge toward a normal distribution)

Explaining the Options Monte Carlo means making a large number of random trials and then counting results Exact means exhaustively computing every possibility  Better have a fast machine with a lot of memory

For a Sample of 60 I could get exhaustive calculations fairly easily Note that in this case the Z approximation gave us amazingly good results We got just under 8% as an exact value

Z Approximation Allows one to Build Confidence Intervals Have not found an option to do this in SPSS Doing it manually We already know for our Data set r= 54 (running) n= 60 (tries) p’=0.9

Setting Up a 95% Confidence Interval Plugging and Chugging – p is between and

Interesting Conclusions 0.9 seems suspiciously below 0.95 but I cannot rule out that it might be 0.95 Flip side is that regular ceramic candles do 0.87 availability which is also in the confidence interval  Thus I not only cannot say these candles are better than ordinary ceramics – I cannot say they are not worse

May be times I will want to compare two binomial data sets Example  Gasifiers tend to eat refractory (causes down time)  Suppose I try a new refractory and then compare availability before and after the change I pick the same 24 days for months before and after the change and compare.

Enter my Before and After Availability Data in SPSS

Set Up to Analyze My Data Pull down the Analyze Menu Select Nonparametric Highlight two related samples

Set Up My Test Highlight and Click on Before Note it appeared in the variable 1 box

Continue With My Set-Up Highlight the After Note that both selections are now highlighted And variable two is listed as after

More of the Set-Up I clicked the arrow to move the Group over into the test variables list I reset my test to a McNemar test (which is the type used for this kind Of problem)

Clicking on Ok We Get Results We have two days of the month That had run the previous month That did not run this month And three days that ran this month That had not run the previous Month. The significance is nothing (We cannot say the refractory change Has helped based on this data)

Basic Portfolio of tests on proportions SPSS can also test proportions on two sample sets to see if they are the same These are the methods of measuring proportions with binomial data