GOSSET, William Sealy 1876-1937 How shall I deal with these small batches of brew?

Slides:



Advertisements
Similar presentations
Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
Advertisements

Sampling: Final and Initial Sample Size Determination
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Statistics for Business and Economics
An “app” thought!. VC question: How much is this worth as a killer app?
Single Sample t-test Purpose: Compare a sample mean to a hypothesized population mean. Design: One group.
Estimating the Population Mean Assumptions 1.The sample is a simple random sample 2.The value of the population standard deviation (σ) is known 3.Either.
Final Jeopardy $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 $100 $200 $300 $400 $500 LosingConfidenceLosingConfidenceTesting.
Overview of Lecture Parametric Analysis is used for
BHS Methods in Behavioral Sciences I
Tuesday, October 22 Interval estimation. Independent samples t-test for the difference between two means. Matched samples t-test.
Chapter 8 Estimation: Single Population
Chapter Topics Confidence Interval Estimation for the Mean (s Known)
Chapter 11: Inference for Distributions
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
Chapter 10, sections 1 and 4 Two-sample Hypothesis Testing Test hypotheses for the difference between two independent population means ( standard deviations.
PSY 307 – Statistics for the Behavioral Sciences
5-3 Inference on the Means of Two Populations, Variances Unknown
Quiz 6 Confidence intervals z Distribution t Distribution.
Hypothesis Testing Using The One-Sample t-Test
Confidence Intervals for the Mean (σ Unknown) (Small Samples)
AM Recitation 2/10/11.
Two Sample Tests Ho Ho Ha Ha TEST FOR EQUAL VARIANCES
II.Simple Regression B. Hypothesis Testing Calculate t-ratios and confidence intervals for b 1 and b 2. Test the significance of b 1 and b 2 with: T-ratios.
T-test Mechanics. Z-score If we know the population mean and standard deviation, for any value of X we can compute a z-score Z-score tells us how far.
Education 793 Class Notes T-tests 29 October 2003.
Stats 95 t-Tests Single Sample Paired Samples Independent Samples
X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ X _ μ.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
T-TEST Statistics The t test is used to compare to groups to answer the differential research questions. Its values determines the difference by comparing.
Approximate letter grade assignments ~ D C B 85 & up A.
Tests of Hypotheses Involving Two Populations Tests for the Differences of Means Comparison of two means: and The method of comparison depends on.
Confidence Intervals Lecture 3. Confidence Intervals for the Population Mean (or percentage) For studies with large samples, “approximately 95% of the.
CHAPTER SEVEN ESTIMATION. 7.1 A Point Estimate: A point estimate of some population parameter is a single value of a statistic (parameter space). For.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
8.2 Testing the Difference Between Means (Independent Samples,  1 and  2 Unknown) Key Concepts: –Sampling Distribution of the Difference of the Sample.
Monday, October 22 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
Math 4030 – 9b Comparing Two Means 1 Dependent and independent samples Comparing two means.
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
_ z = X -  XX - Wow! We can use the z-distribution to test a hypothesis.
8.1 Estimating µ with large samples Large sample: n > 30 Error of estimate – the magnitude of the difference between the point estimate and the true parameter.
Statistics: Unlocking the Power of Data Lock 5 Section 6.4 Distribution of a Sample Mean.
Section 6.4 Inferences for Variances. Chi-square probability densities.
Chapter 9 Inferences Based on Two Samples: Confidence Intervals and Tests of Hypothesis.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Section 6.2 Confidence Intervals for the Mean (  Unknown)
Monday, October 21 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
Essential Statistics Chapter 171 Two-Sample Problems.
Inference for distributions: - Comparing two means.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Introduction to the t statistic. Steps to calculate the denominator for the t-test 1. Calculate variance or SD s 2 = SS/n-1 2. Calculate the standard.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Confidence Intervals Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Chapter 10: The t Test For Two Independent Samples.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Chapter 6 Confidence Intervals.
Math 4030 – 10a Tests for Population Mean(s)
Independent samples t-test for the difference between two means.
Wednesday, October 20 Sampling distribution of the mean.
Problem: If I have a group of 100 applicants for a college summer program whose mean SAT-Verbal is 525, is this group of applicants “above national average”?
Independent samples t-test for the difference between two means.
Monday, October 19 Hypothesis testing using the normal Z-distribution.
Chapter 6 Confidence Intervals.
XY XY XY XY XY XY ρ.
Basic Practice of Statistics - 3rd Edition Two-Sample Problems
Independent samples t-test for the difference between two means.
Elementary Statistics: Picturing The World
Presentation transcript:

GOSSET, William Sealy How shall I deal with these small batches of brew?

GOSSET, William Sealy A series of distributions of sample means drawn from a population of standardized scores (therefore the mean of mean is 0 and sd=1), in which the shape of the distribution varies systematically, depending on the size of the sample. The larger the sample, the more it matches a normal distribution; and the smaller the sample, the fatter the tails.

Jeremy Jimenez A T distribution is derived by approximating the mean of a normally distributed population, particularly useful with an unknown standard deviation and a small sample size. To visualize how it functions, imagine an inverted letter T placed under this standard probability distribution, where the Two Tails of this inverted T are nestled within the Two Tails of the standard probability distribution, thus making the Two Tails Thicker than a Typical distribution. As the sample size rises, the range of potential outlier sample means derived from the population shortens; therefore, the Two Tails of the probability distribution shorten (along with the standard deviation) and Thus Thrust into the Two Tails of the inverted T, also causing the inverted T to rise and create a greater kurTosis

ρ XY

Population Sample A Sample B Sample E Sample D Sample C _  XY r XY

The t distribution, at N-2 degrees of freedom, can be used to test the probability that the statistic r was drawn from a population with  = 0. Table C. H 0 :  XY = 0 H 1 :  XY  0 where r N r 2 t =

Monday, November 7 Independent samples t-test for the difference between two means.

Monday, November 7 Independent samples t-test for the difference between two means. signal-to-noise ratio

Monday, November 7 Independent samples t-test for the difference between two means. signal-to-noise ratiosafety in numbers

μ 1 - μ 2 X 1 -X 2 _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

H 0 :  1 -  2 = 0 H 1 :  1 -  2  0

X boys =53.75 _ X girls =51.16 _ How do we know if the difference between these means, of = 2.59, is reliably different from zero?

X boys =53.75 _ X girls =51.16 _ 95CI:   boys  CI:   girls  We could find confidence intervals around each mean...

H 0 :  1 -  2 = 0 H 1 :  1 -  2  0 But we can directly test this hypothesis...

H 0 :  1 -  2 = 0 H 1 :  1 -  2  0 To test this hypothesis, you need to know … …the sampling distribution of the difference between means.  X 1 -X 2 --

H 0 :  1 -  2 = 0 H 1 :  1 -  2  0 To test this hypothesis, you need to know … …the sampling distribution of the difference between means.  X 1 -X 2 -- …which can be used as the error term in the test statistic.

 X 1 -X 2 =    2 X 1 +  2 X 2 The sampling distribution of the difference between means. This reflects the fact that two independent variances contribute to the variance in the difference between the means. ----

 X 1 -X 2 =    2 X 1 +  2 X 2 The sampling distribution of the difference between means. This reflects the fact that two independent variances contribute to the variance in the difference between the means Your intuition should tell you that the variance in the differences between two means is larger than the variance in either of the means separately.

The sampling distribution of the difference between means, at n = , would be: z = (X 1 - X 2 )  X 1 -X

The sampling distribution of the difference between means. Since we don’t know , we must estimate it with the sample statistic s.  X 1 -X 2 =   2 1   2 2 n 1 n

The sampling distribution of the difference between means. Rather than using s 2 1 to estimate  2 1 and s 2 2 to estimate  2 2, we pool the two sample estimates to create a more stable estimate of  2 1 and  2 2 by assuming that the variances in the two samples are equal, that is,  2 1 =  2 2.  X 1 -X 2 =  s 2 1  s 2 2 n 1 n

s X1-X2 = s p 2 s p 2 N 1 N 2 +

s X1-X2 = s p 2 s p 2 N 1 N 2 +

s X1-X2 = s p 2 s p 2 N 1 N 2 + s p 2 = SS w SS 1 + SS 2 N-2 =

Because we are making estimates that vary by degrees of freedom, we use the t-distribution to test the hypothesis. t = (X 1 - X 2 ) - (  1 -  2 )  s X 1 -X 2 …at (n 1 - 1) + (n 2 - 1) degrees of freedom (or N-2)

Assumptions X 1 and X 2 are normally distributed. Homogeneity of variance. Samples are randomly drawn from their respective populations. Samples are independent.