Statistics for Social and Behavioral Sciences Session #19: Estimation and Hypothesis Testing, Wrap-up & p-value (Agresti and Finlay, from Chapter 5 to.

Slides:



Advertisements
Similar presentations
Inference in the Simple Regression Model
Advertisements

Statistics for Social and Behavioral Sciences Session #16: Confidence Interval and Hypothesis Testing (Agresti and Finlay, from Chapter 5 to Chapter 6)
Topics Today: Case I: t-test single mean: Does a particular sample belong to a hypothesized population? Thursday: Case II: t-test independent means: Are.
Copyright © 2011 Pearson Education, Inc. Statistical Tests Chapter 16.
Statistics for Social and Behavioral Sciences Part IV: Causality Randomized Experiments, ANOVA Chapter 12, Section 12.1 Prof. Amine Ouazad.
Inference Sampling distributions Hypothesis testing.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Statistics for Social and Behavioral Sciences Session #11: Random Variable, Expectations (Agresti and Finlay, Chapter 4) Prof. Amine Ouazad.
1 Introduction to Biostatistics (BIO/EPI 540) Lecture 11: Hypothesis Testing Acknowledgement: Thanks to Professor Pagano (Harvard School of Public Health)
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
Lecture 5 Outline: Thu, Sept 18 Announcement: No office hours on Tuesday, Sept. 23rd after class. Extra office hour: Tuesday, Sept. 23rd from 12-1 p.m.
Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.
T-test.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Inference about a Mean Part II
Statistics 101 Class 9. Overview Last class Last class Our FAVORATE 3 distributions Our FAVORATE 3 distributions The one sample Z-test The one sample.
Slide Copyright © 2009 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Intro Stats Third Edition.
Chapter 9 Hypothesis Testing.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Inference about Population Parameters: Hypothesis Testing
Statistics for Social and Behavioral Sciences Part IV: Causality Association and Causality Session 22 Prof. Amine Ouazad.
Statistics for Social and Behavioral Sciences Session #15: Interval Estimation, Confidence Interval (Agresti and Finlay, Chapter 5) Prof. Amine Ouazad.
Statistics for Social and Behavioral Sciences Session #17: Hypothesis Testing: The Confidence Interval Method and the T-Statistic Method (Agresti and Finlay,
Statistics for Social and Behavioral Sciences Part IV: Causality Multivariate Regression Chapter 11 Prof. Amine Ouazad.
AM Recitation 2/10/11.
Statistics for Social and Behavioral Sciences Session #18: Literary Analysis using Tests (Agresti and Finlay, from Chapter 5 to Chapter 6) Prof. Amine.
Statistics for Social and Behavioral Sciences Session #14: Estimation, Confidence Interval (Agresti and Finlay, Chapter 5) Prof. Amine Ouazad.
Education 793 Class Notes T-tests 29 October 2003.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
More About Significance Tests
Lecture 3: Review Review of Point and Interval Estimators
Statistics for Social and Behavioral Sciences
Estimates and Sample Sizes Lecture – 7.4
Statistics and Quantitative Analysis U4320
Inferential Statistics 2 Maarten Buis January 11, 2006.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
Statistics for Social and Behavioral Sciences Part IV: Causality Multivariate Regression R squared, F test, Chapter 11 Prof. Amine Ouazad.
Statistics for Social and Behavioral Sciences Part IV: Causality Inference for Slope and Correlation Section 9.5 Prof. Amine Ouazad.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 16 Statistical Tests.
Confidence intervals and hypothesis testing Petter Mostad
Large sample CI for μ Small sample CI for μ Large sample CI for p
Objectives (BPS chapter 19) Comparing two population means  Two-sample t procedures  Examples of two-sample t procedures  Using technology  Robustness.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Statistics for Social and Behavioral Sciences Part IV: Causality Comparison of two groups Chapter 7 Prof. Amine Ouazad.
Chapter 8 Parameter Estimates and Hypothesis Testing.
1 CHAPTER 4 CHAPTER 4 WHAT IS A CONFIDENCE INTERVAL? WHAT IS A CONFIDENCE INTERVAL? confidence interval A confidence interval estimates a population parameter.
Testing Differences between Means, continued Statistics for Political Science Levin and Fox Chapter Seven.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
R. G. Bias | School of Information | SZB 562BB | Phone: | i 1 INF397C Introduction to Research in Information Studies.
Inferential Statistics 4 Maarten Buis 18/01/2006.
Monday, October 22 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
366_8. Estimation: Chapter 8 Suppose we observe something in a random sample how confident are we in saying our observation is an accurate reflection.
Monday, October 21 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
Chapter 8 Single Sample Tests Part II: Introduction to Hypothesis Testing Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social &
Inference for distributions: - Comparing two means.
If we fail to reject the null when the null is false what type of error was made? Type II.
Hypothesis Testing and Statistical Significance
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
1 Basics of Inferential Statistics Mark A. Weaver, PhD Family Health International Office of AIDS Research, NIH ICSSC, FHI Lucknow, India, March 2010.
Final project questions Review sessions tomorrow: 1:00-2:00 pm, 3:30-5:00 pm, 6:30-7:30 pm… all in SSC 107 You can start the exam as early as 8:30 am,
Tests of Significance The reasoning of significance tests
Practice For an SAT test  = 500  = 100
Statistical inference
Hypothesis Testing.
Estimates and Sample Sizes Lecture – 7.4
Presentation transcript:

Statistics for Social and Behavioral Sciences Session #19: Estimation and Hypothesis Testing, Wrap-up & p-value (Agresti and Finlay, from Chapter 5 to Chapter 6) Prof. Amine Ouazad

Outline 1.True or False? 1.The p-value 2.One sided t tests 3.Type II error Next time:Mid term covering Chapters 1—6 and 9 of A&F

True or False ?? For the test of a null hypothesis, the confidence interval method can only be applied when sample size N is large. For the test of a null hypothesis, the t test method can only be applied when the sample size is small. The sampling distribution of a t statistic is left-skewed, or right skewed, depending on the number of degrees of freedom. When testing a null hypothesis with 95% confidence level, the probability of a Type I error (i.e. when the null hypothesis is true) is 5%.

Outline 1.True or False? 1.The p-value 2.One sided t tests 3.Type II error Next time:Mid term covering Chapters 1—6 and 9 of A&F

The p-value is the probability (in other potential samples, unobserved) that the absolute value of the t statistic is greater than the observed t statistic in our sample. A low p value means….. – that larger absolute values of the t test are unlikely. In practice: Reject the null hypothesis H0 at 95% if the p- value is lower than Similarly for 90% and 99%! (0.10 and 0.01 resp.)

Back to Amartya Sen: Are there really fewer women per man in India? “More than 100 million women are missing” Dataset collected by Simple Random Sampling. Population: Indian residents. Sample: 2,878,380 respondents. Little response bias. Non response bias is possible. Thomas W. Lamont University Professor, and Professor of Economics and Philosophy, at Harvard University

Back to Amartya Sen: Are there really fewer women per man in India? Obtaining Table 5.1’s t scores values in Stata: type `display invttail(df,0.025)’ for 95%. Replace df by the degrees of freedom (N-1)

Either with the confidence interval method, Or with the t test method, We reject the null hypothesis: “H0: the fraction of women in India is 50%” At 99% (and thus at 95%, 90%). Draw the p-value on a graph of the distribution. Back to Amartya Sen: Are there really fewer women per men in India?

Outline 1.True or False? 1.The p-value 2.One sided t tests 3.Type II error Next time:Mid term covering Chapters 1—6 and 9 of A&F

Back to Cory Gardner With 1850 respondents, the polling company PPP found that 48% of respondents would vote for Gardner. Pollsters think that Gardner wins if the vote share is > 46%. Can you test the null hypothesis that the true vote share is = 46%, with H a : vote share > 46% ?

One sided test of H 0 :  =v, H a :  >=v When testing  =v vs  >=v, use the t statistic method. – avoid the confidence interval method. Assume that  =v and build t = (m-v)/SE. ☞ Reject the null hypothesis at 95% if the t statistic is greater than +t ☞ Do not reject the null if t < 0. Similarly for 90% and 99% (t 0.10 and t 0.01 ).

Can we reject the null hypothesis at 95% with a one sided test? Can we reject the null hypothesis at 95% with a two sided test? ☞ Beware of the dangers of a one-sided test !! Back to Cory Gardner

Why it is objectionable to use one-sided tests…. Assuming that the parameter is higher than a certain value…. – Lowers the threshold t necessary to reject the null hypothesis. – For a one sided test, we use t 0.05 for a test at 95% – For a two sided test, we use t for a test at 95%. Ask yourself: Can I assume that the  will be necessarily higher than v?

But here, not such a big issue: Are there really fewer women per man in India? Obtaining one sided t scores values in Stata: type `display invttail(df,0.05)’ for 95%. Replace df by the degrees of freedom (N-1)

Outline 1.True or False? 1.The p-value 2.One sided t tests 3.Type II error Next time:Mid term covering Chapters 1—6 and 9 of A&F

Type II error Type I error is well-known. But what is the level of type II error?

Wrap up Confidence interval method for the test of H 0 :  = v. H a :  ≠ v. CI = [ m – t * SE ; m + t * SE ] – Reject the H 0 with significance level 1% if the 99% confidence interval for the sample mean m does not include v. – Reject the H 0 with significance level 5% if the 95% confidence interval for the sample mean m does not include v. – Reject the H 0 with significance level 10% if the 90% confidence interval for the sample mean m does not include v. t test method for the test of H0 :  = v. H a :  ≠ v. – Build the t statistic (m-v)/SE – Reject the H0 with significance level 1% if the t statistic is outside the range [-t 0.005, t ] – Reject the H0 with significance level 5% if the t statistic is outside the range [-t 0.025, t ] – Reject the H0 with significance level 10% if the t statistic is outside the range [-t 0.05, t 0.05 ] I will not ask for the one sided t-test, but good to know for your future life.

Coming up: Readings: Mid term on Tuesday, November 25. – Coverage: up to Chapter 6 inclusive. No online quiz this week. Make sure you come to sessions and recitations. For help: Amine Ouazad Office 1135, Social Science building Office hour: Tuesday from 5 to 6.30pm. GAF: Irene Paneda Sunday recitations. At the Academic Resource Center, Monday from 2 to 4pm.