Statistics for Social and Behavioral Sciences Session #17: Hypothesis Testing: The Confidence Interval Method and the T-Statistic Method (Agresti and Finlay,

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

Chapter 10, part D. IV. Inferences about differences between two population proportions You will have two population proportions, p1 and p2. The true.
Chapter 6 Sampling and Sampling Distributions
Statistics for Social and Behavioral Sciences Session #16: Confidence Interval and Hypothesis Testing (Agresti and Finlay, from Chapter 5 to Chapter 6)
Statistics for Social and Behavioral Sciences Part IV: Causality Randomized Experiments, ANOVA Chapter 12, Section 12.1 Prof. Amine Ouazad.
Sampling: Final and Initial Sample Size Determination
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
1 Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Type I and Type II Errors One-Tailed Tests About a Population Mean: Large-Sample.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses Type I and Type II Errors Type I and Type II Errors.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Statistics for Social and Behavioral Sciences Session #11: Random Variable, Expectations (Agresti and Finlay, Chapter 4) Prof. Amine Ouazad.
Statistics for Social and Behavioral Sciences Session #9: Linear Regression and Conditional distribution Probabilities (Agresti and Finlay, Chapter 9)
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Today Today: Chapter 10 Sections from Chapter 10: Recommended Questions: 10.1, 10.2, 10-8, 10-10, 10.17,
BCOR 1020 Business Statistics
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 21 = Start Chapter “Confidence Interval Estimation” (CIE)
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
ESTIMATION AND HYPOTHESIS TESTING: TWO POPULATIONS
Quantitative Business Methods for Decision Making Estimation and Testing of Hypotheses.
Statistics for Social and Behavioral Sciences Part IV: Causality Association and Causality Session 22 Prof. Amine Ouazad.
Statistics for Social and Behavioral Sciences Session #15: Interval Estimation, Confidence Interval (Agresti and Finlay, Chapter 5) Prof. Amine Ouazad.
Comparing Two Groups’ Means or Proportions
Statistics for Social and Behavioral Sciences Part IV: Causality Multivariate Regression Chapter 11 Prof. Amine Ouazad.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Statistics for Social and Behavioral Sciences Session #18: Literary Analysis using Tests (Agresti and Finlay, from Chapter 5 to Chapter 6) Prof. Amine.
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
Statistics for Social and Behavioral Sciences Session #14: Estimation, Confidence Interval (Agresti and Finlay, Chapter 5) Prof. Amine Ouazad.
More About Significance Tests
Statistics for Social and Behavioral Sciences
Two Sample Tests Nutan S. Mishra Department of Mathematics and Statistics University of South Alabama.
Estimates and Sample Sizes Lecture – 7.4
Statistics and Quantitative Analysis U4320
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Agresti/Franklin Statistics, 1 of 122 Chapter 8 Statistical inference: Significance Tests About Hypotheses Learn …. To use an inferential method called.
Statistics for Social and Behavioral Sciences Part IV: Causality Multivariate Regression R squared, F test, Chapter 11 Prof. Amine Ouazad.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Inference about Two Means: Independent Samples 11.3.
STT 315 Ashwini Maurya Acknowledgement: Author is indebted to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit many.
Statistics for Social and Behavioral Sciences Part IV: Causality Inference for Slope and Correlation Section 9.5 Prof. Amine Ouazad.
Jeopardy Statistics Edition. Terms Calculator Commands Sampling Distributions Confidence Intervals Hypothesis Tests: Proportions Hypothesis Tests: Means.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
4 Hypothesis & Testing. CHAPTER OUTLINE 4-1 STATISTICAL INFERENCE 4-2 POINT ESTIMATION 4-3 HYPOTHESIS TESTING Statistical Hypotheses Testing.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Statistics for Social and Behavioral Sciences Part IV: Causality Comparison of two groups Chapter 7 Prof. Amine Ouazad.
© Copyright McGraw-Hill 2004
Statistics for Social and Behavioral Sciences Session #19: Estimation and Hypothesis Testing, Wrap-up & p-value (Agresti and Finlay, from Chapter 5 to.
Chapter 1 Introduction to Statistics. Section 1.1 Fundamental Statistical Concepts.
Essential Statistics Chapter 191 Comparing Two Proportions.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Comparing Two Proportions Chapter 21. In a two-sample problem, we want to compare two populations or the responses to two treatments based on two independent.
Slides by JOHN LOUCKS St. Edward’s University.
Significance Test for the Difference of Two Proportions
Inference: Conclusion with Confidence
Statistics in Applied Science and Technology
CONCEPTS OF HYPOTHESIS TESTING
Chapter 9 Hypothesis Testing.
Elementary Statistics
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Presentation transcript:

Statistics for Social and Behavioral Sciences Session #17: Hypothesis Testing: The Confidence Interval Method and the T-Statistic Method (Agresti and Finlay, from Chapter 5 to Chapter 6) Prof. Amine Ouazad

Statistics Course Outline P ART I. I NTRODUCTION AND R ESEARCH D ESIGN P ART II. D ESCRIBING DATA P ART III. D RAWING CONCLUSIONS FROM DATA : I NFERENTIAL S TATISTICS P ART IV. : C ORRELATION AND C AUSATION : R EGRESSION A NALYSIS Week 1 Weeks 2-4 Weeks 5-9 Weeks This is where we talk about Zmapp and Ebola! Firenze or Lebanese Express’s ratings are within a MoE of each other!

Last Session Hypothesis testing is the foundation of (social) sciences. Three typical types of hypothesis: – A parameter is equal to ….. – A parameter is greater than …. – A parameter is lower than ….. Null hypothesis (to be rejected), and alternative hypothesis. We provide evidence to reject a null hypothesis. – We might not have evidence to reject the null hypothesis. For a test on the population mean m: Confidence interval method for the test of H 0 :  = v. H a :  ≠ v. – Reject the H 0 with significance level 5% if the 95% confidence interval for the sample mean m does not include v. – Reject the H 0 with significance level 10% if the 90% confidence interval for the sample mean m does not include v.

Today Hypothesis testing in Statistics: – The Confidence Interval method of testing  =v. An equivalent way of testing m = v: – The t test (also invented by Mr Student).

Outline 1.Testing hypothesis using the confidence interval method (continued) 2.Testing hypothesis using the t-test (absolutely equivalent) Next time:one-sided t test of mean and proportion Chapter 6 of A&F

Testing H 0 :  =v using confidence intervals H 0 : “The fraction of men in Abu Dhabi is 50%.” equivalently “  = 0.5”. By simple random sampling, gather N observations X i =0,1. Build a confidence interval for the sample mean m of X i. – Same methods as seen in previous sessions. If the null hypothesis is true, only 5 of the 95% confidence intervals will not include 0.5. Thus if the null hypothesis is true, there is only a 5% probability that my confidence interval will not include 0.5. ☞ Reject the null hypothesis if the confidence interval for m does not include v.

Proportion Female of Juilliard Graduates, Total and By Section: 1947 to 1995

Female Share of New Hires in Four Orchestras, 1950s to 1990s

Do Orchestras Prefer Hiring Men? Orchestras in the US are overwhelmingly male. At the Royal Festival Hall We know the rate at which women are hired in orchestras (the data is surprisingly good): Women reach the first stage of recruitment at a 17.1% rate. Women reach the second stage of recruitment at a 56.8% rate. Women reach the finals at a 8.7% rate. Overall, from the overall pool of all applicants, women are hired at a 1.7% rate.

Conducting a little experiment… What if we were auditioning musicians for hiring… behind a curtain, with a carpet, and no talking allowed?? Prof. Cecilia Rouse Princeton University Would that lead to a rate of hiring that is different from the usual rate of hiring? (1.7%) Orchestrating Impartiality: The Impact of “Blind” Auditions on Female Musicians, National Bureau of Economic Research, January 1997.

The data collected Rate of advancement Sample Size Rate of advancement for women in all orchestras (known v) Preliminaries21.6% % Semi-Finals38.5%6556.8% Finals23.5%178.7% Hired2.7%4451.7%

Building the confidence interval The confidence interval is noted: [ m – z 0.05 * SE, m + z 0.05 *SE ] Or [ m – t 0.05 * SE, m + t 0.05 *SE ] The standard error SE = s X /√N. m : sample mean (known) s X : sample standard deviation (known). t 0.05 or z 0.05 : from Table 5.1.

z or t ? We use the notation z when using the Central Limit Theorem: – Sample size is large, data was collected by simple random sampling. We use the notation t when using the t distribution: – Distribution of X is normal (applies to height, weight, but not to superstar distributions). z=t when the sample size is large (when df = ∞). – Thus t is encountered more frequently than z.

t Table

Outline 1.Testing hypothesis using the confidence interval method 2.Testing hypothesis using the t-test (absolutely equivalent) Next time:one-sided t test of mean and proportion Chapter 6 of A&F

From the confidence interval method …to the t-test Null hypothesis:  = v. We do not reject the null hypothesis H 0 with confidence level 95% if the 95% confidence interval for the sample mean m includes v. Do not reject H 0 at 95% if: m – t 0.05 * SE < v < m + t 0.05 * SE Notice that this is equivalent to: Do not reject H0 if: -t 0.05 < (m-v)/SE < t 0.05 t 0.05 is the 95% critical value for the t statistic. (m-v)/SE is the t statistic.

Graphically… Under the null hypothesis (  =v): (m-v)/SE follows a standard normal distribution if the sample size is large. (m-v)/SE follows a t distribution if (i) the sample size is small and (ii) X is normally distributed. Sampling distribution of the t statistic df = N-1 On this graph, indicate for which values of t we should reject the null hypothesis… With 95% confidence. With 90% confidence. And also with 99% confidence ?

Hypothesis testing Hypothesis: an empirical statement about a population parameter. Usually of the shape: – “The parameter is equal to a given value” – “The parameter is greater than a given value” – “The parameter is lower than a given value” Almost all scientific/sociological/economic statements can be reduced to one of these three types. – “The population proportion of voters for Cory Gardner is greater than 50%.” (second type of hypothesis) – “The impact of ZMapp on Ebola patients’ condition is zero.” (first type of hypothesis) This session Next session

Exercise 6.20: Literary Analysis The authorship of an old document is in doubt. A historian hypothesizes that the author was a journalist named Jacalyn Levine. Upon a thorough investigation of Levine’s known works, it is observed that one unusual feature of her writing was that she consistently began 6% of her sentences with the word whereas. To test the historian’s hypothesis, it is decided to count the number of sentences in the disputed document that begin with whereas. Out of the 300 sentences, none do. Let π denote the probability that any one sentence written by the unknown author of the document begins with whereas. Test H 0 : “π= 0.06” against Ha: “π is not equal 0.06.” What assumptions are needed for your conclusion to be valid? (F. Mosteller and D. L. Wallace conducted this type of investigation to determine whether Alexander Hamilton or James Madison authored 12 of the Federalist Papers. See Inference and Disputed Authorship:The Federalist, Addison-Wesley, 1964.)

Wrap up Confidence interval method for the test of H 0 :  = v. H a :  ≠ v. – Reject the H 0 with significance level 1% if the 99% confidence interval for the sample mean m does not include v. – Reject the H 0 with significance level 5% if the 95% confidence interval for the sample mean m does not include v. – Reject the H 0 with significance level 10% if the 90% confidence interval for the sample mean m does not include v. t test method for the test of H0 :  = v. H a :  ≠ v. – Build the t statistic (m-v)/SE – Reject the H0 with significance level 1% if the t statistic is outside the range [-t 0.01, t 0.01 ] – Reject the H0 with significance level 5% if the t statistic is outside the range [-t 0.05, t 0.05 ] – Reject the H0 with significance level 10% if the t statistic is outside the range [-t 0.10, t 0.10 ]

Coming up: Readings: Mid term on Tuesday, November 25. – Coverage: up to Chapter 6 inclusive. Online quiz due Tuesday at 9am. Deadlines are sharp and attendance is followed. For help: Amine Ouazad Office 1135, Social Science building Office hour: Tuesday from 5 to 6.30pm. GAF: Irene Paneda Sunday recitations. At the Academic Resource Center, Monday from 2 to 4pm.