Inferential Statistics 2 Maarten Buis January 11, 2006.

Slides:



Advertisements
Similar presentations
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Topics Today: Case I: t-test single mean: Does a particular sample belong to a hypothesized population? Thursday: Case II: t-test independent means: Are.
Hypothesis Testing Steps in Hypothesis Testing:
PTP 560 Research Methods Week 9 Thomas Ruediger, PT.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
PSY 307 – Statistics for the Behavioral Sciences
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
T-Tests Lecture: Nov. 6, 2002.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
IENG 486 Statistical Quality & Process Control
PSY 307 – Statistics for the Behavioral Sciences
Today Concepts underlying inferential statistics
The t Tests Independent Samples.
Chapter 9: Introduction to the t statistic
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
AM Recitation 2/10/11.
Overview of Statistical Hypothesis Testing: The z-Test
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Hypothesis Testing II The Two-Sample Case.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Estimation and Hypothesis Testing Now the real fun begins.
Chapter 8 Introduction to Hypothesis Testing
1 GE5 Lecture 6 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson.
Confidence Intervals and Hypothesis Testing
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
The Probability of a Type II Error and the Power of the Test
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Statistics and Quantitative Analysis U4320
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Inferential Statistics 4 Maarten Buis 18/01/2006.
Monday, October 22 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
INTRODUCTION TO HYPOTHESIS TESTING From R. B. McCall, Fundamental Statistics for Behavioral Sciences, 5th edition, Harcourt Brace Jovanovich Publishers,
_ z = X -  XX - Wow! We can use the z-distribution to test a hypothesis.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Monday, October 21 Hypothesis testing using the normal Z-distribution. Student’s t distribution. Confidence intervals.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Lecturer’s desk Physics- atmospheric Sciences (PAS) - Room 201 s c r e e n Row A Row B Row C Row D Row E Row F Row G Row H Row A
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
Chapter 13 Understanding research results: statistical inference.
Inferential statistics by example Maarten Buis Monday 2 January 2005.
Hypothesis Testing and Statistical Significance
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Inferential Statistics Psych 231: Research Methods in Psychology.
 What is Hypothesis Testing?  Testing for the population mean  One-tailed testing  Two-tailed testing  Tests Concerning Proportions  Types of Errors.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Chapter 9 Introduction to the t Statistic
Lecture Nine - Twelve Tests of Significance.
Inference and Tests of Hypotheses
Hypothesis Tests: One Sample
Chapter 9 Hypothesis Testing.
Hypothesis Testing.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Psych 231: Research Methods in Psychology
Inferential Statistics
Psych 231: Research Methods in Psychology
Presentation transcript:

Inferential Statistics 2 Maarten Buis January 11, 2006

outline Student Recap Sampling distribution Hypotheses Type I and II errors and power testing means testing correlations

Sampling distribution PrdV example from last lecture. If H 0 is true, than the population consists of 16 million persons of which 41% (=6.56 million persons) supports de PrdV. I have drawn 100,000 random samples of 2,598 persons each and compute the average support in each sample.

Sampling distribution 5% or 50,000 samples have a mean of 39% or less. So if we reject H 0 when we find a support of 39% or less than we will have a 5% chance of making an error. Notice: We assume that the only reason we would make an error is random sampling error.

More precise approach We want to know the score below which only 5% of the samples lie. Drawing lots of random samples is a rather rough approach, an alternative approach is to use the theoretical sampling distribution. The proportion is a mean and the sampling distribution of a mean is the normal distribution with a mean equal to the H 0 and a standard deviation (called standard error) of

More precise approach For a standard normal distribution we know the z-score below which 5% of the samples lie (Appendix 2, table A): So if we compute a z-score for the observed value (.31) and it is below we can reject the H 0, and we will do so wrongly in only 5% of the cases

More precise approach  is the mean of the sampling distribution, so.41 (H 0 ) se is,  of a proportion is so the se is so the z-score is is less than -1.68, so we reject the H 0

Null Hypothesis A sampling distribution requires you to imagine what the population would look like if H 0 is true. This is possible if H 0 is one value (41%) This is impossible if H 0 is a range (<41%) So H 0 should always contain a equal sign (either = or ≤ or ≥)

Null hypothesis In practice the H 0 is almost always 0, e.g.: –difference between two means is 0 –correlation between two variables is 0 –regression coefficient is 0 This is so common that SPSS always assumes that this is the H 0.

Undirected Alternative Hypotheses Often we have an undirected alternative hypothesis, e.g.: –the difference between two means is not zero (could be either positive or negative) –the correlation between two variables is not zero (could be either positive or negative) –the regression coefficient is not zero (could be either

Directed alternative hypothesis In the PrdV example we had a directed alternative hypothesis: Support for PrdV is less than 41%, since PrdV would have still participated if his support were more than 41%.

Type I and Type II errors actual situation decisionH 0 is TrueH 0 is False reject H 0 Type I error probability =  correct decision probability = 1-  (power) do not reject H 0 correct decision probability = 1-  Type II error probability = 

Type I error rate You choose the type I error rate (  ) It is independent of sample size, type of alternative hypothesis, or model assumptions.

Type I versus type II error rate a low probability of rejecting H 0 when H 0 is true (type I error), is obtained by: rejecting the H 0 less often, Which also means a higher probability of not rejecting H 0 when H 0 is false (type II error), In other words: a lower probability of finding a significant result when you should (power).

How to increase your power: Lower type I error rate Larger sample size Use directed instead of undirected alternative hypothesis Use more assumptions in your model (non- parametric tests make less assumptions, but are also have less power)

Testing means What kind of hypotheses might we want to test: –Average rent of a room in Amsterdam is 300 euros –Average income of males is equal to the average income of females

Z versus t In the PrdV example we knew everything about the sampling distribution with only an hypothesis about the mean. In the rent example we don’t: we have to estimate the standard deviation. This adds uncertainty, which is why we use the t distribution instead of the normal Uncertainty declines when sample size becomes larger. In large samples (N>30) we can use the normal.

t-distribution It has a mean and standard error like the normal distribution. It also has a degrees of freedom, which depends on the sample size The larger the degrees of freedom the closer the t-distribution is to the normal distribution.

Data: rents of rooms rent room 1175room room 2180room room 3185room room 4190room room 5200room room 6210room room 7210room room 8210room room 9230room room 10240

Rent example H 0 :  =300, H A :  ≠ 300 We choose  to be 5% N = 19, so df= 18 We reject H 0 if we find a t less than or more than (appendix B, table 2) We do not reject H 0 if we find a t between and

Rent example We use s 2 as an estimate of  2 So is between and 2.101, so we do not reject H 0

Compare means in SPSS

Do before Monday Read Chapter 9 and 10 Do the “For solving Problems”