Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.

Slides:



Advertisements
Similar presentations
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Advertisements

Topics Today: Case I: t-test single mean: Does a particular sample belong to a hypothesized population? Thursday: Case II: t-test independent means: Are.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Review Find the mean square (MS) based on these two samples. A.26.1 B.32.7 C.43.6 D.65.3 E M A = 14 M B = Flood.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Hypothesis testing Week 10 Lecture 2.
1. Estimation ESTIMATION.
BHS Methods in Behavioral Sciences I
Lecture 9: One Way ANOVA Between Subjects
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
Lecture 7 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Business 205. Review Survey Designs Survey Ethics.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Hypothesis Testing Using The One-Sample t-Test
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Confidence Intervals. Estimating the difference due to error that we can expect between sample statistics and the population parameter.
AM Recitation 2/10/11.
Hypothesis Testing:.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
Statistical Analysis Statistical Analysis
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
T-Tests and Chi2 Does your sample data reflect the population from which it is drawn from?
Estimation Statistics with Confidence. Estimation Before we collect our sample, we know:  -3z -2z -1z 0z 1z 2z 3z Repeated sampling sample means would.
Chapter 11: Estimation Estimation Defined Confidence Levels
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Statistics and Quantitative Analysis U4320
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Chapter 9: Testing Hypotheses
Statistics for the Behavioral Sciences Second Edition Chapter 11: The Independent-Samples t Test iClicker Questions Copyright © 2012 by Worth Publishers.
January 31 and February 3,  Some formulae are presented in this lecture to provide the general mathematical background to the topic or to demonstrate.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 10. Hypothesis Testing II: Single-Sample Hypothesis Tests: Establishing the Representativeness.
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
VARIABILITY. Case no.AgeHeightM/F 12368M 22264F 32369F 42571M 52764F 62272M 72465F 82366M 92366F F M F M F F F.
Preparing for the final - sample questions with answers.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Difference Between Means Test (“t” statistic) Analysis of Variance (“F” statistic)
Manijeh Keshtgary Chapter 13.  How to report the performance as a single number? Is specifying the mean the correct way?  How to report the variability.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Psych 230 Psychological Measurement and Statistics
Chapter 8 Parameter Estimates and Hypothesis Testing.
Testing Differences between Means, continued Statistics for Political Science Levin and Fox Chapter Seven.
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
© Copyright McGraw-Hill 2004
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
Other Types of t-tests Recapitulation Recapitulation 1. Still dealing with random samples. 2. However, they are partitioned into two subsamples. 3. Interest.
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
366_8. Estimation: Chapter 8 Suppose we observe something in a random sample how confident are we in saying our observation is an accurate reflection.
Difference Between Means Test (“t” statistic) Analysis of Variance (F statistic)
366_7. T-distribution T-test vs. Z-test Z assumes we know, or can calculate the standard error of the distribution of something in a population We never.
1 Probability and Statistics Confidence Intervals.
Sampling Distribution (a.k.a. “Distribution of Sample Outcomes”) – Based on the laws of probability – “OUTCOMES” = proportions, means, test statistics.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Chapter 9 Introduction to the t Statistic
Other tests of significance. Independent variables: continuous Dependent variable: continuous Correlation: Relationship between variables Regression:
Inference and Tests of Hypotheses
Chi-Square X2.
Difference Between Means Test (“t” statistic)
Statistical Inference for the Mean: t-test
Presentation transcript:

Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their scores cluster around the population mean (parameter) If we take repeated random samples from a normal or near-normal population, the means of these samples will also distribute normally Placing the mean of a random sample at the center, and specifying the level of certainty that we want by setting the corresponding z score, we create a “confidence interval”, a range of means into which, to our chosen certainty, the population mean will fall percent of scores 95 percent of scores 66 pct.17 pct. of scores 2 ½ pct.

Homework Two random samples of officers tested for cynicism For each sample, we needed to specify the confidence interval into which the population parameter (mean of population) will fall, to a 95 percent certainty – In social science research we don’t want to take more than five chances in 100, or 5 percent, of being wrong Remember that 95 percent of the cases in a normally distributed population fall between a z of and (meaning that 5 percent will not) So - always use a z of 1.96

Sample 1 (n=10) OfficerScoreMeanDiff.Sq Sum 8.90 Variance (sum of squares / n-1) s 2.99 Standard deviation (sq. root of variance) s.99

Sample 2 (n=10) OfficerScoreMeanDiff.Sq Sum 8.40 Variance (sum of squares / n-1) s 2.93 Standard deviation (sq. root of variance) s.97

Standard error of the mean of the population of patrol officers Estimated difference, due to chance, between the means of repeated random samples (in this example, of 10 patrol officers) and the population parameter (mean of the population) s S  x = ___  n-1 Estimate based on sample 1 =.33 Estimate based on sample 2 =.32

Confidence interval into which the population mean should fall with a 95 percent certainty (probability 5 percent that population parameter - mean - falls outside this range) ci =  x +/- z (S  x ) Results based on sample 1: Mean = 2.9 S  x =.33 (z = 1.96) 2.9 +/-.65 Left limit = 2.25 Right limit = 3.55 Results based on sample 2: Mean = 2.4 S  x =.32 (z = 1.96) 2.4 +/-.63 Left limit = 1.77 Right limit = 3.03

Answer for final exam There will be one word question on the final for each statistical technique covered during the period Here is the question for confidence interval: – Explain what you would tell the police chief that the confidence interval that you computed actually means Here is a good answer: – There are ninety-five chances in one-hundred that, based on the mean __(insert variable)___ calculated from a random sample of (insert sample size) officers, the mean for all the officers in the department would fall between _______ and _______.

Review - Confidence Interval Our first sample had a mean of 2.9. The second sample mean was 2.4. We used a z of 1.96, which set the probability that the population mean would fall within our confidence interval at 95 percent Based on sample 1, there are 95 chances in 100 that the population mean (parameter) falls between 2.25 and Or, there are 5 chances in 100 that it doesn’t. Based on sample 2, there are 95 chances in 100 that the population mean (parameter) falls between 1.77 and Or, there are 5 chances in 100 that it doesn’t. z scores percent of scores 2 ½ pct Sample 1 Sample 2

How to narrow the confidence interval #1 Increase the probability of being wrong! That is, the chance that the population mean (parameter) actually falls OUTSIDE the calculated range First, select a smaller z-score. Say, z = 1.0 – According to the z-table,.3413 of cases (34%) fall between the mean and z = 1.0 – So, another 34% of the cases must fall between the mean and z = -1.0 – That’s 68% of the cases. So, 32% of the cases must fall outside this range (68% + 32% = 100%) – This presents a much bigger chance of being wrong than by using a z of 1.96, where only 5% of the cases fall outside the range In social science research, we always use a z of 1.96 to set confidence intervals, because we don’t want to take a greater than 5 in 100 chance of being wrong ci =  x +/- z (S  x ) Results based on sample 1: – Mean = 2.9 S  x =.33 z = 1.00 – 2.9 +/-.33 Left limit = 2.57 Right limit = 3.23 Results based on sample 2: – Mean = 2.4 S  x =.32 z = 1.00 – 2.4 +/-.32 Left limit = 2.08 Right limit = 2.72

How to narrow the confidence interval #2 Increase the sample size! – This lets you use z = 1.96, so that the probability that the population parameter falls within the confidence interval is the social science required 95 percent ci =  x +/- z (S  x ) Increase sample 1 size to 30, and to 100 To keep things simple, the sum of squares (8.9) was tripled, and also multiplied by 10. The mean (2.9) was kept the same n = 30 New sum of squares = 26.7 s 2 (variance) =.92 s (standard deviation) =.96 S  x (standard error of the mean) =.18 z (S  x ) =.35 Confidence interval = 2.55  3.25 Old ci was 2.25  3.55 n = 100 New sum of squares = 89 s 2 (variance) =.9 s (standard deviation) =.95 S  x (standard error of the mean) =.1 z (S  x ) =.2 Confidence interval = 2.7  3.1 Old ci was 2.25  3.55

Diff. Between Means Test Independent variable is categorical and the dependent variable is continuous To determine if these variables are associated compare the means of two randomly drawn samples Is the difference between the means so large that we can reject the null hypothesis (that the difference was produced by chance?) – Must calculate standard error of the difference between means The difference between all possible pairs of means, due to chance alone – To do so: Obtain the “pooled sample variance” (for us, half-way between the obtained variances) Compute the S.E. of the Diff. Between Means Compute the t-statistic

Pooled sample variance S p 2 Simplified method: midpoint between the two sample variances S p 2 = Standard error of the difference between means   x1 -  x2   x 1 -  x 2 =  S p 2 ( ) T-Test for significance of the difference between means  x 1 -  x 2 t =   x 1 -  x 2 s s n 1 n 2 +

 x 1 -  x 2 _____________________   x 1 -  x 2 The result is a ratio Our obtained difference between means is on top The predicted difference due to error is on the bottom The greater our difference, the larger the t The larger the t, the more likely we are to reject the null hypothesis (see the t- table in the back of the book) When we reject the null hypothesis, we say that the difference between means is statistically significant, meaning that the samples were not drawn from the same population – we only thought they were t = df = (n1 + n2) - 2

Class exercise -- Difference Between Means Test H 1 : Male officers more cynical than females (1 - tailed) H 2 : Officer gender determines cynicism (2 - tailed) 1.What is the null hypothesis? 2.Draw one sample of male officers, one of females 3.Compute the Standard Error of the Difference Between Means 4.Perform a t-test 5.Use the table in the back of the book to check for significance (no more than 5 chances in 100 that the null hypo. is correct) Note: one tailed hypotheses (direction of the effect on the dependent variable is predicted) require a smaller “t” to reach statistical significance than two-tailed hypotheses, where only an effect is predicted, not its direction Discuss homework assignment