1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.

Slides:



Advertisements
Similar presentations
Inferential Statistics
Advertisements

Chapter 16 Inferential Statistics
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Inference Sampling distributions Hypothesis testing.
Chapter 10: Hypothesis Testing
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Chapter 9 Hypothesis Testing.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Statistical Inference Dr. Mona Hassan Ahmed Prof. of Biostatistics HIPH, Alexandria University.
AM Recitation 2/10/11.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Inferential Statistics & Test of Significance
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
1 GE5 Lecture 6 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson.
Confidence Intervals and Hypothesis Testing
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
EDUC 200C Friday, October 26, Goals for today Homework Midterm exam Null Hypothesis Sampling distributions Hypothesis testing Mid-quarter evaluations.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.
Statistical Power 1. First: Effect Size The size of the distance between two means in standardized units (not inferential). A measure of the impact of.
User Study Evaluation Human-Computer Interaction.
1 GE5 Tutorial 4 rules of engagement no computer or no power → no lessonno computer or no power → no lesson no SPSS → no lessonno SPSS → no lesson no.
The Practice of Statistics Third Edition Chapter 10: Estimating with Confidence Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Distributions of the Sample Mean
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Tuesday, April 8 n Inferential statistics – Part 2 n Hypothesis testing n Statistical significance n continued….
Intro to Psychology Statistics Supplement. Descriptive Statistics: used to describe different aspects of numerical data; used only to describe the sample.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
1 Probability and Statistics Confidence Intervals.
Chapter 13 Understanding research results: statistical inference.
Hypothesis Testing and Statistical Significance
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Statistical Inference
INF397C Introduction to Research in Information Studies Spring, Day 12
Inference and Tests of Hypotheses
Statistical inference: distribution, hypothesis testing
Spearman’s Rank Correlation Test
Inferential Statistics
Inferential Statistics
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
15.1 The Role of Statistics in the Research Process
Hypothesis Testing and Confidence Intervals (Part 2): Cohen’s d, Logic of Testing, and Confidence Intervals Lecture 9 Justin Kern April 9, 2018.
Presentation transcript:

1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5

8 Topics samples and populations significance for the correlation coefficient standard error SPSS

74 1.Quiz 2.Discussion homework 3.Sampling and samples 4.Significance 5.Standard error 6.SPSS workshop 7.Discussion homework next week Content tutorial 5

1. Quiz

2. Discussion homework

Quick refresher - correlation - measures of association

13 Scatter plot When we look at this scatterplot what can we say about the correlation?

Levels of measurement nominalordinal interval or ratio dichotomy nominal Cramérs V or eta Cramérs V ordinal Cramérs VSpearman rho Spearman rho Spearman rho interval or ratio Cramérs V or eta Spearman rho Pearson r dichotomy Cramérs VSpearman rho and measures of association phi

3. Chapters 9, 10, 11 of Howitt & Cramer

Chapter 9: Samples & Populations We talked about large and small correlations 0.3 is small; 0.8 is large; -0.8 is just as large But how large is an “interesting” correlation? Statistical answer: When it is a “significant” correlation Significant = Unlikely to have occurred by chance

Example “This is not an organized demonstration, we just all happened to be wearing the same thing today” Probably not true, because unlikely to have happened by chance.

Chapter 9: Samples & Populations The same question can be asked about samples and populations : – we measure the enthusiasm about a TV show in a sample of 100 people – we find much higher ratings than we expected – how unlikely is it that we found this by chance – if unlikely enough, we conclude that the TV show is more popular than we thought. Inferential Statistics How can we use a sample to draw conclusions that apply to the population?

DEFINITIONS We are not necessarily talking about people, but about scores - age of husband/wife, each couple is a score - “phone brands owned over life time”: multiple scores per person Population: entire set of scores that we are interested in - so only the target group Sample: set of scores that we select for research - because we simply cannot measure every score - a good sample is a random sample

Samples - Statistical inference generally assumes samples are drawn randomly from population - Randomness does not mean just doing something - Often you have to take special measures to give every score the same chance: if you are asking people in the street, don't only do this in the mornings - Random sample: “each score in the population has an equal chance of being selected”

Samples can approximate population Table 9.1 of the book has a population of 100 scores  Mean = 5.52 Table 9.2 has the means of 40 small random samples (size 5)  The mean of sample means is: 5.20, min: 2.00, max: 8.80  SD: 1.60  Sample means from small samples are not such good estimates of the population mean.

larger samples Table 9.3 has means of 40 large samples (of size 20) drawn from same population What can we observe?  The mean of sample means is: 5.33, min: 5.25, max: 6.85  SD: 0.60  Larger sample size and much better estimate of population

Conclusions If we draw a random sample the outcome will not be the same as the outcome in a population. The mean of the sample will not be the same as the mean of the population But the larger the sample, the larger the chance that the sample outcome will be close to the population outcome As we saw: - very small samples are not reliable - larger samples can get close to the population values

SPSS random samples 10% OPEN THE BIG SMARTIE SURVEY! Clean Up the results first: negative results and 100> need to be deleted use data; select cases; IF; weight > 0 & weight < 100 choose “remove unselected cases” and press OK Take a random sample (10% of the cases) data; select cases; random sample; 10 % choose “filter unselected cases” Calculate descriptives

Aside about computers and randomness Computers cannot be really random (or creative) We all got the same “random” sample True random numbers can be found at random.org You can “seed” the SPSS random numbers with your own random number (I use , use your birthday) 1) Go to transform->Random number generators and choose for the twister and a random starting point (for example ) 2) OR Open a syntax window (New; Syntax) and type and run this command: 1)SET SEED Then reselect all cases (Data; select cases; use all; OK) And select another random 10% (Data; select cases;...)

Distribution of means In this (unrealistic) case, we can take 40 samples from the same population We can plot the means we obtained and we find: A normal distribution of sample means Around the mean of means With standard deviation Which we call: standard error

standard deviation: for cases and one sample the average distance from the cases to the mean of the sample standard error: for a number of samples and a mean of means the average distance from this sample's mean to the mean of means But: We can hardly ever obtain more than one sample We also have a formula to estimate standard error from ONE sample only standard error: for one sample and a population mean (via formula, population mean unknown) the estimated 'average' distance from this sample mean to the population mean

The error in the measurement is called standard error Every measurement has a standard error The smaller the standard error the better the measurement. A smaller standard error means a better estimate of the mean of the population! So we could report: The sample mean of age in IMEM year 1 is 20 with standard error of 2.

Confidence interval What is confidence interval? - The confidence interval is a better way to report a mean and standard error - The confidence interval is a good way to understand significance.

Confidence Interval (CI) – report a sample mean The sample mean of age in IMEM year 1 is 20 with standard error of 2. The 68% CI of age was 18 – 22 years The 95% CI of age was 16 – 24 year ← most common The 99% CI of age was 14 – 26 years Meaning of the 95% confidence interval (95% CI): In 95 % of the cases, the population mean will be between 16 and 24.

Chapter 10: Statistical significance for correlation coefficient Remember what null hypothesis (H0) and alternative hypothesis (H1) is? E.g.: H0: there is no relationship between brain size and intelligence H1: there is a relationship between brain size and intelligence - What are the two variables in the previous H0? - What does this imply for the correlation between those? - What would a positive correlation imply?

Under H0 We measured brain size and IQ in 120 volunteers. We compute the Pearson r and the standard error Assume that the SE is 0.2 Under H0, there is no association, we should find r=0 under H0, 95% CI for r = [-0.4 ; +0.4] If we find a value outside 95% CI, we reject H0 sample r = 0.1 → H0 ok sample r = -0.3 → H0 ok sample r = 0.5 → reject H0, accept H1, there is a relationship sample r = -0.5 → reject H0, accept H1, there is an inverse relationship

ASIDE: How to compute estimated standard error Left: Estimated SE for a mean - computed from the standard deviation - here they used 's' for the standard deviation of the sample, we have used sigma ( σ) before. Right: Estimated SE for a correlation coefficient Neither formula is on the exam

Another example H1: Students in this year are smarter than students from last year H0: this year is just as smart as last year Last year’s exam mark was H0: the average exam mark will again be a 7.2 We take a sample of 64 students and find mean 7.7, standard deviation 1.6 the average student is within 1.6 marks of 7.6 estimated standard error = sd / √n = 1.6 / 8 = 0.2 Under H0, mean = 7.2, 95% CI = [6.8 ; 7.6] Observed sample mean is outside of 95% CI We reject H0, this year is smarter! What if we had found mean 7.5, standard deviation 1.6? What if we had found mean 7.7, standard deviation 2.4?

Type I & II Errors Finding a significant results is usually a good thing But keep this in mind: We could make a Type I error Type I: decide that H0 is false when it is true Type II: decide that H0 is true when it is false Powerful statistical tests are those in which Type II has less chance You will never know whether you have made a Type I or II error

4. SPSS workshop

77 Open Big Smartie survey Use two scale type variables Use the correct correlation measure Define how strong the relationship is Check if the relationship is significant. SPSS Topics

77 Take a random sample of 20% of the population Do the same measurement again Calculate your measurements again Repeat this for a 50% sample SPSS Topics

77 Open Big Smartie survey Use two ordinal type variables Use the correct correlation measure Define how strong the relationship is Check if the relationship is significant. SPSS Topics

6. Homework

65