Sampling, sample size estimation, and randomisation

Slides:



Advertisements
Similar presentations
Chapter 9 Choosing the Right Research Design Chapter 9.
Advertisements

Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
ANCOVA Workings of ANOVA & ANCOVA ANCOVA, Semi-Partial correlations, statistical control Using model plotting to think about ANCOVA & Statistical control.
PSY 307 – Statistics for the Behavioral Sciences
Using Statistics in Research Psych 231: Research Methods in Psychology.
Analysis – Regression The ANOVA through regression approach is still the same, but expanded to include all IVs and the interaction The number of orthogonal.
Factorial ANOVA 2-Way ANOVA, 3-Way ANOVA, etc.. Factorial ANOVA One-Way ANOVA = ANOVA with one IV with 1+ levels and one DV One-Way ANOVA = ANOVA with.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 10: Hypothesis Tests for Two Means: Related & Independent Samples.
Lecture 6: Multiple Regression
Lecture 9: One Way ANOVA Between Subjects
One-way Between Groups Analysis of Variance
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 14: Factorial ANOVA.
Today Concepts underlying inferential statistics
2-Way ANOVA, 3-Way ANOVA, etc.
Prepared By: Soad Khalil.  A questionnaire is a written or printed form used in gathering information on some subject or subjects consisting of a list.
Chapter 14 Inferential Data Analysis
Two-Way Balanced Independent Samples ANOVA Overview of Computations.
Repeated Measures ANOVA Used when the research design contains one factor on which participants are measured more than twice (dependent, or within- groups.
Basic Analysis of Variance and the General Linear Model Psy 420 Andrew Ainsworth.
Review for Final Exam Some important themes from Chapters 9-11 Final exam covers these chapters, but implicitly tests the entire course, because we use.
Inferential Statistics
Scot Exec Course Nov/Dec 04 Ambitious title? Confidence intervals, design effects and significance tests for surveys. How to calculate sample numbers when.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
ANOVA Greg C Elvers.
1 of 46 MGMT 6970 PSYCHOMETRICS © 2014, Michael Kalsher Michael J. Kalsher Department of Cognitive Science Inferential Statistics IV: Factorial ANOVA.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Statistical Power 1. First: Effect Size The size of the distance between two means in standardized units (not inferential). A measure of the impact of.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Experimental Design: One-Way Correlated Samples Design
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
The Campbell Collaborationwww.campbellcollaboration.org C2 Training: May 9 – 10, 2011 Introduction to meta-analysis.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Review Hints for Final. Descriptive Statistics: Describing a data set.
Human-Computer Interaction. Overview What is a study? Empirically testing a hypothesis Evaluate interfaces Why run a study? Determine ‘truth’ Evaluate.
Adjusted from slides attributed to Andrew Ainsworth
Analysis of covariance. When… ANCOVA is an ‘extra’ on an ANOVA ANOVA outcome = number of words learned IV = Sex ANCOVA adds a covariate covariate = size.
AP Statistics Chapter 24 Comparing Means.
Chapter 12 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 12: One-Way Independent ANOVA What type of therapy is best for alleviating.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
ANCOVA. What is Analysis of Covariance? When you think of Ancova, you should think of sequential regression, because really that’s all it is Covariate(s)
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
Finishing up: Statistics & Developmental designs Psych 231: Research Methods in Psychology.
Paired Samples and Blocks
Bangor Transfer Abroad Programme Marketing Research SAMPLING (Zikmund, Chapter 12)
Research planning. Planning v. evaluating research To a large extent, the same thing Plan a study so that it is capable of yielding data that could possibly.
Power Analysis: Sample Size, Effect Size, Significance, and Power 16 Sep 2011 Dr. Sean Ho CPSY501 cpsy501.seanho.com Please download: SpiderRM.sav.
ANCOVA.
Sample Size Mahmoud Alhussami, DSc., PhD. Sample Size Determination Is the act of choosing the number of observations or replicates to include in a statistical.
Chapter 13 Understanding research results: statistical inference.
BHS Methods in Behavioral Sciences I May 9, 2003 Chapter 6 and 7 (Ray) Control: The Keystone of the Experimental Method.
Rerandomization to Improve Covariate Balance in Randomized Experiments Kari Lock Harvard Statistics Advisor: Don Rubin 4/28/11.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Sampling Design and Procedure
Inferential Statistics Psych 231: Research Methods in Psychology.
Copyright ©2011 Brooks/Cole, Cengage Learning Gathering Useful Data for Examining Relationships Observation VS Experiment Chapter 6 1.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Welcome to the class! set.seed(843) df <- tibble::data_frame(
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Psych 231: Research Methods in Psychology
Presentation transcript:

Sampling, sample size estimation, and randomisation PS302

Overview Sampling representative sampling (e.g. for surveys) homogenous sampling (e.g. for experiments) Sample size estimation Based on power Gathering the information you need Power calculations (G*Power software) - ANOVA - regression Rules of thumb for multivariate tests Presentation of power analysis in your report Practical randomising Random selection (e.g. for surveys) Random allocation (e.g. for experiments)

Getting a representative sample Survey of UK Households want a sample from each SES group each age group each sex Proportions should match the population

Matching the population Percent of population  percent of sample Assume, sample size = 1200 Population = Women 60%, Men 40%  Sample: Women 720, Men 480

Problem for you to try Population figures: Men 65 years+ = 1 million Women 65 years+ = 1.5 million Men 25-65 years = 8 million Women 25-65 years = 8.5 million Men < 25 years = 5 million Women < 25 years = 5.2 million Total population size = 29.2 million Percent W25-65 = (5.2 / 29.2) * 100 = 17.8% Given a sample size of 200, how many women <25 years should be included?

quota sampling Recruiters are given a quota of each stratum Problem – biased selection by recruiter/interviewer Advantage – random selection very difficult to achieve, quota sampling a good compromise

Homogenous sampling Restrict sampling to a narrow group Sample only Warwick students Sample only one Sex Sample only one Age group Advantages reduces error variance by reducing individual differences

Homogenous sampling ctd Disadvantage – may reduce generalisability generalisability will need to be considered and assessed separately Suitability experimental work studies where individual differences are not directly relevant and power is more important concern

power Probability that any particular (random) sample will produce a statistically significant effect Eg. power = 0.9  90% chance of detecting an effect if there really is an effect Researchers usually aim to have power at 80-90%

Power and sample size All else being equal, to get more power you need more participants Where “all else” means: reliability of measures other sources of error variance p-value the true size of the effect

These concepts are inter-related Desired power ↑ N ↑ Acceptable p-value ↓ N ↑ Effect size to detect ↓ N ↑ Reliability of measures ↓ N ↑ Other error variance ↑ N ↑

if you know these… effect size variance of measures you can often work out what the sample size should be So where can you find them? Previous research studies

Calculating using G-power First step, assemble the figures needed For between subjects ANOVA: Effect size (Cohen’s f, or partial eta squared) Significance level [.05, usually] Power [.8, usually] Numerator degrees of freedom (df) Number of cells in design (groups)

1. Effect size … from previous studies Easy – they reported effect sizes “There was not a significant main effect of Sex on response time, F(1, 42) = 2.03, p = .16, η2 = 0.046” Harder – they reported only the F and df, so you have to make a calculation partial η2 = (dfeffect * F) / [(dfeffect * F) + dferror] = (1 * 2.03) / [(1 * 2.03) + 42] = 0.046

measures of effect size for ANOVA Roughly, the correlation between an effect and the outcome (DV) eta squared The proportion of variance in the outcome variable (DV) that is explained by the IV SSeffect / SS[corrected] total partial eta squared (SPSS prints this out) The proportion of the effect + error variance explained by the effect SSeffect / (SSeffect + SSerror)

4. Numerator df “There was a non-significant main effect of Gender on response time, F(1, 42) = 2.03, p < .05, η2 = 0.09”

5. Number of cells (groups) Two way ANOVA 2 x 3 ANOVA  6 cells 4 x 2 ANOVA  8 cells Etc.

Calculating using G-power First step, assemble the figures needed For this 2 X 3 between subjects ANOVA: Effect size (η2 = 0.046) Significance level [.05, as usual] Power [.8, normal] Numerator degrees of freedom (df = 1, 2 for the respective main effects, or 2 for the interaction) Number of cells in design (groups = 6)

tip: power & ANOVA Each effect in the ANOVA has its own power Eg. 2 x 3 ANOVA Main effect A Main effect B Interaction effect A * B Tip: power is lower for interactions than for main effects

Sample size – ethical issues Too small a sample -- can’t detect significant effects  waste all participants’ time Too large a sample -- waste resources -- waste the extra participants’ time

Sample size – practical issues Resources Time Cost of running each participant Availability Clinical populations are often small Access can take time & require permission

Choosing an appropriate sample size for established laboratory paradigms Shortcut Base sample size on sample size used in previous research This is often perfectly appropriate (but make sure the previous research is of high quality!)

Rules of thumb for multivariate tests multiple regression cases (N) / predictors (p) N at least 50 + 8p for R2 N at least 104 + p for testing a predictor Need more cases if outcome is skewed, anticipated effect size is small, measures less reliable…

Rules of thumb for multivariate tests PCA (exploratory FA) 50 no good 100 poor 300 good, but ideally need more

Random allocation For example 3 between subjects conditions (e.g. control, happy, sad) Who does which condition? first come? Interviewer choice? Must avoid confounds. But can’t check all possible. Solution is random allocation.

Random allocation needs truly random numbers Different ways to do that SPSS random.org Research randomiser scripting language like python

to randomly assign 9 participants to 3 conditions: Python to randomly assign 9 participants to 3 conditions: from random import shuffle numbers = [1,1,1,2,2,2,3,3,3] shuffle(numbers) numbers [3, 2, 1, 2, 3, 1, 3, 1, 2]

Research randomiser http://www.randomizer.org/form.htm 3 conditions, 48 planned participants randomly: allocate each participant (identified by order of recruitment) to one of 3 conditions How many sets of numbers to generate? [1] numbers per set? [48] Number range? [From 1 To 3] Do you wish each number in a set to remain unique? [No] [Don’t “sort”!]

Result Set #1:3, 3, 1, 3, 3, 1, 2, 2, 2, 1, 2, 2, 1, 1, 2, 1, 3, 3, 1, 3, 2, 1, 3, 3, 1, 2, 1, 3, 1, 2, 2, 2, 3, 3, 1, 3, 3, 1, 1, 3, 1, 3, 3, 2, 3, 3, 1, 2

Research randomiser http://www.randomizer.org/form.htm 3 sentence types, 48 sentences 16 in each group, create a random sequence, but limit runs of the same type How many sets of numbers to generate? [16] numbers per set? [3] Number range? [From 1 To 3] Do you wish each number in a set to remain unique? [Yes]

Research randomiser http://www.randomizer.org/form.htm 3 types, 48 sentences, 16 of each type limit run of a given type, while still randomising order of presentation 16 Sets of 3 Unique Numbers Per Set Range: From 1 to 3 -- Unsorted Job Status:       Set #1:2, 3, 1            Set #2:3, 1, 2            Set #3: ….

Web links http://www.randomizer.org/ http://www.random.org/

measures of effect size for ANOVA Roughly, the correlation between an effect and the outcome (DV) eta squared The proportion of variance in the outcome variable (DV) that is explained by the IV SSeffect / SS[corrected] total partial eta squared (SPSS prints this out) The proportion of the effect + error variance explained by the effect SSeffect / (SSeffect + SSerror)