Download presentation
Presentation is loading. Please wait.
1
Experiment Basics: Designs
Psych 231: Research Methods in Psychology
2
Announcements Quiz 7 due Friday
Don’t forget that Exam 2 is coming up (Mon. Oct 24) FYI: 538’s article Debate Interruptions Read it and think about: how to measure your DVs, inter-rater reliability, and operational definitions Announcements
3
Experimental Control: Weight analogy
Variability in a simple experiment: Bigger the weight = more variability from a source T = NRexp + NRother + R Treatment group Control group Absence of the treatment (NRexp = 0) R NR exp other R Difference Detector Experimental Control: Weight analogy
4
Potential Problems: Excessive random variability
Variability in a simple experiment: Bigger the weight = more variability from a source T = NRexp + NRother + R If R is large relative to NRexp then detecting a difference may be difficult But if we reduce the size of NRother and R relative to NRexp then detecting gets easier R NR exp Difference Detector R NR exp Difference Detector Experiment can’t detect the effect of the treatment Our experiment can detect the effect of the treatment Potential Problems: Excessive random variability
5
Potential Problems: Confounding
If an EV co-varies with IV, then NRother component of data will be present, and may lead to misattribution of effect to IV This relationship may or may not exist Men vs. women Matched vs. mismatched lists IV NR exp DV Co-vary together EV IV = independent var DV = dependent var EV = extraneous var Potential Problems: Confounding other
6
Potential Problems: Confounding
Hard to detect the effect of NRexp because the effect looks like it could be from NRexp but could be due to the NRother R NR R other NR exp Difference Detector Experiment can detect an effect, but can’t tell where it is from Potential Problems: Confounding
7
Potential Problems: Confounding
Hard to detect the effect of NRexp because the effect looks like it could be from NRexp but could be due to the NRother These two situations look the same R NR exp other Difference Detector R R NR other Difference Detector There is an effect of the IV There is not an effect of the IV Potential Problems: Confounding
8
Removing Confounding Confound
Hard to detect the effect of NRexp because the effect looks like it could be from NRexp but could be due to the NRother Use experimental control to spread the variability equally across conditions Use experimental control to eliminate the variability from the confound R NR other R NR exp Difference Detector R NR exp Difference Detector Removing Confounding
9
Controlling Variability
How do we introduce control? Methods of Experimental Control Constancy/Randomization Comparison Production Controlling Variability
10
Methods of Controlling Variability
Constancy/Randomization If there is a variable that may be related to the DV that you can’t (or don’t want to) manipulate Control variable: hold it constant (so there isn’t any variability from that variable, no weight from that variable) Random variable: let it vary randomly across all of the experimental conditions (so the R weight from that variable is the same for all conditions) Methods of Controlling Variability
11
Methods of Controlling Variability
Comparison An experiment always makes a comparison, so it must have at least two groups (2 sides of our scale in the weight analogy) Sometimes there are control groups This is often the absence of the treatment Training group No training (Control) group Without control groups if is harder to see what is really happening in the experiment It is easier to be swayed by plausibility or inappropriate comparisons (see diet crystal example) Useful for eliminating potential confounds (think about our list of threats to internal validity) Methods of Controlling Variability
12
Methods of Controlling Variability
Comparison An experiment always makes a comparison, so it must have at least two groups Sometimes there are control groups This is often the absence of the treatment Sometimes there are a range of values of the IV 1 week of Training group 2 weeks of Training group 3 weeks of Training group Methods of Controlling Variability
13
Methods of Controlling Variability
Production The experimenter selects the specific values of the Independent Variables 1 week of Training group 2 weeks of Training group 3 weeks of Training group selects the specific values variability 1 weeks 2 weeks 3 weeks Duration taking the training program Methods of Controlling Variability
14
Methods of Controlling Variability
Production The experimenter selects the specific values of the Independent Variables 1 week of Training group 2 weeks of Training group 3 weeks of Training group Need to do this carefully Suppose that you don’t find a difference in the DV across your different groups Is this because the IV and DV aren’t related? Or is it because your levels of IV weren’t different enough Methods of Controlling Variability
15
So far we’ve covered a lot of the general details of experiments
Now let’s consider some specific experimental designs. Some bad (but not uncommon) designs (and potential fixes) Some good designs 1 Factor, two levels 1 Factor, multi-levels Factorial (more than 1 factor) Between & within factors Experimental designs
16
Poorly designed experiments
Bad design example 1: Does standing close to somebody cause them to move? (theory of personal space) “hmm… that’s an empirical question. Let’s see what happens if …” Design: you stand closely to people and see how long before they move Problem: no control group to establish the comparison group (this design is sometimes called “one-shot case study design”) Fix: introduce a (or some) comparison group(s) Very Close (.1 m) Close (.5 m) Not Close (1.0 m) Poorly designed experiments
17
Poorly designed experiments
Bad design example 2: Does a relaxation program decrease the urge to smoke? 2 groups relaxation training group no relaxation training group The participants choose which group to be in Training group No training (Control) group Poorly designed experiments
18
Poorly designed experiments
Bad design example 2: Non-equivalent control groups Self Assignment Independent Variable Dependent Variable Training group Measure participants No training (Control) group Measure Random Assignment Problem: selection bias for the two groups Fix: need to do random assignment to groups Poorly designed experiments
19
Poorly designed experiments
Bad design example 3: Does a relaxation program decrease the urge to smoke? Pre-test desire to smoke Give relaxation training program Post-test desire to smoke Poorly designed experiments
20
Poorly designed experiments
Bad design example 3: One group pretest-posttest design Dependent Variable Independent Variable Pre vs. Post Dependent Variable Pre-test Training group Post-test Measure participants Pre-test No Training group Post-test Measure Fix: Add another factor Problems include: history, maturation, testing, and more Poorly designed experiments
21
So far we’ve covered a lot of the general details of experiments
Now let’s consider some specific experimental designs. Some bad (but not uncommon) designs Some good designs 1 Factor, two levels 1 Factor, multi-levels Factorial (more than 1 factor) Between & within factors Experimental designs
22
1 factor - 2 levels Good design example
How does anxiety level affect test performance? Two groups take the same test Grp1(low anxiety group): 5 min lecture on how good grades don’t matter, just trying is good enough Grp2 (moderate anxiety group): 5 min lecture on the importance of good grades for success What are our IV and DV? 1 Factor (Independent variable), two levels Basically you want to compare two treatments (conditions) The statistics are pretty easy, a t-test 1 factor - 2 levels
23
1 factor - 2 levels Good design example
How does anxiety level affect test performance? participants Low Moderate Test Random Assignment IV: Anxiety Dependent Variable 1 factor - 2 levels
24
1 factor - 2 levels Good design example
How does anxiety level affect test performance? One factor Use a t-test to see if these points are statistically different low moderate test performance anxiety anxiety Two levels low moderate 60 80 Observed difference between conditions T-test = Difference expected by chance 1 factor - 2 levels
25
1 factor - 2 levels Advantages:
Simple, relatively easy to interpret the results Is the independent variable worth studying? If no effect, then usually don’t bother with a more complex design Sometimes two levels is all you need One theory predicts one pattern and another predicts a different pattern 1 factor - 2 levels
26
1 factor - 2 levels Interpolation Disadvantages:
“True” shape of the function is hard to see Interpolation and Extrapolation are not a good idea low moderate test performance anxiety What happens within of the ranges that you test? Interpolation 1 factor - 2 levels
27
1 factor - 2 levels Extrapolation Disadvantages:
“True” shape of the function is hard to see Interpolation and Extrapolation are not a good idea Extrapolation low moderate test performance anxiety What happens outside of the ranges that you test? high 1 factor - 2 levels
28
So far we’ve covered a lot of the general details of experiments
Now let’s consider some specific experimental designs. Some bad (but not uncommon) designs Some good designs 1 Factor, two levels 1 Factor, multi-levels Factorial (more than 1 factor) Between & within factors Experimental designs
29
1 Factor - multilevel experiments
For more complex theories you will typically need more complex designs (more than two levels of one IV) 1 factor - more than two levels Basically you want to compare more than two conditions The statistics are a little more difficult, an ANOVA (Analysis of Variance) 1 Factor - multilevel experiments
30
1 Factor - multilevel experiments
Good design example (similar to earlier ex.) How does anxiety level affect test performance? Groups take the same test Grp1(low anxiety group): 5 min lecture on how good grades don’t matter, just trying is good enough Grp2 (moderate anxiety group): 5 min lecture on the importance of good grades for success Grp3 (high anxiety group): 5 min lecture on how the students must pass this test to pass the course 1 Factor - multilevel experiments
31
1 factor - 3 levels participants Low Moderate Test Random Assignment
IV: Anxiety Dependent Variable High 1 factor - 3 levels
32
1 Factor - multilevel experiments
low mod test performance anxiety anxiety low mod high high 80 60 60 1 Factor - multilevel experiments
33
1 Factor - multilevel experiments
Advantages Gives a better picture of the relationship (functions other than just straight lines) Generally, the more levels you have, the less you have to worry about your range of the independent variable low moderate test performance anxiety 2 levels high low mod test performance anxiety 3 levels 1 Factor - multilevel experiments
34
1 Factor - multilevel experiments
Disadvantages Needs more resources (participants and/or stimuli) Requires more complex statistical analysis (ANOVA [Analysis of Variance] & follow-up pair-wise comparisons) 1 Factor - multilevel experiments
35
Pair-wise comparisons
The ANOVA just tells you that not all of the groups are equal. If this is your conclusion (you get a “significant ANOVA”) then you should do further tests to see where the differences are High vs. Low High vs. Moderate Low vs. Moderate Pair-wise comparisons
36
So far we’ve covered a lot of the about details experiments generally
Now let’s consider some specific experimental designs. Some bad (but common) designs Some good designs 1 Factor, two levels 1 Factor, multi-levels Factorial (more than 1 factor) Between & within factors Experimental designs
37
Factorial experiments
Two or more factors Some vocabulary Factors - independent variables Levels - the levels of your independent variables 2 x 4 design means two independent variables, one with 2 levels and one with 4 levels “Conditions” or “groups” is calculated by multiplying the levels, so a 2x4 design has 8 different conditions A1 A2 B1 B2 B3 B4 Factorial experiments
38
Factorial experiments
Two or more factors (cont.) Main effects - the effects of your independent variables ignoring (collapsed across) the other independent variables Interaction effects - how your independent variables affect each other Example: 2x2 design, factors A and B Interaction: At A1, B1 is bigger than B2 At A2, B1 and B2 don’t differ Everyday interaction = “it depends on …” Factorial experiments
39
Rate how much you would want to see a new movie (1 no interest, 5 high interest)
Ask men and women – looking for an effect of gender Not much of a difference Interaction effects
40
Maybe the gender effect depends on whether you know who is in the movie. So you add another factor:
Suppose that George Clooney might star. You rate the preference if he were to star and if he were not to star. Effect of gender depends on whether George stars in the movie or not This is an interaction Interaction effects A video lecture from ThePsychFiles.com podcast
41
Results of a 2x2 factorial design
The complexity & number of outcomes increases: A = main effect of factor A B = main effect of factor B AB = interaction of A and B With 2 factors there are 8 basic possible patterns of results: 1) No effects at all 2) A only 3) B only 4) AB only 5) A & B 6) A & AB 7) B & AB 8) A & B & AB Results of a 2x2 factorial design
42
2 x 2 factorial design Interaction of AB A1 A2 B2 B1 Marginal means
What’s the effect of A at B1? What’s the effect of A at B2? Condition mean A1B1 Condition mean A2B1 Marginal means B1 mean B2 mean A1 mean A2 mean Main effect of B Condition mean A1B2 Condition mean A2B2 Main effect of A 2 x 2 factorial design
43
Examples of outcomes Main effect of A ✓ Main effect of B
Dependent Variable B1 B2 30 60 45 60 45 30 30 60 Main Effect of A Main effect of A ✓ Main effect of B X Interaction of A x B X Examples of outcomes
44
Examples of outcomes Main effect of A Main effect of B ✓
Dependent Variable B1 B2 60 60 60 30 30 30 45 45 Main Effect of A Main effect of A X Main effect of B ✓ Interaction of A x B X Examples of outcomes
45
Examples of outcomes Main effect of A Main effect of B
Dependent Variable B1 B2 60 30 45 60 45 30 45 45 Main Effect of A Main effect of A X Main effect of B X Interaction of A x B ✓ Examples of outcomes
46
Examples of outcomes Main effect of A ✓ Main effect of B ✓
Dependent Variable B1 B2 30 60 45 30 30 30 30 45 Main Effect of A Main effect of A ✓ Main effect of B ✓ Interaction of A x B ✓ Examples of outcomes
47
Anxiety and Test Performance
Let’s add another variable: test difficulty. anxiety low mod high 80 35 50 70 80 main effect of difficulty test performance high low mod anxiety easy easy medium hard Test difficulty 80 80 80 medium 65 80 hard 65 80 60 main effect of anxiety Yes: effect of anxiety depends on level of test difficulty Interaction ? Anxiety and Test Performance
48
Factorial Designs Advantages Interaction effects
Always consider the interaction effects before trying to interpret the main effects Adding factors decreases the variability Because you’re controlling more of the variables that influence the dependent variable This increases the statistical Power of the statistical tests Increases generalizability of the results Because you have a situation closer to the real world (where all sorts of variables are interacting) Factorial Designs
49
Factorial Designs Disadvantages
Experiments become very large, and unwieldy The statistical analyses get much more complex Interpretation of the results can get hard In particular for higher-order interactions Higher-order interactions (when you have more than two interactions, e.g., ABC). Factorial Designs
50
So far we’ve covered a lot of the about details experiments generally
Now let’s consider some specific experimental designs. Some bad (but common) designs Some good designs 1 Factor, two levels 1 Factor, multi-levels Factorial (more than 1 factor) Between & within factors Experimental designs
51
What is the effect of presenting words in color on memory for those words?
Clock Chair Cab So you present lists of words for recall either in color or in black-and-white. Two different designs to examine this question Example
52
Between-Groups Factor
2-levels Each of the participants is in only one level of the IV levels Clock Chair Cab Colored words participants Test Clock Chair Cab BW words
53
Within-Groups Factor levels participants Colored words BW Test Clock
Sometimes called “repeated measures” design 2-levels, All of the participants are in both levels of the IV levels participants Colored words BW Test Clock Chair Cab Clock Chair Cab
54
Between vs. Within Subjects Designs
All participants participate in all of the conditions of the experiment. Between-subjects designs Each participant participates in one and only one condition of the experiment. participants Colored words BW Test participants Colored words BW Test Between vs. Within Subjects Designs
55
Between vs. Within Subjects Designs
All participants participate in all of the conditions of the experiment. Between-subjects designs Each participant participates in one and only one condition of the experiment. participants Colored words BW Test participants Colored words BW Test Between vs. Within Subjects Designs
56
Between subjects designs
Advantages: Independence of groups (levels of the IV) Harder to guess what the experiment is about without experiencing the other levels of IV Exposure to different levels of the independent variable(s) cannot “contaminate” the dependent variable Sometimes this is a ‘must,’ because you can’t reverse the effects of prior exposure to other levels of the IV No order effects to worry about Counterbalancing is not required participants Colored words BW Test Clock Chair Cab Between subjects designs
57
Between subjects designs
participants Colored words BW Test Clock Chair Cab Disadvantages Individual differences between the people in the groups Excessive variability Non-Equivalent groups Between subjects designs
58
Individual differences
The groups are composed of different individuals participants Colored words BW Test Individual differences
59
Individual differences
The groups are composed of different individuals participants Colored words BW Test Excessive variability due to individual differences Harder to detect the effect of the IV if there is one R NR Individual differences
60
Individual differences
The groups are composed of different individuals participants Colored words BW Test Non-Equivalent groups (possible confound) The groups may differ not only because of the IV, but also because the groups are composed of different individuals Individual differences
61
Dealing with Individual Differences
Strive for Equivalent groups Created equally - use the same process to create both groups Treated equally - keep the experience as similar as possible for the two groups Composed of equivalent individuals Random assignment to groups - eliminate bias Matching groups - match each individuals in one group to an individual in the other group on relevant characteristics Dealing with Individual Differences
62
Matching groups Group A Group B Matched groups
Trying to create equivalent groups Also trying to reduce some of the overall variability Eliminating variability from the variables that you matched people on Red Short 21yrs matched Red Short 21yrs matched Blue tall 23yrs Blue tall 23yrs matched Green average 22yrs Green average 22yrs Color Height Age matched Brown tall 22yrs Brown tall 22yrs Matching groups
63
Between vs. Within Subjects Designs
Between-subjects designs Each participant participates in one and only one condition of the experiment. Within-subjects designs All participants participate in all of the conditions of the experiment. participants Colored words BW Test participants Colored words BW Test Between vs. Within Subjects Designs
64
Within subjects designs
Advantages: Don’t have to worry about individual differences Same people in all the conditions Variability between conditions is smaller (statistical advantage) Fewer participants are required Within subjects designs
65
Within subjects designs
Disadvantages Range effects Order effects: Carry-over effects Progressive error Counterbalancing is probably necessary to address these order effects Within subjects designs
66
Within subjects designs
Range effects – (context effects) can cause a problem The range of values for your levels may impact performance (typically best performance in middle of range). Since all the participants get the full range of possible values, they may “adapt” their performance (the DV) to this range. Within subjects designs
67
Order effects Carry-over effects
Transfer between conditions is possible Effects may persist from one condition into another e.g. Alcohol vs no alcohol experiment on the effects on hand-eye coordination. Hard to know how long the effects of alcohol may persist. test Condition 2 Condition 1 How long do we wait for the effects to wear off? Order effects
68
Order effects Progressive error
Practice effects – improvement due to repeated practice Fatigue effects – performance deteriorates as participants get bored, tired, distracted Order effects
69
Dealing with order effects
Counterbalancing is probably necessary This is used to control for “order effects” Ideally, use every possible order (n!, e.g., AB = 2! = 2 orders; ABC = 3! = 6 orders, ABCD = 4! = 24 orders, etc). All counterbalancing assumes Symmetrical Transfer The assumption that AB and BA have reverse effects and thus cancel out in a counterbalanced design Dealing with order effects
70
Counterbalancing Simple case Two conditions A & B
Two counterbalanced orders: AB BA participants Colored words BW Test Counterbalancing
71
Often it is not practical to use every possible ordering
Partial counterbalancing Latin square designs – a form of partial counterbalancing, so that each group of trials occur in each position an equal number of times Counterbalancing
72
Partial counterbalancing
Example: consider four conditions Recall: ABCD = 4! = 24 possible orders 1) Unbalanced Latin square: each condition appears in each position (4 orders) D C B A Order 1 Order 2 Order 3 Order 4 A D C B B A D C C B A D Partial counterbalancing
73
Partial counterbalancing
Example: consider four conditions Recall: ABCD = 4! = 24 possible orders 2) Balanced Latin square: each condition appears before and after all others (8 orders) A B C D A B D C Partial counterbalancing
74
Mixed factorial designs
Treat some factors as within-subjects (participants get all levels of that factor) and others as between-subjects (each level of this factor gets a different group of participants). This only works with factorial (multi-factor) designs Mixed factorial designs
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.