Experimental Design and Analysis Instructor: Mark Hancock February 8, 2008Slides by Mark Hancock.

Slides:



Advertisements
Similar presentations
Chapter 16 Introduction to Nonparametric Statistics
Advertisements

PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.
Chapter Seventeen HYPOTHESIS TESTING
Experimental Design & Analysis
Final Review Session.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 10 Notes Class notes for ISE 201 San Jose State University.
One-way Between Groups Analysis of Variance
T-Tests Lecture: Nov. 6, 2002.
Chapter 2 Simple Comparative Experiments
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
The t Tests Independent Samples.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Chapter 14 Inferential Data Analysis
Richard M. Jacobs, OSA, Ph.D.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter 12: Analysis of Variance
T Test for One Sample. Why use a t test? The sampling distribution of t represents the distribution that would be obtained if a value of t were calculated.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
AM Recitation 2/10/11.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Basic Statistics Michael Hylin. Scientific Method Start w/ a question Gather information and resources (observe) Form hypothesis Perform experiment and.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Copyright © 2012 by Nelson Education Limited. Chapter 8 Hypothesis Testing II: The Two-Sample Case 8-1.
Statistical Analysis Statistical Analysis
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
One-way Analysis of Variance 1-Factor ANOVA. Previously… We learned how to determine the probability that one sample belongs to a certain population.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
Between-Groups ANOVA Chapter 12. >When to use an F distribution Working with more than two samples >ANOVA Used with two or more nominal independent variables.
Chapter 16 The Chi-Square Statistic
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
AP STATS EXAM REVIEW Chapter 8 Chapter 13 and 14 Chapter 11 and 12 Chapter 9 and Chapter 10 Chapter 7.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Chapter 14 Repeated Measures and Two Factor Analysis of Variance
Assignment 1 February 15, 2008Slides by Mark Hancock.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Chapter 10 The t Test for Two Independent Samples
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Chapter 13 Repeated-Measures and Two-Factor Analysis of Variance
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
© Copyright McGraw-Hill 2004
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 13 Understanding research results: statistical inference.
Introduction to the t statistic. Steps to calculate the denominator for the t-test 1. Calculate variance or SD s 2 = SS/n-1 2. Calculate the standard.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Chapter 14 Repeated Measures and Two Factor Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Seventh.
Chapter 9 Introduction to the t Statistic
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Six Easy Steps for an ANOVA 1) State the hypothesis 2) Find the F-critical value 3) Calculate the F-value 4) Decision 5) Create the summary table 6) Put.
Chapter 2 Simple Comparative Experiments
Reasoning in Psychology Using Statistics
What are their purposes? What kinds?
Chapter 10 – Part II Analysis of Variance
Statistical Inference for the Mean: t-test
Presentation transcript:

Experimental Design and Analysis Instructor: Mark Hancock February 8, 2008Slides by Mark Hancock

What is an experiment? February 8, 2008Slides by Mark Hancock

You will be able to describe some of the key elements of an experiment that can be analyzed with a statistical test. February 8, 2008Slides by Mark Hancock

Elements of an Experiment People Data – Measurement Hypothesis (There are more… we’ll learn about them later) February 8, 2008Slides by Mark Hancock

People Sample (participants) – People in your study Population – E.g., Canadians, computer scientists, artists – People we want to generalize to February 8, 2008Slides by Mark Hancock

Data Variable – E.g., technique, task time, number of errors Statistic – Mean, median, mode, standard deviation, etc. – Taken from the sample Parameter – Taken from the population February 8, 2008Slides by Mark Hancock

Hypothesis (examples) The average temperature in Calgary is less than -20˚C. A pair of dice will result in a roll of 7 more than it will result in a roll of 10. Canadians prefer Hockey to Baseball. February 8, 2008Slides by Mark Hancock

Hypotheses Carman is a great foosball player. Carman is better at foosball than Mark. Carman wins more foosball games than Mark. Mark scores more points in foosball than Carman. February 8, 2008Slides by Mark Hancock

Hypotheses TreeMaps are easy to use. TreeMaps are better than Phylotrees. People find leaf nodes faster with TreeMaps than with Phylotrees. People find sibling nodes faster with Phylotrees than with TreeMaps. February 8, 2008Slides by Mark Hancock

Null Hypothesis “… the null hypothesis is a pinpoint statement as to the unknown quantitative value of the parameter in the population[s] of interest.” Huck, S.W. Reading Statistics and Research February 8, 2008Slides by Mark Hancock

Null Hypothesis Calgary Temperature: μ Calgary = -20˚C Dice Rolling: μ 7 = μ 10 μ 7 - μ 10 = 0 Foosball: μ Carman = μ Mark Tree Vis: μ TreeMap = μ Phylotrees February 8, 2008Slides by Mark Hancock

What is the null hypothesis? Hypothesis: The average temperature in Vancouver is higher than the average temperature in Calgary H 0 : μ Calgary = μ Vancouver February 8, 2008Slides by Mark Hancock

Elements of an Experiment People – Sample/Participants – Population Data – Variables – Statistics – Parameters Hypotheses – Null Hypothesis February 8, 2008Slides by Mark Hancock

Label this graph: Independent Variable? Dependent Variable? February 8, 2008Slides by Mark Hancock

Variables Independent Variables (Factors) – What you (the experimenter) are changing during the experiment Dependent Variables (Measures) – What is being measured Constants – What is the same for all participants February 8, 2008Slides by Mark Hancock

Variables (Exercise) Questionnaire: ask computer science students to rate their favourite teacher. What independent variables would you use? What dependent variables would you use? What would you keep constant? February 8, 2008Slides by Mark Hancock

Problem: hypotheses are about population, but we only have access to data from a sample. February 8, 2008Slides by Mark Hancock

You will be able to describe why the Law of Large Numbers and the Central Limit Theorem allow us to make general statements about a population based on information about a sample. February 8, 2008Slides by Mark Hancock

Dice Rolling Roll one die – Predictions? Roll one die 5 times and take the average? – Predictions? Roll one die 100 times and take the average? – Predictions? February 8, 2008Slides by Mark Hancock

Dice Rolling Roll one die n times: – Possible outcomes: 1, 2, 3, 4, 5, 6 – Probability of rolling X: P(X) = 1/6 = 16.7% – Expected Value: E(X) = 1(1/6) + 2(1/6) + … + 6(1/6) = 3.5 February 8, 2008Slides by Mark Hancock

Law of Large Numbers “Given a sample of independent and identically distributed random variables with a finite expected value, the average of these observations will eventually approach and stay close to the expected value.” "Law of large numbers." Wikipedia February 8, 2008Slides by Mark Hancock

Experiment “Law of Large Numbers”, Wikipedia February 8, 2008Slides by Mark Hancock

Central Limit Theorem “…if the sum of independent identically distributed random variables has a finite variance, then it will be approximately normally distributed.” “Central Limit Theorem." Wikipedia February 8, 2008Slides by Mark Hancock

Gaussian/Normal Distribution Standard Deviation Mean February 8, 2008Slides by Mark Hancock

LLN vs. CLT Law of Large Numbers “Given a sample of independent and identically distributed random variables with a finite expected value, the average of these observations will eventually approach and stay close to the expected value.” Central Limit Theorem “…if the sum of independent identically distributed random variables has a finite variance, then it will be approximately normally distributed.” February 8, 2008Slides by Mark Hancock

Generalize from μ sample to μ population What information do we have? – Sample mean – Sample variance What information do we seek? – Population mean – Population variance February 8, 2008Slides by Mark Hancock

Generalize from μ sample to μ population Assumptions: – population has expected value of μ population – population has finite variance σ population Conclude: – provided we have enough people (N is large): μ sample  μ population (by LLN) σ sample  σ population (by CLT) February 8, 2008Slides by Mark Hancock

Summary Dependent/independent variables Constants Law of Large Numbers: – eventually data tends to the expected value Central Limit Theorem: – most data tends toward a normal distribution February 8, 2008Slides by Mark Hancock

Break: 15 Minutes February 8, 2008Slides by Mark Hancock

Significance and Power February 8, 2008Slides by Mark Hancock

You will be able to identify two types of errors and be able to avoid these errors when running a study. February 8, 2008Slides by Mark Hancock

Types of mistakes 1.Find a difference when there isn’t one 2.Find no difference when there is one February 8, 2008Slides by Mark Hancock

Rejecting the Null Hypothesis (H 0 ) Reality H 0 trueH 0 false Decision H 0 t rue Type II H 0 false Type I February 8, 2008Slides by Mark Hancock

Rejecting the Null Hypothesis (H 0 ) Reality H 0 trueH 0 false Decision H 0 t rue β H 0 false α or p February 8, 2008Slides by Mark Hancock

Significance (α) – calculated after the experiment Power (1 - β) – calculated before (a priori) or after (post hoc) – depends on effect size and sample size February 8, 2008Slides by Mark Hancock

How do we avoid these errors? 1.Decide before the analysis how acceptable this would be (e.g., p <.05). 2.The smaller the effect size you expect, the larger sample size you need. February 8, 2008Slides by Mark Hancock

(Student’s) T-Test February 8, 2008Slides by Mark Hancock

Who is attributed with the discovery of the Student’s T-Test? February 8, 2008Slides by Mark Hancock

A student! – William Sealy Gosset Guinness Brewery employee Monitored beer quality February 8, 2008Slides by Mark Hancock

You will be able to formulate the appropriate null hypothesis and calculate the t-value for data from a sample. February 8, 2008Slides by Mark Hancock

Null Hypotheses μ = μ 0 (constant value) μ A = μ B February 8, 2008Slides by Mark Hancock

Assumptions Data is distributed normally Equal variance: σ A = σ B (for second H 0 ) February 8, 2008Slides by Mark Hancock

Example Independent Variable: – TreeMap vs. Phylotrees Dependent Variable: – Time to find a leaf node Data: – 30 people used TreeMap, 30 used Phylotrees – Found one leaf node each February 8, 2008Slides by Mark Hancock

Check the Null Hypothesis Null Hypothesis (for population): μ T = μ P or μ T – μ P = 0 Test A (for sample): check value of μ T – μ P February 8, 2008Slides by Mark Hancock

How is the data distributed? Sample Distribution February 8, 2008Slides by Mark Hancock

How do you account for the differences in variance? February 8, 2008Slides by Mark Hancock

February 8, 2008Slides by Mark Hancock

February 8, 2008Slides by Mark Hancock

a.k.a. Standard Error of difference between means February 8, 2008Slides by Mark Hancock

February 8, 2008Slides by Mark Hancock

Interpreting the T Ratio What makes the ratio large? 1.Larger difference 2.Smaller variance Large t => more likely to be a real difference February 8, 2008Slides by Mark Hancock

How do we find significance? Look up in a table (the math is too hard for humans to do) Pick a level of significance (e.g., α =.05) and find the row corresponding to your sample size (df = n – 1). If t > (value in that cell), then p < α February 8, 2008Slides by Mark Hancock

df … … February 8, 2008Slides by Mark Hancock

Break: 20 Minutes February 8, 2008Slides by Mark Hancock

Analysis of Variance (ANOVA) February 8, 2008Slides by Mark Hancock

You will be able to formulate the appropriate null hypothesis and calculate the F-score for data from a sample. February 8, 2008Slides by Mark Hancock

Null Hypotheses μ A = μ B = μ c = … Remember: “the null hypothesis is a pinpoint statement “ σ μ = 0 February 8, 2008Slides by Mark Hancock

Assumptions Data is distributed normally Homogeneity of variance: σ A = σ B = σ C = … A, B, C, … are independent from one another February 8, 2008Slides by Mark Hancock

Example Independent Variable: – TreeMap vs. Phylotrees vs. ArcTrees Dependent Variable: – Time to find a leaf node Data: – 30 people used TreeMap, 30 used Phylotrees, 30 used ArcTrees – Found one leaf node each February 8, 2008Slides by Mark Hancock

Check the Null Hypothesis Null Hypothesis (for population): μ T = μ P = μ A Test A (for sample): check H 0 for sample we know this is not enough! February 8, 2008Slides by Mark Hancock

February 8, 2008Slides by Mark Hancock

February 8, 2008Slides by Mark Hancock

Degrees of Freedom (df) How many more pieces of data you need e.g., if you have μ, you need n-1 pieces of data to find the missing piece of data February 8, 2008Slides by Mark Hancock

Sum of Squares (SS) Measure of variance February 8, 2008Slides by Mark Hancock

Mean Square “Mean” of the sum of squares February 8, 2008Slides by Mark Hancock

F-Score (Fisher’s Test) February 8, 2008Slides by Mark Hancock

F-Score (Fisher’s Test) February 8, 2008Slides by Mark Hancock

Example Degrees of Freedom Sum of Squares Mean Square F Between Groups Within Groups Total How many groups (i.e., how many means are we comparing)? How many total participants? Report as: F(3,146) = 4.5 February 8, 2008Slides by Mark Hancock

Example “…The results of a one-way ANOVA indicated that UFOV [useful field of view] reduction increased with dementia severity, F(2,52) = 15.36, MSe = , p < How many groups of participants were there? How many total participants were there? Fill in the table from before… February 8, 2008Slides by Mark Hancock

Example Degrees of Freedom Sum of Squares Mean Square F Between Groups2165, , Within Groups52279,3185,371.5 Total54444, February 8, 2008Slides by Mark Hancock

Example Independent Variable: – TreeMap vs. Phylotrees vs. ArcTrees Data: – 30 people used TreeMap, 30 used Phylotrees, 30 used ArcTrees – Found one leaf node each Fill in the degrees of freedom column February 8, 2008Slides by Mark Hancock

Example Degrees of Freedom Sum of Squares Mean Square F Between Groups2 Within Groups87 Total89 February 8, 2008Slides by Mark Hancock

What does large F mean? Remember: Consider the null hypothesis What does each value estimate? February 8, 2008Slides by Mark Hancock

F-table (α =.05) df between df within February 8, 2008Slides by Mark Hancock

Summary Analysis of Variance (ANOVA) is used to compare 2 or more means The F-score and df indicate the probability of a Type I error in rejecting the null hypothesis February 8, 2008Slides by Mark Hancock

Summary of First Day Elements of an experiment Null Hypothesis Variables (independent/dependent) Law of Large Numbers/Central Limit Theorem Significance and Power T-Test One-way ANOVA February 8, 2008Slides by Mark Hancock

Next Week Two-way & three-way ANOVA Non-parametric tests February 8, 2008Slides by Mark Hancock