Department of Cognitive Science Michael J. Kalsher PSYC 4310 COGS 6310 MGMT 6969 © 2015, Michael Kalsher Unit 1B: Everything you wanted to know about basic.

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

Hypothesis Testing Steps in Hypothesis Testing:
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Everything You Ever Wanted to Know about Statistics
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Statistical Issues in Research Planning and Evaluation
Chapter 7: Statistical Applications in Traffic Engineering
1. Estimation ESTIMATION.
Review: What influences confidence intervals?
Topics: Inferential Statistics
Experimental Design, Statistical Analysis CSCI 4800/6800 University of Georgia Spring 2007 Eileen Kraemer.
Experimental Design & Analysis
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
BHS Methods in Behavioral Sciences I
Lect 10b1 Histogram – (Frequency distribution) Used for continuous measures Statistical Analysis of Data ______________ statistics – summarize data.
Today Concepts underlying inferential statistics
Intro to Parametric Statistics, Assumptions & Degrees of Freedom Some terms we will need Normal Distributions Degrees of freedom Z-values of individual.
Chapter 9: Introduction to the t statistic
Inferential Statistics
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
AM Recitation 2/10/11.
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
F OUNDATIONS OF S TATISTICAL I NFERENCE. D EFINITIONS Statistical inference is the process of reaching conclusions about characteristics of an entire.
1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Descriptive Statistics: Variability Lesson 5. Theories & Statistical Models n Theories l Describe, explain, & predict real- world events/objects n Models.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Introduction to Linear Regression
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
E VERYTHING Y OU E VER WANTED T O K NOW A BOUT S TATISTICS Chapter 2 Or why there’s no need for this face 
Confidence intervals and hypothesis testing Petter Mostad
Department of Cognitive Science Michael J. Kalsher Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 Regression 1 PSYC 4310/6310 Advanced Experimental.
Chapter 9 Introduction to the t Statistic. 9.1 Review Hypothesis Testing with z-Scores Sample mean (M) estimates (& approximates) population mean (μ)
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 6 Hypothesis Tests with Means.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
Chapter 10 The t Test for Two Independent Samples
© Copyright McGraw-Hill 2004
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
CHAPTER 1 EVERYTHING YOU EVER WANTED TO KNOW ABOUT STATISTCS.
1 Probability and Statistics Confidence Intervals.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Chapter 13 Understanding research results: statistical inference.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Hypothesis Testing and Statistical Significance
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Building Statistical Models Lesson 4. Theories & Models n Theories l Describe, explain, & predict real-world events/objects n Models l Replicas of real-world.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 9 Introduction to the t Statistic
Learning Objectives Describe the hypothesis testing process Distinguish the types of hypotheses Explain hypothesis testing errors Solve hypothesis testing.
INF397C Introduction to Research in Information Studies Spring, Day 12
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Summary descriptive statistics: means and standard deviations:
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
Statistics in Applied Science and Technology
Review: What influences confidence intervals?
Some statistics questions answered:
Statistical Inference for the Mean: t-test
Presentation transcript:

Department of Cognitive Science Michael J. Kalsher PSYC 4310 COGS 6310 MGMT 6969 © 2015, Michael Kalsher Unit 1B: Everything you wanted to know about basic statistics …

Slide 2 Learning Outcomes After completing this section, students will be able to: Demonstrate knowledge concerning what a statistical model is and why we use them (e.g., the mean) Demonstrate knowledge of what the ‘fit’ of a model is and why it is important (e.g., the standard deviation) Distinguish models for samples and populations. Describe and discuss problems with NHST and modern approaches (e.g., reporting confidence intervals and effect sizes).

Slide 3 The Research Process

Slide 4 Building Statistical Models

Slide 5 Populations and Samples Population –The collection of units (be they people, plankton, plants, cities, suicidal authors, etc.) to which we want to generalize a set of findings or a statistical model. Sample –A smaller (but hopefully representative) collection of units from a population used to determine truths about that population

Slide 6 The Only Equation You Will Ever Need … only partly kidding!

Slide 7 A Simple Statistical Model In statistics we fit models to our data (i.e. we use a statistical model to represent what is happening in the real world). The mean is a hypothetical value (i.e. it doesn’t have to be a value that actually exists in the data set). As such, the mean is simple statistical model.

Slide 8 The Mean The mean (or average) is the value from which the (squared) scores deviate least (it has the least error).

Slide 9 The Mean as a Model

Slide 10 Measuring the ‘Fit’ of the Model The mean is a model of what happens in the real world: the typical score. It is not a perfect representation of the data. How can we assess how well the mean represents reality?

Slide 11 A Perfect Fit Rater Rating (out of 5)

Slide 12 Calculating ‘Error’ A simple deviation is the difference between the mean and an actual data point. Deviations can be calculated by taking each score and subtracting the mean from it:

Slide 13

Slide 14 Use the Total Error? We could just take the error between the mean and each data point and add them up. ScoreMeanDeviation Total =0 But, this is what we would get!

Slide 15 Sum of Squared Errors Merely summing the deviations to find out the total error is problematic. This is because deviations cancel out because some are positive and others negative. Therefore, we square each deviation. If we add these squared deviations we get the Sum of Squared Errors (SS).

Slide 16 ScoreMeanDeviation Squared Deviation Total5.20 Sum of Squared Errors (SS)

Slide 17 Mean Squared Error (MS e ) Although the SS is a good measure of the accuracy of our model, it depends on the amount of data collected. To overcome this problem, we use the average, or mean, squared error, but to compute the average we use the degrees of freedom (df) as the denominator.

Slide 18 Degrees of Freedom Sample ? Population The number of degrees of freedom generally refers to the number of independent observations in a sample minus the number of population parameters that must be estimated from sample data (please refer to Jane Superbrain 2.2 on p. 49 of the Field textbook).

Slide 19 The Standard Error The standard deviation (SD) tells us how well the mean represents the sample data. If we wish to estimate this parameter in the population (μ is the symbol used to connote the population mean) then we need to know how well the sample mean represents the values in the population. The standard error of the mean (SE or standard error) tells us the extent to which our sample mean is representative of the population parameter.

Slide 20 The SD and the Shape of a Distribution

Slide 21 Samples vs. Populations Sample –Here, the Mean and SD describe only the sample from which they were calculated. Population –Here, the Mean and SD are intended to describe the entire population (very rare in practice). Sample to Population: –Mean and SD are obtained from a sample, but are used to estimate the mean and SD of the population (very common in practice).

Slide 22 Sampling Variation

Slide 23  = 10 M = 8 M = 10 M = 9 M = 11 M = 12 M = 11 M = 9 M = 10 The SE can be estimated by dividing the sample standard deviation by the square root of the sample size.

Slide 24 Confidence Intervals In statistical inference, we attempt to estimate population parameters using observed sample data. A confidence interval gives an estimated range of values which is likely to include an unknown population parameter, the estimated range being calculated from a given set of sample data. Confidence intervals are constructed at a confidence level, such as 95 %, selected by the user. What does this mean? It means that if the same population is sampled on numerous occasions and interval estimates are made on each occasion, the resulting intervals would bracket the true population parameter in approximately 95 % of the cases.

Slide 25 Confidence Intervals (see discussion starting on p. 54 of Field textbook) Domjan et al. (1998) –‘Conditioned’ sperm release in Japanese Quail. True Mean –15 Million sperm Sample Mean – 17 Million sperm Interval estimate –12 to 22 million (contains true value) –16 to 18 million (misses true value) –CIs constructed such that 95% contain the true value.

Slide 26

Slide 27 Showing Confidence Intervals Visually

Slide 28 Types of Hypotheses Null hypothesis, H 0 –There is no effect. –Example: Big Brother contestants and members of the public will not differ in their scores on personality disorder questionnaires The alternative hypothesis, H 1 –AKA the experimental hypothesis –Example: Big Brother contestants will score higher on personality disorder questionnaires than members of the public

Slide 29 Test Statistics A Statistic for which the frequency of particular values is known. Observed values can be used to test hypotheses.

Slide 30 What does Null Hypothesis Significance Testing (NHST) Tell Us? The importance of an effect? –No, significance depends on the sample size. That the null hypothesis is false? –No, it is always false. That the null hypothesis is true? –No, it is never true. One problem with NHST is that it encourages all or nothing thinking.

Slide 31 One- and Two-Tailed Tests

Slide 32 Type I and Type II Errors Type I error –occurs when we believe that there is a genuine effect in our population, when in fact there isn’t. –The probability is the α-level (usually.05) Type II error –occurs when we believe that there is no effect in the population when, in reality, there is. –The probability is the β-level (often.2)

Slide 33 Confidence Intervals & Statistical Significance

Slide 34 Effect Sizes An effect size is a standardized measure of the size of an effect: –Standardized = so comparable across studies –Not (as) reliant on the sample size –Allows people to objectively evaluate the size of observed effect.

Slide 35 Effect Size Measures There are several effect size measures that can be used: –Cohen’s d –Pearson’s r –Glass’ Δ –Hedges’ g –Odds Ratio/Risk rates Pearson’s r is a good intuitive measure –Oh, apart from when group sizes are different …

Slide 36 Effect Size Measures r =.1, d =.2 (small effect): –the effect explains 1% of the total variance. r =.3, d =.5 (medium effect): –the effect accounts for 9% of the total variance. r =.5, d =.8 (large effect): –the effect accounts for 25% of the variance. Beware of these ‘canned’ effect sizes though: –The size of effect should be placed within the research context.

Slide 37 Important Concepts Alternative hypothesisStandard error of the mean (SE) Confidence intervalSum of squared errors (SS) Degrees of freedomTest statistic DeviationUnexplained variance Effect Size (measures) Explained variance Frequency distribution Mean Mean Squared Error (MSe) Model Null hypothesis Null Hypothesis Significance Testing (NHST) Population Sample Standard deviation