Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
8-2 Basics of Hypothesis Testing
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Today Concepts underlying inferential statistics
Hypothesis Testing Using The One-Sample t-Test
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Inferential Statistics
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Hypothesis Testing:.
Overview of Statistical Hypothesis Testing: The z-Test
Lecture Slides Elementary Statistics Twelfth Edition
Overview Definition Hypothesis
Descriptive statistics Inferential statistics
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Sections 8-1 and 8-2 Review and Preview and Basics of Hypothesis Testing.
Chapter 8 Introduction to Hypothesis Testing
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Statistical Analysis Statistical Analysis
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Statistics. Intro to statistics Presentations More on who to do qualitative analysis Tututorial time.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Statistical Power The ability to find a difference when one really exists.
Overview Basics of Hypothesis Testing
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Chapter 15 Data Analysis: Testing for Significant Differences.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Chapter 8 Introduction to Hypothesis Testing
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Analyzing and Interpreting Quantitative Data
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 10. Hypothesis Testing II: Single-Sample Hypothesis Tests: Establishing the Representativeness.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Quick Review Central tendency: Mean, Median, Mode Shape: Normal, Skewed, Modality Variability: Standard Deviation, Variance.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Inferential Statistics. Coin Flip How many heads in a row would it take to convince you the coin is unfair? 1? 10?
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Chapter Eight: Using Statistics to Answer Questions.
Data Analysis.
Chapter 6: Analyzing and Interpreting Quantitative Data
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
T tests comparing two means t tests comparing two means.
Hypothesis Testing and Statistical Significance
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
Chapter 9 Introduction to the t Statistic
Outline Sampling Measurement Descriptive Statistics:
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Analyzing and Interpreting Quantitative Data
Chapter Nine: Using Statistics to Answer Questions
Presentation transcript:

Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1

Quick Overview of Statistics 2

Descriptive vs. Inferential Statistics  Descriptive Statistics: summarize and describe data (central tendency, variability, skewness)  Inferential Statistics: procedure for making inferences about population parameters using sample statistics SamplePopulation 3

Measures of Central Tendency Raw dataSimple frequency distribution Group frequency distribution Notations Mode Pick out the value (s) occurring more than any other value. Pick out the value (s) with the highest frequency. = Difference between the freq. of modal class and the freq. of the next lower class. = Difference between the freq. of modal class and the freq. of the next higher class. L 1 = Lower class boundary of the modal class c = class width of the modal class Median 1. Order data 2. Determine median position = (n+1)/2 3. Locate median based on step Order data 2. Determine median position = (n+1)/2 3. Locate median based on step 2 using the freq. column L m =lower class boundary of median class n = sample size C.F. = sum of all frequencies lower than the median class f med = frequency of the median class c = class width of the median class Mean Add up all the data values and divide by the number of values. Find the product of all the values and their frequencies ; then add all the products; and finally divide by the total frequency. Find the product of all the midpoints and their frequencies ; then add all the products; and finally divide by the total frequency. X = the actual values (for raw data and ungrouped freq. dist.) = midpoints (for group freq. dist.) f = frequency n = sample size N = population size = summation or sum of 4

DescriptionApplicabilityAdvantageDisadvantage Range Difference between the largest and the smallest value in the data. 1. Interval/ratio 2. No outliers exist 1. Simple to calculate 1. Highly influenced by outliers. 2. Does not use all data Mean deviation It measures the average absolute deviations from the mean. Uncommonly used 1. Interval/ratio 2. When no outliers exist 1. Use all the data 2. Easy to interpret 1. Not resistant to outliers 2. Does not yield any further useful statistical properties. Variance/ standard deviation Variance is the average squared deviations from the mean. Standard deviation is square root of the variance. Commonly used. 1. Interval/ratio 2. When no outliers exist 1. Provides good statistical properties, by avoiding the use of absolute values. 2. Use all the data 1. Not resistant to outliers. 2. Variance depends on the units of measurement, therefore not easy to make comparisons. Sum of Squares Measures variability of the scores, the total variation of all scores 1. Interval/ratio 2. When no outliers exist 1. Effect size calculation 1. Not resistant to outliers. 5 Measures of Variability 5

Variance and Sum of Squares 6

Empirical Rule  The empirical rule states that symmetric or normal distribution with population mean μ and standard deviation σ have the following properties. 7

OutcomeBall 1Ball 2Mean All possible outcomes are shown below in Table 1. Table 1. All possible outcomes when two balls are sampled with replacement. Sampling Distribution 8

Sampling Error As has been stated before, inferential statistics involve using a representative sample to make judgments about a population. Lets say that we wanted to determine the nature of the relationship between county and achievement scores among Texas students. We could select a representative sample of say 10,000 students to conduct our study. If we find that there is a statistically significant relationship in the sample we could then generalize this to the entire population. However, even the most representative sample is not going to be exactly the same as its population. Given this, there is always a chance that the things we find in a sample are anomalies and do not occur in the population that the sample represents. This error is referred as sampling error. 9

Sampling Error A formal definition of sampling error is as follows: Sampling error occurs when random chance produces a sample statistic that is not equal to the population parameter it represents. Due to sampling error there is always a chance that we are making a mistake when rejecting or failing to reject our null hypothesis. Remember that inferential procedures are used to determine which of the statistical hypotheses is true. This is done by rejecting or failing to reject the null hypothesis at the end of a procedure. 10

Sampling Distribution and Standard Error (SE) 

Hypothesis Testing  Null Hypothesis Statistical Significance Testing (NHSST)  Testing p-values using statistical significance tests  Effect Size  Measure magnitude of the effect (e.g., Cohen’s d) 12

Null Hypothesis Statistical Significance Testing  Statistical significance testing answers the following question:  Assuming the sample data came from a population in which the null hypothesis is exactly true, what is the probability of obtaining the sample statistic one got for one’s sample data with the given sample size? (Thompson, 1994)  Alternatively:  Statistical significance testing is used to examine a statement about a relationship between two variables. 13

Hypothetical Example  Is there a difference between the reading abilities of boys and girls?  Null Hypothesis (H 0 ): There is not a difference between the reading abilities of boys and girls.  Alternative Hypothesis (H 1 ): There is a difference between the reading abilities of boys and girls.  Alternative hypotheses may be non-directional (above) or directional (e.g., boys have a higher reading ability than girls). 14

Testing the Hypothesis  Use a sampling distribution to calculate the probability of a statistical outcome.  p calc = likelihood of the sample’s result  p calc < p critical : reject H 0  p calc ≥ p critical : fail to reject H 0 15

Level of Significance (p crit )  Alpha level (α) determines:  The probability at which you reject the null hypothesis  The probability of making a Type I error (typically.05 or.01) True Outcome in Population Reject H 0 is trueH 0 is false Observed Outcome Reject H 0 Type I error (α)Correct Decision Fail to reject H 0 Correct DecisionType II error (β) 16

Example: Independent t-test  Research Question: Is there a difference between the reading abilities of boys and girls?  Hypotheses:  H 0 : There is not a difference between the reading abilities of boys and girls.  H 1 : There is a difference between the reading abilities of boys and girls. 17

Dataset  Reading test scores (out of 100) BoysGirls

Significance Level  α =.05, two-tailed test  df = n 1 + n 2 – 2 = – 2 = 18  Use t-table to determine t crit  t crit = ±

Decision Rules  If t calc > t crit, then p calc < p crit  Reject H 0  If t calc ≤ t crit, then p calc ≥ p crit  Fail to reject H p =

Computations BoysGirls Frequency (N)10 Sum (Σ) Variance (S 2 ) Standard Deviation (S)

Computations cont.  Pooled variance  Standard Error = =

Computations cont.  Compute t calc  Decision: Reject H 0. Girls scored statistically significantly higher on the reading test than boys did. =

Confidence Intervals 24

Statistical Significance vs. Importance of Effect  Does finding that p <.05 mean the finding is relevant to the real world?  Not necessarily…   Effect size provides a measure of the magnitude of an effect  Practical significance  Cohen’s d, η 2, and R 2 are all types of effect sizes 25

Cohen’s d  Equation:  Guidelines:  d =.2 = small  d =.5 = moderate  d =.8 = large  Not only is our effect statistically significant, but the effect size is large. =