Inferential statistics. Why statistics are important Statistics are concerned with difference – how much does one feature of an environment differ.

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

Copyright © Allyn & Bacon (2007) Statistical Analysis of Data Graziano and Raulin Research Methods: Chapter 5 This multimedia product and its contents.
Unit 1: Science of Psychology
Statistical Tests Karen H. Hagglund, M.S.
QUANTITATIVE DATA ANALYSIS
PSY 307 – Statistics for the Behavioral Sciences
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Descriptive Statistics Primer
Chapter 14 Conducting & Reading Research Baumgartner et al Chapter 14 Inferential Data Analysis.
BHS Methods in Behavioral Sciences I
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Lecture 9: One Way ANOVA Between Subjects
Today Concepts underlying inferential statistics
Statistics for CS 312. Descriptive vs. inferential statistics Descriptive – used to describe an existing population Inferential – used to draw conclusions.
Central Tendency and Variability
Inferential Statistics
The Data Analysis Plan. The Overall Data Analysis Plan Purpose: To tell a story. To construct a coherent narrative that explains findings, argues against.
Introduction to Statistics February 21, Statistics and Research Design Statistics: Theory and method of analyzing quantitative data from samples.
AM Recitation 2/10/11.
Statistics.
Statistical Analysis Statistical Analysis
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Statistics. Intro to statistics Presentations More on who to do qualitative analysis Tututorial time.
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Statistics 1 Measures of central tendency and measures of spread.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Research Methods Chapter 8 Data Analysis. Two Types of Statistics Descriptive –Allows you to describe relationships between variables Inferential –Allows.
Hypothesis Testing Using the Two-Sample t-Test
Statistics. Intro to statistics Presentations More on who to do qualitative analysis Tututorial time.
Descriptive Statistics
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Trial Group AGroup B Mean P value 2.8E-07 Means of Substances Group.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
L. Liu PM Outreach, USyd.1 Survey Analysis. L. Liu PM Outreach, USyd.2 Types of research Descriptive Exploratory Evaluative.
I271B The t distribution and the independent sample t-test.
Statistical analysis. Types of Analysis Mean Range Standard Deviation Error Bars.
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
Chapter Twelve The Two-Sample t-Test. Copyright © Houghton Mifflin Company. All rights reserved.Chapter is the mean of the first sample is the.
Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.
Chapter Eight: Using Statistics to Answer Questions.
Data Analysis.
Describing Data Descriptive Statistics: Central Tendency and Variation.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
STATISTICS FOR SCIENCE RESEARCH (The Basics). Why Stats? Scientists analyze data collected in an experiment to look for patterns or relationships among.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Kin 304 Inferential Statistics Probability Level for Acceptance Type I and II Errors One and Two-Tailed tests Critical value of the test statistic “Statistics.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
S519: Evaluation of Information Systems Social Statistics Inferential Statistics Chapter 9: t test.
Dr.Rehab F.M. Gwada. Measures of Central Tendency the average or a typical, middle observed value of a variable in a data set. There are three commonly.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Chapter 15 Analyzing Quantitative Data. Levels of Measurement Nominal measurement Involves assigning numbers to classify characteristics into categories.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Statistical analysis.
AP Biology Intro to Statistics
Statistical analysis.
Central Tendency and Variability
Introduction to Statistics
Myers EXPLORING PSYCHOLOGY (6th Edition in Modules)
Basic Statistical Terms
STATISTICS Topic 1 IB Biology Miss Werba.
Chapter Nine: Using Statistics to Answer Questions
Presentation transcript:

Inferential statistics

Why statistics are important Statistics are concerned with difference – how much does one feature of an environment differ from another Magnitude: The comparative strength of two variables. Reliability. The degree to which the measure of the magnitude of a variable can be replicated with other samples drawn from the same population.

Why statistics are important Relationships – how does much one feature of the environment change as another measure changes Correlation or regression r=0.73 N=20 p<0.01

Arithmetic mean or average Mean (M or X), is the sum (  X) of all the sample values ((X 1 + X 2 + X 3.…… X 22 ) divided by the sample size (N).  X = 45, N = 22. M =  X/N = 45/22 = 2.05

The median median is the "middle" value of the sample. There are as many sample values above the sample median as below it. If the sample size is odd (say, 2a + 1), then the median is the (a+1)st largest data value. If the sample size is even (say, 2a), then the median is defined as the average of the ath and (a+1)st largest data values.

Other measures of central tendency The mode is the single most frequently occurring data value. The midrange is the midpoint of the sample -- the average of the smallest and largest data values in the sample. Find the Mean, Median and Mode

The underlying distribution of the data

Normal distribution

All normal distributions have similar properties. The percentage of the scores that is between one standard deviation (s) below the mean and one standard deviation above is always 68.26% Mean =77.48 SD=7.15 N=62 - 2SD -1SD 0 +1SD +2SD

Is there a difference between Rich and poor scores

Is there a significant difference between Polynesian and “other” scores Mean =75.0 SD=6.8 N=20 Mean =81.9 SD=6.5 N=20

Three things we must know before we can say events are different 1.the difference in mean scores of two or more events - the bigger the gap between means the greater the difference 2.the degree of variability in the data - the less variability the better

Variance and Standard Deviation These are estimates of the spread of data. They are calculated by measuring the distance between each data point and the mean variance (s 2 ) is the average of the squared deviations of each sample value from the mean = s 2 =  X-M) 2 /(N-1) The standard deviation (s) is the square root of the variance.

Calculating the Variance and the standard deviation for the Rich sample RichX-M(X-M) Total Mean (Mx)81.9variance(x)41.9 Nx=20Standard deviation (Sx)6.5

Three things we must know before we can say events are different 3.The extent to which the sample is representative of the population from which it is drawn - the bigger the sample the greater the likelihood that it represents the population from which it is drawn - small samples have unstable means. Big samples have stable means.

Estimating difference The measure of stability of the mean is the Standard Error of the Mean = standard deviation/the square root of the number in the sample. So stability of mean is determined by the variability in the sample (this can be affected by the consistency of measurement) and the size of the sample. The standard error of the mean (SEM) is the standard deviation of the normal distribution of the mean if we were to measure it again and again

Yes it’s significant. The Standard Errors of the Mean = 1.45 and 1.53, so the 95% confidence interval will be about 3 points (1.96*1.5) either side of the mean. The means falls outside each other’s confidence intervals

Is the difference between means significant? What is clear is that the mean of the Rich group is well outside of the area where there is a 95% chance that the mean for the Poor Group will fall, so it is likely that the Rich mean comes from a different population than the Poor mean. The convention is to say that if mean 2 falls outside of the area (the confidence interval) where 95% of mean 1 scores is estimated to be, then mean 2 is significantly different from mean 1. We say the probability of mean 1 and mean 2 being the same is less than 0.05 (p<0.05) and the difference is significant p

The significance of significance Not an opinion A sign that very specific criteria have been met A standardised way of saying that there is a There is a difference between two groups – p<0.05; There is no difference between two groups – p>0.05; There is a predictable relationship between two groups – p<0.05; or There is no predictable relationship between two groups - p>0.05. A way of getting around the problem of variability

If you argue for a one tailed test – saying the difference can only be in one direction, then you can add 2.5% error from side where no data is expected to the side where it is 2.5% of M1 distri- bution 2.5% of M1 distri=b ution 95% of M1 distri- bution 2-tailed test 1-tailed test

T-test results t-Test: Two-Sample Assuming Equal Variances PoorRich Mean Variance Observations20 Pooled Variance46.6 Hypothesized Mean Difference0 df38 t Stat-3.2 P(T<=t) one-tail0 t Critical one-tail1.69 P(T<=t) two-tail0 t Critical two-tail2.02

Tests of significance Tests of difference – t-tests, analysis of variance, chi-square, odds ratios Tests of relationship – correlation, regression analysis Tests of difference and relationship – analysis of covariance, multiple regression analysis.

Chi-squared (  ) comparison of age in the sample vs the Waitakere population Participants in each category Obse rved Sam ple Expec ted Waita kere Age OEO-E(O-E) 2 (O-E) 2 /E years N=4DF=3 75 and older p=0.05     0.56 NS=not significant

Values of chi-square for the research project The fact that two groups are not significant means that there is no significant difference between the sample and Waitakere population except for culture and qualifications Chi-squared GroupobtainedcriterionPsignificance Occupation p<0.05NS Age p<0.05NS Family context p<0.05NS Culture p>0.05Significant Gender p<0.05NS Qualifications p>0.05Significant

Person Height (inches) - X Self Esteem score/5 - Y Person Height (inches) - X Self Esteem score/5 -Y

r =(  (X – M X )*((Y – M Y ))/(N*S X *S Y ) r =correlation coefficient X = Height Y= Self Esteem M X =Mean of X M Y =Mean of Y S X =Standard deviation of X S Y =Standard deviation of y

r=0.73 N=20

Level of Significance Two-Tailed Probabilities Probability of error Chance of not being correlated 10% or 1/10 5% or 1/20 1% or 1 / % or 1/1000 r value when n=

One or two tails? What degrees of freedom What level of significance should be chosen?

Correlations

The perfect positive correlation

The perfect negative correlation

No correlation at all

A perfect relationship, but not a correlation

How correlation is used and misused

Normality of residuals, Linearity, & Homoscedasticity