Presentation is loading. Please wait.

Presentation is loading. Please wait.

Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.

Similar presentations


Presentation on theme: "Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population."— Presentation transcript:

1 Statistical Analysis

2 Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population in general

3 Normal distribution

4 Standard deviation u Defined as square root of the variance. u Measure of the dispersion of the data. u 68-95-99 rule for σ-2σ-3σ u Denoted by letter σ (lower case sigma).

5 Reporting descriptive statistics

6 Box plots

7 p values u Value that gives the confidence that the test results occurred by chance. u Typically must be less than.1 or.05. u Must always be reported as part of the data.

8 Reporting the statistics

9 Tests u T-test u ANOVA u Regression u Correlation u Non-parametric tests

10 T-test u Tests two different sets of values u Assumes a normal distribution u Different forms if the variance of the samples are different u Different forms for independent or dependent samples (whether the two samples data can be paired up)

11 T-test

12 ANOVA u Observed variance between different dependent variables in the experiment u Assumes a normal distribution and also assumes the treatment only effects the mean and not the variance

13 Correlation u Degree of fit between actual scores for a dependent variable and the predicted values based on a regression u Measures the degree of relationship u Correlation coefficients can range from -1.00 to +1.00. The value of -1.00 represents a perfect negative correlation while a value of +1.00 represents a perfect positive correlation. A value of 0.00 represents a lack of correlation.

14 Correlation

15 u This line is called the regression line or least squares line, because it is determined such that the sum of the squared distances of all the data points from the line is the lowest possible.

16 Regression u Prediction of the dependent variable value based on one or more independent variables u Measures the type of relationship between multiple values u Gives the percent of the variance accounted for by each element

17 Regression u But the world is complex and, in most cases, we are interested in comparisons that can’t be captured adequately using just two variables. Accordingly, analogues of the methods we’ve discussed so far have been developed to analyze relations between suites of variables. Because these suites are composed of multiple variables— as opposed to pairs of variables—the family of methods we’re now going to discuss are useful for ‘multiple variable’ or ‘multivariate’ analysis

18 Regression

19 u Performing a regression on the previous data gives:

20 Non-parametric tests u Don’t assume a normal distribution u Can be used with ordinal or nominal data u Weaker test, but less restrictions u Chi-square test u the Mann-Whitney U test u Wilcoxon signed-rank test

21 Mann-Whitney U test u Non-parametric test for assessing whether the medians between 2 samples are the same u for independent data u http://geographyfieldwork.com/Mann%20 Whitney.htm

22 Wilcoxon signed-rank u Used for related samples u No assumptions on distribution

23 Confidence intervals u How sure are we that we have enough people in the sample u Methods of calculating either –how big the sample should be –how much confidence you can place in an existing sample

24 Confidence intervals u Since there are no comparable studies, estimates of the standard deviation was difficult. We used the values obtained by Cardinal & Siedler (1995) in their study of readability of healthcare material: sd = 12 for low groups and sd = 10 for high groups. They also saw a difference of 14 percent in total score between groups. Thus, the numbers we used for the power analysis were: control mean = 53 sd = 12 and experimental group mean = 67 sd = 10. For a significance level of.05 and a power of.9, this gives a value of 12 in each cell of the test design.

25 Outliers u Data that looks to not be part of the set. Want to remove it, but no real standards for what makes it real or an error. u For example, if one is calculating the average temperature of 10 objects in a room, and most are between 20-25° Celsius, but an oven is at 350° C, the median of the data may be 23 but the mean temperature will be 55 u http://www.statsoft.com/textbook/stbasic.ht ml#Correlations

26 u significant digits u writing up the statistics in an article

27 End


Download ppt "Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population."

Similar presentations


Ads by Google