Statistical Evaluation of Data

Slides:



Advertisements
Similar presentations
To Select a Descriptive Statistic
Advertisements

Richard M. Jacobs, OSA, Ph.D.
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Copyright © Allyn & Bacon (2010) Statistical Analysis of Data Graziano and Raulin Research Methods: Chapter 5 This multimedia product and its contents.
Copyright © Allyn & Bacon (2007) Statistical Analysis of Data Graziano and Raulin Research Methods: Chapter 5 This multimedia product and its contents.
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
QUANTITATIVE DATA ANALYSIS
Lect 10b1 Histogram – (Frequency distribution) Used for continuous measures Statistical Analysis of Data ______________ statistics – summarize data.
Social Research Methods
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Inferential Statistics
Understanding Research Results
Chapter 8 Introduction to Hypothesis Testing
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 16 Descriptive Statistics.
Class Meeting #11 Data Analysis. Types of Statistics Descriptive Statistics used to describe things, frequently groups of people.  Central Tendency 
APPENDIX B Data Preparation and Univariate Statistics How are computer used in data collection and analysis? How are collected data prepared for statistical.
Chapter 15 Correlation and Regression
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Statistical Evaluation of Data
Analyzing and Interpreting Quantitative Data
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
UNDERSTANDING RESEARCH RESULTS: DESCRIPTION AND CORRELATION © 2012 The McGraw-Hill Companies, Inc.
METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
Research Ethics:. Ethics in psychological research: History of Ethics and Research – WWII, Nuremberg, UN, Human and Animal rights Today - Tri-Council.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
TYPES OF DATA KEEP THE ACTIVITIES ROLLING Data, Standard Deviation, Statistical Significance.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
2 Kinds of Statistics: 1.Descriptive: listing and summarizing data in a practical and efficient way 2.Inferential: methods used to determine whether data.
Chapter Eight: Using Statistics to Answer Questions.
Data Analysis.
Chapter 6: Analyzing and Interpreting Quantitative Data
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Introduction to statistics I Sophia King Rm. P24 HWB
Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
Chapter 13 Understanding research results: statistical inference.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
Chapter 15 Analyzing Quantitative Data. Levels of Measurement Nominal measurement Involves assigning numbers to classify characteristics into categories.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Criminal Justice and Criminology Research Methods, Second Edition Kraska / Neuman © 2012 by Pearson Higher Education, Inc Upper Saddle River, New Jersey.
AP PSYCHOLOGY: UNIT I Introductory Psychology: Statistical Analysis The use of mathematics to organize, summarize and interpret numerical data.
Chapter 12 Understanding Research Results: Description and Correlation
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Analyzing and Interpreting Quantitative Data
Part Three. Data Analysis
Social Research Methods
STATS DAY First a few review questions.
Central Tendency.
Statistical Evaluation
Basic Statistical Terms
12 Inferential Analysis.
15.1 The Role of Statistics in the Research Process
Chapter Nine: Using Statistics to Answer Questions
Descriptive Statistics
RES 500 Academic Writing and Research Skills
Presentation transcript:

Statistical Evaluation of Data Chapter 15

Descriptive / inferential Descriptive statistics are methods that help researchers organize, summarize, and simplify the results obtained from research studies. Inferential statistics are methods that use the results obtained from samples to help make generalizations about populations.

Statistic / parameter A summary value that describes a sample is called a statistic. M=25 s=2 A summary value that describes a population is called a parameter. µ =25 σ=2

Frequency Distributions One method of simplifying and organizing a set of scores is to group them into an organized display that shows the entire set.

Example

Histogram & Polygon

Bar Graphs

How to interoret?

http://www.transparency.org/cpi2014/results

Other types of graphs https://freedomhouse.org/report/freedom-net/freedom-net-2014#.VIDtzDEc7Ak

Central tendency The goal of central tendency is to identify the value that is most typical or most representative of the entire group.

Central tendency The mean is the arithmetic average. The median measures central tendency by identifying the score that divides the distribution in half. The mode is the most frequently occurring score in the distribution.

Variability Variability is a measure of the spread of scores in a distribution. Range (the difference between min and max) Standard deviation describes the average distance from the mean. Variance measures variability by computing the average squared distance from the mean.

Variance = (Sum of Squares) / N Variance = the index of variability. SD = SQRT (Variance) Variance = (Sum of Squares) / N X 10 7 9 8 6 5 4 3 1 X-M 4 1 3 2 -1 -2 -3 -5 (X-M)2 16 1 9 4 25 Variance = 70/10= 7 SD = SQRT(7) =2.64 SS =70 Total=60 Mean=6

Non-numerical Data Proportion or percentage in each category. For example, 43% prefer Democrat candidate, 28% prefer Republican candidate, 29% are undecided

Correlations A correlation is a statistical value that measures and describes the direction and degree of relationship between two variables.

Types of correlation Phi for dichotomous data only Variable Y\X Quantitiative X Ordinal X Nominal X Quantitative Y Pearson r Biserial rb Point Biserial rpb Ordinal Y Spearman rho/Tetrachoric rtet Rank Biserial rrb Nominal Y Rank Bisereal rrb Phi, C, λ Lambda Phi for dichotomous data only Pearson's contingency coefficient known as C Cramer's V coefficient Goodman and Kruskal lambda coefficient http://www.andrews.edu/~calkins/math/edrm611/edrm13.htm

Regression

Regression Whenever a linear relationship exists, it is possible to compute the equation for the straight line that provides the best fit for the data points. The process of finding the linear equation is called regression, and the resulting equation is called the regression equation.

Where is the regression line? 120 110 100 90 80 STRENGTH 70 140 150 160 170 180 190 200 210 220 WEIGHT

Which one is the regression line? 120 110 100 90 80 STRENGTH 70 140 150 160 170 180 190 200 210 220 WEIGHT

regression equation All linear equations have the same general structure and can be expressed as Y = bX+a Y= 2X + 1

standardized form Often the regression equation is reported in standardized form, which means that the original X and Y scores were standardized, or transformed into z- scores, before the equation was computed. ȥy=βȥx

Multiple Regression

INFERENTIAL STATISTICS

Sampling Error Random samples No treatment

Is the difference due to a sampling error? Random samples Violent /Nonviolent TV

Is the difference due to a sampling error? Sampling error is the naturally occurring difference between a sample statistic and the corresponding population parameter. The problem for the researcher is to decide whether the 4- point difference was caused by the treatments ( the different television programs) or is just a case of sampling error

Hypothesis testing A hypothesis test is a statistical procedure that uses sample data to evaluate the credibility of a hypothesis about a population.

5 elements of a hypothesis test The Null Hypothesis The null hypothesis is a statement about the population, or populations, being examined, and always says that there is no effect, no change, or no relationship. 2. The Sample Statistic The data from the research study are used to compute the sample statistic.

5 elements of a hypothesis test 3. The Standard Error Standard error is a measure of the average, or standard distance between sample statistic and the corresponding population parameter. "standard error of the mean , sm" refers to the standard deviation of the distribution of sample means taken from a population. 4. The Test Statistic A test statistic is a mathematical technique for comparing the sample statistic with the null hypothesis, using the standard error as a baseline.

5 elements of a hypothesis test 5. The Alpha Level ( Level of Significance) The alpha level, or level of significance, for a hypothesis test is the maximum probability that the research result was obtained simply by chance. A hypothesis test with an alpha level of .05, for example, means that the test demands that there is less than a 5% (. 05) probability that the results are caused only by chance.

Reporting Results from a Hypothesis Test In the literature, significance levels are reported as p values. For example, a research paper may report a significant difference between two treatments with p <.05. The expression p <.05 simply means that there is less than a .05 probability that the result is caused by chance.

Errors in Hypothesis Testing If a researcher is misled by the results from the sample, it is likely that the researcher will reach an incorrect conclusion. Two kinds of errors can be made in hypothesis testing.

Type I Errors A Type I error occurs when a researcher finds evidence for a significant result when, in fact, there is no effect ( no relationship) in the population. The error occurs because the researcher has, by chance, selected an extreme sample that appears to show the existence of an effect when there is none. The consequence of a Type I error is a false report. This is a serious mistake. Fortunately, the likelihood of a Type I error is very small, and the exact probability of this kind of mistake is known to everyone who sees the research report.

Type II error A Type II error occurs when sample data do not show evidence of a significant effect when, in fact, a real effect does exist in the population. This often occurs when the effect is so small that it does not show up in the sample.

Factors that Influence the Outcome of a Hypothesis Test 1. The sample size. The difference found with a large sample is more likely to be significant than the same result found with a small sample. 2. The Size of the Variance When the variance is small, the data show a clear mean difference between the two treatments.

Effect Size Knowing the significance of difference is not enough. We need to know the size of the effect.

Measuring Effect Size with Cohen’s d

Measuring Effect Size as a Percentage of Variance ( r2) The effect size can also be measured by calculating the percentage of variance in the treatment condition that could be predicted by the variance in the control group. df = (n1-1)+(n2-1)

Questions?

Group Discussion Identify the two basic concerns with using a correlation to measure split-half reliability and explain how these concerns are addressed by Spearman-Brown, K-R 20, and Cronbach’s alpha. Identify the basic concern with using the percentage of agreement as a measure of inter-rater reliability and explain how this concern is addressed by Cohen’s kappa.