Sunee Raksakietisak Srinakharinwirot University

Slides:



Advertisements
Similar presentations
Hypothesis Testing Steps in Hypothesis Testing:
Advertisements

Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Quantitative Data Analysis: Hypothesis Testing
Statistical Tests Karen H. Hagglund, M.S.
Data Analysis Statistics. Inferential statistics.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Final Review Session.
Data Analysis Statistics. Inferential statistics.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Chapter 14 Inferential Data Analysis
Richard M. Jacobs, OSA, Ph.D.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
Chapter 13: Inference in Regression
Class Meeting #11 Data Analysis. Types of Statistics Descriptive Statistics used to describe things, frequently groups of people.  Central Tendency 
Descriptive Statistics e.g.,frequencies, percentiles, mean, median, mode, ranges, inter-quartile ranges, sds, Zs Describe data Inferential Statistics e.g.,
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
TAUCHI – Tampere Unit for Computer-Human Interaction ERIT 2015: Data analysis and interpretation (1 & 2) Hanna Venesvirta Tampere Unit for Computer-Human.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Data Analysis (continued). Analyzing the Results of Research Investigations Two basic ways of describing the results Two basic ways of describing the.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Examining Relationships in Quantitative Research
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Academic Research Academic Research Dr Kishor Bhanushali M
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Chapter Eight: Using Statistics to Answer Questions.
Chapter 6: Analyzing and Interpreting Quantitative Data
Research Methodology Lecture No :26 (Hypothesis Testing – Relationship)
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Chapter 15 Analyzing Quantitative Data. Levels of Measurement Nominal measurement Involves assigning numbers to classify characteristics into categories.
Appendix I A Refresher on some Statistical Terms and Tests.
Chapter 18 Data Analysis Overview Yandell – Econ 216 Chap 18-1.
COM 295 STUDY Inspiring Minds/com295study.com
32931 Technology Research Methods Autumn 2017 Quantitative Research Component Topic 4: Bivariate Analysis (Contingency Analysis and Regression Analysis)
Data measurement, probability and Spearman’s Rho
Dependent-Samples t-Test
BINARY LOGISTIC REGRESSION
Lecture Slides Elementary Statistics Twelfth Edition
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Chapter 10 CORRELATION.
Statistics.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Kin 304 Inferential Statistics
Correlation and Regression
Stats Club Marnie Brennan
Introduction to Statistics
Ass. Prof. Dr. Mogeeb Mosleh
Chi Square (2) Dr. Richard Jackson
1.3 Data Recording, Analysis and Presentation
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Statistics II: An Overview of Statistics
Inferential Statistics
15.1 The Role of Statistics in the Research Process
Parametric versus Nonparametric (Chi-square)
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Chapter Nine: Using Statistics to Answer Questions
Psych 231: Research Methods in Psychology
RES 500 Academic Writing and Research Skills
COMPARING VARIABLES OF ORDINAL OR DICHOTOMOUS SCALES: SPEARMAN RANK- ORDER, POINT-BISERIAL, AND BISERIAL CORRELATIONS.
Analyzing and Interpreting Quantitative Data
Presentation transcript:

Sunee Raksakietisak Srinakharinwirot University sunee@g.swu.ac.th Statistical Techniques for Data Analysis : Overview of Statistical Techniques Selection Sunee Raksakietisak Srinakharinwirot University sunee@g.swu.ac.th

About this presentation These set of slides were last used in the presentation titled “Reporting Statistical Data Analysis Results” At the seminar and workshop on “Writing Scientific Articles for Publication IV” on 19-20 July 2008 Organized by Thai-Australian Technological Services Center (TATSC) (http://www.tatsc.or.th/index.php/events/110- seminar-workshop-writing-scientific-articles-for- publication-iv-19-20-july-2008)

Introduction Article ScienceAsia Vol.32 No.1 march 2006 Understanding Data: Important for All Scientists, and Where Any Nation Might Excel http://www.scienceasia.org/2006.32.n1/001.php My work is in response to this article and for my presentation: Back to the basic of statistical methods

Research for my talk Investigate how statistical methods have been used in scientific papers The articles in ScienceAsia (online) were investigated: Current issue (Vol.33 No.2 June 2007) back to Vol.31 No1. March 2005 Total of 154 articles The keyword “statistic” was used for searching covers words like “statistics”, “statistically” some articles need the word “significant”

Research for my talk (cont’d) Results: 40 articles out of 154 articles used statistical methods (26%)

Statistical Methods What are the statistical methods that have been used? And how often? Results: t-test (28% ) One way ANOVA (68%) Two way ANOVA (10%) Correlation (13%) Regression (15%) Others (3%)

Further questions? How good is the writing/reporting of statistical part? Evaluation in a 10 points scale: Title: 0 – 1 point Abstract: 0 – 2 points Method: 0 – 3 points Results: 0 – 4 points How much is the misconception of writing/reporting statistical results/conclusion?

Overview of Statistical Methods Descriptive statistics Qualitative data (nominal, ordinal) : Frequency and percentages Quantitative data (interval, ratio / scale / numeric) : Mean and SD Distinguish between SD and SEM (Standard Error of the Mean) ! Inferential statistics Hypothesis testing

Statistical Methods t-test : compare two means One way ANOVA : compare two or more means One factor (the effect of the factor on the measured variables) Two-way ANOVA Two factors (the effect of 2 factors on the measured variables)

Steps in statistical hypothesis testing Formulate hypothesis: Ho and H1 Set level of significance (α = 0.05, 0.01, 0.10) Statistics used to test hypothesis in (1) This statistics is called “Inferential statistics” Formulae (don’t need to know) Has distribution (Z, t, F, χ2) Decision rule: Reject Ho if P-value < α Calculate statistics and p-value Statistical package gives these values Make decision: Reject Ho or Do not reject Ho Reject Ho means that the test is significant

Normal t Chi-square F

What is P-value P-value is the probability from the value of statistics to tails of distribution (either one tail or two tails) Web page to calculate the p-value of various distribution: http://vassarstats.net/tabs.html (http://faculty.vassar.edu/lowry/tabs.html)

P-value (cont’d) P-value can never be zero !!! Often found misconception since the statistical package gives value up to some decimal places e.g. for 3 decimal places, if p-value is very small--smaller than .001--the package will show .000 hence we have to say P < .001 instead of P = .000

Misconception about Alpha and P-value The frequency of cell division was calculated after 2 weeks of culture and was statistically analyzed by analysis of variance (ANOVA) at p ≤ 0.05 (correct??) Means within a column followed by the same letter are not significantly different at P ≤ 0.05 according to DMRT (correct??)

Misconception (cont’d) Statistical significance was defined as ρ < 0.05 (correct??) The repeated measurements of L’ value and rehydration ratio of the dehydrated products from different pre- treatments were subject to analysis of variance (p=0.05) (correct??) Significant difference at p < .05 (correct??)

Correct concept Collected data were statistically analyzed and mean separation was calculated according to the Least Significant Difference (LSD) method at the 5% level of significance (Correct) Results were considered to be statistically significant when p<0.05 (Correct)

Correct concept * P < 0.05, ** P< 0.01, *** P<0.001; ns not significant (Correct) ** = significant at 1% level, ns = non-significant (Correct) The bars with the same letter are not significantly different (P>0.05) (Correct)

About t-test Two variables Dependent (variable to compare mean): scale Independent (group variable): nominal Has 2 levels/groups Common mistakes: Independent variable has more than 2 groups, did t-test for many pairs (should do ANOVA)

About t-test (cont’d) Statistical test: test whether the two means are different significantly The test is significant when the null hypothesis (mean the same) is rejected; that is the means are different T-test has 2 formula: variances equal and variances not equal

Reporting results (Example) See worksheet N, Mean, and SD for each group, t, and p-value In journal articles, different ways of reporting Report Mean and SD (no N) Report Mean and SEM Report as Mean ± SD, Mean ± SEM Note: SEM = SD/√N SEM gives the picture of Confidence Interval (C.I.)

About One way ANOVA Two variables Dependent (variable to compute mean): scale One independent (factor): nominal Has 2 levels/groups or more t-test is a special case of one way ANOVA T2 = F

One way ANOVA (cont’d) Statistical test: test whether there is any effect of factor on dependent variable (or are all the means equal?) F – test (test statistics has F distribution) The test is significant means that there is an effect of factor on dependent variables; at least one pair of the mean is different Multiple comparisons of all pairs of mean by LSD, Duncan, SNK, Tukey, Bonferroni, Scheffé

Reporting Results (Example) See worksheet N, Mean, and SD for each group F, and p-value Symbol indicating the difference in means from multiple comparisons

About Two way ANOVA Three variables Dependent (variable to compute mean): scale Two independent variables (factors): nominal

Two way ANOVA (cont’d) Statistical test: Test for interaction effect first Plot graph to visualize interaction effect If no interaction effect then test for main effect of each factor (one way ANOVA) If there is interaction effect then test for simple effect: the effect of one factor for each level of another factor

Reporting Results (Example) See worksheet N, Mean, and SD for each group Table indicating the significant of main effect of each factor and interaction effect

Stop here for a minute t-test, ANOVA is parametric statistical methods for mean comparisons It is a univariate analysis (one dependent variable) Parametric methods have an assumption that the dependent variable has normal distribution

Test for Normality Test for normality can be easily done by statistical package If not normal, try transformation If normal, then parametric test can be used If still not normal after transformation, use nonparametric statistical methods If other assumptions of parametric such as equal variances are not assumed, use nonparametric test

Nonparametric test Rank of data is used instead of raw data Robust but give lower power than parametric test Equivalent parametric , nonparametric methods see summary commands in SPSS Most of the time the conclusion by either parametric or nonparametric tests are the same

From comparison to Modeling Most of the scientific experiments, manipulated (independent) variables are quantitative variables But when doing the experiment, some values are selected for experiment Temperature (e.g. 3 levels of temperature) To see effect of temperature on …. (dependent variable) One way ANOVA Show graph of mean of dependent variable on each level of temperature

Correlation Correlation of 2 variables (both must be scale variables) Correlation often mean Pearson Correlation Assume linear correlation Assume bivariate normal distribution (It is parametric methods) Nominal variable with 2 values (level) is ok (watch out if more than 2 values, not ok) If not normal, use rank correlation (nonparametric)

Regression It is a modeling technique: cause (independent) and effect (dependent) Model: Regression equation (prediction equation) How good is the model: R2, percentage of variance of dependent variable explained (accounted for) by independent variables (predictors) Only one dependent variables, but can be many independent variables (predictors) All must be scale variables Modeling: Enter, Forward, Backward, Stepwise

Nominal Dependent Variable Variables of interest (dependent) often nominal in medical area Has lung cancer or doesn’t have Has heart attack or doesn’t have Use chi-square to test the differences (like t-test or ANOVA) Use logistic regression for modeling

Reliability Analysis Cognitive test analysis Reliability coefficient: KR20 / Cornbach Alpha Item statistics: difficulty index, discrimination index (item to total correlation / point biserial correlation) Affective test analysis (e.g. likert scale) Reliability coefficient: Cornbach Alpha Item statistics: discrimination index (item to total correlation) See details in handout article

My hope and final remark You have big picture of how to choose statistical methods for your data analysis You know how/what to report statistical data analysis results in the research journal articles See examples of research articles using various statistical methods