MST Stats Primer This is a 30 minute overview of a 4 hour tutorial presented at a National Conference. The full handouts from that tutorial can be found.

Slides:



Advertisements
Similar presentations
To Select a Descriptive Statistic
Advertisements

CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
CHOOSING A STATISTICAL TEST © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Introduction to Statistics Quantitative Methods in HPELS 440:210.
Departments of Medicine and Biostatistics
Ordinal Data. Ordinal Tests Non-parametric tests Non-parametric tests No assumptions about the shape of the distribution No assumptions about the shape.
PBHL 5313 Nonparametric Methods Module III Zoran Bursac Biostatistics UAMS.
Statistical Tests Karen H. Hagglund, M.S.
MSc Applied Psychology PYM403 Research Methods Quantitative Methods I.
Basic Statistical Review
Basic Statistics for Research: Choosing Appropriate Analyses and Using SPSS Dr. Beth A. Bailey Dr. Tiejian Wu Department of Family Medicine.
Today Concepts underlying inferential statistics
Measures of Association Deepak Khazanchi Chapter 18.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Statistics Idiots Guide! Dr. Hamda Qotba, B.Med.Sc, M.D, ABCM.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Nonparametric or Distribution-free Tests
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT
Week 9: QUANTITATIVE RESEARCH (3)
Review I volunteer in my son’s 2nd grade class on library day. Each kid gets to check out one book. Here are the types of books they picked this week:
Quantitative Methods: Choosing a statistical test Summer School June 2015 Dr. Tracie Afifi.
Chapter 14: Nonparametric Statistics
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
Simple Linear Regression
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
Copyright © 2012 Pearson Education. Chapter 23 Nonparametric Methods.
Experimental Research Methods in Language Learning Chapter 11 Correlational Analysis.
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
Linear correlation and linear regression + summary of tests
Steps in Statistical Testing: 1) State the null hypothesis (Ho) and the alternative hypothesis (Ha). 2) Choose an acceptable and appropriate level of significance.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
ANALYSIS PLAN: STATISTICAL PROCEDURES
Sample size and common statistical tests There are three kinds of lies- lies, dammed lies and statistics…… Benjamin Disraeli.
Angela Hebel Department of Natural Sciences
In Stat-I, we described data by three different ways. Qualitative vs Quantitative Discrete vs Continuous Measurement Scales Describing Data Types.
Chap 18-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 18-1 Chapter 18 A Roadmap for Analyzing Data Basic Business Statistics.
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
Variables It is very important in research to see variables, define them, and control or measure them.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Analisis Non-Parametrik Antonius NW Pratama MK Metodologi Penelitian Bagian Farmasi Klinik dan Komunitas Fakultas Farmasi Universitas Jember.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Chapter 21prepared by Elizabeth Bauer, Ph.D. 1 Ranking Data –Sometimes your data is ordinal level –We can put people in order and assign them ranks Common.
Nonparametric Statistics
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Hypothesis Testing Procedures Many More Tests Exist!
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
Dr.Rehab F.M. Gwada. Measures of Central Tendency the average or a typical, middle observed value of a variable in a data set. There are three commonly.
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
Bivariate analysis. * Bivariate analysis studies the relation between 2 variables while assuming that other factors (other associated variables) would.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Nonparametric Statistics
32931 Technology Research Methods Autumn 2017 Quantitative Research Component Topic 4: Bivariate Analysis (Contingency Analysis and Regression Analysis)
Data measurement, probability and Spearman’s Rho
Non-Parametric Tests 12/1.
Non-Parametric Tests 12/1.
Non-Parametric Tests 12/6.
Statistics.
CHOOSING A STATISTICAL TEST
Non-Parametric Tests.
Medical Statistics Dr. Gholamreza Khalili
Nonparametric Statistics
Introduction to Statistics
Association, correlation and regression in biomedical research
Unit XI: Data Analysis in nursing research
Presentation transcript:

MST Stats Primer This is a 30 minute overview of a 4 hour tutorial presented at a National Conference. The full handouts from that tutorial can be found on my class web site on dupontmanual.com

KEY Most of you understand that you need to work with someone that understands how to correctly collect the data you are interested in studying. You MUST also work with someone that understands how to correctly set up/analyze the data from a statistical standpoint from the beginning stages of your work. This can save hours of needless data collection only to find that the data CANNOT be analyzed

KEY - continued Can range from something as simple as: to few subjects in a group, not valid or reliable testing instruments, etc. You may need to collapse numerous categories to allow for any statistical analysis, but now you’ve lost any meaningful practical separation of categories/groups. There are advanced ways to determine the number of subjects needed in a study; Rule of thumb, need a min of 10 in any group for any chance of showing that differences exist unless the differences are quite large.

Goal Try to design study to remove/control all influence of any variables that is not your independent variable to max the effect on your dependent variable (you want your independent variable to be the only thing that causes the change in your dependent variable). Done by controlling or eliminating the variability caused by other variables. NO perfect studies exist as it is virtually impossible to eliminate or control all the variables that influence your dependent variable. If your research is not basic research then you want to see what happens in the real world when very little is controlled. However, now it becomes very difficult to know what really caused your change.

Need Solid Test Design Must understand your data to know what test design is appropriate. First that means you MUST understand the types of variables in your study & the type of data those variables generate.

Types of Variables Indep – A var that is manipulated (the treatment var, the cause). Dependent – A var that is measured (the outcome, the effect). Categorical – A classification var that is analyzed (e.g. gender). Control – A characteristic that is restricted in the study, but not compared (e.g. only MST students). Extraneous – A var that affects the dep var, but is not part of the design, is not controlled. (e.g. Amount of sleep). Confounding – When an extraneous var is systematically related to the independent var. (e.g. ACT score & IQ). Predictor – Another name for the independent variable in regression. Response - Another name for the dependent variable in regression. Sometimes called the Criterion. Dummy – Var constructed to allow analysis within a specific models

Types of Data Nominal -No arithmetic relationship or order between different classifications. Examples: Occupation (Clerk, Police Officer, Teacher, …); Gender (Male, Female) Ordinal - Data can be ordered into discrete categories, but categories have no arithmetic relationship. Example: Survey Data (5 - Strongly Agree, 4 - Somewhat Agree, 3 - Neither Agree Nor Disagree, 2 - Somewhat Disagree, 1 - Strongly Disagree) Interval - Data on a measurement scale with an arbitrary zero point in which numerically equal intervals at different locations on the scale reflect the same quantitative difference. Example: Temperature (Fahrenheit or Celsius) Ratio - Data on a measurement scale with an absolute zero point in which numerically equal intervals at different locations on the scale reflect the same quantitative difference. Examples: Height, Weight, Pressure, Temperature (Kelvins)

Parametric vs Nonparametric Tests Choosing the right test to compare measurements is tricky, as you must choose between two families of tests (parametric and nonparametric). Many statistical tests are based upon the assumption that the data are sampled from a Normal distribution. These tests are referred to as parametric tests. Commonly used parametric tests are listed in the 2nd column of the table (e.g. t test & ANOVA). These usually include the interval & ratio data types, but not all interval & ratio data types are normally distributed

Parametric vs Nonparametric Tests Tests that do not make assumptions about the population distribution are referred to as nonparametric tests. Some of these tests are covered in the tutorial found online as well. All commonly used nonparametric tests rank the outcome variable from low to high and then analyze the ranks. These tests are listed in the 3rd column of the table (e.g. Wilcoxon, Mann-Whitney test, and Kruskal-Wallis tests). These tests are also called distribution-free tests. These usually include the nominal & ordinal data types, but some interval & ratio data types are are best analyzed as non-parametric

Choosing Between Parametric And Nonparametric Tests: The Easy Ones Choosing between them is sometimes easy. Definitely choose a parametric test if you are sure that your data were sampled from a population that follows a Normal distribution (at least approximately). Definitely select a nonparametric test in three situations: The outcome is a rank or a score and the population is clearly not Normal. e.g class ranking of students, or a manual muscle test (measured on a continuous scale where 0 is no movement and 5 is basically normal).

Choosing Between Parametric And Nonparametric Tests: The Easy Ones Some values are "off the scale," that is, too high or too low to measure. Even if the data is Normal, it is impossible to analyze such data with a parametric test since you don't know all of the values. Using a nonparametric test with these data is simple. Assign values too low to measure an arbitrary very low value and assign values too high to measure an arbitrary very high value. Then perform a nonparametric test. Since the nonparametric test uses the relative ranks of the values, it won't matter that you didn't know all the values exactly. The data are measurements, and you are sure that the population is not distributed in a Normal manner. If the data are not Normal, consider whether you can transform the values to make the distribution become Normal (e.g. take the logarithm or reciprocal of all values). There are often biological or chemical reasons (as well as statistical ones) for performing a particular transform.

Choosing Between Parametric And Nonparametric Tests: The Hard Ones If it’s not clear after reading the tutorials. Talk with your study mentor. Find a statistics mentor. Do this BEFORE you start your study so you don’t waste your time collecting data that can’t be analyzed

What’s the one thing you could do to improve your statistical analysis the most Increase your sample size It makes either stats approach give usable results

Why Increase sample size Large data sets present no problems. It is usually easy to tell if the data come from a Normal population, but it doesn't really matter because the nonparametric tests are so powerful and the parametric tests are so robust. It is the small data sets that present a dilemma. It is difficult to tell if the data come from a Normal population, but it matters a lot. The nonparametric tests are not powerful and the parametric tests are not robust.

OVERVIEW OF BASIC AVAILABLE STATISTICAL TESTS The full tutorial found on the website discusses many different statistical tests. To select the right test, ask yourself two questions: What type of data do you have? What is your goal? Then refer to the table that follows. Most of the tests described in the tutorial & the table can be performed by most advanced statistical packages.

Type of Data Goal Measurement from Normal Population Rank, Score, or Measure from Non-Normal Population Binomial (Two Possible Outcomes) Survival Time Describe one group Mean, SD Median, interquartile range Proportion Kaplan Meier survival curve Compare one group to a hypothetical value One-sample t test Wilcoxon test Chi-square or Binomial test - Compare two unpaired groups Unpaired t test Mann-Whitney test, Nominal data: Fisher's Exact (small sample), Chi-square (large sample). Ordinal Data: Wilcoxon Rank Sum Fisher's (small sample), Chi-square (large sample) Log-rank test or Mantel-Haenszel Compare two paired groups Paired t test Wilcoxon Signed Rank, Sign test (small sample), McNemar's test (large sample) Sign test (small sample), McNemar's test (large sample) Conditional proportional hazards regression Compare three or more unmatched groups One-way ANOVA Kruskal-Wallis test Chi-square test Cox proportional hazard regression Compare three or more groups (matched or unmatched) Repeated-measures ANOVA or MANOVA Friedman test Cochrane Q

Type of Data Goal Measurement from Normal Population Rank, Score, or Measure from Non-Normal Population Binomial (Two Possible Outcomes) Survival Time Compare groups with known association to other variables ANCOVA, MANOVA (Principle Components & Factor Analysis) - Quantify association between two variables Pearson’s r Nominal: Relative Risk Odds Ratio. Ordinal: Spearman Rho, Kendall’s Tau Contingency coefficients Predict value from another measured variable Simple linear regression or Nonlinear regression Nonparametric regression Simple logistic regression Cox proportional hazard regression Predict value from several measured or binomial variables Multiple linear regression or Multiple nonlinear regression Multiple logistic regression Developed from: Intuitive Biostatistics, H.J. Motulsky, Ch. 37, Oxford University Press, 1995. & Hermansen, M. Biostatistics: Some Basic Concepts. Caduceus Medical Publishers. 1990.