Biostatistics – A Revisit What are they? Why do we need them? Their relevance and importance.

Slides:



Advertisements
Similar presentations
A PowerPoint®-based guide to assist in choosing the suitable statistical test. NOTE: This presentation has the main purpose to assist researchers and students.
Advertisements

CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Departments of Medicine and Biostatistics
Statistical Tests Karen H. Hagglund, M.S.
MSc Applied Psychology PYM403 Research Methods Quantitative Methods I.
Basic Statistical Review
1 Practicals, Methodology & Statistics II Laura McAvinue School of Psychology Trinity College Dublin.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Social Research Methods
Biostatistics in Research Practice: Non-parametric tests Dr Victoria Allgar.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
The Research Skills exam: The four horsemen of the apocalypse: pestilence, war, famine and the RS1 exam.
Statistical Techniques in Hospital Management QUA 537
Statistics Idiots Guide! Dr. Hamda Qotba, B.Med.Sc, M.D, ABCM.
1 Introduction to biostatistics Lecture plan 1. Basics 2. Variable types 3. Descriptive statistics: Categorical data Categorical data Numerical data Numerical.
Non-Parametric Methods Professor of Epidemiology and Biostatistics
Understanding Research Results
Practical statistics for Neuroscience miniprojects Steven Kiddle Slides & data :
Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242.
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
Fundamentals of Statistical Analysis DR. SUREJ P JOHN.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Exploratory Data Analysis. Height and Weight 1.Data checking, identifying problems and characteristics Data exploration and Statistical analysis.
Can I Believe It? Understanding Statistics in Published Literature Keira Robinson – MOH Biostatistics Trainee David Schmidt – HETI Rural and Remote Portfolio.
Non-Parametric Methods Professor of Epidemiology and Biostatistics
Descriptive Statistics e.g.,frequencies, percentiles, mean, median, mode, ranges, inter-quartile ranges, sds, Zs Describe data Inferential Statistics e.g.,
Choosing and using statistics to test ecological hypotheses
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Common Nonparametric Statistical Techniques in Behavioral Sciences Chi Zhang, Ph.D. University of Miami June, 2005.
Algonquin College - Jan Ladas1 Community Dental Health Algonquin College.
2 Categorical Variables (frequencies) Testing mean differences of a continuous variable between groups (categorical variable) 2 Continuous Variables 2.
Choosing the Appropriate Statistics Dr. Erin Devers October 17, 2012.
Chapter 16 The Chi-Square Statistic
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
CHI SQUARE TESTS.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Angela Hebel Department of Natural Sciences
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
Fundamental Concepts of Biostatistics Cathy Jenkins, MS Biostatistician II Lisa Kaltenbach, MS Biostatistician II April 17, 2007.
SPSS Workshop Day 2 – Data Analysis. Outline Descriptive Statistics Types of data Graphical Summaries –For Categorical Variables –For Quantitative Variables.
Elang 273: Statistics. Review: Scientific Method 1. Observe something 2. Speculated why it is so and form hypothesis 3. Test hypothesis by getting data.
Principles of statistical testing
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
1 Week 3 Association and correlation handout & additional course notes available at Trevor Thompson.
Revision of topics for CMED 305 Final Exam. The exam duration: 2 hours Marks :25 All MCQ’s. (50 questions) You should choose the correct answer. No major.
Nonparametric Statistics
Hypothesis Testing Procedures Many More Tests Exist!
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
Dr.Rehab F.M. Gwada. Measures of Central Tendency the average or a typical, middle observed value of a variable in a data set. There are three commonly.
PSY 325 AID Education Expert/psy325aid.com FOR MORE CLASSES VISIT
Appendix I A Refresher on some Statistical Terms and Tests.
Chapter 11 Summarizing & Reporting Descriptive Data.
Introduction to Marketing Research
32931 Technology Research Methods Autumn 2017 Quantitative Research Component Topic 4: Bivariate Analysis (Contingency Analysis and Regression Analysis)
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
LEVELS of DATA.
Statistics.
Basic Statistics Overview
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Bivariate Testing (Chi Square)
The Chi-Square Distribution and Test for Independence
Bivariate Testing (Chi Square)
Program This course will be dived into 3 parts: Part 1 Descriptive statistics and introduction to continuous outcome variables Part 2 Continuous outcome.
Data Analysis Module: Chi Square
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Presentation transcript:

Biostatistics – A Revisit What are they? Why do we need them? Their relevance and importance

The Research Question (+ scientific hypothesis) - Be specific with the question and familiar with the literature Select a Study design to answer the question – observation versus intervention Study design issue – the sample - type of sample? Random or non-random - sample size? (question of power and feasibility) Lab/Field work + Data entry - ensure quality control, most important

How should you present your data? – Descriptive Statistics -as they are i.e. as tables of raw numbers -Numerical summaries * frequencies (%) * mean and SD (or median and IQR) -visually (in figures or graphs) -Generally tables of raw data go into appendix -Tables useful for detailed and specific information (data) - Graphs/Figures for visual impact (less precise) -Frequencies (%) or means? To answer this need to know what kind of data you have

What kind of data do you have? Categorical Nominal – no order e.g. Gender, hair colour Ordinal – has an order e.g. Gene type, blood group, Age group, tumour grade Metric Continuous – measured, has units e.g. Age, Height, Systolic blood pressure Discrete – counts e.g No. of asthma attacks/month white blood cell count, no. of children

Categorical data and discrete data Numerically - Express these as frequencies (%) Graphically – use bar charts or pie charts Continuous data (show units of measurement in tables/graphs) Numerically - Mean + SD (for symmetrical distributions) - Median + IQR (75%tile - 25%tile) (for skewed data) Graphically – use histograms or boxplots

Most common research questions: Is there a difference, a change, an association? While we may see a difference, a change, an apparent association – how do we go about determining if they are statistically different? INFERENTIAL STATISTICS (excluding chance as a possible explanation of your results) Before choosing an inferential test need to know: what type of data? – categorical or continuous what type of distribution? – normal or non-normal

INFERENTIAL STATISTICS – Scenario I Have 2 categorical variables e.g. sex and gene type Research question: is there a difference in gene profile by sex? Data analysis: Run a cross-tab and apply the chi-square statistical test to examine if the two variables are related. If they are - you will see a difference in the gene frequencies between males and females. Also, the p value will be small (<.05) I.e. very unlikely that chance is accounting for the differences observed. The  2 will be large.

INFERENTIAL STATISTICS – Scenario II Have 1 categorical variable and 1 continuous variable (most common scenario in lab.) Research question: is there a difference in creatinine levels between smokers and non-smokers? Data analysis: Before choosing an appropriate test answer following questions - how many groups being compared? - are they paired or independent (non-related)? - is creatinine (continuous variable) nomally distributed? (To test for normality run a 1 K-S test)

If data is normally distributed then select a parametric test – two groups non-related so select an independent sample t-test (most common test) If data not normally distributed (e.g. very skewed) then you have a choice: i)Transform the data and use a parametric test Or ii) use a non-parametric test (e.g. Mann whitney U test) The choice here depends largely on precedent – how have others in the field analysed this type of data

Data Normal and involves two groups – related use a paired T-test (T statistic) Data Normal and involves two groups – non-related use a independent T-test (T statistic) Data Normal and involves more than two groups – use a 1-way ANOVA (F statistic) Data is NOT Normally distributed – use a non-parametric test which is equivalent to one of the above situations

Non-parametric tests are distribution free – make no assumptions about the data are good for skewed data or small samples are NOT as powerful as parametric tests and therefore not as likely to pick up statistically significant differences when they may be there

When is a test statistically significant? When P is small – less than 0.05 (p<0.05) and the test statistic is large e.g.  2, F statistic (ANOVA), T statistic (T tests). When P is large e.g. p = 0.69 then the test is not significant and chance cannot be excluded as a possible explanation of the results. Always show the exact p value – provides more information than just giving NS (not significant) N.B. Just because a difference is found to be statistically significant does not mean that it is clinically or practically significant. Be sure to distinguish these!

INFERENTIAL STATISTICS – Scenario III Have 2 continuous variables – how are they related (associated)? E.g. weight and height Research question: is there an association between weight and height? Data analysis: i)Examine this visually using a scatterplot ii)Examine this statistically using Pearson correlation coefficient r (values -1 to +1). This will assess the strength of linear association between the two variables

Association and Agreement Association and Agreement are not the same! when you have a new method and you wish to assess it against a gold standard then it is appropriate to examine the level of agreement as well as linear association. If variable is continuous then use a Bland and Altman plot and assess the statistical differences between methods using a Paired T test. Assess degree of bias by getting the mean of the difference. For categorical data use Kappa. (values >.7 indicate good agreement).