Introduction to Biostatistics

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

To Select a Descriptive Statistic
A PowerPoint®-based guide to assist in choosing the suitable statistical test. NOTE: This presentation has the main purpose to assist researchers and students.
STATISTICAL ANALYSIS. Your introduction to statistics should not be like drinking water from a fire hose!!
CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Chapter 13 Conducting & Reading Research Baumgartner et al Data Analysis.
Descriptive Statistics
1 Basic statistics Week 10 Lecture 1. Thursday, May 20, 2004 ISYS3015 Analytic methods for IS professionals School of IT, University of Sydney 2 Meanings.
Chapter 19 Data Analysis Overview
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 18-1 Chapter 18 Data Analysis Overview Statistics for Managers using Microsoft Excel.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Statistics Idiots Guide! Dr. Hamda Qotba, B.Med.Sc, M.D, ABCM.
Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242.
Class Meeting #11 Data Analysis. Types of Statistics Descriptive Statistics used to describe things, frequently groups of people.  Central Tendency 
Chapter 21 Basic Statistics.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
ANALYSIS PLAN: STATISTICAL PROCEDURES
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Angela Hebel Department of Natural Sciences
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
Analyzing and Interpreting Quantitative Data
Chap 18-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 18-1 Chapter 18 A Roadmap for Analyzing Data Basic Business Statistics.
Appendix B: Statistical Methods. Statistical Methods: Graphing Data Frequency distribution Histogram Frequency polygon.
Statistics for Neurosurgeons A David Mendelow Barbara A Gregson Newcastle upon Tyne England, UK.
1 UNIT 13: DATA ANALYSIS. 2 A. Editing, Coding and Computer Entry Editing in field i.e after completion of each interview/questionnaire. Editing again.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 7 Analyzing and Interpreting Quantitative Data.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
Chapter 18 Data Analysis Overview Yandell – Econ 216 Chap 18-1.
Statistics & Evidence-Based Practice
COMPLETE BUSINESS STATISTICS
Statistical Data Analysis
Inferential Statistics
Nonparametric tests, χ², logistic analysis
Practice As part of a program to reducing smoking, a national organization ran an advertising campaign to convince people to quit or reduce their smoking.
BIOSTATISTICS Qualitative variable (Categorical) DESCRIPTIVE
MATH-138 Elementary Statistics
Course Objectives Define the concepts of Biostatistics, and common terminologies used Describe the different types of Scales of measurements Populations,
Statistical tests for quantitative variables
Statistics in psychology
Non-Parametric Tests 12/1.
Non-Parametric Tests 12/6.
Statistics.
CHOOSING A STATISTICAL TEST
Basic Statistics Overview
Parametric vs Non-Parametric
Non-Parametric Tests.
Description of Data (Summary and Variability measures)
Analysis of Data Graphics Quantitative data
Social Research Methods
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Medical Statistics Dr. Gholamreza Khalili
SDPBRN Postgraduate Training Day Dundee Dental Education Centre
SA3202 Statistical Methods for Social Sciences
Introduction to Statistics
Basic Statistical Terms
قياس المتغيرات في المنهج الكمي
Writing the IA Report: Analysis and Evaluation
Non – Parametric Test Dr. Anshul Singh Thapa.
Unit XI: Data Analysis in nursing research
15.1 The Role of Statistics in the Research Process
Review for Exam 1 Ch 1-5 Ch 1-3 Descriptive Statistics
DESIGN OF EXPERIMENT (DOE)
InferentIal StatIstIcs
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
PSYCHOLOGY AND STATISTICS
Introductory Statistics
Examine Relationships
Presentation transcript:

Introduction to Biostatistics Nguyen Quang Vinh – Goto Aya

What & Why is Statistics What & Why is Statistics? + Statistics, Modern society + Objectives → Statistics Applying for Data analysis + Correct scene - Dummy tables + Right tests

What & Why is Statistics?

Statistics Statistics: - science of data - study of uncertainty Biostatistics: data from: Medicine, Biological sciences (business, education, psychology, agriculture, economics...) Modern society: - Reading, Writing & - Statistical thinking: to make the strongest possible conclusions from limited amounts of data.

Objectives (1) Organize & summarize data (2) Reach inferences (sample  population) Statistics: Descriptive statistics  (1) Inferential statistics  (2)

Descriptive statistics Grouped data the frequency distribution Measures of central tendency Measures of dispersion (dispersion, variation, spread, scatter) Measures of position Exploratory data analysis (EDA) Measures of shape of distribution: graphs, skewness, kurtosis

Inferential statistics drawing of inferences Estimation Hypothesis testing  reaching a decision + Parametric statistics + Non-parametric statistics << Distribution-free statistics Modeling, Predicting

Descriptive statistics GROUPED DATA THE FREQUENCY DISTRIBUTION Tables Class Limit Frequency Relative frequency Cumulative Frequency Cumulative Relative Frequency ...

Descriptive statistics MEASURES OF CENTRAL TENDENCY The Mean (arithmetic mean) The Median (Md) The Midrange (Mr) Mode (Mo)

Descriptive statistics MEASURES OF DISPERSION (dispersion, variation, spread, scatter) Range Variance Standard Deviation Coefficient of Variance

Descriptive statistics Exploratory data analysis (EDA) Stem & Leaf displays Box-and-Whisker Plots (min, Q1, Q2, Q3, max)

Descriptive statistics MEASURES OF SHAPE OF DISTRIBUTION Graphs Frequency distribution Relative frequency of occurrence  proportion of values Nominal, Ordinal level Bar chart Pie chart Interval, Ratio level The histogram: frequency histogram & relative frequency histogram Frequency polygon: midpoint of class interval Pareto chart: bar chart with descending sorted frequency Cumulative frequency Cumulative relative frequency → OGIVE graph (Ojiv or Oh’- jive graph)

Descriptive statistics MEASURES OF SHAPE OF DISTRIBUTION Skewness, Kurtosis Skewness (Sk), Pearsonian coefficient, is a measure of asymmetry of a distribution around its mean. Kurtosis characterizes the relative peakedness or flatness of a distribution compared with the normal distribution.

Inferential statistics Estimation

Inferential statistics Hypothesis testing  reaching a decision

Inferential statistics Modeling, Predicting

What statistical calculations cannot do Choosing good sample Choosing good variables Measuring variables precisely

Goals for physicians Understand the statistics portions of most articles in medical journals. Avoid being bamboozled by statistical nonsense. Do simple statistics calculations yourself. Use a simple statistics computer program to analyze data. Be able to refer to a more advanced statistics text or communicate with a statistical consultant (without an interpreter). Not being a number 1 “statistician” among physicians! But understanding... 19

Two problems: Important differences are often obscured (biological variability and/or experimental imprecision) Overgeneralize

How to overcome Scientific & Clinical Judgment Common sense Leap of faith

Statistics encourage investigators to become thoughtful & independent problem solvers

Applying for Data analysis Very important! Have the authors set the scene correctly? → Dummy tables

Wilcoxon-Mann-Whitney test Wilcoxon signed ranks test, Sign test Choosing a test for comparing the averages of 2 or more samples of scores of experiments with one treatment factor Data Between subjects (independent samples) Within subjects (related samples) 2 samples Interval Independent t-test Paired t-test Ordinal Wilcoxon-Mann-Whitney test Wilcoxon signed ranks test, Sign test Nominal Chi-square test Mc Nemar test > 2 samples One way ANOVA Repeated measured ANOVA Kruskal-Wallis test Friedman test Cochran’s Q test (dichotomous data only)

Scheme for choosing one-sample test Nominal 2 categories >2 categories Binomial test Chi-square test Ordinal Randomness Distribution Runs test Kolmogorov-Smirnov test Interval Mean t-test

Measures of association between 2 variables Data Statistic Interval Pearson Correlation (r) Ordinal Spearman’s Rho, Kendall’s tau-a, tau-b, tau-c Nominal Phi, Cramer V

Design Data summary Statistics & Tests 2 independent groups Proportions Rank Ordered Mean Survival Chi-square, Fisher-exact Mann-Whitney U Unpaired t-test Mantel-Haenzel, Log rank 2 related groups McNemar Chi-square Sign test Wilcoxon signed rank Paired t-test More than 2 independent groups Chi-square Kruskal-Wallis ANOVA Log rank More than 2 related groups Cochran Q Friedman Repeated ANOVA Study of Causation; one independent variable (univariate) Proportion Relative Risk Odd Ratios Correlation coefficient Study of Causation; more than one independent variable (Multivariate) Discriminant Analysis Multiple Logistic Regression Log Linear Model Regression Analysis Multiple Classification Analysis

How to interpret statistical results Example

Example 113 newborns, Male:Female = 50:63, were weighted (grams) as follow: Male: 3500, 3700, 3400, 3400, 3400, 3100, 4100, 3600, 3600, 3400, 3800, 3100, 2400, 2800, 2600, 2100, 1800, 2700, 2400, 2400, 2200, 2600, 4600, 4400, 4400, 2100, 4300, 3000, 3300, 3100, 3400, 3300, 4100, 2300, 3000, 4400, 3100, 2900, 2400, 3500, 3400, 3400, 3100, 3600, 3400, 3100, 2800, 2800, 2600, 2100. Female: 3900, 2800, 3300, 3000, 3200, 3600, 3400, 3300, 3300, 3300, 4200, 4500, 4200, 4100, 2400, 3100, 3500, 3100, 2800, 3500, 3800, 2300, 3200, 2300, 2400, 2200, 4400, 4100, 3700, 4400, 3900, 4100, 4300, 4100, 2900, 2500, 2200, 2400, 2300, 2500, 2200, 4100, 3700, 4000, 4000, 3800, 3800, 3300, 3000, 2900, 2000, 2800, 2300, 2400, 2100, 3700, 3400, 3900, 4100, 3600, 3800, 2400, 1800.

Questions % of F ≠ 50% Mean of weights ≠ 3000g

Descriptive statistics n= 113 Gender: Female (n,%) 63 (0.56%)

Descriptive statistics n= 113 Weight: Mean: 3217.7g (S.D.= 0.499g) Median: 3300g (Min: 1800g, Max: 4600g)

Analytic statistics Binomial test Test of p = 0.5 vs. p not = 0.5 The results indicate that there is no statistically significant difference (p = 0.259). In other words, the proportion of females in this sample does not significantly differ from the hypothesized value of 50%. f/n Sample p 95% CI p-value Female 63/113 0.56 0.46-0.65 0.259

Analytic statistics One sample t-test Test of μ = 3000 vs. not = 3000 The mean of the variable weight 3217.70g, which is statistically significantly different from the test value of 3000g. Conclusion: this group of newborns has a significantly higher weight mean. n= 113 Mean SD SEM 95% CI t p Weight 3217.70 711.42 66.92 3085.10-3350.30 3.25 0.002

References Intuitive Biostatistics. Harvey Motulsky. Oxford University Press, 2010. Business Statistics Textbook. Alan H. Kvanli, Robert J. Pavur, C. Stephen Guynes. University of North Texas, 2000. Biostatistics: A Foundation for Analysis in the Health Sciences. Wayne W. Daniel. Georgia State University, 1991.