Introduction to Medical Statistics. Why Do Statistics? Extrapolate from data collected to make general conclusions about larger population from which.

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Inference Sampling distributions Hypothesis testing.
Jo Sweetland Research Occupational Therapist
Introduction to statistics in medicine – Part 1 Arier Lee.
Statistics. Review of Statistics Levels of Measurement Descriptive and Inferential Statistics.
Statistical Tests Karen H. Hagglund, M.S.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Sampling Distributions
Statistics By Z S Chaudry. Why do I need to know about statistics ? Tested in AKT To understand Journal articles and research papers.
1 Introduction to biostatistics Lecture plan 1. Basics 2. Variable types 3. Descriptive statistics: Categorical data Categorical data Numerical data Numerical.
(a brief over view) Inferential Statistics.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
1. Statistics: Learning from Samples about Populations Inference 1: Confidence Intervals What does the 95% CI really mean? Inference 2: Hypothesis Tests.
Statistical Analysis I have all this data. Now what does it mean?
Statistical Analysis Statistical Analysis
Statistics in psychology Describing and analyzing the data.
Inferential Stats, Discussions and Abstracts!! BATs Identify which inferential test to use for your experiment Use the inferential test to decide if your.
Statistical Analysis I have all this data. Now what does it mean?
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Data Analysis (continued). Analyzing the Results of Research Investigations Two basic ways of describing the results Two basic ways of describing the.
RESULTS & DATA ANALYSIS. Descriptive Statistics  Descriptive (describe)  Frequencies  Percents  Measures of Central Tendency mean median mode.
Introduction to Biostatistics, Harvard Extension School © Scott Evans, Ph.D.1 Descriptive Statistics, The Normal Distribution, and Standardization.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
DATA IDENTIFICATION AND ANALYSIS. Introduction  During design phase of a study, the investigator must decide which type of data will be collected and.
Day 2 Session 1 Basic Statistics Cathy Mulhall South East Public Health Observatory Spring 2009.
Final review - statistics Spring 03 Also, see final review - research design.
CONFIDENCE INTERVAL It is the interval or range of values which most likely encompasses the true population value. It is the extent that a particular.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Statistical test for Non continuous variables. Dr L.M.M. Nunn.
The use of statistics in psychology. statistics Essential Occasionally misleading.
Experimental Psychology PSY 433 Appendix B Statistics.
Medical Statistics as a science
How confident are we in the estimation of mean/proportion we have calculated?
Introduction to Statistics Santosh Kumar Director (iCISA)
The exam is of 2 hours & Marks :40 The exam is of two parts ( Part I & Part II) Part I is of 20 questions. Answer any 15 questions Each question is of.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Medical Statistics as a science. Меdical Statistics: To do this we must assume that all data is randomly sampled from an infinitely large population,
AP Statistics Section 11.1 B More on Significance Tests.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 6 Putting Statistics to Work.
Copyright © 2005 Pearson Education, Inc. Slide 6-1.
PCB 3043L - General Ecology Data Analysis.
Tuesday, April 8 n Inferential statistics – Part 2 n Hypothesis testing n Statistical significance n continued….
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
STATISTICS FOR SCIENCE RESEARCH (The Basics). Why Stats? Scientists analyze data collected in an experiment to look for patterns or relationships among.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Statistics Nik Bobrovitz BHSc, MSc PhD Student University of Oxford December 2015
Chapter 13 Understanding research results: statistical inference.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
AP PSYCHOLOGY: UNIT I Introductory Psychology: Statistical Analysis The use of mathematics to organize, summarize and interpret numerical data.
Agenda n Probability n Sampling error n Hypothesis Testing n Significance level.
Outline Sampling Measurement Descriptive Statistics:
Data Analysis.
Medical Statistics as a science
STATISTICS FOR SCIENCE RESEARCH
Dr.MUSTAQUE AHMED MBBS,MD(COMMUNITY MEDICINE), FELLOWSHIP IN HIV/AIDS
Statistics in psychology
How Psychologists Ask and Answer Questions Statistics Unit 2 – pg
Statistics.
Inferential statistics,
AP Biology Intro to Statistics
Data analysis and basic statistics
Presentation transcript:

Introduction to Medical Statistics

Why Do Statistics? Extrapolate from data collected to make general conclusions about larger population from which data sample was derived Extrapolate from data collected to make general conclusions about larger population from which data sample was derived Allows general conclusions to be made from limited amounts of data Allows general conclusions to be made from limited amounts of data To do this we must assume that all data is randomly sampled from an infinitely large population, then analyse this sample and use results to make inferences about the population To do this we must assume that all data is randomly sampled from an infinitely large population, then analyse this sample and use results to make inferences about the population

Walter Frank Raphael Weldon

Karl Pearson

Data Categorical data:  values belong to categories Categorical data:  values belong to categories Nominal data: there is no natural order to the categories e.g. blood groups Nominal data: there is no natural order to the categories e.g. blood groups Ordinal data: there is natural order e.g. Adverse Events (Mild/Moderate/Severe/Life Threatening) Ordinal data: there is natural order e.g. Adverse Events (Mild/Moderate/Severe/Life Threatening) Binary data: there are only two possible categories e.g. alive/dead Binary data: there are only two possible categories e.g. alive/dead Numerical data:  the value is a number (either measured or counted) Numerical data:  the value is a number (either measured or counted) Continuous data: measurement is on a continuum e.g. height, age, haemoglobin Continuous data: measurement is on a continuum e.g. height, age, haemoglobin Discrete data: a “count” of events e.g. number of pregnancies Discrete data: a “count” of events e.g. number of pregnancies

Descriptive Statistics: Descriptive Statistics: concerned with summarising or describing a sample eg. mean, median Inferential Statistics: Inferential Statistics: concerned with generalising from a sample, to make estimates and inferences about a wider population eg. T-Test, Chi Square test

Statistical Terms Mean:  the average of the data  sensitive to outlying data Mean:  the average of the data  sensitive to outlying data Median:  the middle of the data  not sensitive to outlying data Median:  the middle of the data  not sensitive to outlying data Mode:  most commonly occurring value Mode:  most commonly occurring value Range:  the spread of the data Range:  the spread of the data IQ range:  the spread of the data  commonly used for skewed data IQ range:  the spread of the data  commonly used for skewed data Standard deviation:  a single number which measures how much the observations vary around the mean Standard deviation:  a single number which measures how much the observations vary around the mean Symmetrical data:  data that follows normal distribution  (mean=median=mode)  report mean & standard deviation & n Symmetrical data:  data that follows normal distribution  (mean=median=mode)  report mean & standard deviation & n Skewed data:  not normally distributed  (mean  median  mode)  report median & IQ Range Skewed data:  not normally distributed  (mean  median  mode)  report median & IQ Range

Standard Normal Distribution

Mean +/- 1 SD  encompasses 68% of observations Mean +/- 2 SD  encompasses 95% of observations Mean +/- 3SD  encompasses 99.7% of observations

Steps in Statistical Testing Null hypothesis Ho: there is no difference between the groups Null hypothesis Ho: there is no difference between the groups Alternative hypothesis H1: there is a difference between the groups Alternative hypothesis H1: there is a difference between the groups Collect data Collect data Perform test statistic eg T test, Chi square Perform test statistic eg T test, Chi square Interpret P value and confidence intervals Interpret P value and confidence intervals P value  0.05 Reject Ho P value > 0.05 Accept Ho Draw conclusions Draw conclusions

Meaning of P P Value: the probability of observing a result as extreme or more extreme than the one actually observed from chance alone P Value: the probability of observing a result as extreme or more extreme than the one actually observed from chance alone Lets us decide whether to reject or accept the null hypothesis Lets us decide whether to reject or accept the null hypothesis P > 0.05Not significant P > 0.05Not significant P = 0.01 to 0.05Significant P = 0.01 to 0.05Significant P = to 0.01Very significant P = to 0.01Very significant P < 0.001Extremely significant P < 0.001Extremely significant

T Test T test checks whether two samples are likely to have come from the same or different populations T test checks whether two samples are likely to have come from the same or different populations Used on continuous variables Used on continuous variables Example: Age of patients in the APC study (APC/placebo) Example: Age of patients in the APC study (APC/placebo) PLACEBO: APC: mean age 60.6 years mean age 60.5 years SD+/- 16.5SD +/ SD+/- 16.5SD +/ n= 840n= 850 n= 840n= % CI % CI % CI % CI What is the P value? What is the P value? P =  not significant  patients from the same population (groups designed to be matched by randomisation so no surprise!!) P =  not significant  patients from the same population (groups designed to be matched by randomisation so no surprise!!)

T Test: SAFE “Serum Albumin” Q: Are these albumin levels different? Ho = Levels are the same (any difference is there by chance) H1 =Levels are too different to have occurred purely by chance Statistical test: T test  P < (extremely significant) Reject null hypothesis (Ho) and accept alternate hypothesis (H1) ie. 1 in chance that these samples are both from the same overall group therefore we can say they are very likely to be different PLACEBO ALBUMIN n mean28 30 SD % CI

RANDOMIZED CONTROLLED TRIALS

Reducing Sample Size Same results but using much smaller sample size (one tenth) Same results but using much smaller sample size (one tenth) ALIVE DEAD TOTAL % DEAD ALIVE DEAD TOTAL % DEAD PLACEBO 58 (69.2%) 26 (30.8%) 84 (100%) 30.8 DEAD 64 (75.3%) 21 (24.7%) 85 (100%) 24.7 TOTAL 122 (72.2%) 47 (27.8%) 169 (100%)  Reduction in death rate = 6.1% (still the same)  Perform Chi Square test  P = in 100 times this difference in mortality could have happened by chance therefore results not significant  Again, power of a study to find a difference depends a lot on sample size for binary data as well as continuous data

Summary Size matters=BIGGER IS BETTER Size matters=BIGGER IS BETTER Spread matters=SMALLER IS BETTER Spread matters=SMALLER IS BETTER Bigger difference=EASIER TO FIND Bigger difference=EASIER TO FIND Smaller difference=MORE DIFFICULT TO FIND Smaller difference=MORE DIFFICULT TO FIND To find a small difference you need a big study To find a small difference you need a big study