INTRODUCTION TO BIOSTATISTICS DR.S.Shaffi Ahamed Asst. Professor Dept. of Family and Comm. Medicine KKUH.

Slides:



Advertisements
Similar presentations
Population Population
Advertisements

Chapter 2 Summarizing and Graphing Data
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Statistics for Managers Using Microsoft® Excel 5th Edition
IB Math Studies – Topic 6 Statistics.
Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.
The controlled assessment is worth 25% of the GCSE The project has three stages; 1. Planning 2. Collecting, processing and representing data 3. Interpreting.
Statistical Tests Karen H. Hagglund, M.S.
Uses of Biostatistics in Epidemiology (1) Amornrath Podhipak, Ph.D. Department of Epidemiology Faculty of Public Health Mahidol University 2006.
1 Introduction to biostatistics By Dr. S. Shaffi Ahamed Asst. Professor Dept. of Family & Community Medicine KKUH.
INTRODUCTION TO BIOSTATISTICS DR.S.Shaffi Ahamed Asst. Professor Dept. of Family and Comm. Medicine KKUH.
DESCRIPTIVE STATISTICS: GRAPHICAL AND NUMERICAL SUMMARIES
INTRODUCTION TO BIOSTATISTICS
Introduction to Statistics
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
Introduction to Biostatistics
Statistical Techniques in Hospital Management QUA 537
The Stats Unit.
PSYCHOLOGY 820 Chapters Introduction Variables, Measurement, Scales Frequency Distributions and Visual Displays of Data.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
The exam is of 2 hours & Marks :40 The exam is of two parts ( Part I & Part II) Part I is of 20 questions. Answer any 15 questions Each question is.
Collecting, Presenting, and Analyzing Research Data By: Zainal A. Hasibuan Research methodology and Scientific Writing W# 9 Faculty.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 3 Organizing and Displaying Data.
Data Presentation.
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Smith/Davis (c) 2005 Prentice Hall Chapter Four Basic Statistical Concepts, Frequency Tables, Graphs, Frequency Distributions, and Measures of Central.
1 Introduction to biostatistics By Dr. S. Shaffi Ahamed Asst. Professor Dept. of Family & Community Medicine KKUH.
Chapter 2 Describing Data.
Dr.Shaikh Shaffi Ahamed Ph.D., Associate Professor Dept. of Family & Community Medicine.
The exam duration: 1hour 30 min. Marks :25 All MCQ’s. You should choose the correct answer. No major calculations, but simple maths IQ is required. No.
Introduction to Statistics Mr. Joseph Najuch Introduction to statistical concepts including descriptive statistics, basic probability rules, conditional.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Areej Jouhar & Hafsa El-Zain Biostatistics BIOS 101 Foundation year.
Medical Statistics Medical Statistics Tao Yuchun Tao Yuchun 1
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Biostatistics.
Categorical vs. Quantitative…
Biostatistics, statistical software I. Basic statistical concepts Krisztina Boda PhD Department of Medical Informatics, University of Szeged.
Dr.Shaikh Shaffi Ahamed Ph.D., Associate Professor Dept. of Family & Community Medicine.
Chapter Eight: Using Statistics to Answer Questions.
By: Asma Al-Oneazi Supervised by… Dr. Amal Fatani.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Presenting Data in Charts, Graphs and Tables #1-8-1.
Postgraduate books recommended by Degree Management and Postgraduate Education Bureau, Ministry of Education Medical Statistics (the 2nd edition) 孙振球 主.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Data, Type and Methods of representation Dr Hidayathulla Shaikh.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
Revision of topics for CMED 305 Final Exam. The exam duration: 2 hours Marks :25 All MCQ’s. (50 questions) You should choose the correct answer. No major.
Elementary Statistics (Math 145) June 19, Statistics is the science of collecting, analyzing, interpreting, and presenting data. is the science.
Types of Data Dr.Lely Lubna Alaydrus Community Medicine Department Aimst University.
Dr.Shaikh Shaffi Ahamed Ph.D., Associate Professor Dept. of Family & Community Medicine.
Graphical Presentation Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Biostatistics Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
2 NURS/HSCI 597 NURSING RESEARCH & DATA ANALYSIS GEORGE MASON UNIVERSITY.
Introduction to Biostatistics Lecture 1. Biostatistics Definition: – The application of statistics to biological sciences Is the science which deals with.
Data Presentation Numerical Summary Measures Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU.
INTRODUCTION AND DEFINITIONS
Introduction to biostatistics & Basic Concepts By Dr. S
Introduction to Bio-Medical statistics
CHAPTER 5 Basic Statistics
Chapter 5 STATISTICS (PART 1).
Description of Data (Summary and Variability measures)
Introduction to Statistics
Basic Statistical Terms
Biostatistics College of Medicine University of Malawi 2011.
THE STAGES FOR STATISTICAL THINKING ARE:
Chapter Nine: Using Statistics to Answer Questions
Biostatistics Lecture (2).
Essentials of Statistics 4th Edition
Presentation transcript:

INTRODUCTION TO BIOSTATISTICS DR.S.Shaffi Ahamed Asst. Professor Dept. of Family and Comm. Medicine KKUH

This session covers:  Background and need to know Biostatistics  Origin and development of Biostatistics  Definition of Statistics and Biostatistics  Types of data  Graphical representation of a data  Frequency distribution of a data

 “ Statistics is the science which deals with collection, classification and tabulation of numerical facts as the basis for explanation, description and comparison of phenomenon” Lovitt Lovitt

“BIOSTATISICS ”  (1) Statistics arising out of biological sciences, particularly from the fields of Medicine and public health.  (2) The methods used in dealing with statistics in the fields of medicine, biology and public health for planning, conducting and analyzing data which arise in investigations of these branches.

Origin and development of statistics in Medical Research  In 1929 a huge paper on application of statistics was published in Physiology Journal by Dunn.  In 1937, 15 articles on statistical methods by Austin Bradford Hill, were published in book form.  In 1948, a RCT of Streptomycin for pulmonary tb., was published in which Bradford Hill has a key influence.  Then the growth of Statistics in Medicine from 1952 was a 8-fold increase by 1982.

Douglas Altman Ronald FisherKarl Pearson C.R. Rao Gauss -

Basis

Sources of Medical Uncertainties 1.Intrinsic due to biological, environmental and sampling factors 2.Natural variation among methods, observers, instruments etc. 3.Errors in measurement or assessment or errors in knowledge 4.Incomplete knowledge

Intrinsic variation as a source of medical uncertainties  Biological due to age, gender, heredity, parity, height, weight, etc. Also due to variation in anatomical, physiological and biochemical parameters  Environmental due to nutrition, smoking, pollution, facilities of water and sanitation, road traffic, legislation, stress and strains etc.,  Sampling fluctuations because the entire world cannot be studied and at least future cases can never be included  Chance variation due to unknown or complex to comprehend factors

Natural variation despite best care as a source of uncertainties  In assessment of any medical parameter  Due to partial compliance by the patients  Due to incomplete information in conditions such as the patient in coma

Medical Errors that cause Uncertainties  Carelessness of the providers such as physicians, surgeons, nursing staff, radiographers and pharmacists.  Errors in methods such as in using incorrect quantity or quality of chemicals and reagents, misinterpretation of ECG, using inappropriate diagnostic tools, misrecording of information etc.  Instrument error due to use of non-standardized or faulty instrument and improper use of a right instrument.  Not collecting full information  Inconsistent response by the patients or other subjects under evaluation

Incomplete knowledge as a source of Uncertainties  Diagnostic, therapeutic and prognostic uncertainties due to lack of knowledge  Predictive uncertainties such as in survival duration of a patient of cancer  Other uncertainties such as how to measure positive health

Biostatistics is the science that helps in managing medical uncertainties Biostatistics is the science that helps in managing medical uncertainties

Reasons to know about biostatistics:  Medicine is becoming increasingly quantitative.  The planning, conduct and interpretation of much of medical research are becoming increasingly reliant on the statistical methodology.  Statistics pervades the medical literature.

CLINICAL MEDICINE  Documentation of medical history of diseases.  Planning and conduct of clinical studies.  Evaluating the merits of different procedures.  In providing methods for definition of “normal” and “abnormal”.

Role of Biostatistics in patient care  In increasing awareness regarding diagnostic, therapeutic and prognostic uncertainties and providing rules of probability to delineate those uncertainties  In providing methods to integrate chances with value judgments that could be most beneficial to patient  In providing methods such as sensitivity-specificity and predictivities that help choose valid tests for patient assessment  In providing tools such as scoring system and expert system that can help reduce epistemic uncertainties

PREVENTIVE MEDICINE  To provide the magnitude of any health problem in the community.  To find out the basic factors underlying the ill-health.  To evaluate the health programs which was introduced in the community (success/failure).  To introduce and promote health legislation.

Role of Biostatics in Health Planning and Evaluation  In carrying out a valid and reliable health situation analysis, including in proper summarization and interpretation of data.  In proper evaluation of the achievements and failures of a health programme

Role of Biostatistics in Medical Research  In developing a research design that can minimize the impact of uncertainties  In assessing reliability and validity of tools and instruments to collect the infromation  In proper analysis of data

Example: Evaluation of Penicillin (treatment A) vs Penicillin & Chloramphenicol (treatment B) for treating bacterial pneumonia in children< 2 yrs.  What is the sample size needed to demonstrate the significance of one group against other ?  Is treatment A is better than treatment B or vice versa ?  If so, how much better ?  What is the normal variation in clinical measurement ? (mild, moderate & severe) ?  How reliable and valid is the measurement ? (clinical & radiological) ?  What is the magnitude and effect of laboratory and technical error ? error ?  How does one interpret abnormal values ?

WHAT DOES STAISTICS COVER ? Planning Planning Design Design Execution (Data collection) Execution (Data collection) Data Processing Data Processing Data analysis Data analysis Presentation Presentation Interpretation Interpretation Publication Publication

BASIC CONCEPTS Data : Set of values of one or more variables recorded on one or more observational units Categories of data 1. Primary data: observation, questionnaire, record form, interviews, survey, 2. Secondary data: census, medical record,registry Sources of data 1. Routinely kept records 2. Surveys (census) 3. Experiments 4. External source

TYPES OF DATA  QUALITATIVE DATA  DISCRETE QUANTITATIVE  CONTINOUS QUANTITATIVE

QUALITATIVE Nominal Example: Sex ( M, F) Example: Sex ( M, F) Exam result (P, F) Exam result (P, F) Blood Group (A,B, O or AB) Blood Group (A,B, O or AB) Color of Eyes (blue, green, Color of Eyes (blue, green, brown, black) brown, black)

ORDINAL ORDINAL Example: Example: Response to treatment Response to treatment (poor, fair, good) (poor, fair, good) Severity of disease Severity of disease (mild, moderate, severe) (mild, moderate, severe) Income status (low, middle, Income status (low, middle, high) high)

QUANTITATIVE (DISCRETE) Example: The no. of family members Example: The no. of family members The no. of heart beats The no. of heart beats The no. of admissions in a day The no. of admissions in a day QUANTITATIVE (CONTINOUS) Example: Height, Weight, Age, BP, Serum Example: Height, Weight, Age, BP, Serum Cholesterol and BMI Cholesterol and BMI

Discrete data -- Gaps between possible values Continuous data -- Theoretically, no gaps between possible values Number of Children Hb

CONTINUOUS DATA CONTINUOUS DATA QUALITATIVE DATA QUALITATIVE DATA wt. (in Kg.) : under wt, normal & over wt. wt. (in Kg.) : under wt, normal & over wt. Ht. (in cm.): short, medium & tall Ht. (in cm.): short, medium & tall

Table 1 Distribution of blunt injured patients according to hospital length of stay

Scale of measurement Qualitative variable: A categorical variable Nominal (classificatory) scale - gender, marital status, race Ordinal (ranking) scale - severity scale, good/better/best

Scale of measurement Quantitative variable: A numerical variable: discrete; continuous Interval scale : Data is placed in meaningful intervals and order. The unit of measurement are arbitrary. - Temperature (37º C -- 36º C; 38º C-- 37º C are equal) and No implication of ratio (30º C is not twice as hot as 15º C)

Ratio scale: Data is presented in frequency distribution in logical order. A meaningful ratio exists. - Age, weight, height, pulse rate - pulse rate of 120 is twice as fast as 60 - person with weight of 80kg is twice as heavy as the one with weight of 40 kg.

Scales of Measure  Nominal – qualitative classification of equal value: gender, race, color, city  Ordinal - qualitative classification which can be rank ordered: socioeconomic status of families  Interval - Numerical or quantitative data: can be rank ordered and sizes compared : temperature  Ratio - Quantitative interval data along with ratio: time, age.

CLINIMETRICS A science called clinimetrics in which qualities are converted to meaningful quantities by using the scoring system. Examples: (1) Apgar score based on appearance, pulse, grimace, activity and respiration is used for neonatal prognosis. (2) Smoking Index: no. of cigarettes, duration, filter or not, whether pipe, cigar etc., (3) APACHE( Acute Physiology and Chronic Health Evaluation) score: to quantify the severity of condition of a patient

INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures of Skewness & Kurtosis Inferential Statistiscs Estimation Hypothesis Testing Ponit estimate Inteval estimate Univariate analysis Multivariate analysis

Frequency Distributions Frequency Distributions  data distribution – pattern of variability.  the center of a distribution  the ranges  the shapes  simple frequency distributions  grouped frequency distributions  midpoint

Patien t No Hb(g/dl) Hb(g/dl) Hb(g/dl) Tabulate the hemoglobin values of 30 adult male patients listed below

Steps for making a table Step1 Find Minimum (9.1) & Maximum (15.7) Step2 Calculate difference 15.7 – 9.1 = 6.6 Step3 Decide the number and width of the classes (7 c.l) , ,---- the classes (7 c.l) , ,---- Step4 Prepare dummy table – Hb (g/dl), Tally mark, No. patients Hb (g/dl), Tally mark, No. patients

Hb (g/dl)Tall marksNo. patients 9.0 – – – – – – – 15.9 Total Hb (g/dl) Tall marksNo. patients 9.0 – – – – – – – 15.9 l lll llll lll ll Total-30 DUMMY TABLE Tall Marks TABLE

Hb (g/dl)No. of patients 9.0 – – – – – – – Total30 Table Frequency distribution of 30 adult male patients by Hb

Table Frequency distribution of adult patients by Hb and gender: Hb and gender: Hb (g/dl) GenderTotal MaleFemale < – – – – – – – Total30 60

Elements of a Table Ideal table should have Number Title Column headings Foot-notes Number – Table number for identification in a report Title,place - Describe the body of the table, variables, Time period (What, how classified, where and when) Column - Variable name, No., Percentages (%), etc., Heading Foot-note(s) - to describe some column/row headings, special cells, source, etc.,

Table II. Distribution of 120 (Madras) Corporation divisions according to annual death rate based on registered deaths in 1975 and 1976 Figures in parentheses indicate percentages

DIAGRAMS/GRAPHS Discrete data --- Bar charts (one or two groups) --- Bar charts (one or two groups) Continuous data --- Histogram --- Histogram --- Frequency polygon (curve) --- Frequency polygon (curve) --- Stem-and –leaf plot --- Stem-and –leaf plot --- Box-and-whisker plot --- Box-and-whisker plot

Example data

Histogram Figure 1 Histogram of ages of 60 subjects

Polygon

Example data

Stem and leaf plot Stem-and-leaf of Age N = 60 Leaf Unit = (11)

Box plot

Descriptive statistics report: Boxplot - minimum score - maximum score - lower quartile - upper quartile - median - mean - the skew of the distribution: positive skew: mean > median & high-score whisker is longer negative skew: mean < median & low-score whisker is longer

The prevalence of different degree of Hypertension in the population Pie Chart Circular diagram – total -100% Divided into segments each representing a category Decide adjacent category The amount for each category is proportional to slice of the pie

Bar Graphs The distribution of risk factor among cases with Cardio vascular Diseases Heights of the bar indicates frequency Frequency in the Y axis and categories of variable in the X axis The bars should be of equal width and no touching the other bars

HIV cases enrolment in USA by gender Bar chart

HIV cases Enrollment in USA by gender Stocked bar chart

Graphic Presentation of Data the histogram (quantitative data) the bar graph (qualitative data) the frequency polygon (quantitative data)

General rules for designing graphs  A graph should have a self-explanatory legend  A graph should help reader to understand data  Axis labeled, units of measurement indicated  Scales important. Start with zero (otherwise // break)  Avoid graphs with three-dimensional impression, it may be misleading (reader visualize less easily

Any Questions

Origin and development of statistics in Medical Research  In 1929 a huge paper on application of statistics was published in Physiology Journal by Dunn.  In 1937, 15 articles on statistical methods by Austin Bradford Hill, were published in book form.  In 1948, a RCT of Streptomycin for pulmonary tb., was published in which Bradford Hill has a key influence.  Then the growth of Statistics in Medicine from 1952 was a 8-fold increase by 1982.