O RGANIZING & DISPLAYING DATA Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.

Slides:



Advertisements
Similar presentations
Comparing Two Proportions (p1 vs. p2)
Advertisements

Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between Categorical Variables Chapter 12.
Analysis of frequency counts with Chi square
Extension Article by Dr Tim Kenny
Understanding real research 3. Assessment of risk.
ChiSq Tests: 1 Chi-Square Tests of Association and Homogeneity.
Module 2: Data analysis key concepts. Module 2: Learning objectives  Understand the definition and purpose of data analysis  Define statistical and.
CHAPTER 2 Basic Descriptive Statistics: Percentages, Ratios and rates, Tables, Charts and Graphs.
Alok Srivastava Chapter 2 Describing Data: Graphs and Tables Basic Concepts Frequency Tables and Histograms Bar and Pie Charts Scatter Plots Time Series.
Understanding study designs through examples Manish Chaudhary MPH (BPKIHS)
CHAPTER 2 Basic Descriptive Statistics: Percentages, Ratios and rates, Tables, Charts and Graphs.
Are exposures associated with disease?
The Bahrain Branch of the UK Cochrane Centre In Collaboration with Reyada Training & Management Consultancy, Dubai-UAE Cochrane Collaboration and Systematic.
Chapter 10 Analyzing the Association Between Categorical Variables
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.3 Determining.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.3 Determining.
Categorical Variables, Relative Risk, Odds Ratios STA 220 – Lecture #8 1.
Absolute, Relative and Attributable Risks. Outcomes or differences that we are interested in:  Differences in means or proportions  Odds ratio (OR)
INTRODUCTION TO EPIDEMIOLO FOR POME 105. Lesson 3: R H THEKISO:SENIOR PAT TIME LECTURER INE OF PRESENTATION 1.Epidemiologic measures of association 2.Study.
STA 2023 Chapter 1 Notes. Terminology  Data: consists of information coming from observations, counts, measurements, or responses.  Statistics: the.
Ana Jerončić, PhD Department for Research in Biomedicine and Health.
Objectives (BPS chapter 1)
Stat 1080 “Elementary Probability and Statistics” By Dr. AFRAH BOSSLY
Analysis of Categorical Data
Ratios,Proportions and Rates MAE Course Measures of frequency The basic tools to describe quantitatively the causes and patterns of disease, or.
Copyright © 2012 by Nelson Education Limited.2-1 Chapter 2 Basic Descriptive Statistics: Percentages, Ratios and Rates, Tables, Charts, and Graphs.
PU515 – Applied Biostatistics Dana Colbert-Wheeler, MHA, MCHES
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Screening and Prevention of Illnesses and Injuries: Research Methods.
Estimation of Various Population Parameters Point Estimation and Confidence Intervals Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
Measures of Association
S TATISTICS Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Descriptive statistics I Distributions, summary statistics.
Understanding real research 4. Randomised controlled trials.
S TATISTICS Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
S TATISTICS Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
R Programming Risk & Relative Risk 1. Session 2 Overview 1.Risk 2.Relative Risk 3.Percent Increase/Decrease Risk 2.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Statistical test for Non continuous variables. Dr L.M.M. Nunn.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Relative Values. Statistical Terms n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the data  not sensitive to.
Aim: How do we analyze data with a two-way table?
1 Chapter 11: Analyzing the Association Between Categorical Variables Section 11.1: What is Independence and What is Association?
PU 515 Midterm Review Dana Colbert-Wheeler, MHA, MCHES Instructor, Kaplan Health Sciences.
Stat 100 Feb 11 Read Chapter 12, try 1-9. Problem 14 of Chapter 12 Case control study: 239 lung cancer patients and 429 controls 98 of the lung cancer.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Risks & Odds Professor Kate O’Donnell. When talking about the chance of something happening, e.g. death, hip fracture, we can talk about: risk and relative.
Postgraduate books recommended by Degree Management and Postgraduate Education Bureau, Ministry of Education Medical Statistics (the 2nd edition) 孙振球 主.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Ch1 Larson/Farber 1 1 Elementary Statistics Larson Farber Introduction to Statistics As you view these slides be sure to have paper, pencil, a calculator.
Probability and odds Suppose we a frequency distribution for the variable “TB status” The probability of an individual having TB is frequencyRelative.
Chapter 3 Descriptive Statistics for Qualitative Data.
Fall 2002Biostat Inference for two-way tables General R x C tables Tests of homogeneity of a factor across groups or independence of two factors.
Handbook for Health Care Research, Second Edition Chapter 11 © 2010 Jones and Bartlett Publishers, LLC CHAPTER 11 Statistical Methods for Nominal Measures.
Methods of quantifying disease Stuart Harris Public Health Intelligence Analyst Course – Day 3.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
2 3 انواع مطالعات توصيفي (Descriptive) تحليلي (Analytic) مداخله اي (Interventional) مشاهده اي ( Observational ) كارآزمايي باليني كارآزمايي اجتماعي كارآزمايي.
Chapter 2. **The frequency distribution is a table which displays how many people fall into each category of a variable such as age, income level, or.
Epidemiological measureas. How do we determine disease frequency for a population?
Descriptive statistics (2)
Statistics 200 Lecture #7 Tuesday, September 13, 2016
The binomial applied: absolute and relative risks, chi-square
Chapter 2 Describing Data: Graphs and Tables
Warmup Which part- time jobs employed 10 or more of the students?
Introduction & 1.1: Analyzing categorical data
Interpreting Basic Statistics
Measurements of Risk & Association …
Basic measurements in Demography
Presentation transcript:

O RGANIZING & DISPLAYING DATA Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics

S UMMARIZING AND DISPLAYING BINARY AND CATEGORICAL DATA Binary data: data which takes two values. Example of binary data is yes/no, … etc Summary of binary: First step in analyzing categorical data is to count the number of observations in each category Express them in proportions of the total sample size. Then calculate the following metrics : ratio, proportion, percentage, risk and rates.

Ratio: is simply one number divided by another. Example : Weight of person (in kg) and height (in meters) then the ratio of weight and height is Body Mass Index. Proportions: are ratio of counts where the numerator (the top number) is subset of the denominator ( the bottom number). Example: in study of 50 patients, 30 are depressed, the the proportion is 30/50 or 0.6. Percentage : is the proportion multiplied by * 100 = 60% of the patients are depressed. Risk : is a proportion if the numerator counts events which happens prospectively (likely ) Example: 300 students start nursing school and 15 drop out before final, the risk of dropping is 15/300=0.05 or 5%.

Rates: it is proportion where time period is attached to it. Example people die in one year out of population of the death rate is / = 0.01 death per person per year. This known as crude death rate ( ignoring important factors like age).

Type of delivery FrequencyPercentageGender Male n (%) Female n (%) Standard Vaginal delivery (33)23 (43) Assisted vaginal delivery10 4 (9)6 (11) Elective caesarean section884 (9)4 (8) Emergency caesarean section13 6 (13)7 (13 ) Emergency caesarean section not in labour (36)13 (25) Total (100)53(100) Distribution of modes of delivery within gender

Contingency table OutcomeTreatment TestControl PositiveAB NegativeCD A+CB+D

C OMPARING BINARY DATA We use the summary measures to show how the groups differ. By using the difference in proportions and relative risk Proportion for positive outcome is

The difference in proportion is In prospective (likely) studies the proportion is known as a risk. If we ignore the sign we got the absolute risk difference (ARD) If we anticipate that the treatment to reduce some bad outcome (negative) (such as death) then it may be known as absolute risk reduction (ARR)

Risk Ratio or relative risk (RR) Inverse of ARD is the number needed to treat/harm

E XAMPLE SUMMARIZING RESULTS OF S MOKING CESSATION Stopped Smoking Nurse led InterventionUsual care (Control group) N%N% Yes No Smoking cessation rates at one year in the coronary heart disease Proportion values are 57/100 = 57% and 44/118 = 37% and the difference is 20%. If we started with 100 women we expect 20 patients between two groups.

Relative Risk is 57/37=1.5 this is the risk of stopping smoking (a good thing) with the intervention compared to the control group. Thus patients with coronary heart disease are 1.5 times more likely to stop smoking in the intervention group than the control group.

S UMMARIZING BINARY DATA - ODDS AND ODDS RATIO Instead of using the probability we use the odds which defined as the ratio of the probability of occurrence of the event to the probability of non- occurrence

From the contingency table we can find the odds ratio When the probability of an event happened is rare, the odds and probability are close, because a is much smaller than c. Thus OR approximate RR when the success are rare.

C OMPARISON OF RR AND OR FOR DIFFERENT BASE LINE P(test)P(control)RRORRR and OR Close Close Not close Not close Close Close

E XAMPLE (2437 RANDOM SAMPLE, 4 YEARS ) PsychosisCannabisTotal YesNo Yes No The risk of non-cannabis users is 342/2117 = 0.16 while the risk of cannabis users is 82/320 =0.26 Thus the relative risk of psychosis is 0.26/0.16= or an increased risk of 62.5% (1/1.6525) The odds ratio of psychosis for cannabis smokers is (82 * 1175 )/ (238 * 342 ) = 1.79, which is close to relative risk. Because the chance of getting psychosis is reasonably small. The OR of not developing psychosis is 1/1.79 = 0.56.

D ISPLAYING CATEGORICAL DATA Categorical data can be displayed either by bar charts or pie chart