Categorical Data By Farrokh Alemi, Ph.D.

Slides:



Advertisements
Similar presentations
Displaying Data Objectives: Students should know the typical graphical displays for the different types of variables. Students should understand how frequency.
Advertisements

Analyzing Data: Bivariate Relationships Chapter 7.
PAIRED TEST Farrokh Alemi Ph.D.. Framework for Hypothesis Testing.
VARIANCE & STANDARD DEVIATION By Farrokh Alemi, Ph.D. This lecture is organized by Dr. Alemi and narrated by Yara Alemi. The lecture is based on the OpenIntro.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Displaying and Describing Categorical Data Chapter 3.
Two-Way Relative Frequency Tables
Displaying Categorical Data
Descriptive Statistics: Tabular and Graphical Methods
Second factor: education
Statistics 200 Lecture #7 Tuesday, September 13, 2016
CHAPTER 1 Exploring Data
Chapter 9: Non-parametric Tests
Inference about a population proportion.
2.2 Bar Charts, Pie Charts, and Stem and Leaf Diagram
Chapter 25 Comparing Counts.
Lesson – Teacher Notes Standard:
Displaying and Describing Categorical Data
Chapter 2 Describing Data: Graphs and Tables
Normal Distribution Farrokh Alemi Ph.D.
Analysis of two-way tables - Data analysis for two-way tables
CHAPTER 1 Exploring Data
Looking at Data - Relationships Data analysis for two-way tables
Chapter 11 Goodness-of-Fit and Contingency Tables
Log Linear Modeling of Independence
Probability Calculus Farrokh Alemi Ph.D.
Observations, Variables and Data Matrices
AP STATISTICS LESSON 4 – 3 ( DAY 1 )
Chapter 1 Data Analysis Section 1.1 Analyzing Categorical Data.
CHAPTER 6: Two-Way Tables
Stratification Matters: Analysis of 3 Variables
Test of Independence in 3 Variables
Relations in Categorical Data
Contingency Tables and Association
Chapter 1: Exploring Data
Warmup Which part- time jobs employed 10 or more of the students?
The Table Categorization
Comparing two Rates Farrokh Alemi Ph.D.
Expectation And Variance of Random Variables
CHAPTER 1 Exploring Data
1.1 Analyzing Categorical Data.
CHAPTER 1 Exploring Data
Good Morning AP Stat! Day #2
Categorical Data National Opinion Research Center’s General Social Survey In 2006 a sample of 1928 adults in the U.S. were asked the question “When is.
CHAPTER 1 Exploring Data
Wednesday, September 21, 2016 Farrokh Alemi, PhD.
Analyzing the Association Between Categorical Variables
Benchmarking Clinicians using Data Balancing
CHAPTER 1 Exploring Data
Chapter 26 Comparing Counts.
Section 4-3 Relations in Categorical Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Inference for Two Way Tables
CHAPTER 1 Exploring Data
Displaying and Describing Categorical data
CHAPTER 1 Exploring Data
Chapter 26 Comparing Counts.
CHAPTER 1 Exploring Data
Two-Way Frequency Tables
CHAPTER 1 Exploring Data
Benchmarking Clinicians using Data Balancing
Categorical Data In Chapter 3 we introduced the idea of categorical data. In Chapter 15 we explored probability rules and when events are independent.
CHAPTER 1 Exploring Data
Displaying and Describing Categorical Data
CHAPTER 1 Exploring Data
Univariate Description
Presentation transcript:

Categorical Data By Farrokh Alemi, Ph.D. This lecture is organized by Dr. Alemi and narrated by Yara Alemi. The lecture is based on the OpenIntro Statistics book.

Count of Categories Nominal, ordinal and categorical variables can be analyzed by counting the frequency of occurrence of each category.

Frequency Table A frequency table counts the number of observations that fall within different categories of the variable. Here we have classified the severity of the comorbidities of the patient. We count how many patients fall within each category. For example, of the 3921 heart failure patients examined 545 had low severity of comorbidities.

Frequency Table A bar plot is a common way to display a frequency table for a single categorical variable. The categories are given on the x-axis and the frequency is given in the Y-axis.

Relative Frequency Table A relative frequency replaces the count of categories with the portion that fall within the category. This figure shows the relative frequency of severity of discharge comorbidities. The bar plot looks the same but the Y-axis has been changed with portion of the sample replacing counts of patients.

Contingency Table A table that counts the co-occurrences of two categorical data is called a contingency table. Here we have counted the number of patients with low. Medium and severe conditions at discharge that are readmitted to the hospital within 30 days.

Number with severe comorbidity & readmitted Contingency Table For example, 149 patients had both severe comorbidities at discharge and were readmitted to the hospital within 30 days. Number with severe comorbidity & readmitted

Row Proportions Row proportions can be calculated by dividing the value of the cell in the table by the row total. 149 / 367

Row Proportions This shows a contingency table with the count of patients in the table replaced with row proportions.

Row Proportions In this table, the total row is always one.

Row Proportions Now we can more easily see that among patients readmitted in 30 days 41 percent had severe comorbidities at discharge.

Column Proportions One can calculate column proportions by dividing each cell in the table by its column total count.

Column Proportions This yields 1 for the total row in each column. This is to say that 100% of the patients fall in the column total. By dividing by the total we are examining the portion of the patients that fall within different categories among the total patients falling in the column.

Column Proportions Now we can see that among patients with severe comorbidities, 27% are readmitted within 30 days. This provides evidence that knowledge of severity of comorbidities could help us predict which patient might be readmitted.

Take Home Lesson Categorical variables are examined by counting the portion that fall within them

Do One: Using the following data, (1) create bar plots of frequencies, (2) calculate the row and column proportions, (3) find the portion of patients with no comorbidities who were readmitted See if you can answer these questions. Pay attention to the last part of the question. To answer this part, do you need row or column proportions?