M28- Categorical Analysis 1  Department of ISM, University of Alabama, 1992-2003 Categorical Data.

Slides:



Advertisements
Similar presentations
AP Statistics Course Review.
Advertisements

Probability Unit 3.
Basic Statistics The Chi Square Test of Independence.
© 2002 Prentice-Hall, Inc.Chap 4-1 Statistics for Managers Using Microsoft Excel (3 rd Edition) Chapter 4 Basic Probability and Discrete Probability Distributions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 4-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
BCOR 1020 Business Statistics Lecture 6 – February 5, 2007.
Chapter 4 Using Probability and Probability Distributions
CHAPTER 1 Exploring Data 1.1 Analyzing Categorical Data.
Chapter 4 Basic Probability
Visualizing Events Contingency Tables Tree Diagrams Ace Not Ace Total Red Black Total
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 4-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
CHAPTER 1 Exploring Data
Statistical Analysis Pedro Flores. Conditional Probability The conditional probability of an event B is the probability that the event will occur given.
CEEN-2131 Business Statistics: A Decision-Making Approach CEEN-2130/31/32 Using Probability and Probability Distributions.
Copyright ©2011 Pearson Education 4-1 Chapter 4 Basic Probability Statistics for Managers using Microsoft Excel 6 th Global Edition.
Chapter 4 Basic Probability
Probability and Probability Distributions
Chapter 4 Probability See.
Analyzing Categorical Data Categorical data is data divided in categories and each category has an associated value Ways to display categorical data: Bar.
10/3/20151 PUAF 610 TA Session 4. 10/3/20152 Some words My –Things to be discussed in TA –Questions on the course and.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Theory of Probability Statistics for Business and Economics.
Statistical Analysis Topic – Math skills requirements.
Copyright ©2014 Pearson Education Chap 4-1 Chapter 4 Basic Probability Statistics for Managers Using Microsoft Excel 7 th Edition, Global Edition.
Using Probability and Discrete Probability Distributions
Warm-Up List all of the different types of graphs you can remember from previous years:
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 4-1 Chapter 4 Basic Probability Business Statistics: A First Course 5 th Edition.
Copyright © 2009 Pearson Education, Inc LEARNING GOAL Interpret and carry out hypothesis tests for independence of variables with data organized.
Chap 4-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 4 Using Probability and Probability.
Handout week 1 course Renske Doorenspleet 1 Chapter 1 -A. The role of statistics in the research process -B. Statistical applications -C. Types of variables.
Chapter 1: Exploring Data Sec. 1.1 Analyzing Categorical Data.
Statistics: Analyzing 2 Categorical Variables MIDDLE SCHOOL LEVEL  Session #1  Presented by: Dr. Del Ferster.
M21- Scatterplots 1  Department of ISM, University of Alabama, Lesson Objectives  Learn to visually assess the relationship between two quantitative.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 1 Exploring Data 1.0 Introduction Data Analysis:
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
The two way frequency table The  2 statistic Techniques for examining dependence amongst two categorical variables.
 Some variables are inherently categorical, for example:  Sex  Race  Occupation  Other categorical variables are created by grouping values of a.
Basic Business Statistics Assoc. Prof. Dr. Mustafa Yüzükırmızı
Aim: How do we analyze data with a two-way table?
Copyright © Cengage Learning. All rights reserved. 4 Probability.
Statistical Analysis Topic – Math skills requirements.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
Correlation/Regression - part 2 Consider Example 2.12 in section 2.3. Look at the scatterplot… Example 2.13 shows that the prediction line is given by.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 4-1 Chapter 4 Basic Probability Basic Business Statistics 11 th Edition.
1 M04- Graphical Displays 2  Department of ISM, University of Alabama, 2003 Graphical Displays of Data.
Statistics.  Probability experiment: An action through which specific results (counts, measurements, or responses) are obtained.  Outcome: The result.
Chapter 4 Probability Concepts Events and Probability Three Helpful Concepts in Understanding Probability: Experiment Sample Space Event Experiment.
+ Warm Up Which of these variables are categorical? Which are quantitative?
2.5 Additive Rules: Theorem 2.10: If A and B are any two events, then: P(A  B)= P(A) + P(B)  P(A  B) Corollary 1: If A and B are mutually exclusive.
5-Minute Check on section 7-1a Click the mouse button or press the Space Bar to display the answers. Convert these statements into discrete probability.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 4-1 Chapter 4 Basic Probability Business Statistics: A First Course 5 th Edition.
AP Stats Review day 1 April 2, Basics Two Parts (90 Minutes each part) – 40 Multiple Choice Content Questions (10-15) Calculation Questions(25-30)
+ Chapter 1: Exploring Data Section 1.1 Analyzing Categorical Data The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Chap 4-1 Chapter 4 Using Probability and Probability Distributions.
Compilation of student responses on last Wednesday’s warm up “Statistics is…” The larger the word, the more often it was used in a student’s definition.
Copyright © 2009 Pearson Education, Inc LEARNING GOAL Interpret and carry out hypothesis tests for independence of variables with data organized.
4.5 through 4.9 Probability continued…. Today’s Agenda Go over page 158 (49 – 52, 54 – 58 even) Go over 4.5 and 4.6 notes Class work: page 158 (53 – 57.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Probability Distributions Chapter 6.
1.1 ANALYZING CATEGORICAL DATA. FREQUENCY TABLE VS. RELATIVE FREQUENCY TABLE.
Howard Community College
MATH-138 Elementary Statistics
Chapter 4 Using Probability and Probability Distributions
Chapter 4 Basic Probability.
Business Statistics Topic 4
Warmup Which part- time jobs employed 10 or more of the students?
Chapter 2 Looking at Data— Relationships
Elementary Statistics 8th Edition
Presentation transcript:

M28- Categorical Analysis 1  Department of ISM, University of Alabama, Categorical Data

M28- Categorical Analysis 2  Department of ISM, University of Alabama, Lesson Objective  Understand basic rules of probability.  Calculate marginal and conditional probabilities.  Determine if two categorical variables are independent.

M28- Categorical Analysis 3  Department of ISM, University of Alabama,   Recall Rule of Thumb: Quantitative variables: averages or differences have meaning. Ex: weight, height, income, age

M28- Categorical Analysis 4  Department of ISM, University of Alabama,   Recall Rule of Thumb: Categorical variables: classify people or things. Ex: gender, race, occupation, political affiliation, country of origin

M28- Categorical Analysis 5  Department of ISM, University of Alabama, Note: Sometimes quantitative variables are expressed as categorical. Income (Family Economic Income) : Class Definition 1. Less than $30, $30,000 but less than $100, $100,000 or more.

M28- Categorical Analysis 6  Department of ISM, University of Alabama, Relationships between variables

M28- Categorical Analysis 7  Department of ISM, University of Alabama, Relationship between two quantitative variables? Is relationship linear (scatterplot)?  Use Correlation &  Least Squares Regression.   Data transformations.

M28- Categorical Analysis 8  Department of ISM, University of Alabama, Best graphical tool for examining the relationship between a quantitative variable and a categorical variable, (i.e., comparing distributions). Recall: Boxplots USFar EastEurope Weight “Do the distributions of weights vary for different countries of origin?” Example: Weight vs. Country of Origin Boxplot can be used to answer:

M28- Categorical Analysis 9  Department of ISM, University of Alabama, Relationship between two categorical variables? Use two-way frequency tables: Look at marginal probabilities and conditional probabilities.

10 Data M28- Categorical Data  Department of ISM, University of Alabama, STATISTICSSTATISTICS is the science of transforming data into information to make decisions in the face of uncertainty.

M28- Categorical Analysis 11  Department of ISM, University of Alabama, A numerical measure of the likelihood that an outcome or an event occurs. P(A) = probability of event A Probability How do we measure "uncertainty"?

M28- Categorical Analysis 12  Department of ISM, University of Alabama, Three Methods for Assessing Probability  Classical  Relative Frequency  Subjective

M28- Categorical Analysis 13  Department of ISM, University of Alabama, P(A) = 0  impossible event P(A) = 1  certain event 2. Sum of the probabilities of all possible outcomes must equal 1. (Binomial, Poisson) 1.0 < P(A) < 1 _ _ Probability requirements for discrete variables:

M28- Categorical Analysis 14  Department of ISM, University of Alabama, Conditional probability: The chance one event happens, given that another event will occur. P(A | B) = P(A and B) P(B) All outcomes belonging to BOTH A AND B Those outcomes in the restricted group, B =

M28- Categorical Analysis 15  Department of ISM, University of Alabama, Problem: Credit Card Manager New credit test to determine credit worthiness. Credit test checked against 500 previous customers.

M28- Categorical Analysis 16  Department of ISM, University of Alabama, Passed (P) Failed (F) Good (G) Default (D) Credit Test A Credit History

M28- Categorical Analysis 17  Department of ISM, University of Alabama, P(D)  What is the probability of a customer defaulting given that he fails test A? What is the probability of a customer defaulting? P(D | F)  P(Defaults given failed test A) = P(Defaults) = PF G D

M28- Categorical Analysis 18  Department of ISM, University of Alabama, General Rules: P(A and B) = P(A)  P(B|A) = P(B)  P(A|B) P(A or B) = P(A) + P(B) - P(A and B)

M28- Categorical Analysis 19  Department of ISM, University of Alabama, P(Fails AND Defaults) = P(F)  P(D|F) PF G D

M28- Categorical Analysis 20  Department of ISM, University of Alabama, P(Fails OR Defaults) = P(F) + P(D)  -  P(D AND F) Note: The “overlap” group would be counted twice if no subtraction PF G D

M28- Categorical Analysis 21  Department of ISM, University of Alabama, Does knowledge of “test A result” help you make a better decision? P(D)  P(D | F)  Do you want to know the test A results before you give the loan? “Credit test A results” and “defaulting” are ____________ on each other.

M28- Categorical Analysis 22  Department of ISM, University of Alabama, A “Newer” Credit Test. Is it even better? A different sample of 500 credit records

M28- Categorical Analysis 23  Department of ISM, University of Alabama, Passed (P) Failed (F) Good (G) Default (D) Credit Test B Credit History

M28- Categorical Analysis 24  Department of ISM, University of Alabama, P(D)  What is the probability of a customer defaulting given that he fails test B? What is the probability of a customer defaulting? P(D | F)  P(Defaults given failed test B) = P(Defaults) = PF G D

M28- Categorical Analysis 25  Department of ISM, University of Alabama, Does knowledge of “test B result” help you make a better decision? P(D)  P(D | F)  Test B tells me. “Credit test B results” and “defaulting” are of each other.

M28- Categorical Analysis 26  Department of ISM, University of Alabama, Independence

M28- Categorical Analysis 27  Department of ISM, University of Alabama, Two events are independent if the occurrence, or non-occurrence, of one does not affect the chances of the other occurring, or not occurring. Otherwise, we say the events are dependent.

M28- Categorical Analysis 28  Department of ISM, University of Alabama, independent If A and B independent, then P(A and B) = P(A)  P(B) P(A or B) = P(A) + P(B) - P(A)  P(B) P(A|B) = P(A) P(B|A) = P(B) Note: The condition does NOT change the probability.

M28- Categorical Analysis 29  Department of ISM, University of Alabama, Survey of randomly selected people voters in Jan. 2001: Q1: Did you vote in the 2000 election? Q2: Do you favor an amendment to require a balanced budget? Q3: To which political party do you belong ?

M28- Categorical Analysis 30  Department of ISM, University of Alabama, Political Party: Republican Democrat Other Total Do you favor amendment for a balanced budget? Yes No Total

Sample size Republican Democrat Other Total Party: Favor amendment Yes No Total Marginal totals for opinion. Marginal totals for Party.

What proportion favor the amend.? What proportion claim to be Rep? and What proportion favor the amend. and are Other? Yes No Total Party Favor amend Repub Demo Other Total

What proportion favor the amend, given those that claim to be Rep? Of those that claim to be Democrat, what proportion favor the amend. Considering only those opposed, what proportion are not Republican? Yes No Total Party Favor amend Repub Demo Other Total

M28- Categorical Analysis 34  Department of ISM, University of Alabama, Restrict subjects to only those that meet a condition. Within this restricted group, what is the distribution of some other var.? Distribution of “opinion” given those that claim to be Republican: P( Yes | Rep. ) =.523 P( No | Rep. ) = “given that” Conditional Distribution:

M28- Categorical Analysis 35  Department of ISM, University of Alabama, Is there a relationship between the party and the opinion on the amendment? What would you expect to happen if no relationship existed?

M28- Categorical Analysis 36  Department of ISM, University of Alabama, Three Conditional Distributions: P( Yes | Rep.) =.523, P( No | Rep.) = P( Yes | Demo) =.297, P( No | Demo) = P( Yes | Other) =.600, P( No | Other) = Marginal Distribution: P( Yes ) =.455, P( No ) =.545 Is there a relationship? Why? or Why not?

M28- Categorical Analysis 37  Department of ISM, University of Alabama, If there is NO relationship (i.e., independence) between the party and the opinion, then “the three conditional probabilities should be the close to each other and close to the marginal probability.”

M28- Categorical Analysis 38  Department of ISM, University of Alabama, Three Conditional Probabilities: P( Yes | Rep.) =.523 P( Yes | Demo) =.297 P( Yes | Other) =.600 Marginal Probability: P( Yes ) =.455 Not close; therefore, party” and the “opinion” are Not close; therefore, “party” and the “opinion” are ____________. Are these close to each other? AND close to the “marginal”?

M28- Categorical Analysis 39  Department of ISM, University of Alabama, Visual Displays Create with “Pivot Tables” in Excel.

M28- Categorical Analysis 40  Department of ISM, University of Alabama, Rep. Demo. Other Barchart- Clustered Frequency Yes

M28- Categorical Analysis 41  Department of ISM, University of Alabama, Rep. Demo. Other Barchart- Stacked Frequency Yes

M28- Categorical Analysis 42  Department of ISM, University of Alabama, Rep. Demo. Other Barchart- Percents Percent Yes

M28- Categorical Analysis 43  Department of ISM, University of Alabama, Summary For two categorical variables:  Must use conditional probabilities to determine if a relationship exists.  Cannot use correlation.  Visual display: Stacked percentage bar charts

M28- Categorical Analysis 44  Department of ISM, University of Alabama, Quant. vs. Quant numerical graphical LS regression line, r, r-sq, std error Scatterplot, residual plots X-bar and s for each category Side-by-side box plots Two-way table, conditional & marginal distributions Bar chart : stacked, percent. Cat. vs. Cat. Quant. vs. Cat. Variables Associations between TWO Variables

M28- Categorical Analysis 45  Department of ISM, University of Alabama, The End