Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Probability SECTIONS 11.1, 11.2 Probability (11.1, 11.2) Odds, Odds Ratio.

Slides:



Advertisements
Similar presentations
Inferential Statistics
Advertisements

Conditional Probability Target Goal: I can find the probability of a conditional event. 5.3b h.w: p 329: odd, 73, 83, 91 – 95 odd.
1 Chapter 3 Probability 3.1 Terminology 3.2 Assign Probability 3.3 Compound Events 3.4 Conditional Probability 3.5 Rules of Computing Probabilities 3.6.
Processing physical evidence discovering, recognizing and examining it; collecting, recording and identifying it; packaging, conveying and storing it;
1 Probability Part 1 – Definitions * Event * Probability * Union * Intersection * Complement Part 2 – Rules Part 1 – Definitions * Event * Probability.
Categorical Data. To identify any association between two categorical data. Example: 1,073 subjects of both genders were recruited for a study where the.
Analysis of frequency counts with Chi square
Extension Article by Dr Tim Kenny
Probability. Probability Definitions and Relationships Sample space: All the possible outcomes that can occur. Simple event: one outcome in the sample.
MGMT 242 Fall, 1998 Probability Essentials Chapter 3 “A pinch of probably is worth a pound of perhaps.” James Thurber, American Humorist.
Calculating & Reporting Healthcare Statistics
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Bayesian Inference SECTION 11.1, 11.2 Bayes rule (11.2) Bayesian inference.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Probability SECTION 11.1 Events and, or, if Disjoint Independent Law of total.
Breast Screening. NHS Breast Screening Programme Introduced in 1988 Invites women from age group for screening every 3 yrs. Age extension roll-out.
Statistics: Unlocking the Power of Data Lock 5 1 in 8 women (12.5%) of women get breast cancer, so P(breast cancer if female) = in 800 (0.125%)
Statistics: Unlocking the Power of Data Lock 5 Inference for Proportions STAT 250 Dr. Kari Lock Morgan Chapter 6.1, 6.2, 6.3, 6.7, 6.8, 6.9 Formulas for.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Probability SECTIONS 11.1 Probability (11.1) Odds, odds ratio (not in book)
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
1 of 27 PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2013, Michael Kalsher Michael J. Kalsher Department of Cognitive Science Adv. Experimental.
6 Probability Chapter6 p Operations on events and probability An event is the basic element to which probability can be applied. Notations Event:
5.3A Conditional Probability, General Multiplication Rule and Tree Diagrams AP Statistics.
Ratios,Proportions and Rates MAE Course Measures of frequency The basic tools to describe quantitatively the causes and patterns of disease, or.
CHP400: Community Health Program - lI Research Methodology. Data analysis Hypothesis testing Statistical Inference test t-test and 22 Test of Significance.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Multiple Regression SECTIONS 10.1, 10.3 (?) Multiple explanatory variables.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Probability part 2 Chapter 7.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Chapter 11 Chi-Square Procedures 11.3 Chi-Square Test for Independence; Homogeneity of Proportions.
Measures of Association
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 12/4/12 Bayesian Inference SECTION 11.1, 11.2 More probability rules (11.1)
Previous Lecture: Data types and Representations in Molecular Biology.
Bayesian Inference I 4/23/12 Law of total probability Bayes Rule Section 11.2 (pdf)pdf Professor Kari Lock Morgan Duke University.
Copyright © 2009 Pearson Education, Inc LEARNING GOAL Interpret and carry out hypothesis tests for independence of variables with data organized.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between Categorical Variables Chapter 6.
Probability. Statistical inference is based on a Mathematics branch called probability theory. If a procedure can result in n equally likely outcomes,
Chapter 3:Basic Probability Concepts Probability: is a measure (or number) used to measure the chance of the occurrence of some event. This number is between.
Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.
Probability Definition: Probability: the chance an event will happen. # of ways a certain event can occur # of possible events Probability =  Probability.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
1 RES 341 RESEARCH AND EVALUATION WORKSHOP 4 By Dr. Serhat Eren University OF PHOENIX Spring 2002.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Describing Data: Categorical Variables SECTIONS 2.1 One categorical variable.
Bayesian Inference, Review 4/25/12 Frequentist inference Bayesian inference Review The Bayesian Heresy (pdf)pdf Professor Kari Lock Morgan Duke University.
Lecture: Forensic Evidence and Probability Characteristics of evidence Class characteristics Individual characteristics  features that place the item.
Conditional Probability: the likelihood that an event will occur GIVEN that another event has already occurred. A two way table & tree diagrams can represent.
Please turn off cell phones, pagers, etc. The lecture will begin shortly. There will be a very easy quiz at the end of today’s lecture.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan SECTION 7.1 Testing the distribution of a single categorical variable : χ.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 11/29/12 Probability SECTION 11.1 Events and, or, if Disjoint Independent.
AP Statistics Notes Chapter 14 and 15.
Section Conditional Probability Objectives: 1.Understand the meaning of conditional probability. 2.Learn the general Multiplication Rule:
Postgraduate books recommended by Degree Management and Postgraduate Education Bureau, Ministry of Education Medical Statistics (the 2nd edition) 孙振球 主.
Results: How to interpret and report statistical findings Today’s agenda: 1)A bit about statistical inference, as it is commonly described in scientific.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Multiple Regression SECTIONS 10.1, 10.3 Multiple explanatory variables (10.1,
Probability and odds Suppose we a frequency distribution for the variable “TB status” The probability of an individual having TB is frequencyRelative.
Statistics: Unlocking the Power of Data Lock 5 Inference for Means STAT 250 Dr. Kari Lock Morgan Sections 6.4, 6.5, 6.6, 6.10, 6.11, 6.12, 6.13 t-distribution.
Copyright ©2011 Brooks/Cole, Cengage Learning Relationships Between Categorical Variables – Risk Class 26 1.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan SECTION 7.2 χ 2 test for association (7.2) Testing for an Association between.
Conditional Probability If two events are not mutually exclusive, the fact that we know that B has happened will have an effect on the probability of A.
Probability 4/18/12 Probability Conditional probability Disjoint events Independent events Section 11.1 (pdf)pdf Professor Kari Lock Morgan Duke University.
HL2 Math - Santowski Lesson 93 – Bayes’ Theorem. Bayes’ Theorem  Main theorem: Suppose we know We would like to use this information to find if possible.
Chapter 11 Sequences, Induction, and Probability Copyright © 2014, 2010, 2007 Pearson Education, Inc Probability.
Probability Probability Day 3 Introduction to Probability Probability of Independent Events.
Statistics 200 Lecture #7 Tuesday, September 13, 2016
Lecture: Forensic Evidence and Probability Characteristics of evidence
Copyright © 2014, 2010, 2007 Pearson Education, Inc.
Conditional Probability and the Multiplication Rule
Breast Cancer Screening in High-Risk Men: A 12-Year Longitudinal Observational Study of Male Breast Imaging Utilization and Outcomes Mammography screening.
Presentation transcript:

Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Probability SECTIONS 11.1, 11.2 Probability (11.1, 11.2) Odds, Odds Ratio (not in book)

Statistics: Unlocking the Power of Data Lock 5 Event An event is something that either happens or doesn’t happen, or something that either is true or is not true Examples: You get cancer A randomly selected person is obese A particular mutation occurs It snows tonight

Statistics: Unlocking the Power of Data Lock 5 Probability The probability of event A, P(A), is the probability that A will happen Probability always refers to an event Probability is always between 0 and 1 P(A) = 1 means A will definitely happen P(A) = 0 means A will definitely not happen

Statistics: Unlocking the Power of Data Lock 5 Questions of the Day How common is breast cancer? How does breast cancer risk depend on other variables?

Statistics: Unlocking the Power of Data Lock 5

Ways of Expressing Probability 1 in 8 women will get breast cancer 1/8 of women will get breast cancer The proportion of women who will get breast cancer is 1/8, or % of women will get breast cancer The probability of breast cancer for a female is These statements are all saying the same thing

Statistics: Unlocking the Power of Data Lock 5 Probability statements can be directly translated into a relative frequency table: Relative Frequency Table Get Breast CancerDo not Get Breast CancerTOTAL

Statistics: Unlocking the Power of Data Lock 5 Probability statements can also be translated into a frequency table, although unlike data description, the total is arbitrary: Either of these are equally valid for probability calculations! Frequency Table Get Breast CancerDo not Get Breast CancerTOTAL 178 Get Breast CancerDo not Get Breast CancerTOTAL

Statistics: Unlocking the Power of Data Lock 5 Odds If p denotes the probability of an event, the odds are defined as Interpreting odds  Odds of 1 indicate 50/50  p < 0.5 yield odds < 1  p > 0.5 yield odds > 1 Odds of 3, or 3:1, mean that out of 4 times, we would expect the variable to be in that category 3 times and out of that category 1 time

Statistics: Unlocking the Power of Data Lock 5 Breast Cancer Odds Commonly expressed as 1:7 or 1/7 The odds of a woman getting breast cancer are 1:7 For every one woman who will get breast cancer, 7 women will not.

Statistics: Unlocking the Power of Data Lock 5 Conditional Probability P(A if B) is the probability of A, if we know B has happened or is true This is read in multiple ways: “probability of A if B” “probability of A given B” “probability of A conditional on B” You may also see this written as P(A | B)

Statistics: Unlocking the Power of Data Lock 5 Conditional Probability The probability and odds that we calculated are restricted only to females, so we are implicitly conditioning on the fact that gender is female. For females, what’s the probability of getting breast cancer? What proportion of women will get breast cancer? Conditional probability is the probability of an event, conditional on (or given) that another variable takes a specific value (gender = female)

Statistics: Unlocking the Power of Data Lock 5 P(survival if advanced stage) = 0.27 P(survival if early detection) = 0.98

Statistics: Unlocking the Power of Data Lock 5 What does 0.43% represent? a) P(breast cancer if 30 – 39 years old) b) P(30 – 39 years old if breast cancer)

Statistics: Unlocking the Power of Data Lock 5 What does this tell us? a) P(breast cancer if first-degree relative) b) P(first degree relative if breast cancer)

Statistics: Unlocking the Power of Data Lock 5 What does this tell us? a) P(breast cancer if no family history) b) P(no family history if breast cancer)

Statistics: Unlocking the Power of Data Lock 5 What does this tell us? a) P(family history if breast cancer) = 0.15 b) P(breast cancer if family history) = 0.15 c) P(breast cancer if no family history) = 0.15

Statistics: Unlocking the Power of Data Lock 5 These are given as facts, but clearly there is some uncertainty! (or different populations being studied…)

Statistics: Unlocking the Power of Data Lock 5 P(breast health routine if female) = P(breast health routine if male) = 0.212

Statistics: Unlocking the Power of Data Lock 5 1 in 8 women (12.5%) of women get breast cancer, so P(breast cancer if female) = in 800 (0.125%) of men get breast cancer, so P(breast cancer if male) =

Statistics: Unlocking the Power of Data Lock 5 Conditional Probability P(breast cancer if female) = What’s P(female if breast cancer)? P(A if B) is NOT the same as P(B if A)!!! This is an important point, and something that is easily confused. Be careful with conditioning! 0.99

Statistics: Unlocking the Power of Data Lock 5 Odds Ratio The odds ratio (OR) is the ratio of the odds in one group to the odds in the other group: Odds ratios of 1 indicate no difference between the groups (no relationship between the two variables)

Statistics: Unlocking the Power of Data Lock 5 Odds Ratio

Statistics: Unlocking the Power of Data Lock 5 Cats and Schizophrenia

Statistics: Unlocking the Power of Data Lock 5 1% of women at age 40 who participate in routine screening have breast cancer. 80% of women with breast cancer get positive mammographies. 9.6% of women without breast cancer get positive mammographies. A 40-year old woman participates in routine screening and has a positive mammography. What’s the probability she has cancer? Breast Cancer Screening

Statistics: Unlocking the Power of Data Lock 5 A 40-year old woman participates in routine screening and has a positive mammography. What’s the probability she has cancer? a) 0-10% b) 10-25% c) 25-50% d) 50-75% e) % Breast Cancer Screening

Statistics: Unlocking the Power of Data Lock 5 Breast Cancer Screening A 40-year old woman participates in routine screening and has a positive mammography. What’s the probability she has cancer? What is this asking for? a)P(cancer if positive mammography) b)P(positive mammography if cancer) c)P(positive mammography if no cancer) d)P(positive mammography) e)P(cancer)

Statistics: Unlocking the Power of Data Lock 5 CancerCancer-free Positive Result Negative Result If we randomly pick a ball from the Cancer bin, it’s more likely to be red/positive. If we randomly pick a ball the Cancer-free bin, it’s more likely to be green/negative. Everyone We randomly pick a ball from the Everyone bin. C C C C C FFFFFFFFFFFF FFFFFFFFFFF FFFFFFFFFF FFFFFFFFF FFFFFFFF FFFFFFF FFFFFF FFFFF If the ball is red/positive, is it more likely to be from the Cancer or Cancer-free bin?

Statistics: Unlocking the Power of Data Lock 5 100,000 women in the population 1% 1000 have cancer99,000 cancer-free 99% 80%20% 800 test positive 200 test negative 9.6%90.4% 9,504 test positive 89,496 test negative Thus, 800/(800+9,504) = 7.8% of positive results have cancer (you could also get this by creating a two-way table)

Statistics: Unlocking the Power of Data Lock 5

To Do Do HW 2.1 (due Friday, 9/25)