CHAPTER 3 Key Principles of Statistical Inference.

Slides:



Advertisements
Similar presentations
Inferential Statistics
Advertisements

Chapter 9 Hypothesis Testing Understandable Statistics Ninth Edition
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Statistical Issues in Research Planning and Evaluation
Inferential Statistics & Hypothesis Testing
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
Review: What influences confidence intervals?
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Behavioural Science II Week 1, Semester 2, 2002
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Chapter 9 Hypothesis Testing.
PSY 307 – Statistics for the Behavioral Sciences
Today Concepts underlying inferential statistics
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Richard M. Jacobs, OSA, Ph.D.
Inferential Statistics
(Medical) Diagnostic Testing. The situation Patient presents with symptoms, and is suspected of having some disease. Patient either has the disease or.
Testing Hypotheses.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
AM Recitation 2/10/11.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Hypothesis Testing:.
Overview of Statistical Hypothesis Testing: The z-Test
Overview Definition Hypothesis
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Inference for Proportions(C18-C22 BVD) C19-22: Inference for Proportions.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Statistical Analysis Statistical Analysis
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Estimation of Statistical Parameters
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 10. Hypothesis Testing II: Single-Sample Hypothesis Tests: Establishing the Representativeness.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Issues concerning the interpretation of statistical significance tests.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
© Copyright McGraw-Hill 2004
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University.
Chapter 13 Understanding research results: statistical inference.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Hypothesis Testing and Statistical Significance
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Understanding Results
Presentation transcript:

CHAPTER 3 Key Principles of Statistical Inference

Objectives for Chapter 3  Describe the principles of statistical inference  Describe characteristics of a normal distribution  Discuss types of hypothesis-testing  Discuss Type 1 & Type II statistical errors

Objectives for Chapter 3  Define sensitivity, specificity, predictive value & efficiency  Discuss tests of significance  Interpret a confidence interval  Examine the components of sample size estimation for study populations

Statistical Inference  Involves obtaining information from sample of data about population from which sample was drawn & setting up a model to describe this population  When random sample is drawn from population, every member of population has equal chance of being selected in the sample

Finite Population  Each person in population can be listed  Random sample can be drawn using: Table of random numbers Research Randomizer Random. Org

Types of Statistical Inference  Parameter Estimation takes two forms Point Estimation: when estimate of population parameter is single number  Ex. Mean, median, variance & SD  Hypothesis-Testing: More common type

Normal Curve  Theoretically perfect frequency polygon in which mean, median & mode all coincide in center  Takes form of symmetrical bell-shaped curve  Called Gaussian Curve  Most important distribution in statistics

Normal Curve 68% of cases fall within + 1 SD of the mean 95% of cases fall within + 2 SD of the mean 99.7% of cases fall within + 3 SD of the mean.

Normal Distribution & Z score  When variable’s mean & SD are known, any set of scores can be transformed into z- scores with Mean = 0 SD = 1  Two Important Z scores: z = 95% confidence interval z = 99% confidence interval

USEFULNESS OF Z-SCORES  Can make comparisons across different normal distributions, across different samples of individuals or different groups.  The Z score standardizes all normal curves, makes them comparable even when the means and standard deviations are different.

Percentiles  Tells the relative position of a given score  Allows us to compare scores on tests with different means & SDs.  Calculated as  (# of scores - given score) X 100 total # of scores

Percentile  25th percentile = 1st quartile  50th percentile = 2nd quartile Also the median  75th percentile = 3rd quartile

Standard Scores  Way of expressing a score in terms of its relative distance from the mean z-score is example of standard score  Standard scores are used more often than percentiles  Transformed standard scores often called T-scores Usually has M = 50 & SD = 10

Normal Distribution & Drawing Samples  When many samples are drawn from a population, the means of those samples tend to be normally distributed  Larger the number of samples, the more the distribution approximates the normal curve

Standard Error of Mean (SE)  Is standard deviation of the population  Constant relationship between SD of a distribution of sample means (SE), the SD of population from which samples were drawn & size of samples  As size of sample increases, size of error decreases  The greater the variability, the greater the error

Probability Axioms  Fall between 0% & 100%  No negative probabilities  Probability of an event is 100% less the probability of the opposite event

Definitions of Probability  Frequency Probability based on number of times an event occurred in a given sample (n) # of times event occurred X 100 total # of people in n

Definitions of Probability  Subjective Probability:percentage expressing personal, subjective belief that event will occur  p values of.05, often used as a probability cutoff in hypothesis-testing to indicate something unusual happening in the distribution

Hypothesis-Testing  Prominent feature of quantitative research  Hypotheses originate from theory that underpins research  Two types of hypotheses: Null H o Alternative

Null Hypothesis - H o  H o proposes no difference or relationship exists between the variables of interest  Foundation of the statistical test  When you statistically test an hypothesis, you assume that H o correctly describes the state of affairs between the variables of interest

Null Hypothesis - H o  If a statistically significant relationship is found (p <.05), H o is rejected  If no statistically significant relationship is found (p. >.05), H o is rejected

Alternative Hypothesis - H a  A hypothesis that contradicts H o  Can indicate the direction of the difference or relationship expected  Often called the research hypothesis & represented by H r

Sampling Error  Inferences from samples to populations are always probabilistic, meaning we can never be certain our inference was correct  Drawing the wrong conclusion is called an error of inference, defined in terms of H o as Type I and Type II

Types of Errors  We summarize these in a 2x2 box: Right decision  Wrong decision 1   Retain H 0 Wrong decision  Right decision  Reject H 0 H 0 FalseH 0 True

Types of Errors  Type I error occurs when you reject a true H o Called alpha error  Type II error occurs when you accept a false Ho Called beta error

Types of Errors  Inverse relationship between Type 1 & Type II errors. Decreasing the likelihood of one type of error increases the likelihood of the other type error This can be done by changing the significance level  Which type of error can be most tolerated in a particular study?

Screening for Diseases  Sensitivity  Specificity  Predictive Value  Efficiency

Why Screen?  Tenet of public health is that primary prevention is best  If all cases of disease cannot be prevented, then the next best strategy is the early detection of disease in asymptomatic, apparently healthy individuals

Questions to Ask  Based on disease status: If we screen a population, what proportion of people who have the disease will be correctly identified? If we screen a population, what proportion of people who do not have the disease will be correctly identified?

Questions to Ask  Based on the test results: If the test result is positive, what is the probability that the patient will have the disease? If the test result is negative, what is the probability that the patient will not have the disease?

Sensitivity & Specificity  Sensitivity: The ability of a test to correctly identify those with the disease (true positives)  Specificity: The ability of a test to correctly identify those without the disease (true negatives)

Ideal Screening Test  100% sensitive = No false negatives  100% specific = No false positives

An Ideal Screening Program… NEGATIVE (-) True Negative (TN) TEST RESULTS POSITIVE (+) True Positive (TP) Diseased Actual diagnosis CORRECT Not Diseased

NEGATIVE (-) Not Diseased True Negative (TN) TEST RESULTS POSITIVE (+) True Positive (TP) Diseased Actual diagnosis CORRECT Diseased False Negative (FN) Oops! Should not have these False Positive (FP) Not Diseased In the real world…

More Definitions  False Positive: Healthy person incorrectly receives a positive (diseased) test result.  False Negative: Diseased person incorrectly receives a negative (healthy) test result.

2 x 2 Table to Calculate Various Outcomes ab c Positive Negative Not Diseased TP FN FP TN Diseased True Diagnosis Test Result Totala + cb + d a + b c + d Total a + b + c + d d

Calculating Sensitivity  The probability of having a positive test if you are positive (diseased) ab dc Positive Negative Not Diseased TP FN FP TN Diseased True Diagnosis Test Result Totala + cb + d a + b c + d Total a + b + c + d Sensitivity (Sn) Sensitivity = a (a + c) True Positives True Positives + False Negatives =

Calculating Specificity ab dc Positive Negative Not Diseased TP FN FP TN Diseased True Diagnosis Test Result Totala + cb + d a + b c + d Total a + b + c + d Specificity (Sp) Specificity = d (b + d) True Negatives False Positives + True Negatives =  The probability of having a negative test if you are negative (not diseased)

 % Sensitivity + % False Negative = 100% Higher sensitivity  fewer false negatives Lower sensitivity  more false negatives  % Specificity + % False Positive = 100% Higher specificity  fewer false positives Lower specificity  more false positives Inverse Relationships

Normal (no disease) Diseased B Number of Individuals true negatives true positives Interrelationship Between Sensitivity and Specificity CA false negativesfalse positives

Example: 80 people had their serum level of calcium checked to determine whether they had hyperparathyroidism. 20 were ultimately shown to have the disease. Of the 20, 12 had an elevated level of calcium (positive test result). Of the 60 determined to be free of disease, 3 had an elevated level of calcium. Step 1: Fill in the boxes with the data provided ab c Positive Negative Not Diseased 12 3 Diseased True Diagnosis Test Result Total 2060 Total 80 d

ab c Positive Negative Not Diseased Diseased True Diagnosis Test Result Total Total 80 d Step 2: Complete the table Example: 80 people had their serum level of calcium checked to determine whether they had hyperparathyroidism. 20 were ultimately shown to have the disease. Of the 20, 12 had an elevated level of calcium (positive test result). Of the 60 determined to be free of disease, 3 had an elevated level of calcium.

ab c Positive Negative Not Diseased Diseased True Diagnosis Test Result Total Total 80 d Step 3: Calculating the Sensitivity Sensitivity = a (a + c) True Positives True Positives + False Negatives = Sensitivity =12/20 = 60%

ab c Positive Negative Not Diseased Diseased True Diagnosis Test Result Total Total 80 d Step 4: Calculating the Specificity Specificity = d (b + d) True Negatives False Positives + True Negatives = Specificity =57/60 = 95%

Which is Preferred: High Sensitivity or High Specificity?  If you have a fatal disease with no treatment (such as for early cases of AIDS), optimize specificity  If you are screening to prevent transmission of a preventable disease (such as screening for HIV in blood donors), optimize sensitivity

Goal  Minimize chance (probability) of false positive and false negative test results.  Or, equivalently, maximize probability of correct results.

Accuracy of tests in use  Positive predictive value: probability that a person who has a positive test result really has the disease.  Negative predictive value: probability that a person who has a negative test result really is healthy.

Example

Example (continued) Positive predictive value = 49/1467 = Negative predictive value = 1475/1511 = 0.98 Kids who test positive have small chance in having elevated lead levels, while kids who test negative can be quite confident that they have normal lead levels.

Caution about predictive values! Reading positive and negative predictive values directly from table is accurate only if the proportion of diseased people in the sample is representative of the proportion of diseased people in the population. (Random sample!)

Example

Example (continued)  Sn = 392/400 = 0.98  Sp = 576/600 = 0.96  PPV = 392/416 = 0.94  NPV = 576/584 = 0.99 Note prevalence of disease is 400/1000 or 40% Looks good?

Example

Example (continued)  Sn = 49/50 = 0.98  Sp = 912/950 = 0.96  PPV = 49/87 = 0.56  NPV = 912/913 = Sensitivity & specificity the same, but PPV smaller because prevalence of disease is smaller, namely 50/1000 or 5%.

Find correct predictive values by knowing….  True proportion of diseased people in the population.  Sensitivity of the test  Specificity of the test

Example: PPV of pap smears?  Rate of atypia in normal population is  Sensitivity = 0.70  Specificity = 0.90 Find probability that a woman will have atypical cervical cells given that she had a positive pap smear.

Example

Example (continued)  PPV = 70/10,060 =  NPV = 89,910/89,940 = Person with positive pap has tiny chance (0.6%) of truly having disease, while person with negative pap almost certainly will be disease free.

Significance Level  States risk of rejecting H o when it is true  Commonly called p value Ranges from Summarizes the evidence in the data about H o Small p value of.001 provides strong evidence against H o, indicating that getting such a result might occur 1 out of 1,000 times

Testing a Statistical Hypothesis  State H o  Choose appropriate statistic to test H o  Define degree of risk of incorrectly concluding H o is false when it is true  Calculate statistic from a set of randomly selected observations  Decide whether to accept or reject H o based on sample statistic

Power of a Test  Probability of detecting a difference or relationship if such a difference or relationship really exists  Anything that decreases the probability of a Type II error increases power & vice versa  A more powerful test is one that is likely to reject H o

One-Tailed & Two-Tailed Tests  Tails refer to ends of normal curve  When we hypothesize the direction of the difference or relationship, we state in what tail of the distribution we expect to find the difference or relationship  One-tailed test is more powerful & is used when we have a directional hypothesis

Tailedness One-Tailed Test-.05 Level of Significance Two-Tailed Test-.05 Level of Significance Significantly different from mean Tail Tail

Degrees of Freedom (df)  The freedom of a score’s value to vary given what is known about other & the sum of the scores Ex. Given three scores, we have 3 df, one for each independent item. Once you know mean, we lose one df df = n - 1, the number of items in set less 1

Confidence Interval (CI)  Degree of confidence, expressed as a percent, that the interval contains the population mean (or proportion), & for which we have an estimate calculated from sample data 95% CI = X (standard error) 99% CI = X (standard error)

Relationship Between Confidence Interval & Significance Levels  95% CI contains all the (H o ) values for which p >.05  Makes it possible to uncover inconsistencies in research reports  A value for H o within the 95% CI should have a p value >.05, & one outside of the 95% CI should have a p value less than.05

Statistical Significance VS Meaningful Significance  Common mistake is to confuse statistical significance with substantive meaningfulness  Statistically significant result simply means that if H o were true, the observed results would be very unusual  With N > 100, even tiny relationships/differences are statistically significant

Statistical Significance VS Meaningful Significance  Statistically significant results say nothing about clinical importance or meaningful significance of results  Researcher must always determine if statistically significant results are substantively meaningful.  Refrain from statistical “sanctification” of data

Sample Size Determination  Likelihood of rejecting H o (ie, avoiding a Type II error  Depends on Significance Level: P value, usually.05 Power: 1 - beta error, usually set at.80 Effect Size: degree to which H o is false (ie, the size of the effect of independent variable on dependent variable

Sample Size Determination  Given three of these parameters, the fourth (n) can be determined  Can use Sample Size tables to determine the optimal n needed for a given analysis