Basic statistics 11/09/13.

Slides:



Advertisements
Similar presentations
DiseaseNo disease 60 people with disease 40 people without disease Total population = 100.
Advertisements

© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Psychology: A Modular Approach to Mind and Behavior, Tenth Edition, Dennis Coon Appendix Appendix: Behavioral Statistics.
Table of Contents Exit Appendix Behavioral Statistics.
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
QUANTITATIVE DATA ANALYSIS
Epidemiology in Medicine Sandra Rodriguez Internal Medicine TTUHSC.
Anthropometry Technique of measuring people Measure Index Indicator Reference Information.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
AP Biology Intro to Statistic
Statistics for Health Care
Mode Mean Range Median WHAT DO THEY ALL MEAN?.
SHOWTIME! STATISTICAL TOOLS IN EVALUATION DESCRIPTIVE VALUES MEASURES OF VARIABILITY.
1 Measures of Central Tendency Greg C Elvers, Ph.D.
Measures of Central Tendency
Today: Central Tendency & Dispersion
Statistics in Screening/Diagnosis
Mode Mean Range Median WHAT DO THEY ALL MEAN?.
Statistics for Linguistics Students Michaelmas 2004 Week 1 Bettina Braun.
WHAT DO THEY ALL MEAN?. 4 ways to describe data The Mean – The average number in a data set. The Median – The middle number in a data set….. 50% of the.
 Mean: true average  Median: middle number once ranked  Mode: most repetitive  Range : difference between largest and smallest.
Multiple Choice Questions for discussion
ESTIMATION. STATISTICAL INFERENCE It is the procedure where inference about a population is made on the basis of the results obtained from a sample drawn.
Statistics for Health Care Biostatistics. Phases of a Full Clinical Trial Phase I – the trial takes place after the development of a therapy and is designed.
Statistics Recording the results from our studies.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Descriptive Statistics
Sensitivity & Specificity Sam Thomson 8/12/10. Sensitivity Proportion of people with the condition who have a positive test result Proportion of people.
Measures of Central Tendency And Spread Understand the terms mean, median, mode, range, standard deviation.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Psychology’s Statistics. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
Determination of Sample Size: A Review of Statistical Theory
Measures of Central Tendency Foundations of Algebra.
Agenda Descriptive Statistics Measures of Spread - Variability.
Measures of Central Tendency: The Mean, Median, and Mode
FREQUANCY DISTRIBUTION 8, 24, 18, 5, 6, 12, 4, 3, 3, 2, 3, 23, 9, 18, 16, 1, 2, 3, 5, 11, 13, 15, 9, 11, 11, 7, 10, 6, 5, 16, 20, 4, 3, 3, 3, 10, 3, 2,
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
Introduction to Statistics Santosh Kumar Director (iCISA)
Evaluating Results of Learning Blaž Zupan
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
Chapter SixteenChapter Sixteen. Figure 16.1 Relationship of Frequency Distribution, Hypothesis Testing and Cross-Tabulation to the Previous Chapters and.
PCB 3043L - General Ecology Data Analysis. PCB 3043L - General Ecology Data Analysis.
Data Analysis.
Diagnostic Tests Studies 87/3/2 “How to read a paper” workshop Kamran Yazdani, MD MPH.
Statistical analysis Why?? (besides making your life difficult …)  Scientists must collect data AND analyze it  Does your data support your hypothesis?
7.3 Measures of Central Tendency and Dispersion. Mean – the arithmetic average is the sum of all values in the data set divided by the number of values.
Psychology’s Statistics Appendix. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
CHAPTER 3 Key Principles of Statistical Inference.
Making Sense of Statistics: A Conceptual Overview Sixth Edition PowerPoints by Pamela Pitman Brown, PhD, CPG Fred Pyrczak Pyrczak Publishing.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
Chapter 2 The Mean, Variance, Standard Deviation, and Z Scores.
Probability and Statistics
AP Biology Intro to Statistics
Descriptive Statistics
Measures of Central Tendency
Mode Mean Range Median WHAT DO THEY ALL MEAN?.
Mathematical Presentation of Data Measures of Dispersion
Class session 7 Screening, validity, reliability
Lecture 3.
Measures of Central Tendency
Measures of Central Tendency
2-Way Tables, Statistics, & Probability
Descriptive and inferential statistics. Confidence interval
AP Biology Intro to Statistic
AP Biology Intro to Statistic
Summary descriptive statistics: means and standard deviations:
AP Biology Intro to Statistic
Patricia Butterfield & Naomi Chaytor October 18th, 2017
Presentation transcript:

Basic statistics 11/09/13

Topics to cover Averages: Mean, Median, Mode, Range, Confidence intervals, Standard deviation Incidence and prevalence Screening tests – positive and negative predictive values, sensitivity and specificity

Central Tendencies and Spread of Data Basic statistics Central Tendencies and Spread of Data

Measures of central tendency 15 patients with epilepsy were recruited into a trial. They were asked to record the number of seizures they had in a six month period. The results are presented below. 4 2 1 1 13 1 9 2 1 1 2 3 7 5 8

Measures of central tendency Calculate the: Mean Median Mode Range

Measures of central tendency Calculate the: Mean = 4 Median = 2 Mode = 1 Range = 12

Measures of central tendency Mean: sum of the observations divided by the number of observations (1 + 1 + 1 + 1 + 1 + 2 + 2 + 2 + 3 + 4 + 5 + 7 + 8 + 9 + 13) / 15 = 4 Median: the middle value when the total observations are arranged in order of increasing value 1 1 1 1 1 2 2 2 3 4 5 7 8 9 13 Mode: the most commonly occurring value Range: the difference between the highest and lowest values in a set of data 13 – 1 = 12

Standard deviation A measure of the spread of the data Can be used to calculate confidence intervals

Normal Distribution

Confidence intervals Used to assess statistical significance Provides a measure of the extent to which a sample estimate is likely to differ from the true population value Indicates with a standard level of certainty (usually 95%), the range of values within which the true population mean is likely to lie e.g. 25±5

Confidence intervals contd. For a given level of confidence: a narrow interval indicates that the sample estimate has good (high) precision a wide interval indicates that the sample estimate has poor (low) precision Confidence intervals become narrower as: the sample size increases the variability of the data decreases the degree of confidence required for the population mean decreases e.g. 90%, 95%, 99%

Incidence and Prevalence Basic statistics Incidence and Prevalence

Incidence In the last year there have been 24 new cases of colorectal carcinoma in your practice (list size 10276). What is the incidence of colorectal carcinoma? Incidence = 24 / 10276 = 0.0023 Incidence per 1000 = 2.30 per 1000

Incidence Number of new cases diagnosed in a population per unit of time Incidence rate = (number of new cases diagnosed in a given period of time / population size) x 100, 000 (or 1000 etc.)

Prevalence 2593 of the 8725 patients registered with your practice have a BMI of 30 or more. What is the prevalence of obesity? Prevalence = (2593 / 8725) x 100 = 29.7% Prevalence = (2593 / 8725) x 1000 = 297.2 per 1000

Prevalence Total number of cases per population at a particular point in time (e.g. number per 100,000 population) Prevalence rate = (number of cases in population / total size of population) x 100,000 (or 1000 etc.) Prevalence = incidence x duration of condition

Relationship between incidence and prevalence Prevalence = incidence x duration of condition Increase incidence → increase prevalence Cure more patients → lower prevalence More patient die → lower prevalence Enhance survival → increase prevalence

Screening Test Statistics Basic statistics Screening Test Statistics

Screening tests Cervical cancer present Cervical cancer absent New test positive 100 True positives (TP) 50 False positives (FP) New test negative 10 False negatives (FN) 840 True negatives (TN) A blood test to help diagnose cervical cancer has been developed. A study is done on 1000 patients comparing this test to the standard technique

Positive and negative predictive values Positive predictive value (PPV): proportion of people who test positive who actually have the disease PPV = TP / (TP + FP) Negative predictive value (NPV): proportion of people told they don’t have the disease that really don’t have it NPV = TN / (TN + FN) Give an indication of the reliability of a positive or negative test result

PPV and NPV 100 50 10 840 PPV = 100 / (100+50) NPV = 840 / (840+10) Cervical cancer present Cervical cancer absent New test positive 100 50 New test negative 10 840 PPV = 100 / (100+50) = 0.67 = 67% NPV = 840 / (840+10) = 0.99 = 99% The higher the PPV, the more likely it is that a patient with a positive test result does have the disease The higher the NPV, the more likely it is that someone who has tested negative really doesn’t have the disease

Sensitivity and specificity Sensitivity: the proportion of people with a disease who are detected by the test (proportion of positives found) Sensitivity = TP / (TP + FN) Specificity: the proportion of people who don’t have a disease who test negative (proportion of negatives found) Specificity = TN / (TN + FP) Indicate the proportion of the population with/without the disease which will be detected by the test

Sensitivity and specificity Cervical cancer present Cervical cancer absent New test positive 100 50 New test negative 10 840 Sensitivity = 100 / (100+10) = 0.91 = 91% Specificity = 840 / (840+50) = 0.94 = 94% High sensitivity = few missed diagnoses High specificity = few false positives