Week 2 An overview Exposure and outcome (dependent and independent variables)Exposure and outcome (dependent and independent variables) Reliability and.

Slides:



Advertisements
Similar presentations
ADVANCED STATISTICS FOR MEDICAL STUDIES Mwarumba Mwavita, Ph.D. School of Educational Studies Research Evaluation Measurement and Statistics (REMS) Oklahoma.
Advertisements

The Research Consumer Evaluates Measurement Reliability and Validity
CONCEPTS UNDERLYING STUDY DESIGN
KAHS 6020 Multivariate analysis and design Dr. Alison Macpherson Website
1 Hypothesis Testing Chapter 8 of Howell How do we know when we can generalize our research findings? External validity must be good must have statistical.
Statistical Issues in Research Planning and Evaluation
1 Case-Control Study Design Two groups are selected, one of people with the disease (cases), and the other of people with the same general characteristics.
KINE 4565: The epidemiology of injury prevention Case control and case crossover studies.
Statistical Tests Karen H. Hagglund, M.S.
Analysis of frequency counts with Chi square
QUANTITATIVE DATA ANALYSIS
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Statistics By Z S Chaudry. Why do I need to know about statistics ? Tested in AKT To understand Journal articles and research papers.
Statistics for Health Care
Today Concepts underlying inferential statistics
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 14 Inferential Data Analysis
BASIC STATISTICS WE MOST OFTEN USE Student Affairs Assessment Council Portland State University June 2012.
1 of 27 PSYC 4310/6310 Advanced Experimental Methods and Statistics © 2013, Michael Kalsher Michael J. Kalsher Department of Cognitive Science Adv. Experimental.
David Yens, Ph.D. NYCOM PASW-SPSS STATISTICS David P. Yens, Ph.D. New York College of Osteopathic Medicine, NYIT l PRESENTATION.
POPULATION DYNAMICS Required background knowledge: Data and variability concepts  Data collection Measures of central tendency (mean, median, mode, variance,
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Mohsen Askarishahi Reference: 1)Aviva Petrie. Medical Statistics at a Glance. Blackwell (2005) 2) Sheldon M. Ross. Introductory Statistics. Elsevier Inc.
 Mean: true average  Median: middle number once ranked  Mode: most repetitive  Range : difference between largest and smallest.
Multiple Choice Questions for discussion
OKU 9 Chapter 15: ORTHOPAEDIC RESEARCH Brian E. Walczak.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
Instrumentation.
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
Basic statistics 11/09/13.
Chapter 15 Data Analysis: Testing for Significant Differences.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Chapter Eleven A Primer for Descriptive Statistics.
Statistics for Infection Control Practitioners Presented By: Shana O’Heron, MPH, CIC Infection Prevention and Management Associates.
User Study Evaluation Human-Computer Interaction.
Analyzing and Interpreting Quantitative Data
The exam duration: 1hour 30 min. Marks :25 All MCQ’s. You should choose the correct answer. No major calculations, but simple maths IQ is required. No.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Week 5: Logistic regression analysis Overview Questions from last week What is logistic regression analysis? The mathematical model Interpreting the β.
The binomial applied: absolute and relative risks, chi-square.
Average Arithmetic and Average Quadratic Deviation.
Chapter 2 Nature of the evidence. Chapter overview Introduction What is epidemiology? Measuring physical activity and fitness in population studies Laboratory-based.
Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter 20 Testing Hypothesis about proportions
Medical Statistics as a science
1.1 Statistical Analysis. Learning Goals: Basic Statistics Data is best demonstrated visually in a graph form with clearly labeled axes and a concise.
Relative Values. Statistical Terms n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the data  not sensitive to.
The exam is of 2 hours & Marks :40 The exam is of two parts ( Part I & Part II) Part I is of 20 questions. Answer any 15 questions Each question is of.
Data Analysis.
Chapter 6: Analyzing and Interpreting Quantitative Data
Authenticity of results of statistical research. The Normal Distribution n Mean = median = mode n Skew is zero n 68% of values fall between 1 SD n 95%
Psychometrics. Goals of statistics Describe what is happening now –DESCRIPTIVE STATISTICS Determine what is probably happening or what might happen in.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
A short introduction to epidemiology Chapter 6: Precision Neil Pearce Centre for Public Health Research Massey University Wellington, New Zealand.
Chapter 13 Understanding research results: statistical inference.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Dr.Rehab F.M. Gwada. Measures of Central Tendency the average or a typical, middle observed value of a variable in a data set. There are three commonly.
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Understanding Results
Basic Statistics Overview
NURS 790: Methods for Research and Evidence Based Practice
15.1 The Role of Statistics in the Research Process
Presentation transcript:

Week 2 An overview Exposure and outcome (dependent and independent variables)Exposure and outcome (dependent and independent variables) Reliability and validityReliability and validity What is “statistical significance”?What is “statistical significance”? Relationships between variables -continuous variables (t-tests and z-tests) -continuous variables (correlations)Relationships between variables -continuous variables (t-tests and z-tests) -continuous variables (correlations) -the normal (gaussian) distribution -categorical variables (chi-square tests)-the normal (gaussian) distribution -categorical variables (chi-square tests) Two by two tables and confidence intervalsTwo by two tables and confidence intervals Review of the articlesReview of the articles Example 1: Children crossing streetsExample 1: Children crossing streets Measures of association between variablesMeasures of association between variables For next weekFor next week

A somewhat advanced society has figured how to package basic knowledge in pill form. A student, needing some learning, goes to the pharmacy and asks what kind of knowledge pills are available. The pharmacist says "Here's a pill for English literature." The student takes the pill and swallows it and has new knowledge about English literature! "What else do you have?" asks the student. "Well, I have pills for art history, biology, and world history, "replies the pharmacist. The student asks for these, and swallows them and has new knowledge about those subjects! Then the student asks, "Do you have a pill for statistics? "The pharmacist says "Wait just a moment", and goes back into the storeroom and brings back a whopper of a pill that is about twice the size of a jawbreaker and plunks it on the counter. "I have to take that huge pill for statistics?" inquires the student. The pharmacist understandingly nods his head and replies "Well, you know statistics always was a little hard to swallow."

Epidemiologic study designs 1.Randomized controlled trial Considered the ‘gold standard’ Exposure is assigned randomly Participants followed over time to assess outcome Analytic comparison of risk or benefit in exposed vs. not exposed Can be applied to program evaluation

Epidemiologic study design 2 2. Cohort study One group exposed Other group unexposed Participants followed over time to assess outcome Analytic comparison of risk in exposed vs. not exposed Can be applied to program evaluation

Epidemiologic study designs 3 3.Case-control study Based on outcomeBased on outcome Exposure is compared in those with and without outcomeExposure is compared in those with and without outcome Analytic comparison of risk in exposed vs. not exposedAnalytic comparison of risk in exposed vs. not exposed 4. Descriptive study Provides descriptive statistics of problem under studyProvides descriptive statistics of problem under study No analytic comparison of risk / benefitNo analytic comparison of risk / benefit Often precedes analytic studiesOften precedes analytic studies

Dependent vs independent variables Remember the exposure/outcome relationship Another way to describe it is to attribute dependent and independent variables-the outcome depends on the independent exposure variables It is the association between these variables that leads us to statistical tests The test we use depends on the type of variable

Statistical significance What is statistical significance? The probability that the observed relationship could have happed by chance The p-value and confidence interval are the usual measures of significance Set by tradition at 0.05 or 95% The higher the p value, the more likely it could have happened by chance The wider the confidence interval, the more likely it could have happened by chance Both driven by variability in the data and sample size

Types of variables Continuous variables -variables for which there is a range of responses e.g., age, blood pressure, weight Categorical variables –Variables that fall into categories –e.g, gender, smoking status

Hypothesis testing for continuous variables Mean (the average number) -calculated by summing all the numbers and dividing by n -Hypothesis testing usually done using a t-test to compare the 2 means -Significance of t-test based on sample size and variability within the data Median (the number in the middle) -not usually tested Mode (the most frequent response) -not usually tested

Hypothesis testing for categorical variables Counts (how many fall within each category) Compare using 2X2 table Proportions (what percentage fall within each category) Compare 2 proportions Frequency distributions (comparing counts and percentages between categories) Compare using chi-square test

2X2 tables: the foundation Disease or other outcome No disease or other outcome Exposed ab Not exposed cd

2X2 tables: estimating associations Disease or other outcome No disease or other outcome Exposed aba+b Not exposed cdc+d a+cb+da+b+c+d

Odds ratios and relative risks Odds ratios (ad/bc) calculate the odds of an outcome given an exposure Relative risk (a/a+b)/c/c+d) calculates the relative risk of an outcome in exposed compared to non-exposed group Statistical packages calculate confidence intervals

Confidence intervals Confidence intervals are used for hypothesis testing in 2X2 tables (and others) The width of a confidence interval is based on the variablility within the data and the sample size An OR or RR of 1 = no association A confidence interval that crosses 1 is NOT statistically significant

Regression lines and correlation Correlation is the measure of the way one variable is associated with another Can be done with 2 continuous variables The regression line is the best fit between 2 variables Ranges from -1 to 1

Article review Questions to consider: What is the research question? What is their study design? What is the exposure variable(s)? What is the outcome variable? What are the strengths and limitations? Who funded the study? How compelling are the findings?

Example # 1 Statistical associations of the number of streets crossed by children and: -socio-economic indicators -child pedestrian injury rate

Background Child pedestrian injury rate has been declining in many countries, including Canada Concern has been expressed that the decline is due to a reduction in exposure to traffic (i.e., children are driven or bussed rather than walking)

Objective The objective of this study was to measure the number of streets children cross on one day To see if the number of streets crossed varies by socio-economic status To see if the child pedestrian injury rate is associated with the number of streets crossed

Variables Number of streets crossed as reported by parents from a random sample of schools in Montreal Socio-economic status measured by: -car ownership -parental education -home ownership Injury rate in police district as reported by the police

Methods Frequency distribution of average # of streets crossed presented by age and SES Statistical testing for the differences between means for categorical variables Scatterplot generated and regression line calculated

Table 1 Number of Streets Crossed by Age and Socio-economic Indicators* AgeNMeanSD 5 & & & Number of cars Home Ownership Rent home Own home

No car1 car Average streets crossed (Mean) Standard deviation Sample size Z Test for difference between means 13.8, p<0.001 Comparing average streets crossed by car ownership

Measures of association between variables Tied in to the concept of reliability and validity Sometimes we need to test a new variable in relation to an old one For example, a new questionnaire, faster blood test, etc. Several ways to measure association: Cronbach’s alpha, kappa, sensitivity, specificity, positive predictive value, negative predictive value

Cronbach’s alpha Measures the reliability of a psychometric instrument Assesses the extent to which a set of test items can be treated as measuring a single latent variable Mean correlation between a set of items with the mean of all the other items Looks at variation between individuals compared to variation due to items Can be between – infinity and 1 (although usually only between 0 and 1) Usually considered ‘good’ if > 0.8

Kappa Measures the extent to which ratings given by 2 raters agree Often used when experts are assigning scores based on opinions (e.g., medication errors) Gives credit when scores match exactly, takes away agreement when they don’t Can be between 0 and 1 Usually considered ‘good’ if > 0.7

Sensitivity and specificity Sensitivity Measures the extent to which a test agrees with a ‘gold standard’ Often used when trying out a new diagnostic test Reports how often the new test agrees with the old when positive Captures the false negatives Calculated using a 2 X 2 table Acceptability of score depends on test qualities

Sensitivity and specificity Specificity Measures the extent to which a test agrees with a ‘gold standard’ Often used when trying out a new diagnostic test Captures the ‘false positives’ Reports how often the new test agrees with the old when negative (eg accurately reports the absence of the condition) Calculated using a 2 X 2 table Acceptability of score depends on test qualities

2X2 tables revisited Gold standard + (has condition) Gold standard – (does not have condition) New test + ab New test - cd

Calculating sensitivity and specificity Sensitivity= number who are both disease positive and test positive/number who are disease positive a/a+c Specificity = number who are both disease negative and test negative/number who are disease negative d/d+b

Understanding sensitivity and specificity Sensitivity is high when the test picks up a lot of the true disease (has few false negatives) High sensitivity is important for infectious diseases (e.g., HIV) Specificity is high when the test does not have false positives. This is important when the consequences of treating the disease are significant (e.g., cancer)

Positive and negative predictive value Tells you how good a test is at predicting whether a patient actually has the disease Positive predictive value is the probability that the patient has the disease given a positive test Depends on sensitivity, specificity and the prevalence of the disease

Overview Different types of variables are measured and presented differently P values and confidence intervals are the measure of statistical significance Tell us the probability that these results could have happened by chance Cronbach’s alpha, kappa, sensitivity and specificity tell us about relationships between measurements

For next week 1 Read Chapter 3 in the text Read the ICES privacy document ( Think about privacy and confidentiality What issues are relevant to you in your current research?

For next week 2 Identify your data set Where did it come from? How was it collected? What type of variables does it include? What is your research question? What are your exposure variables? What is your outcome variable? If you are not familiar with SPSS it is STRONGLY recommended that you complete the tutorial