Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC CHAPTER 10 Basic Statistical Concepts.

Slides:



Advertisements
Similar presentations
Richard M. Jacobs, OSA, Ph.D.
Advertisements

Copyright © Allyn & Bacon (2007) Statistical Analysis of Data Graziano and Raulin Research Methods: Chapter 5 This multimedia product and its contents.
Statistics. Review of Statistics Levels of Measurement Descriptive and Inferential Statistics.
Statistical Tests Karen H. Hagglund, M.S.
Statistics.
QUANTITATIVE DATA ANALYSIS
CHAPTER 6 Statistical Analysis of Experimental Data
Statistics 800: Quantitative Business Analysis for Decision Making Measures of Locations and Variability.
Data Analysis Statistics. Inferential statistics.
Statistics for CS 312. Descriptive vs. inferential statistics Descriptive – used to describe an existing population Inferential – used to draw conclusions.
12.3 – Measures of Dispersion
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Introduction to Statistics February 21, Statistics and Research Design Statistics: Theory and method of analyzing quantitative data from samples.
Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242.
Statistical Analysis I have all this data. Now what does it mean?
CHAPTER 4 Research in Psychology: Methods & Design
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
Statistics in psychology Describing and analyzing the data.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Class Meeting #11 Data Analysis. Types of Statistics Descriptive Statistics used to describe things, frequently groups of people.  Central Tendency 
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Types of data and how to present them 47:269: Research Methods I Dr. Leonard March 31, :269: Research Methods I Dr. Leonard March 31, 2010.
Data Handbook Chapter 4 & 5. Data A series of readings that represents a natural population parameter A series of readings that represents a natural population.
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Statistical Analysis I have all this data. Now what does it mean?
UNDERSTANDING RESEARCH RESULTS: DESCRIPTION AND CORRELATION © 2012 The McGraw-Hill Companies, Inc.
METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.
Research Process Parts of the research study Parts of the research study Aim: purpose of the study Aim: purpose of the study Target population: group whose.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Introduction to Quantitative Research Analysis and SPSS SW242 – Session 6 Slides.
Research Ethics:. Ethics in psychological research: History of Ethics and Research – WWII, Nuremberg, UN, Human and Animal rights Today - Tri-Council.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Chapter Eight: Using Statistics to Answer Questions.
STATISTICS AND OPTIMIZATION Dr. Asawer A. Alwasiti.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
STATISTICS FOR SCIENCE RESEARCH (The Basics). Why Stats? Scientists analyze data collected in an experiment to look for patterns or relationships among.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
Handbook for Health Care Research, Second Edition Chapter 13 © 2010 Jones and Bartlett Publishers, LLC CHAPTER 13 Statistical Methods for Continuous Measures.
Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
Chapter 2 Describing and Presenting a Distribution of Scores.
Handbook for Health Care Research, Second Edition Chapter 11 © 2010 Jones and Bartlett Publishers, LLC CHAPTER 11 Statistical Methods for Nominal Measures.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Criminal Justice and Criminology Research Methods, Second Edition Kraska / Neuman © 2012 by Pearson Higher Education, Inc Upper Saddle River, New Jersey.
Different Types of Data
STATISTICS FOR SCIENCE RESEARCH
CHAPTER 4 Research in Psychology: Methods & Design
Statistics in psychology
APPROACHES TO QUANTITATIVE DATA ANALYSIS
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Introduction to Statistics
Basic Statistical Terms
Statistics: The Interpretation of Data
15.1 The Role of Statistics in the Research Process
Chapter Nine: Using Statistics to Answer Questions
Introductory Statistics
Presentation transcript:

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC CHAPTER 10 Basic Statistical Concepts

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Preliminary Concepts The following terms are used frequently in this chapter: Population- Collection of data or objects (that describes some phenomenon of interest. Sample: Subset of a population that is accessible for measurement. Variable- Characteristic or entity that can take on different values. Qualitative variable-Categorical variable not placed on a meaningful number scale. Quantitative variable-One that is measurable using a meaningful scale of numbers. Discrete variable- Quantitative variable with gaps or interruptions in the values it may assume. Continuous variable- Quantitative variable that can take on any value, including fractional ones possible and limited by instrumentation or application. 2

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Levels of Measurement Numbers (0, 1, 2, … ) have the following properties: - Distinguishability: 0, 1, 2, and so on, are different numbers. -Ranking (greater than or less than): 1 is less than 2. -Equal intervals: Between 1 and 2, we assume the same distance as between 3 and 4. Nominal- named categories without any particular order to them Ordinal- consist of discrete categories that have an order to them (no indication of equal interval) Continuous (Interval)- can assume any value, rather than just whole numbers (assume equal, uniform intervals) Continuous (Ratio) - mathematically strongest level is the ratio, where numbers represent equal intervals and start with zero 3

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Significant Figures Number of digits used to express a measured number is a rough indication of the error Zero as Significant Figures: - Final zeros to the right of the decimal point that are used to indicate accuracy are significant - For numbers less than one, zeros between the decimal point and the fi rst digit are not significant Calculations Using Significant Figures- the least precise measurement used in a calculation determines the number of significant figures in the answer 4

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Rounding Done so that you do not infer accuracy in the result that was not present in the measurements Universal rounding rules: If the final digits of a number are 0, 1, 2, 3, or 4, the numbers are rounded down (dropped, and the preceding figure is retained unaltered). If the final digits are 5, 6, 7, 8, or 9, the numbers are rounded up (dropped, and the preceding figure is increased by one). 5

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Descriptive Statistics Methods for organizing data and reducing a large set of numbers to a few informative numbers Data representation- data set should be organized for inspection through use of a frequency distribution Histogram- A bar graph in which the height of the bar indicates the frequency of occurrence of a value or class of values. Frequency polygon- A graph in which a point indicates the frequency of a value, and the points are connected to form a broken line (hence a polygon) Percentage- The numerical frequency on the Y-axis is replaced with the percentage of occur­rence in this form of the polygon. 6

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Descriptive Statistics Percentile- A percentile is the value of a variable in a data set below which a certain percent of observations fall Cumulative percentage curve- This graph plots the cumulative percentage on the Y-axis against the values of the variable on the X-axis. The curve then describes the rate of accumulation for the values of the variable.

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Measures of the Typical Value of a Set of Numbers Summation Operator- denoted by the Greek capital letter sigma(∑) and simply indicates addition over values of variable Three statistics are used to represent the typical value (also called the central tendency) -Mean-sum of all observations divided by the number of observations -Median- is the 50th percentile of a distribution, or the point that divides the distribution into equal halves -Mode- is the most frequently occurring observation in the distribution 8

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Measures of Dispersion Dispersion indicate the variability, or how spread out the data are Range- is the distance between the smallest and the largest values of the variable Variance - is a measure of how different the values in a set of numbers are from each other. It is calculated as the average squared deviation of the values from the mean Standard deviation-average deviation from mean Coefficient of Variation- expresses standard deviation as percentage of mean Standard scores (or z score)- deviation from mean expressed in units of standard deviation 9

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Propagation of Errors in Calculations and Correlation and Regression Propagation of Errors in Calculations- physical quantity of interest is not measured directly but rather a function of one of more measurement made from an experiment Correlation- descriptive measure of relationship or association between two variables Regression- linear relationship between two variables, use the value of one variable to predict the value of the other variable - When we measure X and predict Y, Y is said to be regressed on X 10

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Inferential Statistics Although a sample from a population is economical, we still wish to use the sample measure­ments (statistics) to infer to the population measures (parameters). Concept of Probability- probability of an event can be defined as the relative frequency, or proportion, of occurrence of that event out of some total number of events. -Values between 0 and 1 Normal Distribution and Standard scores- a normally distributed variable, the mean is at the center of the distribution, and therefore, the mean is also the median and the mode. -Normal distribution- z score for the mean must always be zero. 11

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Normal Curve 12

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Inferential Statistics Sampling Distribution- is the probability distribution of a statistic and most important concept in inferential statistics Confidence Intervals- is the range of values that are believed to contain the true parameter value Error intervals- describe the combined effects of systematic and random errors on individual measurements. -We can also say something about how much confidence should be placed in the estimate 13

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Inferential Statistics Data Analysis for Device Evaluation Studies Step 1- create a scatter plot of the raw data to get a subjective impression of their validity. Step 2- make sure the data comply with the assumption of normality Step 3- once the data are judged to conform to the underlying assumptions, the mean and standard deviation are used to calculate error intervals Step 4- the data should be presented in graphic form and labeled with the numerical values for the error intervals 14

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Inferential Statistics Interpreting Manufacturers’ Error Specifications- evaluating a new device, a major concern is with how much error can be expected in normal use. - Knowing that any specification of error is just an estimate, we want to know how much confidence to place in it. - Manufacturers can be rather obscure about their error specifications. Hypothesis testing- technique for quantifying our guess about a hypothesis. We never know the “real” situation. -Does drug X cause Y or not? We can figure the odds and quantify our probability of being right or wrong. -Chance difference

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Inferential Statistics Type I and II Errors – Type 1-the error of rejecting the null hypothesis when it is true – Type 2-the error of accepting false null hypothesis Power Analysis and Sample Size- probability of correctly rejecting the null hypothesis – The most practical means to control power is to manipulate sample size 16

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Inferential Statistics Rules of Thumb for Estimating Sample Size -Estimates Based on Mean and Standard Deviation -Estimates Based on Proportionate Change and Coefficient of Variation -Estimates for Confidence Intervals -Sample Size for Binomial Test -Unequal Sample Sizes -Rule of Threes 17

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Inferential Statistics Clinical Importance Versus Statistical Significance – Size of the test statistic for a given difference is determined by the standard error, which in turn is determined by the sample size – Difference between two mean values (treatment group vs. control group) is significant but so small that it does not have any practical effect, then we must conclude that the results are not clinically important 18

Handbook for Health Care Research, Second Edition Chapter 10 © 2010 Jones and Bartlett Publishers, LLC Inferential Statistics Matched Versus Unmatched Data – Unmatched data (or unpaired or independent) if values in one group are unrelated in any way to the data values in the other group – Matched data (or paired or dependent) data are selected so that they will be as nearly identical as possible 19