Descriptive Statistics: Part One Farrokh Alemi Ph.D. Kashif Haqqi M.D.

Slides:



Advertisements
Similar presentations
Measurement, Evaluation, Assessment and Statistics
Advertisements

Introduction to Statistics Quantitative Methods in HPELS 440:210.
Elementary Statistics MOREHEAD STATE UNIVERSITY
© The McGraw-Hill Companies, Inc., by Marc M. Triola & Mario F. Triola SLIDES PREPARED BY LLOYD R. JAISINGH MOREHEAD STATE UNIVERSITY MOREHEAD.
Bios 101 Lecture 4: Descriptive Statistics Shankar Viswanathan, DrPH. Division of Biostatistics Department of Epidemiology and Population Health Albert.
QUANTITATIVE DATA ANALYSIS
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 2 = Finish Chapter “Introduction and Data Collection”
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Introduction to Educational Statistics
Sources of Data Levin and Fox Ch. 1: The Experiment The Survey Content Analysis Participant Observation Secondary Analysis 1.
1 The Assumptions. 2 Fundamental Concepts of Statistics Measurement - any result from any procedure that assigns a value to an observable phenomenon.
Central Tendency.
Thomas Songer, PhD with acknowledgment to several slides provided by M Rahbar and Moataza Mahmoud Abdel Wahab Introduction to Research Methods In the Internet.
Levels of Measurement Nominal measurement Involves assigning numbers to classify characteristics into categories Ordinal measurement Involves sorting objects.
1 Introduction to biostatistics Lecture plan 1. Basics 2. Variable types 3. Descriptive statistics: Categorical data Categorical data Numerical data Numerical.
Measures of Central Tendency
Fundamentals of Statistical Analysis DR. SUREJ P JOHN.
CHAPTER 4 Research in Psychology: Methods & Design
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Measures of Central Tendency or Measures of Location or Measures of Averages.
© The McGraw-Hill Companies, Inc., by Marc M. Triola & Mario F. Triola SLIDES PREPARED BY LLOYD R. JAISINGH MOREHEAD STATE UNIVERSITY MOREHEAD.
© Copyright McGraw-Hill CHAPTER 1 The Nature of Probability and Statistics.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
STATISTICS is about how to COLLECT, ORGANIZE,
Statistics: Basic Concepts. Overview Survey objective: – Collect data from a smaller part of a larger group to learn something about the larger group.
Introduction to Statistics What is Statistics? : Statistics is the sciences of conducting studies to collect, organize, summarize, analyze, and draw conclusions.
Basic Statistics. Scales of measurement Nominal The one that has names Ordinal Rank ordered Interval Equal differences in the scores Ratio Has a true.
10a. Univariate Analysis Part 1 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Probability & Statistics – Bell Ringer  Make a list of all the possible places where you encounter probability or statistics in your everyday life. 1.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.
Measures of Central Tendency or Measures of Location or Measures of Averages.
CHAPTER 3  Descriptive Statistics Measures of Central Tendency 1.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.
Preparing to Analyse Data C.Adithan Department of Pharmacology JIPMER Pondicherry
1 Outline 1. Why do we need statistics? 2. Descriptive statistics 3. Inferential statistics 4. Measurement scales 5. Frequency distributions 6. Z scores.
Anthony J Greene1 Central Tendency 1.Mean Population Vs. Sample Mean 2.Median 3.Mode 1.Describing a Distribution in Terms of Central Tendency 2.Differences.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Chapter 1 Getting Started What is Statistics?. Individuals vs. Variables Individuals People or objects included in the study Variables Characteristic.
Elementary Statistics (Math 145) June 19, Statistics is the science of collecting, analyzing, interpreting, and presenting data. is the science.
Descriptive Statistics Printing information at: Class website:
Biostatistics Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
2 NURS/HSCI 597 NURSING RESEARCH & DATA ANALYSIS GEORGE MASON UNIVERSITY.
Elementary Statistics
Chapter 6 Introductory Statistics and Data
Biostatistics?.
Elementary Statistics MOREHEAD STATE UNIVERSITY
Summary descriptive statistics: means and standard deviations:
The Nature of Probability and Statistics
Statistics Statistics- Inferential Statistics Descriptive Statistics
The Nature of Probability and Statistics
Summary descriptive statistics: means and standard deviations:
Elementary Statistics MOREHEAD STATE UNIVERSITY
6A Types of Data, 6E Measuring the Centre of Data
The Nature of Probability and Statistics
15.1 The Role of Statistics in the Research Process
Descriptive Statistics
Chapter 6 Introductory Statistics and Data
Presentation transcript:

Descriptive Statistics: Part One Farrokh Alemi Ph.D. Kashif Haqqi M.D.

Table of Content Objectives Definitions Sampling methods Types of variables Reliability and validity Average Median Mode

Objectives 1.Define validity and reliability and explain the role of each in assessing the quality of data. 2.Distinguish among nominal, ordinal, and numeric data, as well as discrete and continuous data. 3.Given a set of numerical data, calculate the mean, median and mode, and state the relative advantages of each as a measure of central tendency. Back to Table of Content

Definition of Variables A variable is an attribute of a person or an object that varies. Measurement are rules for assigning numbers to objects to represent quantities of attributes. Back to Table of Content

What Is Statistics? Statistics is the science of describing or making inferences about the world from a sample of data. Descriptive statistics are numerical estimates that organize and sum up or present the data. Inferential statistics is the process of inferring from a sample to the population.

Definition Datum is one observation about the variable being measured. Data are a collection of observations. A population consists of all subjects about whom the study is being conducted. A sample is a sub-group of population being examined.

Sampling Methods Random sample: all subjects have equal chance of inclusion in the study. Systematic sampling: selecting the kth numbered subject. Stratified sample: random sampling within pre- defined groups of subjects. Staged sampling: A small random sample is made and if its results are ambiguous then another larger random sample is collected. Back to Table of Content

Types of Variables A discrete variable has gaps between its values. For example, sex is a discrete variable. If male is 1 and female is 0, values in between have no meaning. A continuous variable has no gaps between its values. All values or fractions of values have meaning. Age is an example of continuous variable. Back to Table of Content

Types of Variables (Continued) Nominal scale assign numbers to attribute to name the category. The numbers have no meaning by themselves, e.g. DRG code. Ordinal scale assign numbers so that more of an attribute has higher values, e.g. Severity. In an interval scale the interval between the numbers has meaning, e.g. Fahrenheit scale Ratio scale is an interval scale where zero has true meaning, e.g. Age.

Reliability and Validity ReliabilityValidity DefinitionConsistency of results on repeat measures Measuring what is supposed to be measured TypesInter-raterFace validity Intra-raterConstruct validity Split halfPredictive validity Test-retest Back to Table of Content

To Be Valid You Must Have a Reliable Measure. But You Can Have an Invalid Measure That Is Reliable.

Example of Reliability Calculation Next page shows a table from Hayward, RA, McMahon LF, Bernard AM. Evaluating the care of general medicine inpatients: how good is implicit review? Annals of Internal Medicine, volume 118(7), 1993, pp Hayward, RA, McMahon LF, Bernard AM. Evaluating the care of general medicine inpatients: how good is implicit review? Annals of Internal Medicine, volume 118(7), 1993, pp Two reviewers rated the quality of health care delivered in the same case. The Table shows inter-rater reliability

Inter-rater Reliability

Average The mean, arithmetic average, is found by adding values of the data and dividing by the number of values. The mean of 3, and 4 is 3.5. The geometric average is found by multiplying the values of the data and taking the power of one divided by the number of values. The geometric average of 3 and 4 is square root of 3 times 4. Can you calculate the mean and geometric average for 3, 4, and 5? Back to Table of Content

Example The mean of 3, 4 and 5 is the sum of these numbers divided by 3. The geometric average of 3, 4 and 5 is the cube root of 3 times 4 times 5. To calculate the cube root in excel you write a formula like: =(3*4*5)^0.33 The answer is Open Excel and verify that you can do this.

Difference Between Mean and Geometric Average A geometric average is used when averaging probabilities. A mean is used in most other context.

Median The median is the halfway point in a data set. To calculate median arrange data in order. Calculate half of the observations by dividing the number of values by 2 and rounding the value to the lower number. Count half the values and use the next value as median. Back to Table of Content

Example The median for age of 7 patients (23, 45, 56, 23, 34, 65, 25) if given by: –Order the list of values: 23, 23, 25, 34, 45, 56, 65. –There are 7 observations. Divide 7 by two and round to lower number and you get 3. –Skip the first 3 and the median is the next number. In this example, 34 is the median. –Do this in Excel.Do this in Excel.

Mode The most frequent value observed is the mode. Mode is always an observed value in the data set. To calculate the mode, count the number of times each value is repeated. The value with most repetition is the mode. Do this in Excel. Back to Table of Content

Example for Mode Age data: 23, 45, 56, 23, 34, 65, is repeated twice. All other values are repeated once. The mode is 23.

Differences in Measures of Central Tendency Mode, median and mean could be three different numbers in asymmetrical distributions of data. For any data set there is only one mean and median but there may be many modes. Median is less influenced by the extreme values than mean. Mean is almost never observed, median is observed in only odd numbered data sets and mode is always observed in the data set.