Descriptive Statistics becoming familiar with the data.

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Chapter 3, Numerical Descriptive Measures
1 Chapter 1: Sampling and Descriptive Statistics.
Introduction to Data Analysis
Data analysis: Explore GAP Toolkit 5 Training in basic drug abuse data management and analysis Training session 9.
MSS 905 Methods of Missiological Research
Descriptive Statistics
Statistics.
Scales of Measurement n Nominal classificationlabels mutually exclusive exhaustive different in kind, not degree.
SOWK 6003 Social Work Research Week 10 Quantitative Data Analysis
Basic Business Statistics 10th Edition
Homework Questions. Quiz! Shhh…. Once you are finished you can work on the warm- up (grab a handout)!
Measures of a Distribution’s Central Tendency, Spread, and Shape Chapter 3 SHARON LAWNER WEINBERG SARAH KNAPP ABRAMOWITZ StatisticsSPSS An Integrative.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
+ Quantitative Analysis: Supporting Concepts EDTEC 690 – Methods of Inquiry Minjuan Wang (based on previous slides)
Describing distributions with numbers
Chapter 1 Descriptive Analysis. Statistics – Making sense out of data. Gives verifiable evidence to support the answer to a question. 4 Major Parts 1.Collecting.
Exploratory Data Analysis. Height and Weight 1.Data checking, identifying problems and characteristics Data exploration and Statistical analysis.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
STAT 250 Dr. Kari Lock Morgan
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Measurement in Exercise and Sport Psychology Research EPHE 348.
Chapter 3 Statistical Concepts.
Statistics and Research methods Wiskunde voor HMI Betsy van Dijk.
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Numerical Descriptive Techniques
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics And related matters. Two families of statistics Descriptive statistics – procedures for summarizing, organizing, graphing, and,
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Introduction to Descriptive Statistics Objectives: 1.Explain the general role of statistics in assessment & evaluation 2.Explain three methods for describing.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Chapter 2 Describing Data.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Chapter 13 Descriptive Data Analysis. Statistics  Science is empirical in that knowledge is acquired by observation  Data collection requires that we.
Correlation examining relationships. Five Descriptive Questions What is the middle of the set of scores? What is the middle of the set of scores? How.
MMSI – SATURDAY SESSION with Mr. Flynn. Describing patterns and departures from patterns (20%–30% of exam) Exploratory analysis of data makes use of graphical.
Lecturer’s desk Physics- atmospheric Sciences (PAS) - Room 201 s c r e e n Row A Row B Row C Row D Row E Row F Row G Row H Row A
Numerical Measures. Measures of Central Tendency (Location) Measures of Non Central Location Measure of Variability (Dispersion, Spread) Measures of Shape.
Statistics Without Fear! AP Ψ. An Introduction Statistics-means of organizing/analyzing data Descriptive-organize to communicate Inferential-Determine.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Chapter 6: Interpreting the Measures of Variability.
LIS 570 Summarising and presenting data - Univariate analysis.
Scales of Measurement n Nominal classificationlabels mutually exclusive exhaustive different in kind, not degree.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
Basic Statistics for Testing. Why we need statistics Types of scales Frequency distributions Percentile ranks.
Dr.Rehab F.M. Gwada. Measures of Central Tendency the average or a typical, middle observed value of a variable in a data set. There are three commonly.
Descriptive Statistics Printing information at: Class website:
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Statistics 200 Lecture #4 Thursday, September 1, 2016
Measurements Statistics
Different Types of Data
MATH-138 Elementary Statistics
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Statistical Reasoning
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Quartile Measures DCOVA
One Quantitative Variable: Measures of Spread
Welcome!.
Honors Statistics Review Chapters 4 - 5
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Biostatistics Lecture (2).
Presentation transcript:

Descriptive Statistics becoming familiar with the data

The Strategies Initial Screening Initial Screening Levels of Measurement Levels of Measurement Five Descriptive Questions Five Descriptive Questions Graphical Presentations Graphical Presentations Search for Outliers Search for Outliers

Initial Screening Missing values Missing values Defining labels Defining labels Key punch errors Key punch errors Valid values Valid values Understanding what you have Understanding what you have Understanding the population, sampling frame, and sample Understanding the population, sampling frame, and sample

Levels of Measurement Nominal Nominal Ordinal Ordinal Interval Interval Ratio Ratio Determining what statistics are appropriate Determining what statistics are appropriate

Nominal Naming things. Naming things. Creating groups that are qualitatively different or unique… Creating groups that are qualitatively different or unique… But not necessarily quantitatively different. But not necessarily quantitatively different.

Nominal Placing individuals or objects into categories. Placing individuals or objects into categories. Making mutually excusive categories. Making mutually excusive categories. Numbers assigned to categories are arbitrary. Numbers assigned to categories are arbitrary.

Nominal Sample variables: Sample variables: – Gender – Race – Ethnicity – Geographic location – Hair or eye color

Ordinal Rank ordering things. Rank ordering things. Creating groups or categories when only rank order is known. Creating groups or categories when only rank order is known. Numbers imply order but not exact quantity of anything. Numbers imply order but not exact quantity of anything.

Ordinal The difference between individuals with adjacent ranks, on relevant quantitative variables, is not necessarily the same across the distribution. The difference between individuals with adjacent ranks, on relevant quantitative variables, is not necessarily the same across the distribution.

Ordinal Sample variables: Sample variables: – Class Rank – Place of finish in a race (1 st, 2 nd, etc.) – Judges ratings – Responses to Likert scale items (for example – SD, D, N, A, SA)

Interval Orders observations according to the quantity of some attribute. Orders observations according to the quantity of some attribute. Arbitrary origin. Arbitrary origin. Equal intervals. Equal intervals. Equal differences expressed as equal distances. Equal differences expressed as equal distances.

Interval Sample variables: Sample variables: – Test Scores SAT SAT GRE GRE IQ tests IQ tests – Temperature Celsius Celsius Fahrenheit Fahrenheit

Ratio Quantitative measurement. Quantitative measurement. Equal intervals. Equal intervals. True zero point. True zero point. Ratios between values are useful. Ratios between values are useful.

Ratio Sample variables: Sample variables: – Financial variables – Finish times in a race – Number of units sold – Test scores scaled as percent correct or number correct

Levels of Measurement Review What level of measurement? What level of measurement? – Today is a fall day. – Today is the third hottest day of the month. – The high today was 70 o Fahrenheit. – The high today was 20 o Celsius. – The high today was 294 o Kelvin.

Levels of Measurement Review What level of measurement? What level of measurement? – Student #1256 is: – a male – from Lawrenceville, GA. – He came in third place in the race today. – He scored 550 on the SAT verbal section. – He has turned in 8 out of the 10 homework assignments.

Levels of Measurement Review What level of measurement? What level of measurement? – Student #3654 is: – in the third reading group. – Nominal? – Ordinal? – Interval? – Ratio?

Five Descriptive Questions What is the middle of the set of scores? What is the middle of the set of scores? How spread out are the scores? How spread out are the scores? Where do specific scores fall in the distribution of scores? Where do specific scores fall in the distribution of scores? What is the shape of the distribution? What is the shape of the distribution? How do different variables relate to each other? How do different variables relate to each other?

Five Descriptive Questions Middle Middle Spread Spread Rank or Relative Position Rank or Relative Position Shape Shape Correlation Correlation Descriptive Statistics Answer Sheet Descriptive Statistics Answer Sheet Descriptive Questions in Excel, SPSS, and TI-83 Descriptive Questions in Excel, SPSS, and TI-83

Middle Mean Mean Median Median Mode Mode

Spread Standard Deviation Standard Deviation Variance Variance Range Range IQR IQR

Rank or Relative Position Five number summary Five number summary Min, 25 th, 50 th, 75 th, Max Min, 25 th, 50 th, 75 th, Max Identifying specific values that have interpretive meaning Identifying specific values that have interpretive meaning Identifying where they fall in the set of scores Identifying where they fall in the set of scores Box plots Box plots Outliers Outliers

Shape Positive Skewness Positive Skewness Negative Skewness Negative Skewness Normality Normality Histograms Histograms

Shape - Normality

Shape- Positive Skewness

Shape – Negative Skewness

Correlation Direction of Relationships Direction of Relationships Positive or Negative Positive or Negative Magnitude of Relationships Magnitude of Relationships Weak, Moderate, Strong Weak, Moderate, Strong Scatterplots Scatterplots Outliers Outliers

Outliers

Outliers Boxplot shows middle 50% of scores as the box. Boxplot shows middle 50% of scores as the box. Q3 (75 th ) – Q1 (25 th ) = IQR Q3 (75 th ) – Q1 (25 th ) = IQR Data outside 1.5 IQR rule are outliers Data outside 1.5 IQR rule are outliers Q1 – (1.5*IQR) Q1 – (1.5*IQR) Q3 + (1.5*IQR) Q3 + (1.5*IQR)

Outliers

Outliers

Outliers

Outliers If normality of the population can be assumed, other rules can be used. If normality of the population can be assumed, other rules can be used. Mean +/- 2 SDs or Mean +/- 3 SDs Mean +/- 2 SDs or Mean +/- 3 SDs Empirical Rule Empirical Rule Approximately 68% within +/- 1 SD Approximately 68% within +/- 1 SD Approximately 95% within +/- 2 SD Approximately 95% within +/- 2 SD Approximately 99% within +/- 3 SD Approximately 99% within +/- 3 SD

Outliers You can also look at outliers in the bivariate case. You can also look at outliers in the bivariate case. Examine the scatterplots for values out of the pattern. Examine the scatterplots for values out of the pattern.

Outliers