Univariate Analysis/Descriptive Statistics

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Introduction to Summary Statistics
Introduction to Summary Statistics
Introduction to Data Analysis
Descriptive (Univariate) Statistics Percentages (frequencies) Ratios and Rates Measures of Central Tendency Measures of Variability Descriptive statistics.
Statistics Intro Univariate Analysis Central Tendency Dispersion.
Statistics Intro Univariate Analysis Central Tendency Dispersion.
Dr. Michael R. Hyman, NMSU Statistics for a Single Measure (Univariate)
Measures of Dispersion
1 Measures of Central Tendency Greg C Elvers, Ph.D.
Measures of Central Tendency
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Programming in R Describing Univariate and Multivariate data.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Data Analysis and Statistics. When you have to interpret information, follow these steps: Understand the title of the graph Read the labels Analyze pictures.
APPENDIX B Data Preparation and Univariate Statistics How are computer used in data collection and analysis? How are collected data prepared for statistical.
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
Data Handbook Chapter 4 & 5. Data A series of readings that represents a natural population parameter A series of readings that represents a natural population.
Statistics Recording the results from our studies.
CSCI N207: Data Analysis Using Spreadsheets Copyright ©2005  Department of Computer & Information Science Univariate Data Analysis.
Biostatistics: Measures of Central Tendency and Variance in Medical Laboratory Settings Module 5 1.
Introduction to Summary Statistics. Statistics The collection, evaluation, and interpretation of data Statistical analysis of measurements can help verify.
10a. Univariate Analysis Part 1 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.
Skewness & Kurtosis: Reference
The Central Tendency is the center of the distribution of a data set. You can think of this value as where the middle of a distribution lies. Measure.
10b. Univariate Analysis Part 2 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Introduction to Statistics Santosh Kumar Director (iCISA)
Part II Sigma Freud and Descriptive Statistics Chapter 3 Vive La Différence: Understanding Variability.
RESEARCH & DATA ANALYSIS
LIS 570 Summarising and presenting data - Univariate analysis.
Descriptive Statistics(Summary and Variability measures)
Economics 111Lecture 7.2 Quantitative Analysis of Data.
Statistics Review  Mode: the number that occurs most frequently in the data set (could have more than 1)  Median : the value when the data set is listed.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 18.
AP Biology Resources Statistical Analysis and Graphing.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 10 Descriptive Statistics Numbers –One tool for collecting data about communication.
Exploratory Data Analysis
INTRODUCTION TO STATISTICS
Statistical Methods Michael J. Watts
MATH-138 Elementary Statistics
Analysis and Empirical Results
Statistical Methods Michael J. Watts
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
PA330 FEB 28, 2000.
Measures of Central Tendency
Introduction to Summary Statistics
Central Tendency and Variability
Introduction to Summary Statistics
Introduction to Summary Statistics
Description of Data (Summary and Variability measures)
Introduction to Summary Statistics
Module 8 Statistical Reasoning in Everyday Life
Introduction to Summary Statistics
Introduction to Summary Statistics
BUS7010 Quant Prep Statistics in Business and Economics
Introduction to Summary Statistics
Univariate Statistics
Introduction to Summary Statistics
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
Introduction to Summary Statistics
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Introduction to Summary Statistics
Presentation transcript:

Univariate Analysis/Descriptive Statistics Data Measurement Measurement of the data is the first step in the process that ultimately guides the final analysis. Consideration of sampling, controls, and the required precision all influence the final analysis. Validation: Instruments and methods used to measure the data must be validated for accuracy. Precision and accuracy…Determination of error Social vs. Physical Sciences

Univariate Analysis/Descriptive Statistics The Range Min/Max Average Median Mode Variance Standard Deviation Histograms and Normal Distributions

Univariate Analysis/Descriptive Statistics The Range Difference between minimum and maximum values in a data set Larger range usually (but not always) indicates a large spread or deviation in the values of the data set. (73, 66, 69, 67, 49, 60, 81, 71, 78, 62, 53, 87, 74, 65, 74, 50, 85, 45, 63, 100)

Univariate Analysis/Descriptive Statistics The Average (Mean) Sum of all values divided by the number of values in the data set. One measure of central location in the data set. Average = Average=(73+66+69+67+49+60+81+71+78+62+53+87+74+65+74+50+85+45+63+100)/20 = 68.6 Excel function: AVERAGE()

Univariate Analysis/Descriptive Statistics The Median The middle value in a sorted data set. Half the values are greater and half are less than the median. Another measure of central location in the data set. (45, 49, 50, 53, 60, 62, 63, 65, 66, 67, 69, 71, 73, 74, 74, 78, 81, 85, 87, 100) Median: 68 (1, 2, 4, 7, 8, 9, 9) Excel function: MEDIAN()

Univariate Analysis/Descriptive Statistics The Mode Most frequently occurring value. Another measure of central location in the data set. (45, 49, 50, 53, 60, 62, 63, 65, 66, 67, 69, 71, 73, 74, 74, 78, 81, 85, 87, 100) Mode: 74 Generally not all that meaninful unless a larger percentage of the values are the same number.

Univariate Analysis/Descriptive Statistics Variance One measure of dispersion (deviation from the mean) of a data set. Variance = Average value of the data set Variance = [(45 – 68.6)2 + (49 – 68.6)2 + (50 – 68.6)2 + (53 – 68.6)2 + …]/20 = 181 Excel Functions: VARP(), VAR()

Univariate Analysis/Descriptive Statistics Standard Deviation Square root of the variance. Can be thought of as the average deviation from the mean of a data set. The magnitude of the number is more in line with the values in the data set. Standard Deviation = ([(45 – 68.6)2 + (49 – 68.6)2 + (50 – 68.6)2 + (53 – 68.6)2 + …]/20)1/2 = 13.5 Excel Functions: STDEVP(), STDEV()

Univariate Analysis/Histograms Distributions Previous statistical measures are easier to understand and interpret when graphically illustrated. However, charting each data element can lead to very busy and confusing charts that do not help interpret the data. Grouping the data elements into categories and charting the frequency within these categories yields a graphical illustration of how the data is distributed throughout its range.

Univariate Analysis/Histograms Frequency Table The first step is to decide on the categories and group the data appropriately. (45, 49, 50, 53, 60, 62, 63, 65, 66, 67, 69, 71, 73, 74, 74, 78, 81, 85, 87, 100) Category Labels Frequency 0-50 3 51-60 2 61-70 6 71-80 5 81-90 >90 1

Univariate Analysis/Histograms A histogram is simply a column chart of the frequency table. Category Labels Frequency 0-50 3 51-60 2 61-70 6 71-80 5 81-90 >90 1

Univariate Analysis/Histograms Average (68.6) and Median (68) Histogram Mode (74) -1SD +1SD

Univariate Analysis/Normal Distributions Distributions that can be described mathematically as Gaussian are also called Normal The Bell curve Symmetrical/ Perfect Distribution Mean = Median Skewed Mean>Median Mean<Median