Data Transformation Data conversion Changing the original form of the data to a new format More appropriate data analysis New.

Slides:



Advertisements
Similar presentations
Introduction to Summary Statistics
Advertisements

Exploring Marketing Research William G. Zikmund Chapter 20: Basic Data Analysis.
Statistics.
Descriptive (Univariate) Statistics Percentages (frequencies) Ratios and Rates Measures of Central Tendency Measures of Variability Descriptive statistics.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Data Analysis Statistics. OVERVIEW Getting Ready for Data Collection Getting Ready for Data Collection The Data Collection Process The Data Collection.
Biostatistics Unit 2 Descriptive Biostatistics 1.
1 Basic statistics Week 10 Lecture 1. Thursday, May 20, 2004 ISYS3015 Analytic methods for IS professionals School of IT, University of Sydney 2 Meanings.
Dr. Michael R. Hyman, NMSU Statistics for a Single Measure (Univariate)
Learning Objectives for Section 11.3 Measures of Dispersion
Edpsy 511 Homework 1: Due 2/6.
MR2300: MARKETING RESEARCH PAUL TILLEY Unit 10: Basic Data Analysis.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Central Tendency and Variability
Exploring Marketing Research William G. Zikmund
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
Measures of Central Tendency
BPT 2423 – STATISTICAL PROCESS CONTROL.  Frequency Distribution  Normal Distribution / Probability  Areas Under The Normal Curve  Application of Normal.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Exploring Marketing Research William G. Zikmund Chapter 20: Basic Data Analysis.
Programming in R Describing Univariate and Multivariate data.
CHAPTER 3 : DESCRIPTIVE STATISTIC : NUMERICAL MEASURES (STATISTICS)
B AD 6243: Applied Univariate Statistics Understanding Data and Data Distributions Professor Laku Chidambaram Price College of Business University of Oklahoma.
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
Essentials of Marketing Research
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
BUS250 Seminar 4. Mean: the arithmetic average of a set of data or sum of the values divided by the number of values. Median: the middle value of a data.
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Some Useful Continuous Probability Distributions.
8.3 Measures of Dispersion  In this section, you will study measures of variability of data. In addition to being able to find measures of central tendency.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Interpreting Performance Data
Skewness & Kurtosis: Reference
Essentials of Marketing Research William G. Zikmund Chapter 14: Basic Data Analysis.
Determination of Sample Size: A Review of Statistical Theory
Business Research Methods William G. Zikmund
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
CHAPTER 3 : DESCRIPTIVE STATISTIC : NUMERICAL MEASURES (STATISTICS)
Chapter SixteenChapter Sixteen. Figure 16.1 Relationship of Frequency Distribution, Hypothesis Testing and Cross-Tabulation to the Previous Chapters and.
Statistical Analysis Quantitative research is first and foremost a logical rather than a mathematical (i.e., statistical) operation Statistics represent.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Statistical Analysis of Data. What is a Statistic???? Population Sample Parameter: value that describes a population Statistic: a value that describes.
Chapter 11 Data Descriptions and Probability Distributions Section 3 Measures of Dispersion.
Chapter 2 Describing and Presenting a Distribution of Scores.
Welcome to… The Exciting World of Descriptive Statistics in Educational Assessment!
PowerPoint Presentation by Charlie Cook The University of West Alabama William G. Zikmund Barry J. Babin 9 th Edition Part 5 Sampling and Fieldwork © 2007.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 2 Describing and Presenting a Distribution of Scores.
Essentials of Marketing Research William G. Zikmund
Statistical Methods Michael J. Watts
Statistical Methods Michael J. Watts
Data Analysis & Report Writing
LEARNING OUTCOMES After studying this chapter, you should be able to
Central Tendency and Variability
Descriptive Statistics
Characteristics of the Mean
BUS7010 Quant Prep Statistics in Business and Economics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Descriptive Statistics
Statistics for a Single Measure (Univariate)
BUSINESS MARKET RESEARCH
Presentation transcript:

Data Transformation Data conversion Changing the original form of the data to a new format More appropriate data analysis New variables

Data Transformation Summative Score = VAR1 + VAR2 + VAR 3

Descriptive Analysis The transformation of raw data into a form that will make them easy to understand and interpret; rearranging, ordering, and manipulating data to generate descriptive information

Tabulation Tabulation - Orderly arrangement of data in a table or other summary format Frequency table Percentages

Frequency Table The arrangement of statistical data in a row-and-column format that exhibits the count of responses or observations for each category assigned to a variable

Measure of CentralMeasure of Type of ScaleTendencyDispersion NominalModeNone OrdinalMedianPercentile Interval or ratioMeanStandard deviation Central Tendency

Base The number of respondents or observations (in a row or column) used as a basis for computing percentages

Index Numbers Score or observation recalibrated to indicate how it relates to a base number CPI - Consumer Price Index

Measures of Central Tendency Mean - arithmetic average –µ, Population;, sample Median - midpoint of the distribution Mode - the value that occurs most often

Population Mean

Sample Mean

Measures of Dispersion or Spread Range Mean absolute deviation Variance Standard deviation

The Range as a Measure of Spread The range is the distance between the smallest and the largest value in the set. Range = largest value – smallest value

Deviation Scores The differences between each observation value and the mean:

Low Dispersion Value on Variable Frequency Low Dispersion Verses High Dispersion

Frequency High dispersion Value on Variable Low Dispersion Verses High Dispersion

Average Deviation

Mean Squared Deviation

The Variance

Variance

The variance is given in squared units The standard deviation is the square root of variance:

Sample Standard Deviation

The Normal Distribution Normal curve Bell shaped Almost all of its values are within plus or minus 3 standard deviations I.Q. is an example

2.14% 13.59% 34.13% 13.59% 2.14% Normal Distribution

Normal Curve: IQ Example

Standardized Normal Distribution Symetrical about its mean Mean identifies highest point Infinite number of cases - a continuous distribution Area under curve has a probability density = 1.0 Mean of zero, standard deviation of 1

Standard Normal Curve The curve is bell-shaped or symmetrical About 68% of the observations will fall within 1 standard deviation of the mean About 95% of the observations will fall within approximately 2 (1.96) standard deviations of the mean Almost all of the observations will fall within 3 standard deviations of the mean

z A Standardized Normal Curve

The Standardized Normal is the Distribution of Z –z+z

Standardized Scores

Standardized Values Used to compare an individual value to the population mean in units of the standard deviation

Linear Transformation of Any Normal Variable into a Standardized Normal Variable Sometimes the scale is stretched Sometimes the scale is shrunk    X