Descriptive Statistics & SPSS introduction

Slides:



Advertisements
Similar presentations
Introduction to Summary Statistics
Advertisements

SPSS Session 1: Levels of Measurement and Frequency Distributions
Statistics for the Social Sciences
Chapter 13 Conducting & Reading Research Baumgartner et al Data Analysis.
Descriptive Statistics
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Introduction to Educational Statistics
Measures of Dispersion
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Descriptive Statistics PowerPoint Prepared by Alfred.
Measures of Central Tendency 3.1. ● Analyzing populations versus analyzing samples ● For populations  We know all of the data  Descriptive measures.
FEBRUARY, 2013 BY: ABDUL-RAUF A TRAINING WORKSHOP ON STATISTICAL AND PRESENTATIONAL SYSTEM SOFTWARE (SPSS) 18.0 WINDOWS.
Introduction to SPSS (For SPSS Version 16.0)
Math 116 Chapter 12.
PY550 Research and Statistics Dr. Mary Alberici Central Methodist University.
Describing Data: Numerical
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
Objective To understand measures of central tendency and use them to analyze data.
Chapter 3 Statistical Concepts.
APPENDIX B Data Preparation and Univariate Statistics How are computer used in data collection and analysis? How are collected data prepared for statistical.
Measures of Central Tendency or Measures of Location or Measures of Averages.
Types of data and how to present them 47:269: Research Methods I Dr. Leonard March 31, :269: Research Methods I Dr. Leonard March 31, 2010.
1 Describing distributions with numbers William P. Wattles Psychology 302.
Thinking About Psychology: The Science of Mind and Behavior 2e Charles T. Blair-Broeker Randal M. Ernst.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Experimental Research Methods in Language Learning Chapter 9 Descriptive Statistics.
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Measures of Central Tendency or Measures of Location or Measures of Averages.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
1.Introduction to SPSS By: MHM. Nafas At HARDY ATI For HNDT Agriculture.
LIS 570 Summarising and presenting data - Univariate analysis.
Describing Distributions Statistics for the Social Sciences Psychology 340 Spring 2010.
Presenting Data Descriptive Statistics. Chapter- Presentation of Data Mona Kapoor.
Measurements Statistics WEEK 6. Lesson Objectives Review Descriptive / Survey Level of measurements Descriptive Statistics.
Summation Notation, Percentiles and Measures of Central Tendency Overheads 3.
Descriptive Statistics(Summary and Variability measures)
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Descriptive Statistics – Measures of Central Tendency.
Probability and Statistics 12/11/2015. Statistics Review/ Excel: Objectives Be able to find the mean, median, mode and standard deviation for a set of.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Descriptive Statistics – Measures of Relative Position.
IENG-385 Statistical Methods for Engineers SPSS (Statistical package for social science) LAB # 1 (An Introduction to SPSS)
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Descriptive Statistics ( )
Statistical Methods Michael J. Watts
Measurements Statistics
Probability and Statistics
Analysis and Empirical Results
Statistical Methods Michael J. Watts
Chapter 3 Describing Data Using Numerical Measures
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Description of Data (Summary and Variability measures)
Science of Psychology AP Psychology
Statistics is the science of conducting studies to collect, organize, summarize, analyze, present, interpret and draw conclusions from data. Table.
Chapter 3 Describing Data Using Numerical Measures
An Introduction to Statistics
Descriptive Statistics
Introduction to Statistics
Descriptive and inferential statistics. Confidence interval
Numerical Descriptive Measures
Univariate Statistics
Welcome!.
Describing distributions with numbers
Statistics for a Single Measure (Univariate)
Myers Chapter 1 (F): Statistics in Psychological Research: Measures of Central Tendency A.P. Psychology.
Chapter Nine: Using Statistics to Answer Questions
Business and Economics 7th Edition
Numerical Descriptive Measures
Presentation transcript:

Descriptive Statistics & SPSS introduction Nan Yu COMM 420.8

Basic steps in conducting research Literature review Raise questions or make hypotheses Design of a study Collect evidence (qualitative evidence or quantitative evidence) Analyze your evidence (data) Draw conclusion from the analysis

Tasks for the next part of your group project Collect evidence Analyze your data Results and Conclusion Prepare for the presentation

Data Analysis Requirement Recommendation Understand statistics Be able to use statistical software Recommendation Take other stat classes or data analysis classes if you are truly interested in this area. This class will review very basic skills and principles about data analysis

A review – levels of measurement Nominal/Categorical e.g. Gender, Race, Favorite Music Genre Ordinal In a typical month, how many movies do you rent on video? None 1-2 3-5 5-20 More than 20

Strongly Disagree Neutral Strongly Agree Interval/Ratio I believe in love at first sight. Strongly Disagree Neutral Strongly Agree 1 2 3 4 5 On a typical day, how many hours of television do you watch?_______ How many siblings do you have?______

Text Book p.327, Table 14.1 Descriptive statistics --reduce and simplify the number to interpret the results Inferential statistics --make a judgment of what you observe in the sample can be generalized to the population from which the sample was drawn

Text Book p.327. Table 14.1 Nonparametric Parametric (continuous) Nominal, Ordinal Parametric (continuous) Interval, Ratio

Text Book p.327. Table 14.1 Univariate Bivariate Multivariate One variable Bivariate Two variables Multivariate Three or more variables Note: Sometimes scholars combine bivariate and multivariate so the distinction would be one variable (univariate) versus multiple variables (multivariate)

Descriptive Stats What to describe? What is the “center” of the data? How the data vary? (variability)

Measures of central tendency Mean Mode Median

Mean Another name of average If describing a population, denoted as μ, the Greek letter “mu.” If describing a sample, denoted as, called “x-bar.” “Balance point” or the “value center” of the distribution. Only use for interval/ratio.

Calculating the Mean Imagine that we have a population of 5 objects with heights of 2, 4, 6, 8, and 10 inches. X1 = 2 X2 = 4 X3 = 6 X4 = 8 X5 = 10 The mean of the population is: Sum of all the numbers divided by the number of observations contributing to that sum

Most frequently occurring score in a distribution. Mode Most frequently occurring score in a distribution. One data set can have many modes. Appropriate for all types of data, but most useful for categorical data.

Mode Hometown Number of students What is the mode in this case? Pennsylvania New York New Jersey Ohio Maryland Other States What is the mode in this case?

Median Middle score in a distribution (50% above, 50% below) Can use with ordinal and interval/ratio, but not with nominal

Median E.g., Scores: 4 9 2 2 1 8 10 9 7 Rank order them: 4 9 2 2 1 8 10 9 7 Rank order them: 1 2 2 4 7 8 9 9 10 Median

Summary Choice of descriptive measures depends entirely on the level of measurement for a particular variable.

A normal distribution Symmetric, bell-shaped curve. Most values fall around the mean, but some values are smaller, and some are larger. mean median mode

But distributions are not always normal (p.345) Skewness: How far the peak is from the center of the distribution

What happens if the distributions are not normal? Impact of skewness on measures of Central Tendency Mode Mode Median Median Mean Mean Right (Negative) Skew Left (Positive) Skew

Distribution Kurtosis (p.345) Normal (bell-shaped) peaked flat

Dispersion The mean, mode and median are not enough to understand the distribution of a variable.

Variability How variable are the scores in a distribution? Range Standard Deviation Variance

Range

Standard Deviation A measurement of variability that indicates how much all of the scores in the distribution typically deviate from the mean.

Calculating Standard Deviation

Variance

What is the standard deviation for this distribution?

SPSS Statistical Package for the Social Science A list of other available statistical software, p.332 Please download the file “Class Survey” from ANGEL (week 9) and save it on your desktop

Introduction to SPSS Opening a data file STEP 1 Double click on the “class survey.sav” file. STEP 2 Select "Edit," then "Options" STEP 3 Select "Display names" and "File," then Click "OK"

Introduction to SPSS Variables Cases (People)

Variable Labels To See Value Labels Place cursor over variable name Click Value Label button

Sorting Cases Click Data -> Sort Cases Select Variable, Move to Right Box, Click on OK when ready.

Data Editor in Variable View Information About Each Variable Across Variables Listed Down Values

Frequencies/Descriptive Statistics Requesting frequencies Analyze -> Descriptive Statistics -> Frequencies

Select the variables you want to analyze, and use the arrow keys to move them into the "Variables" box. Click "Statistics." Select the statistics you want, and then click "Continue" and then "OK."

Output Window You can double-click on the chart and edit the appearance… You can copy and paste the chart into other applications... Notice the new window...

Graphs A. Select "Graphs" then "Histogram" B. Put “tvhours" in the variable box. C. Select "Display normal curve" D. Select "OK"

In Your Output Window You can double-click on the graph and edit the appearance. You can copy and paste the graph into other applications.

In Your Output Window As new procedures are run, they are appended to the output and added to the outline on the left of the screen. You can navigate around your output by selecting from the outline menu. You can also delete any output by selecting it and then clicking the delete key.

Entering Your Own Data A. Open Data (Type in Data)

Label for the Variables Variable Names Listed Down B. Go to “Variable View” and name and label variables Label for the Variables Variable Names Listed Down (No spaces) Labels for the Values Values Dialog Box: Click “Add” after each new value label that you include.

C. Enter data in “Data View” Cell Editor Case Cell 1. Click on a given cell where you want to add data. 2. Type the value you want to include in that cell. The value will appear in the cell editor. 3. The TAB key will take you to the next variable in the same case. 4. The ENTER (return) key will take you to the next case.

D. Save your data after each case I lost all my data because I forgot to save it. Oh-my-gosh! I think I forgot to save the data! Ha ha ha. Too bad for them. My data is safe!

Saving Your Data and Output Extensions Used for File Types Date Files: .sav e.g., "mydata.sav" Output Files: .spo e.g., "myoutput.spo"

Saving Your Data File A. From within the Data Editor, select "File" then "Save as" B. Select your floppy disk. C. Type in the name of your data file (with the extension .sav). D. Select "Save." Note: After you have saved your data for the first time, you can save it periodically by selecting "Save" rather than "Save as."

Saving Your Output File A. From within the Output window, select "File" then "Save as" B. Select your floppy disk. C. Type in the name of your output (with the extension .spo). D. Select "Save." Note: After you have saved your output for the first time, you can save it periodically by selecting "Save" rather than "Save as."

Z-scores If two measures were used to measure the liking toward “Nan Yu” not likable 1 2 3 4 5 likable not nice 1 2 3 4 5 6 7 nice The raw scores are not comparable because they are measured differently.

So we need to standardize the scores… What we do is… Translating each individual score. The transformed scores will necessarily have a mean of zero and a standard deviation of one. The standard score indicates how many standard deviations an observation is above or below the mean.

-3 -2 -1 1 2 3

Requesting Z-scores from SPSS Open Class Survey.sav Go to Analyze  Descriptive Statistics  Descriptive

Z-scores in SPSS A. Put “tvpm" in the variable box. B. Check “Save standardize values as variables, can click OK

Z-scores in SPSS

Practice Request z-scores for “tvsit.” Sort “Ztvsit” in a descending order.

In-Class Demo Run frequencies on “wwwbuy” Request mean, median, mode, standard deviation, range, and skewness, and historgram with normal curve. Does it look at a normal distribution? Base on the reported mean, median and mode, explain the skewness of the distribution of “wwwbuy.”

In-class Demo answer You can compare your SPSS output with the one on ANGEL (class demo answer.spo) No, it doesn’t look at a normal curve. The mean is higher the median and the mode. So it is a positive (left) skew.