SPSS Workshop Day 2 – Data Analysis. Outline Descriptive Statistics Types of data Graphical Summaries –For Categorical Variables –For Quantitative Variables.

Slides:



Advertisements
Similar presentations
Statistics for the Social Sciences Psychology 340 Fall 2006 Distributions.
Advertisements

Displaying Data Objectives: Students should know the typical graphical displays for the different types of variables. Students should understand how frequency.
STATISTICAL ANALYSIS. Your introduction to statistics should not be like drinking water from a fire hose!!
Describing Quantitative Variables
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
1 Frequency Distributions & Graphing Nomenclature  Frequency: number of cases or subjects or occurrences  represented with f  i.e. f = 12 for a score.
EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES
IB Math Studies – Topic 6 Statistics.
Copyright ©2011 Brooks/Cole, Cengage Learning More about Inference for Categorical Variables Chapter 15 1.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Categorical Variables Chapter 15.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Ch. 2: The Art of Presenting Data Data in raw form are usually not easy to use for decision making. Some type of organization is needed Table and Graph.
Chapter 2 Graphs, Charts, and Tables – Describing Your Data
Organizing Information Pictorially Using Charts and Graphs
Organization and description of data
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Descriptive statistics (Part I)
Examining Univariate Distributions Chapter 2 SHARON LAWNER WEINBERG SARAH KNAPP ABRAMOWITZ StatisticsSPSS An Integrative Approach SECOND EDITION Using.
Inferential Statistics: SPSS
PY550 Research and Statistics Dr. Mary Alberici Central Methodist University.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Objectives (BPS chapter 1)
Objective To understand measures of central tendency and use them to analyze data.
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 9: Quantitative.
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Statistics 3502/6304 Prof. Eric A. Suess Chapter 3.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Chapter 15 Data Analysis: Testing for Significant Differences.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
2 Categorical Variables (frequencies) Testing mean differences of a continuous variable between groups (categorical variable) 2 Continuous Variables 2.
Chapter 2 Describing Data.
VCE Further Maths Chapter Two-Bivariate Data \\Servernas\Year 12\Staff Year 12\LI Further Maths.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
CADA Final Review Assessment –Continuous assessment (10%) –Mini-project (20%) –Mid-test (20%) –Final Examination (50%) 40% from Part 1 & 2 60% from Part.
1 An Introduction to SPSS for Windows Jie Chen Ph.D. 6/4/20161.
The Statistical Analysis of Data. Outline I. Types of Data A. Qualitative B. Quantitative C. Independent vs Dependent variables II. Descriptive Statistics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 2-2 Frequency Distributions.
SPSS Instructions for Introduction to Biostatistics Larry Winner Department of Statistics University of Florida.
CHI SQUARE TESTS.
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Displaying Distributions with Graphs. the science of collecting, analyzing, and drawing conclusions from data.
1 Chapter 2: Exploring Data with Graphs and Numerical Summaries Section 2.1: What Are the Types of Data?
Applied Quantitative Analysis and Practices
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Lecture 2 Frequency Distribution, Cross-Tabulation, and Hypothesis Testing.
Chapter Eight: Using Statistics to Answer Questions.
Statistical Analysis using SPSS Dr.Shaikh Shaffi Ahamed Asst. Professor Dept. of Family & Community Medicine.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
UNIT #1 CHAPTERS BY JEREMY GREEN, ADAM PAQUETTEY, AND MATT STAUB.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Descriptive Statistics Unit 6. Variable Any characteristic (data) recorded for the subjects of a study ex. blood pressure, nesting orientation, phytoplankton.
1 Take a challenge with time; never let time idles away aimlessly.
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
Graphs with SPSS Aravinda Guntupalli. Bar charts  Bar Charts are used for graphical representation of Nominal and Ordinal data  Height of the bar is.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Descriptive Statistics
Prof. Eric A. Suess Chapter 3
Exploratory Data Analysis
Chapter 2: Methods for Describing Data Sets
Looking at data Visualization tools.
Laugh, and the world laughs with you. Weep and you weep alone
CHAPTER 1 Exploring Data
Hypothesis Testing and Comparing Two Proportions
Statistical Analysis using SPSS
Chapter Nine: Using Statistics to Answer Questions
Chapter 18: The Chi-Square Statistic
Presentation transcript:

SPSS Workshop Day 2 – Data Analysis

Outline Descriptive Statistics Types of data Graphical Summaries –For Categorical Variables –For Quantitative Variables Contingency Tables Hypothesis Testing –One Sample t-test –Two Sample t-test Sample Size/Power Analysis

Descriptive Statistics 5-number summary –Minimum- minimum value in your dataset –Q1- 25th percentile (25% of the data is below this value) –Median- middle value of your data (50th percentile: 50% of the data is below this value) –Q3- 75th percentile (75% of the data is below this value) –Maximum- maximum value in your dataset Mean- average value of your all your data points Standard deviation- the average distance each observation falls from the mean Variance- average of the squared deviations; explains the variation of the data about the mean

To SPSS: Open gssnet.sav ->Analyze->Descriptive Statistics ->Descriptives ->Analyze->Descriptive Statistics->Frequencies (you can get more descriptive statistics here also)

Types of Data Variable- any characteristic that is recorded for subjects in a study –Categorical- if each observation belongs to one of a set of categories –Quantitative- if observations on it take numerical values that represent different magnitudes of the variable Discrete- if its possible values form a set of separate numbers, such as 0, 1, 2, … Continuous- if its possible values form an interval

Other Valuable Terminology Parameter- a numerical summary of the population Statistic- a numerical summary of a sample taken from the population Frequency table- a listing of possible values for a variable, together with the number of observations for each value –Relative frequency- proportions and percentages

Graphical Summaries for Categorical Variables Pie chart- a circle having a “slice of the pie” for each category. The size of a slice corresponds to the percentage of observations in the category Bar chart- displays a vertical bar for each category. The height of the bar is the percentage of observations in the category

To SPSS: Still in gssnet.sav For the pie chart: ->Graphs->Pie->Summaries of groups of cases->Define slices by netcat->Click OK For labels: ->Double click on the chart ->Elements->Show data labels->choose labels

SPSS continued For the bar chart: ->Graphs->Bar->Simple ->Category axis: netcat Again, we can choose which labels to appear on the chart by double clicking.

Graphical Summaries for Quantitative Variables Dot plot- shows a dot for each observation, placed just above the value on the number line for that observation. Stem-and-Leaf Plot- each observation is represented by a stem and a leaf. Usually the stem consists of all digits except the final one, which is the leaf. Histogram- a graph that uses bars to portray the frequencies or the relative frequencies of the possible outcomes. Scatterplot- display for two variables. It uses the horizontal axis for the explanatory variable (x) and the vertical axis for the response variable (y).

To SPSS: Open marathon.sav Histogram: ->Analyze->Descriptive Statistics->Frequencies ->Charts->Histogram (you can also put a normal curve on the histogram to see how the shape of your data compares to the normal distribution)

SPSS continued: Scatterplots: ->Graphs->Scatter/dot.. ->Simple Scatter->Define ->Choose (continuous) variables

Other Useful Plots Time plot- charts each observation, on the vertical scale, against the time it was measured, on the horizontal scale Box plot- constructed from the 5-number summary

To SPSS: Box plots: ->Graphs->Boxplot->Simple ->variable (continuous) ->category axis (categorical) (You can also use boxplots in order to visually compare different groups on a quantitative variable, i.e. age by gender)

Contingency Tables/Cross Tabs A contingency table is a display for two categorical variables. Its rows list the categories of one variable and its columns list the categories of the other variable. Each entry in the table is the frequency of cases in the sample with certain outcomes of the two variables The process of taking a data file and finding the frequencies for the cells of a contingency table is referred to as cross-tabulation of the data

Example 2 x 2 contingency table: Binge Drinking by Gender Binge Drinker Non- binge Drinker Total Male Female Total

Chi-squared Test for Independence The chi-squared test is a hypothesis test to see whether two categorical variables are independent of one another. We will look to see if the p-value <.05 (Reject the null hypothesis) If so, then our variables are not independent of one another

To SPSS: ->Analyze->Descriptive Statistics->Crosstabs You can also request a chi- squared test for independence: ->Click on Statistics ->Check Chi-square

Interpreting P-values We compare the calculated p-value to a pre-specified value (usually.05), if the calculated p- value is less than.05 then there is significant evidence to reject the null hypothesis.

One-sample t-test Does the population mean differ from hypothesized value? –Different alternative hypotheses (SPSS only does two-sided hypothesis test)

Examples Does anorexia therapy induce a positive mean weight change ? Is the amount of Coke dispensed into a can 12 oz.? Do radio advertisements increase the average daily sales of hamburgers?

To SPSS: Is the mean age of marathon runners greater than 30? ->Analyze->Compare means ->One sample t-test ->test value = 30

Interpreting the p-value With a p-value less than.05, there is a significant difference between the mean age of our sample and the specified test value of 30.

Two-sample t-test (Independent samples) Does one population mean differ from another population mean? –Different alternative hypotheses

Examples Do women tend to spend more time on housework than men? Do men and women watch the same amount of television in a day?

To SPSS: Are the male runners older than the female runners? ->Analyze->Independent Samples t-test ->test variable (continuous) ->grouping variable (categorical)

Interpreting the p-value With a p-value less than.05, there is a significant difference between the mean completion time for males and females.

Paired t-test (matched pairs/dependent samples) Does the population mean change for two different treatments (before & after)? –Different alternative hypotheses

Examples Does the use of a cell phone impact driver reaction time? (matched pairs) Does exercise help blood pressure? (before & after)

To SPSS: Open endorph.sav Do the beta endorphin levels differ before and after running a half- marathon? ->Analyze->Compare means ->Paired samples t-test ->Paired variables (before & after)

Interpreting the p-value With a p-value less than.05, there is a significant difference between beta endorphin levels before and after running a half- marathon.

Determining Sample Size Power- the ability to reject the null hypothesis when it is false –If a certain level of power is desired, use power analysis to determine the required sample size