1 SPSS 202: Linear and Logistic Regression Using SPSS (Workshop) Dr. Daisy Dai Department of Medical Research.

Slides:



Advertisements
Similar presentations
Introduction to VistaPHw Charting Function
Advertisements

1 SPSS Tutorial 101: Import, Merge and Save Data Sets Dr. Daisy Dai Department of Medical Research.
Advance to next slide1 Interactive Introduction to SPSS Statistical Software Elizabeth Bigham, Ph.D. California State University San Marcos May
Refresher Instruction Guide Strategic Planning and Assessment Module
Correlation and Linear Regression.
Fundamental Features of Graphs All graphs have two, clearly-labeled axes that are drawn at a right angle. –The horizontal axis is the abscissa, or X-axis.
1 Using SPSS: Descriptive Statistics Department of Operations Weatherhead School of Management.
Copyright © Allyn & Bacon (2007) Using SPSS for Windows Graziano and Raulin Research Methods This multimedia product and its contents are protected under.
WINKS SDA Statistical Data Analysis (Windows Kwikstat) Getting Started Guide.
Project #3 by Daiva Kuncaite Problem 31 (p. 190)
Examining Relationship of Variables  Response (dependent) variable - measures the outcome of a study.  Explanatory (Independent) variable - explains.
A Simple Guide to Using SPSS© for Windows
Nemours Biomedical Research Statistics April 2, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Additional HW Exercise (a) The breaking strength in pounds of force is being studied for two metal alloys, labeled S and T, and for different temperatures.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
SW318 Social Work Statistics Slide 1 Using SPSS for Graphic Presentation  Various Graphics in SPSS  Pie chart  Bar chart  Histogram  Area chart 
SPSS 1: An Introduction to the Statistical Package SPSS Suzie Cro MRC Clinical Trials Unit.
SPSS Statistical Package for the Social Sciences is a statistical analysis and data management software package. SPSS can take data from almost any type.
Correlation in SPSS. Correlation in SPSS is very simple and efficient  Step 1. Switch to Data View to make sure the data isn’t corrupted or changed.
Introduction to SPSS Short Courses Last created (Feb, 2008) Kentaka Aruga.
SPSS 202: Data Management by SPSS (Workshop) Dr. Daisy Dai Department of Medical Research 1.
Srinivasulu Rajendran Centre for the Study of Regional Development (CSRD) Jawaharlal Nehru University (JNU) New Delhi India
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Mann-Whitney U Test PowerPoint Prepared by Alfred.
Guide to Using Excel For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 6th Ed. Chapter 14: Multiple Regression.
Two-Way Analysis of Variance STAT E-150 Statistical Methods.
FEBRUARY, 2013 BY: ABDUL-RAUF A TRAINING WORKSHOP ON STATISTICAL AND PRESENTATIONAL SYSTEM SOFTWARE (SPSS) 18.0 WINDOWS.
Introduction to SPSS (For SPSS Version 16.0)
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Slide 1 SOLVING THE HOMEWORK PROBLEMS Simple linear regression is an appropriate model of the relationship between two quantitative variables provided.
Practical statistics for Neuroscience miniprojects Steven Kiddle Slides & data :
DISCLAIMER This guide is meant to walk you through the physical process of graphing and regression in Excel…. not to describe when and why you might want.
WINKS 7 Tutorial 5 Tutorial 5 – Creating a data set and entering data (Comparing Two Means, t-test) Permission granted for use for instruction and for.
How to Analyze Data? Aravinda Guntupalli. SPSS windows process Data window Variable view window Output window Chart editor window.
Fundamental Statistics in Applied Linguistics Research Spring 2010 Weekend MA Program on Applied English Dr. Da-Fu Huang.
Advance to next slide1 Set Up Module Section 1. Advance to next slide2 Interactive Introduction to SPSS Statistical Software Elizabeth Bigham, Ph.D. California.
LINDSEY BREWER CSSCR (CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH) UNIVERSITY OF WASHINGTON September 17, 2009 Introduction to SPSS (Version 16)
SPSS Presented by Chabalala Chabalala Lebohang Kompi Balone Ndaba.
1 1 Slide Simple Linear Regression Part A n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Using SPSS for Windows Part II Jie Chen Ph.D. Phone: /6/20151.
McGraw-Hill/Irwin The Interactive Computing Series © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Microsoft Excel 2002 Lesson 1 Introduction.
BOLD 2.0 Navigation Help Guide Note: BOLD will be inaccessible from 9:00 pm ET on Friday, June 1, to 7:00 am ET on Monday, June 4, so that the upgrade.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Entering Data Manually PowerPoint Prepared by.
SPSS Basics and Applications Workshop: Introduction to Statistics Using SPSS.
Introduction to SPSS. Object of the class About the windows in SPSS The basics of managing data files The basic analysis in SPSS.
WINKS 7 Tutorial 7 – Advanced Topic: Labels and Formats Permission granted for use for instruction and for personal use. © Alan C. Elliott,
11/4/2015Slide 1 SOLVING THE PROBLEM Simple linear regression is an appropriate model of the relationship between two quantitative variables provided the.
1 An Introduction to SPSS for Windows Jie Chen Ph.D. 6/4/20161.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Within Subjects Analysis of Variance PowerPoint.
Dr. Engr. Sami ur Rahman Research Methods in Computer Science Lecture: Data Analysis (Introduction to SPSS)
SPSS Instructions for Introduction to Biostatistics Larry Winner Department of Statistics University of Florida.
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
Statistical Analysis using SPSS Dr.Shaikh Shaffi Ahamed Asst. Professor Dept. of Family & Community Medicine.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
Conduct Simple Correlations Section 7. Correlation –A Pearson correlation analyzes relationships between parametric, linear (interval or ratio which are.
Outline of Today’s Discussion 1.Practice in SPSS: Scatter Plots 2.Practice in SPSS: Correlations 3.Spearman’s Rho.
1 PEER Session 02/04/15. 2  Multiple good data management software options exist – quantitative (e.g., SPSS), qualitative (e.g, atlas.ti), mixed (e.g.,
Analyzing Data. Learning Objectives You will learn to: – Import from excel – Add, move, recode, label, and compute variables – Perform descriptive analyses.
IENG-385 Statistical Methods for Engineers SPSS (Statistical package for social science) LAB # 1 (An Introduction to SPSS)
Introduction to SPSS Review of Concepts (stats and scales) Data entry (the workspace and labels) – By hand – Import Excel Running an analysis-
Logging Into Windows XP for first time (labs only!)
SPSS: Using statistical software — a primer
Introduction to SPSS.
By Dr. Madhukar H. Dalvi Nagindas Khandwala college
DEPARTMENT OF COMPUTER SCIENCE
Social Science Research Design and Statistics, 2/e Alfred P
LINDSEY BREWER CSSCR (CENTER FOR SOCIAL SCIENCE COMPUTATION AND RESEARCH) UNIVERSITY OF WASHINGTON September 17, 2009 Introduction to SPSS (Version 16)
Statistical Analysis using SPSS
ACE Secure Data Portal - Accounts Tab - Statements
Performing the Runs Test Using SPSS
Presentation transcript:

1 SPSS 202: Linear and Logistic Regression Using SPSS (Workshop) Dr. Daisy Dai Department of Medical Research

2 Contents Correlation (Pearson, Spearman, r-square) Scatter Plot and Trending Simple regression Multiple regression Logistic regression

3 Introduction to SPSS

4 What is SPSS? Statistical software. CMH has 10 server licenses. SPSS 18.

5 SPSS Data Entry SPSS data can be entered manually. –The format is ready for analysis. SAS, Excel, txt, etc. data can be easily imported to SPSS. SPSS data files are saved as “SPSS data document (.sav)”. SPSS output files are saved as “SPSS viewer document (.spv)”.

6 SPSS Data Entry SPSS has a few unique features in data entry. –Categorical variables need to be coded. For instance, code male as 1 and female as 0 or vice versa. –When you have two treatments, test and control, please use 1 for test and 0 for control. –Categorical variables that are not coded in other sourced data files will not be imported or analyzed properly in SPSS. –Continuous variables don’t need coding. –Missing values needs to be defined in “variable view” page.

7 Log in SPSS CMH offers server version SPSS 18. Any employee can log in SPSS from your employee account. Go to Start ->Program ->Accessories -> Remote Desktop Connection

8 Log in SPSS In the prompted connection window, enter cmhterm. Click Connect.

9 Log in SPSS In the Log On Window, enter your cmh user name and password. Choose log on to CMH Click OK.

10 Scatter Plot

11 Data Set 1: Anemia in Women A survey was conduct to a sample of 20 anemia women, randomly selected from a pre-defined geographical area. The participants had a blood sample taken and their hemoglobin (Hb) level and packed cell volume (PCV) measured. They were also asked their age, and whether or not they had experienced the menopause. The goals of the study were to determine whether Hb affects PCV or the other way around or whether Hb was associated with age.

12 Data Subject NumberHb (g/dl)PCV (%)Age (years)Menopause0 = No, 1 = Yes

13 Tasks Import data View and modify data Scatter plot and trending Save the results.

14 Task 1: Import Data Double click spss 18 icon on the screen. Click File -> Open -> Data. Click OK.

15 Task 1: Import Anemia Data Select the folder where data saved. (Note: since SPSS is in the server, we need to save files in the net work drive. I usually save files to my account in U drive. ) Enter file name. Select file type. (SPSS data file is in.sav format. SPSS can open excel or many other files, please make sure you choose the right file type). Click Open.

16 Task 2: View and Modify Data Now the data is open. There are two tabs on the screen, data view tab and variable view tab. We can read the data in “data View” tab.

17 Task 2: View and Modify Data We can define data structure including variable name, label, etc. in “Variable View” tab. Note the categorical variable, menopause, needs to be coded in the values column. Enter 0 for No and 1 for Yes.

18 Task 3: Scatter Plot Generate scatter plot between Hb and PCV. Please list the dependent (outcome) variable in y-axis and the independent (explanatory) variable in x-axis.

19 Task 3: Scatter Plot Click Graphs -> Legacy Dialogs -> Scatter/Dot -> Simple Scatter -> Define

20 Task 3: Scatter Plot All variables in the Anemia data set is listed in the left panel Select the variables for y axis and x-axis by first clicking the variable in the left panel and then clicking the arrow and the corresponding spot in the right panel. The marker variable is optional. If you want to label the subjects, then choose the corresponding variable as the marker. For instance, you can label the subjects by ID or by menopause. Here we choose menopause. Click OK.

21 Task 3: Scatter Plot One can double click the figure to prompt Chart Editor. Click on the fitted line icon located in the middle of last line of tool bar. We can also edit font or add in text box. Close Chart Editor.

22 Task 3: Scatter Plot One can fit linear, quadratic or cubic trend along with confidence interval to the data. Loess (local regression) is an useful tool to fit non- linear and irregular data.

23 Task 4: Save output One can save SPSS output file by clicking file - > save as to generate a viewer file in. apv format. This file can be edited by SPSS in the future. Or one can export the figure by right click and export to a word or pdf document. This file is permanent without revision to figures.

24 Practice Generate scatter plot between Hb and age. Fit an appropriate trend. Use Chart Editor to edit the font and add text when needed. Save the figure. Interpret the scatter plot and r-square.

25 Correlation

26 Tasks Continue with anemia data. Determine Pearson and Spearman correlation among the continuous variables Interpret results

27 Correlation On the data page, click Analyze-> Correlat-> Bivariate

28 Correlation We have three continuous variables in the data sets: Hb, PCV and Age. Select these three variables and check Pearson (parametric) and Spearman (non-parametric). Check two-tailed for conservative analysis And flag significant correlations. Click OK.

29 Pearson and Spearman Correlation Nonparametric correlations are based on ranks of data and it can be applied when data does not follow normality assumption (skewed) or outlier exists.

30 Simple Regression

31 Tasks Continue using anemia data. Perform simple linear regression. Determine the fitted regression model.

32 Simple Regression Click Analyze-> Regression -> Linear

33 Regression Select Hb as Dependent variable (y axis). Select PCV as Independent variable (x axis). In this case, PCV is the explanatory variable and Hb is the outcome variable. In other words, we investigate how PCV will impact Hb. Click Ok.

34 Hb= *PCV When PCV=35, the predicted Hb= *35=12.8 PCV is significantly associated with Hb with p-value=0.001

35 Multiple Regression

36 Tasks Continue with anemia data. Consider PCV and age as two risk factors associated with Hb.

37 Multiple Regression Click Analyze-> Regression -> Linear

38 Multiple Regression Select Hb as Dependent variable. Select PCV and age as Independent variable. Click Ok.

39 Multiple Regression One can choose more functions in the statistics tab. Click Continue

40

41 Hb= *PCV+0.11*age When PCV=35 and age=20, the predicted Hb= * *20=10.8 The observed Hb is PCV is significantly associated with Hb with p-value=0.008 after taking age into account.

42 Logistic Regression

43 Case Study 2: Relapse Rate in AML One hundred and two patients with acute myelogenous leukemia (AML) in remission were enrolled in a study of a new antisense oligonucleotide (asODN). The patients were randomly assigned to receive a 10-day infusion of asODN or no treatment (Control), and the effects were followed for 90 days. The time of remission from diagnosis or prior relapse (X, in months) at study enrollment was considered an important covariate in predicating response. The response data are shown in next page with Y=1 indicating relapse, death, or major intervention, such as bone marrow transplant before Day 90. Is there any evidence that administration of asODN is associated with a decreased relapse rate?

p. 323

45 Task: Logistic Regression Click Analyze- >Regression -> Binary Logistic

46 Task: Logistic Regression Select Relapse as dependent variable. Select time and treatment as covariate. One can also add in time by treatment interaction. Since treatment is binary, click categorical tab and select treatment as categorical covariates.

47

48 Questions?

49 Thank You For more information, visit my website aspx?id=9740 Or go to Scope ->Research -> Medical Research -> Statistics

50 References Medical Statistics by Campbell et al. Introductory Statistics by Neil Weiss Common Statistical Methods for Clinical Research by Walker