Public Health 5415 Biostatistical Methods II Spring 2005 Greg Grandits 612-626-9033 Class Times Monday10:10am-12:05pm Wednesday10:10am-11:00am.

Slides:



Advertisements
Similar presentations
Statistical Methods Lynne Stokes Department of Statistical Science Lecture 7: Introduction to SAS Programming Language.
Advertisements

Slide C.1 SAS MathematicalMarketing Appendix C: SAS Software Uses of SAS  CRM  datamining  data warehousing  linear programming  forecasting  econometrics.
Statistics in Science  Introducing SAS ® software Acknowlegements to David Williams Caroline Brophy.
I OWA S TATE U NIVERSITY Department of Animal Science Getting Started Using SAS Software Animal Science 500 Lecture No. 2.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Today: Run SAS programs on Saturn (UNIX tutorial) Runs SAS programs on the PC.
Biostatistical Methods II PubH 6415 Spring PubH 6415 – Biostatistics I Instructor: Susan Telke (office hours: lecture.
Module 2: Data analysis key concepts. Module 2: Learning objectives  Understand the definition and purpose of data analysis  Define statistical and.
Categorical Data Analysis using SAS. 2 List the components of a SAS program. Open an existing SAS program and run it. Discuss the Chi Square Test of Independence.
SAS ® Regression Essentials. 2 List the components of a SAS program. Open an existing SAS program and run it. Objectives.
Welcome to the Exciting World of ! Lessons to familiarize yourself with.
1 SAS SAS is a statistics software package developed by SAS Institute Inc. in U.S.A. SAS products include SAS/STAT, SAS/IML, SAS/OR, etc. The most.
SAS for Categorical Data Copyright © 2004 Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright.
EPLS Lab Software Orientation SAS. Orientation Overview Background Getting Started SAS Windows.
SAS Workshop INTRODUCTORY ASPECTS SPRING 2012 January 20121K. F. O'Brien.
Understanding SAS Data Step Processing Alan C. Elliott stattutorials.com.
Introduction to SAS Essentials Mastering SAS for Data Analytics Alan Elliott and Wayne Woodward SAS Essentials - Elliott & Woodward1.
SAS ® ANOVA Essentials. 2 List the components of a SAS program. Open an existing SAS program and run it. Objectives.
Introduction To Correlation with SAS Sam Gordji Weir 107.
Data Preparation for Analytics Using SAS Gerhard Svolba, Ph.D. Reviewed by Madera Ebby, Ph.D.
Welcome to SAS…Session..!. What is SAS..! A Complete programming language with report formatting with statistical and mathematical capabilities.
Topic 1: Class Logistics. Outline Class Web site Class policies Overview References Software Background Reading.
SAS Workshop Lecture 1 Lecturer: Annie N. Simpson, MSc.
Chapter 9 Producing Descriptive Statistics PROC MEANS; Summarize descriptive statistics for continuous numeric variables. PROC FREQ; Summarize frequency.
Introduction to SAS Essentials Mastering SAS for Data Analytics Alan Elliott and Wayne Woodward SAS ESSENTIALS -- Elliott & Woodward1.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Introduction to SAS BIO 226 – Spring Outline Windows and common rules Getting the data –The PRINT and CONTENT Procedures Manipulating the data.
1 Experimental Statistics - week 4 Chapter 8: 1-factor ANOVA models Using SAS.
April 11 Logistic Regression –Modeling interactions –Analysis of case-control studies –Data presentation.
Introduction to SAS. What is SAS? SAS originally stood for “Statistical Analysis System”. SAS is a computer software system that provides all the tools.
1 Experimental Statistics - week 2 Review: 2-sample t-tests paired t-tests Thursday: Meet in 15 Clements!! Bring Cody and Smith book.
Lesson 5 - Topics Formatting Output Working with Dates Reading: LSB:3:8-9; 4:1,5-7; 5:1-4.
Math 3400 Computer Applications of Statistics Lecture 1 Introduction and SAS Overview.
SAS Macro: Some Tips for Debugging Stat St. Paul’s Hospital April 2, 2007.
SAS 介绍和举例 Presented by 经济实验教学中心 商务数据挖掘中心. Raw Data Read in Data Process Data (Create new variables) Output Data (Create SAS Dataset) Analyze Data Using.
BMTRY 789 Introduction to SAS Programming Lecturer: Annie N. Simpson, MSc.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
2 December 2004PubH8420: Parametric Regression Models Slide 1 Applications - SAS Parametric Regression in SAS –PROC LIFEREG –PROC GENMOD –PROC LOGISTIC.
Lesson 2 Topic - Reading in data Chapter 2 (Little SAS Book)
ISU Basic SAS commands Laboratory No. 1 Computer Techniques for Biological Research Animal Science 500 Ken Stalder, Professor Department of Animal Science.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Today - Messages Additional shared lab hours in A-269 –M, W, F 2:30-4:25 –T, Th 4:00-5:15 First priority is for PH5452. No TA or instructor Handouts –
1 EPIB 698E Lecture 1 Notes Instructor: Raul Cruz 7/9/13.
Lesson 6 - Topics Reading SAS datasets Subsetting SAS datasets Merging SAS datasets.
Lecture 3 Topic - Descriptive Procedures Programs 3-4 LSB 4:1-4.4; 4:9:4:11; 8:1-8:5; 5:1-5.2.
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
Chapter 1: Overview of SAS System Basic Concepts of SAS System.
Lecture 4 Ways to get data into SAS Some practice programming
Computing with SAS Software A SAS program consists of SAS statements. 1. The DATA step consists of SAS statements that define your data and create a SAS.
FORMAT statements can be used to change the look of your output –if FORMAT is in the DATA step, then the formats are permanent and stored with the dataset.
Lesson 2 Topic - Reading in data Programs 1 and 2 in course notes –Chapter 2 (Little SAS Book)
1 EPIB 698C Lecture 1 Instructor: Raul Cruz-Cano
SAS Programming Training Instructor:Greg Grandits TA: Textbooks:The Little SAS Book, 5th Edition Applied Statistics and the SAS Programming Language, 5.
Four way analysis Nursing home residence Gender Age Death.
Based on Learning SAS by Example: A Programmer’s Guide Chapters 1 & 2
Online Programming| Online Training| Real Time Projects | Certifications |Online Classes| Corporate Training |Jobs| CONTACT US: STANSYS SOFTWARE SOLUTIONS.
The Urban Institute - SAS Training6/9/20161 SAS Training This SAS Training Course was designed to introduce users at The Urban Institute to SAS programming.
SAS ® 101 Based on Learning SAS by Example: A Programmer’s Guide Chapters 16 & 17 By Tasha Chapman, Oregon Health Authority.
SAS ® 101 Based on Learning SAS by Example: A Programmer’s Guide Chapters 5 & 6 By Ravi Mandal.
SAS ® 101 Based on Learning SAS by Example: A Programmer’s Guide Chapters 3 & 4 By Tasha Chapman, Oregon Health Authority.
Applied Business Forecasting and Regression Analysis
April 18 Intro to survival analysis Le 11.1 – 11.2
PubH 6420 Introduction to SAS Programming
SAS Programming Training
SAS Programming Training
Introduction to DATA Step Programming: SAS Basics II
SAS Programming Training
STAT 515 Statistical Methods I Lecture 1 August 22, 2019 Originally prepared by Brian Habing Department of Statistics University of South Carolina.
Presentation transcript:

Public Health 5415 Biostatistical Methods II Spring 2005 Greg Grandits Class Times Monday10:10am-12:05pm Wednesday10:10am-11:00am

Course objectives: Write and run simple SAS programs to perform common analyses. Analyze health science data using basic statistical and inferential techniques. Understand statistical methods as commonly presented in public health literature

Topics Covered Linear regression Logistic regression Life-table analyses Cox regression Relative risk, odds ratio, hazard ratio estimation SAS programming to do above analyses

SAS Usage SAS is the worlds largest privately held software company 40,000 customer sites worldwide 3.5 million users worldwide 90% of Fortune 500 companies use SAS Nearly all analyses of publications in medical research use SAS SAS invests extensive resources to R & D.

Why SAS? It is widely used –Industry, government, and academia It is very powerful –programming language –sophisticated analyses (better than Excel)

JAMA January 12, 2005 Meat Consumption and Risk of Colorectal Cancer, Chao Colon and rectal cancer incidence rate ratios (RRs) and 95% CIs by meat intake were estimated using Cox proportional hazards regression modeling. P values for linear trend were estimated by modeling meat intake (g/wk) using the median value within quintiles; these results were similar when modeled as continuous variables. All P values were 2-sided and considered significant at P<.05. All analyses were conducted using SAS version 9.0 (SAS Institute Inc, Cary, NC). Consumption of Veg/Fruits and Risk of Breast Cancer All analyses were performed using SAS version 8 (SAS Institute Inc, Cary, NC). All tests were 2-sided with an {alpha} of.05.

JAMA January 12, 2005 Fasting Serum Glucose Level and Cancer Risk in Korean Men and Women Age-adjusted death and cancer incidence rates were calculated for each category of fasting serum glucose level and directly standardized to the age distribution of the 1995 Korean national population. All analyses were stratified by sex. All analyses were conducted using SAS statistical software, version 8.0 (SAS Institute Inc, Cary, NC).

Details –Homework, readings, programs, data files –Class slides Lab/Office hours 4 hours per week (TA or instructor)

Details Text books: Applied Statistics and the SAS Programming Language, RP Cody and JK Smith (Read Chapter 1 for next week) Introductory Biostatistics, CT Le The Little SAS Book, LD Delwiche and SJ Slaughter (Chapter 1 available on website)

Grading Homework - 30% (half credit for late homework, can turn in no later than 2 weeks after due date) Two tests - 30% each Short project - 10% No final exam

Using SAS SAS is available several ways: In the Mayo A-269 (TRC) lab Other PCs with SAS From biostatistics UNIX computer via telnet Purchase from the University 152 Shepherd Labs (ADCS) $150 per year

What is SAS ? SAS is a programming language that reads, processes, and performs statistical analyses of data. A SAS program is made up of programming statements which SAS interprets to do the above functions.

Raw Data Read in Data Process Data (Create new variables) Output Data (Create SAS Dataset) Analyze Data Using Statistical Procedures Data Step PROCs

Structure of Data Made up of rows and columns Rows in SAS are called observations Columns in SAS are called variables An observation is all the information for one entity (patient, patient visit, clinical center, county) SAS processes data one observation at a time

Example of Data 12 observations and 5 variables F 23 S 15 MN F 21 S 15 WI F 22 S 09 MN F 35 M 02 MN F 22 M 13 MN F 25 S 13 WI M 20 S 13 MN M 26 M 15 WI M 27 S 05 MN M 23 S 14 IA M 21 S 14 MN M 29 M 15 MN Gender Age Marital status Number of credits State of residence

* This is a short example program to demonstrate what a SAS program looks like. This is a comment statement because it begins with a * and ends with a semi-colon ; DATA demo; INPUT gender $ age marstat $ credits state $ ; if credits > 12 then fulltime = 'Y'; else fulltime = 'N'; if state = 'MN' then resid = 'Y'; else resid = 'N'; DATALINES; F 23 S 15 MN F 21 S 15 WI F 22 S 09 MN F 35 M 02 MN F 22 M 13 MN F 25 S 13 WI M 20 S 13 MN M 26 M 15 WI M 27 S 05 MN M 23 S 14 IA M 21 S 14 MN M 29 M 15 MN ; RUN; TITLE 'Running the Example Program'; PROC PRINT DATA=DEMO ; VAR gender age marstat credits fulltime state ; RUN;

Rules for SAS Statements and Variables SAS statements end with a semicolon (;) SAS statements can be entered in lower or uppercase Multiple SAS statements can appear on one line A SAS statement can use multiple lines Variable names can be from 1-32 characters and begin with A-Z or an underscore (_)

1 DATA demo; Create a SAS dataset called demo 2 INPUT gender $ What are the variables age marstat $ credits state $ ; 3 if credits > 12 then fulltime = 'Y'; else fulltime = 'N'; 4 if state = 'MN' then resid = 'Y'; else resid = 'N'; Statements 3 and 4 create 2 new variables

5 DATALINES; Tells SAS the data is coming F 23 S 15 MN F 21 S 15 WI F 22 S 09 MN F 35 M 02 MN F 22 M 13 MN F 25 S 13 WI M 20 S 13 MN M 26 M 15 WI M 27 S 05 MN M 23 S 14 IA M 21 S 14 MN M 29 M 15 MN ; Tells SAS the data is ending 6 RUN; Tells SAS to run the statements

Types of Data Numeric (e.g. age, blood pressure) Character (patient name, ID, diagnosis) Each type treated differently by SAS

TITLE 'Running the Example Program'; PROC PRINT DATA=demo ; VAR gender age marstat credits fulltime state ; RUN; * You can run additional procedures; PROC MEANS DATA=demo ; VAR age credits ; RUN; PROC FREQ DATA=demo ; TABLES gender ; RUN;

Files Generated When SAS Program is Submitted Log file – a text file listing program statements processed and giving notes, warnings and errors. (in UNIX the file will be named fname.log) Always look at the log file ! Tells how SAS understood your program Output file – a text file giving the output generated from the PROCs (in UNIX the file will be named fname.lst)

Messages in SAS Log Notes – messages that may or may not be important Warnings – messages that are usually important Errors – fatal in that program will abort (notes and warnings will not abort your program)

LOG FILE NOTE: Copyright (c) by SAS Institute Inc., Cary, NC, USA. NOTE: SAS (r) Proprietary Software Release 8.2 (TS2M0) Licensed to UNIVERSITY OF MINNESOTA, Site NOTE: This session is executing on the WIN_NT platform. NOTE: SAS initialization used: real time 7.51 seconds cpu time 0.89 seconds 1 * This is a short example program to demonstrate what a 2 SAS program looks like. This is a comment statement because 3 it begins with a * and ends with a semi-colon ; 4 5 DATA demo; 6 INFILE DATALINES; 7 INPUT gender $ age marstat $ credits state $ ; 8 9 if credits > 12 then fulltime = 'Y'; else fulltime = 'N'; 10 if state = 'MN' then resid = 'Y'; else resid = 'N'; 11 DATALINES; NOTE: The data set WORK.DEMO has 12 observations and 7 variables. NOTE: DATA statement used: real time 0.38 seconds cpu time 0.06 seconds

25 RUN; 26 TITLE 'Running the Example Program'; 27 PROC PRINT DATA=demo ; 28 VAR gender age marstat credits fulltime state ; 29 RUN; NOTE: There were 12 observations read from the data set WORK.DEMO. NOTE: PROCEDURE PRINT used: real time 0.19 seconds cpu time 0.02 seconds 30 PROC MEANS DATA=demo N SUM MEAN; 31 VAR age credits ; 32 RUN; NOTE: There were 12 observations read from the data set WORK.DEMO. NOTE: PROCEDURE MEANS used: real time 0.25 seconds cpu time 0.03 seconds 33 PROC FREQ DATA=demo; TABLES gender; 34 RUN; NOTE: There were 12 observations read from the data set WORK.DEMO. NOTE: PROCEDURE FREQ used: real time 0.15 seconds cpu time 0.03 seconds

LST FILE Running the Example Program Obs gender age marstat credits fulltime state 1 F 23 S 15 Y MN 2 F 21 S 15 Y WI 3 F 22 S 9 N MN 4 F 35 M 2 N MN 5 F 22 M 13 Y MN 6 F 25 S 13 Y WI 7 M 20 S 13 Y MN 8 M 26 M 15 Y WI 9 M 27 S 5 N MN 10 M 23 S 14 Y IA 11 M 21 S 14 Y MN 12 M 29 M 15 Y MN The MEANS Procedure Variable N Sum Mean age credits The FREQ Procedure Cumulative Cumulative gender Frequency Percent Frequency Percent F M