Introduction to Statistical Computing in Clinical Research Biostatistics 212.

Slides:



Advertisements
Similar presentations
1 SESSION 5 Graphs for data analysis. 2 Objectives To be able to use STATA to produce exploratory and presentation graphs In particular Bar Charts Histograms.
Advertisements

Do files, log files, and workflow in Stata Biostatistics 212 Lecture 2.
Using Excel Biostatistics 212 Lecture 4. Housekeeping Questions about Lab 3? –replace vs. recode Final Project Dataset! –“Housekeeping” commands vs. data.
Stata and logit recap. Topics Introduction to Stata – Files / directories – Stata syntax – Useful commands / functions Logistic regression analysis with.
Variables 9/10/2013. Readings Chapter 3 Proposing Explanations, Framing Hypotheses, and Making Comparisons (Pollock) (pp.48-58) Chapter 1 Introduction.
Generating new variables and manipulating data with STATA Biostatistics 212 Lecture 3.
Teaching Statistics Using Stata Software Susan Hailpern BSN MPH MS Department of Epidemiology and Population Health Albert Einstein College of Medicine.
1 Using SPSS: Descriptive Statistics Department of Operations Weatherhead School of Management.
Introduction to SPSS Allen Risley Academic Technology Services, CSUSM
INTRODUCTION TO STATA Võ Tuấn Khoa Trần Thế Trung.
Introduction to Statistical Computing in Clinical Research Biostatistics 212 Lecture 1.
Srinivasulu Rajendran Centre for the Study of Regional Development (CSRD) School of Social Sciences (SSS) Jawaharlal Nehru University (JNU) New Delhi -
Getting Started with STATA By: Katie Droll. Embrace Stata! Stata is your statistical buddy! If you put in a bit of effort to learn the basics, you should.
Today: Run SAS programs on Saturn (UNIX tutorial) Runs SAS programs on the PC.
Ann Arbor ASA ‘Up and Running’ Series: SPSS Prepared by volunteers of the Ann Arbor Chapter of the American Statistical Association, in cooperation with.
Generating new variables and manipulating data with STATA Biostatistics 212 Lecture 3.
Introduction to Statistical Computing in Clinical Research Biostatistics 212 Course director: Mark Pletcher Teaching Assistant: Lee Zane.
SADC Course in Statistics Adding a statistics package Module I3, Session 13.
A Simple Guide to Using SPSS© for Windows
Stata Introduction Sociology 229A, Class 2 Copyright © 2008 by Evan Schofer Do not copy or distribute without permission.
15a.Accessing Data: Frequencies in SPSS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Generating new variables and manipulating data with STATA Biostatistics 212 Session 2.
Everything I wish I had known about research design and data analysis… Statlab Workshop Fall 2006 Kyle Hood and Frank Farach.
SPSS 1: An Introduction to the Statistical Package SPSS Suzie Cro MRC Clinical Trials Unit.
SPSS Statistical Package for the Social Sciences is a statistical analysis and data management software package. SPSS can take data from almost any type.
RESEARCH HUB AT THE UNIVERSITY LIBRARIES PENN STATE UNIVERSITY TOUR OF STATISTICAL PACKAGES.
Introduction to SPSS Short Courses Last created (Feb, 2008) Kentaka Aruga.
FEBRUARY, 2013 BY: ABDUL-RAUF A TRAINING WORKSHOP ON STATISTICAL AND PRESENTATIONAL SYSTEM SOFTWARE (SPSS) 18.0 WINDOWS.
Introduction to SPSS (For SPSS Version 16.0)
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
Biostatistics, statistical software II. A brief survey of statistical program systems Krisztina Boda PhD Department of Medical Informatics, University.
L1: INTRODUCTION Getting started with Stata Angela Ambroz May 2015.
 Overview of SPSS  Interface  Getting Started  Managing Data  Descriptive Statistics  Basic Analysis  Additional Resources.
Stata Workshop #1 Chiu-Hsieh (Paul) Hsu Associate Professor College of Public Health
Scottish Social Survey Network: Master Class 1 Data Analysis with Stata Dr Vernon Gayle and Dr Paul Lambert 23 rd January 2008, University of Stirling.
Harvard-MIT Data Center (HMDC)
18b. PROC SURVEY Procedures in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
STATA Mini Course Fall 2015 Jane Leber Herr Littauer 113 1Stata Mini Course – Spring 2015.
BMTRY 789 Introduction to SAS Programming Lecturer: Annie N. Simpson, MSc.
Organizing a project, making a table Biostatistics 212 Session 5.
L3: BIG STATA CONCEPTS Getting started with Stata Angela Ambroz May 2015.
SPSS Overview. The opening screen 2 The SPSS windows 3.
Introduction to MATLAB 7 Engineering 161 Engineering Practices II Joe Mixsell Spring 2010.
Generating new variables and manipulating data with STATA Biostatistics 212 Lecture 3.
Organizing a project, making a table Biostatistics 212 Lecture 7.
What is SPSS  SPSS is a program software used for statistical analysis.  Statistical Package for Social Sciences.
1 An Introduction to SPSS for Windows Jie Chen Ph.D. 6/4/20161.
Getting Started With Stata Session 1 Jim Anthony John Troost Department of Epidemiology Michigan State University.
STAT 3130 Statistical Methods I Lecture 1 Introduction.
Introduction to Statistical Computing in Clinical Research
11/25/2015Slide 1 Scripts are short programs that repeat sequences of SPSS commands. SPSS includes a computer language called Sax Basic for the creation.
Introduction to Statistical Computing in Clinical Research Biostatistics 212 Lecture 1.
STATA for S-052 M. Shane Tutwiler Your Friendly S-040 Lecturer William Johnston IT Services Harvard Graduate School of Education.
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
Mr. Magdi Morsi Statistician Department of Research and Studies, MOH
Introduction to MATLAB 7 Engineering 161 Engineering Practices II Joe Mixsell Spring 2012.
1.Introduction to SPSS By: MHM. Nafas At HARDY ATI For HNDT Agriculture.
PSC 47410: Data Analysis Workshop  What’s the purpose of this exercise?  The workshop’s research questions:  Who supports war in America?  How consistent.
DTC Quantitative Methods Summary of some SPSS commands Weeks 1 & 2, January 2012.
16a. Accessing Data: Means in SPSS ®. 16a. Accessing Data: Means in SSPS ® 1 Prerequisites Recommended modules to complete before viewing this module.
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
1 EPIB 698C Lecture 1 Instructor: Raul Cruz-Cano
SAS Programming Training Instructor:Greg Grandits TA: Textbooks:The Little SAS Book, 5th Edition Applied Statistics and the SAS Programming Language, 5.
Before the class starts: 1) login to a computer 2) start Stata 13.
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
DEPARTMENT OF COMPUTER SCIENCE
Week 1 Gates Introduction to Information Technology cosc 010 Week 1 Gates
Introduction Introduction to Stata 2016.
Statistical Analysis with
Presentation transcript:

Introduction to Statistical Computing in Clinical Research Biostatistics 212

Today... Course overview –Course objectives –Course details: grading, homework, etc –Schedule, lecture overview Where does Stata fit in? Basic data analysis with Stata Stata demos

Course Objectives Learn how to use STATA Learn practical application of basic epidemiological and statistical concepts using STATA Learn how to turn raw data into presentable tables and figures

Course details Introduction to Statistical Computing - 1 unit Summer schedule – 2 lectures, 1 lab - Parnassus 9/6 lecture 1-2:30 9/13 lecture 1-2:30, lab 2:45-4:45 Fall schedule – Every other Tuesday – China Basin 9/20, 10/4, 10/18, 11/1, 11/15, 11/29 Lecture 3-4, Lab 4-5 Final Project due 12/6/05

Course details Introduction to Statistical Computing - 1 unit Grading: Satisfactory/Unsatisfactory Requirements: -Hand in all four Labs (even if late) -Satisfactory Final Project -80% of total points Reading: Optional

Course details, cont Faculty Mark Pletcher, MD, MPH Lee Zane, MD, MAS Scott Biggins, MD Course for homework:

Overview of lecture topics 1- Introduction to STATA 2- Do files, log files, and workflow in STATA 3- Generating variables and manipulating data with STATA 4- Basic epidemiology with STATA I 5- Basic epidemiology with STATA II 6- Using Excel 7- Organizing a project, making a table 8- Making a figure with STATA or Excel First 2 lectures here at Parnassus, the rest in China Basin

Overview of labs Lab 1 – Load a dataset and analyze it, learn about do and log files. Lab 2 – Import data from excel, generate new variables and manipulate data, document everything with do and log files. Lab 3 – Epidemiologic analysis using Stata Lab 4 – Using and creating Excel spreadsheets Labs 2 and 3 will be spread across several lab sessions Course is front-loaded – last 2 lab sessions dedicated to Final Project

Overview of labs, cont First lab will be at Parnassus next week 2:45-4:45, the rest at China Basin 4:00-5:00 after lecture Scott Biggins will lead 1 section, I will lead the other China Basin Computer Lab –No computers in it! –Must bring own laptop with Stata loaded

Overview of labs, cont Labs are generally due 1 week after the last lab session dedicated to them Labs 2-4 and the Final Project should be ed to the course address – Answers posted 1 day after Lab is due If you don’t turn the lab in on time, you STILL must turn it in to pass the class, even though you won’t get points credit for it (per TICR policy). PLEASE CORRECT YOUR COURSE OVERVIEW FORM

Final Project Create a Table and a Figure using your own data, document analysis using Stata. Due 1 week after last lab session, 20 points docked for each 1 day late.

Getting started with STATA Session 1

Types of software packages used in clinical research Statistical analysis packages Spreadsheets Database programs Custom applications –Cost-effectiveness analysis (TreeAge, etc) –Survey analysis (SUDAAN, etc)

Software packages for analyzing data STATA SAS S-plus, and “R” SPS-S SUDAAN Epi-Info JMP MatLab StatExact

Why use STATA? Quick start, user friendly Immediate results, response You can look at the data Menu-driven option Good graphics Log and do files Good manuals, help menu

Why NOT use STATA? SAS is used more often SAS does some things STATA does not Programming easier with S-plus Complicated data structure and manipulation easier with SAS Epi-info is even easier than STATA?

STATA – Basic functionality Hold data for you –Stata holds 1 “flat” file dataset only (.dta file) Listen to what you want –Type a command, press enter Do stuff –Statistics, data manipulation, etc Show you the results –Results window

Demo #1 Open the program Load some data Look at it Run a command

STATA - Windows Two basic windows –Command –Results Optional windows –Variable list –History of commands Other functions –Data browser/editor –Do file editor –Viewer (for log, help files, etc)

STATA - Buttons The usual – open, save, print Log-file open/suspend/close Do-file editor Browse and Edit Break

STATA - Menus Almost every command can be accessed via menu

Demo #2 Enter in some data Look at it Run a couple of commands

Menu vs. Command line Menu advantages –Look for commands you don’t know about –See the options for each command –Complex commands easier – learn syntax Command line advantages –Faster (if you know the command!) –“Closer” to the program –Only way to write “do” files Document and repeat analyses

STATA commands Describing your data describe [varlist] –Displays variable names, types, labels list [varlist] –Displays the values of all observations codebook [varlist] –Displays labels and codes for all variables

STATA commands Descriptive statistics – continuous data summarize [varlist] [, detail] –# obs, mean, SD, range –“, detail” gets you more detail (median, etc) histogram varname –Simple histogram of your variable ci [varlist] –Mean, standard error, and confidence intervals –Actually works for dichotomous variables, too.

STATA commands Descriptive statistics – categorical data tabulate [var] –Counts and percentages –(see also, table - this is very different!)

STATA commands Analytic statistics – 2 categorical variables

tabulate [var1] [var2] –“Cross-tab” –Descriptive options, row(row percentages), col(column percentages) –Statistics options, chi2(chi2 test), exact(fisher’s exact test)

Getting help Try to find the command on the pull-down menus Help menu –If you don’t know the command - Search... –If you know the command - Stata command... Try the manuals –more detail, theoretical underpinnings, etc

STATA commands Analytic statistics – 1 categorical, 1 continuous

bysort catvar: sum [contvar] –mean, SD, range of one in subgroup ttest [contvar], by([catvar]) –t-test oneway [contvar] [catvar] –ANOVA table [catvar] [, contents(mean [contvar]…) –Table of statistics

STATA commands Analytic statistics – 2 continuous

scatter [var1] [var2] –Scatterplot of the two variables pwcorr [varlist] [, sig] –Pairwise correlations between variables –“sig” option gives p-values

Demo #3 Load a STATA dataset Explore the data Describe the data Answer some simple research questions

Next week Do files, log files, and workflow in Stata In lab next week: –Familiarize yourself with Stata –Practice today’s material (loading and analyzing data) –Start learning how to use do and log files You can leave lab early if you finish!