Basics of Biostatistics for Health Research Session 4 – February 28, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health Sciences.

Slides:



Advertisements
Similar presentations
Basics of Biostatistics for Health Research Session 2 – February 14 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
Advertisements

Statistics for the Social Sciences
Example: In a heart study the systolic blood pressure was measured for 24 men aged 25 and for 30 men aged 40. Do these data show sufficient evidence to.
Chapter 10 Analysis of Variance (ANOVA) Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
One-sample T-Test of a Population Mean
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Statistical Tests Karen H. Hagglund, M.S.
WINKS SDA Statistical Data Analysis (Windows Kwikstat) Getting Started Guide.
Part I – MULTIVARIATE ANALYSIS
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Parameters and Statistics Probabilities The Binomial Probability Test.
Stat 512 – Lecture 13 Chi-Square Analysis (Ch. 8).
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
BS704 Class 7 Hypothesis Testing Procedures
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Today Concepts underlying inferential statistics
Total Population of Age (Years) of People that Smoke
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
8/20/2015Slide 1 SOLVING THE PROBLEM The two-sample t-test compare the means for two groups on a single variable. the The paired t-test compares the means.
Practical statistics for Neuroscience miniprojects Steven Kiddle Slides & data :
Inferential Statistics: SPSS
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Kolmogorov-Smirnov Test PowerPoint Prepared by.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
Simple Linear Regression
SPSS Series 1: ANOVA and Factorial ANOVA
A random sample of 300 doctoral degree
By Hui Bian Office for Faculty Excellence 1. K-group between-subjects MANOVA with SPSS Factorial between-subjects MANOVA with SPSS How to interpret SPSS.
SW388R6 Data Analysis and Computers I Slide 1 Central Tendency and Variability Sample Homework Problem Solving the Problem with SPSS Logic for Central.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
Assessing Survival: Cox Proportional Hazards Model
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Project 6 Using The Analysis ToolPak To Analyze Sales Transactions Jason C. H. Chen, Ph.D. Professor of Management Information Systems School of Business.
Research Project Statistical Analysis. What type of statistical analysis will I use to analyze my data? SEM (does not tell you level of significance)
Multivariate Analysis. One-way ANOVA Tests the difference in the means of 2 or more nominal groups Tests the difference in the means of 2 or more nominal.
Lab 5 instruction.  a collection of statistical methods to compare several groups according to their means on a quantitative response variable  Two-Way.
LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Analyses using SPSS version 19
Review Hints for Final. Descriptive Statistics: Describing a data set.
Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics ANalysis Of VAriance: ANOVA.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Non – Parametric Test Dr.L.Jeyaseelan Dept. of Biostatistics Christian Medical College Vellore, India.
Basics of Biostatistics for Health Research Session 3 – February 21, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health Sciences.
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
Chapter 12: Correlation and Linear Regression 1.
ONE-WAY BETWEEN-GROUPS ANOVA Psyc 301-SPSS Spring 2014.
WINKS 7 Tutorial 3 Analyzing Summary Data (Using Student’s t-test) Permission granted for use for instruction and for personal use. ©
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
Handout Twelve: Design & Analysis of Covariance
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Analyzing Data. Learning Objectives You will learn to: – Import from excel – Add, move, recode, label, and compute variables – Perform descriptive analyses.
Exercise 1 Content –Covers chapters 1-4 Chapter 1 (read) Chapter 2 (important for the exercise, 2.6 comes later) Chapter 3 (especially 3.1, 3.2, 3.5) Chapter.
IENG-385 Statistical Methods for Engineers SPSS (Statistical package for social science) LAB # 1 (An Introduction to SPSS)
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
An Introduction to Epi Info 6/7
Advanced Quantitative Techniques
Multiple Regression.
Basic Statistics Overview
Do Statistical Analysis with Stata
Part 2 - Compare average in different groups
Program This course will be dived into 3 parts: Part 1 Descriptive statistics and introduction to continuous outcome variables Part 2 Continuous outcome.
Exercise 1: Open the file ‘Birthweight_reduced’
Exercise 1 Use Transform  Compute variable to calculate weight lost by each person Calculate the overall mean weight lost Calculate the means and standard.
Presentation transcript:

Basics of Biostatistics for Health Research Session 4 – February 28, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health Sciences & Department of Psychiatry

Generate Commands Using Logic generate obese2 =. recode obese2.=0 if bmi <= 30 recode obese2.=1 if bmi > 30 tab obese obese2 prtest obese2, by(sex) Missing as obese, which is strange.

Missing Values and Logical Operators management/logical-expressions-and- missing-values/ management/logical-expressions-and- missing-values/

Generate Commands Using Logic generate obese2 =. recode obese2.=0 if bmi <= 30 recode obese2.=1 if bmi > 30 & bmi !=. tab obese obese2, missing prtest obese2, by(sex) This code works.

Statistical Errors

Sample Size Simulation

Sample Size Calculation in STATA 3 2 1

Sample Size Dialogue Boxes

Let’s do a calculation! You are planning a parallel group RCT – with treatment and control groups. Normally, 20% of people die with disease X, but you expect to cut this in half with a new treatment. How many do you need in each group to achieve 95% power at alpha = 5%?

Output (sampsi)

Another Calculation A QoL scale in a particular disease has a mean score of 20 and a standard deviation of 5. You are conducting a placebo controlled trial to evaluate a treatment that is expected to improve the QoL by 2 points on this scale. You recruit n=50 into each group – what power will you achieve?

Output (sampsi)

Go to “ Scroll to the bottom. Right click to download the files described as being “for PGME Students” –One is a dataset –One is a data dictionary Save them on your desktop

Review: Comparing Proportions We’ve looked at several procedures for comparing proportions (e.g. for obesity in men vs. women): generate obese =. recode obese.=0 if bmi <= 30 recode obese.=1 if bmi > 30 & bmi !=. tab obese obese, missing prtest obese, by(sex)

Epitab Commands 1 3 2

Review: Comparing Proportions We’ve looked at several procedures for comparing proportions (e.g. for obesity in men vs. women): recode sex 2=1 1=0 cs obese sex

The output…

A “non-significant” association generate highgluc =. recode highgluc.=0 if glucose <= 140 recode highgluc.=1 if glucose > 140 & glucose !=. generate female=sex recode female (1=0) (2=1) tab highgluc female, exact

How does this look with cs?

Review: Try the cci command to obtain the OR Check your work with the cc command.

Comparing Proportions? Yes No Fisher’s Exact TestParametric Assumptions? Yes No Multiple Groups? Yes No YesNo ANOVA t-test Kruskall-Wallis Wilcoxon’s-Rank Sum

Two situations we haven’t covered… Severely skewed distributions Two continuous variables

Severely Skewed Variables

Solution: Make Some Categories For example: –Non-smokers –Light smokers (<20) –Moderate –Heavy > 40 Your task: Make a variable with these categories and do a statistical test to compare men to women.

E.g. for the recoding… generate smoke =. recode smoke.=1 if cigpday==0 recode smoke.=2 if cigpday > 0 & cigpday < 20 recode smoke.=3 if cigpday >=20 & cigpday <= 40 recode smoke.=4 if cigpday > 40 & cigpday !=. tab smoke, missing

Some output…

Two continuous variables E.g. diastolic blood pressure and BMI The place to start is always a scatter plot STATA calls this a “two way” graph

Start with Create

Select the two variables Submit

The command produced… Produced by our dialogue box… twoway (scatter diabp sysbp) The same dialogue box can fit a line… twoway (lfit diabp sysbp) This time select “line”

You can combine the two.. Try it! twoway (scatter diabp sysbp) (lfit diabp sysbp) To assess significance, use the regress command (can you find the menu option?) regress diabp sysbp

Note: the linear output Line: y = mx + b diabp = (sysbp)

(In Class) Assignment for Today Assess whether there is an association between systolic blood pressure and death (you need to decide how) We’ll define elevated systolic blood pressure as being > 140 mm of Hg. –What is the risk ratio for death for people with elevated systolic blood pressure? –Is the risk ratio statistically significant?