HRP223 2008 Copyright © 1999-2008 Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.

Slides:



Advertisements
Similar presentations
Inference for Regression
Advertisements

Final Thoughts HRP 223 – 2013 December 4 th, 2013 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
1 SAS Formats and SAS Macro Language HRP223 – 2011 November 9 th, 2011 Copyright © Leland Stanford Junior University. All rights reserved. Warning:
1 Merging with SQL HRP223 – 2011 October 31, 2011 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation.
1 Lab 2 HRP223 – 2010 October 18, 2010 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected.
1 Processing Grouped Data HRP223 – 2011 November 14 th, 2011 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
1 Combining (with SQL) HRP223 – 2010 October 27, 2009 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation.
Multiple regression analysis
1 Creating and Tweaking Data HRP223 – 2010 October 24, 2011 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
1 Database Theory and Normalization HRP223 – 2010 November 14 th, 2011 Copyright © Leland Stanford Junior University. All rights reserved. Warning:
Lecture 13 – Tues, Oct 21 Comparisons Among Several Groups – Introduction (Case Study 5.1.1) Comparing Any Two of the Several Means (Chapter 5.2) The One-Way.
Copyright (c) Bani Mallick1 STAT 651 Lecture 10. Copyright (c) Bani Mallick2 Topics in Lecture #10 Comparing two population means using rank tests Comparing.
A Simple Guide to Using SPSS© for Windows
Chapter 2 Simple Comparative Experiments
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Lecture 13: Tues., Feb. 24 Comparisons Among Several Groups – Introduction (Case Study 5.1.1) Comparing Any Two of the Several Means (Chapter 5.2) The.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
SAS for Categorical Data Copyright © 2004 Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright.
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Hypothesis Testing in Linear Regression Analysis
Introduction to SAS Essentials Mastering SAS for Data Analytics
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
Statistics for clinical research An introductory course.
Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and international.
STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Health and Disease in Populations 2001 Sources of variation (2) Jane Hutton (Paul Burton)
Topic 5 Statistical inference: point and interval estimate
HPR Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
1 Experimental Statistics - week 2 Review: 2-sample t-tests paired t-tests Thursday: Meet in 15 Clements!! Bring Cody and Smith book.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
1 Lab 2 and Merging Data (with SQL) HRP223 – 2009 October 19, 2009 Copyright © Leland Stanford Junior University. All rights reserved. Warning:
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and international.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
1 Data Manipulation (with SQL) HRP223 – 2010 October 13, 2010 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.3 Two-Way ANOVA.
HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Experimental Statistics - week 3
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
Analysis of Variance STAT E-150 Statistical Methods.
HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
1 Probability and Statistics Confidence Intervals.
HRP 223 – 2007 lm.ppt - Linear Models Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected.
1 Data Manipulation (with SQL) HRP223 – 2009 October 12, 2009 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
Beginning Data Manipulation HRP Topic 4 Oct 14 th 2012 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and international.
Chapter 2 Simple Comparative Experiments
SAS Output Delivery System
CHAPTER 29: Multiple Regression*
Statistics Statistics are like bikinis.  What they reveal is suggestive, but what they conceal is vital.  ~Aaron Levenstein.
Lab 3 and HRP259 Lab and Combining (with SQL)
Lab 2 and Merging Data (with SQL)
Lab 2 HRP223 – 2010 October 18, 2010 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected.
File Sharing and Processing Grouped Data
Processing Grouped Data
Global PaedSurg Research Training Fellowship
Presentation transcript:

HRP Copyright © Leland Stanford Junior University. All rights reserved. Warning: This presentation is protected by copyright law and international treaties. Unauthorized reproduction of this presentation, or any portion of it, may result in severe civil and criminal penalties and will be prosecuted to maximum extent possible under the law. HRP Topic 8 – Analysis of Means

HRP Normally DistributedNot Normally Distributed One sample vs. populationOne sample t-testWilcoxon Signed Rank Two paired samplesPaired t-testDifference then Signed Rank Two unpaired samplesT-testWilcoxon Rank-sum Three or more unpaired samplesANOVAKruskal-Wallis Three or more paired samplesMixed effectsTransform then mixed model Normally DistributedNot Normally Distributed One sample vs. populationDescribe > Distribution Two paired samplesAnalyze >ANOVA>t-testDescribe > Distribution Two unpaired samplesAnalyze >ANOVA>t-testDescribe > Distribution Three or more unpaired samplesAnalyze >ANOVA>LinearAnalyze >ANOVA>Nonpar. Three or more paired samplesAnalyze >ANOVA>Mixed One Categorical Predictor

HRP Multiple Categorical Predictors  Unpaired samples – ANOVA  Paired samples – Mixed Effects Models  If data is not normally distributed – There are spcialized statistics (Friedman’s test for 2 predictors). – Try to transform into normally distributed.

Mean vs. Expected BMI It would be nice to see the actual Excel file.

Adds a link to source Adds a link to source and runs import wizard This gives instant access to the current state of the spreadsheet but it is bugged if you mix character and numeric data.

HRP Take a Look at the Data

Prior to analysis, do all 3 plots. Histograms and box plots show outliers and bimodal data but are not ideal for assessing normality. The formal tests for normality are not great. They will not find problems with small samples and will declare problems with large samples.

1. normal distribution 2. skewed-to-the-right distribution 3. skewed-to-the-left distribution 4. heavy-tailed distribution 5. light-tailed distribution Image from: Statistics I: Introduction to ANOVA, Regression, and Logistic Regression: Course Notes. SAS Press 2008.

HRP Inference 101  You only have one sample but you want to make inferences to the world.  Given what you see in this sample, you can guess what the distribution of samples looks like around the null distribution.

If the population you are sampling from has a mean of 4, you will not observe a score of 4. How do you compare this sample vs. another with a mean of 5? Make a histogram of the means

.75/sqrt(1) =.75.75/sqrt(5) =.34.75/sqrt(25) =.15 Distributions of the Means

HRP Precision  Think of the “+/- something” imprecision in the estimates of the political polls.  You typically end up saying you are 95% sure you chose an interval that has the true value inside the range bracketed by the confidence limits (CLs). Either the population value is or is not in the interval between the lower and upper confidence limit, and if you repeated the process on many samples, 95% of such intervals would include the population value.  The 99% CI is wider (more accurate) than the more precise 95% CL.

Confidence Intervals from 10 Samples Axis with units showing your outcome You want to set the width of the interval so that in 95% of the experiments, the confidence interval includes the true value. In theory, you tweak the interval and increase or decrease the width. The unobservable truth

HRP Benefits of CLs  You have information about the estimate's precision.  The width of the CI tells you about the degree of random error which is set by the confidence interval.  Wide intervals indicate poor precision. Plausible values could be across a broad range.

HRP Estimation vs. Hypothesis Testing  P-value <.05 corresponds to a 95% CL that does not include the null hypothesis value.  CLs show uncertainty, or lack of precision, in the estimate of interest and thus convey more useful information than the p-value.

CLs vs. p-values null value Upper CL Lower CL Confidence interval P >.05 and the null value is inside of the confidence limits (CLs) null value Upper CL Lower CL Confidence interval P <.05 and the null value is not inside of the confidence limits 0 difference between groups or odds ratio of 1

null value zone of clinical indifference Not statistically significant and not clinically interesting Not statistically significant, possibly clinically interesting Statistically significant but not clinically interesting Statistically significant and clinically interesting

HRP Compare Two Teachers  Import the data  Describe the data – Assign the method as a classification variable  Do an unpaired T-test  Do a one-way ANOVA with the predictor having only two levels

SS total is the sum of the distance between each point and the overall mean line squared. SS error is the sum of the total squared distances between each point and the group mean lines.

HRP

Psoriasis  Scores are arbitrary numbers 0 = < 0% response, 5 = 26-50% response, etc.

HRP If and only if you work on a fast machine!

HRP Weight Gain

Postpartum Depression

After the Formats

Dementia

HRP Wide to Long  You may have noticed that the data for these analyses are all set up as long, skinny files where there is a record for every observation on a patient. Some people store data as wide records with many variables with a single record for each person.  To convert from wide to long: – Do data step processing with arrays. – Use the Transpose option on the Data menu. – Combine proc transpose and data step code. – Use a macro I wrote. (It is brand new, so check it.)

Tolerance.sas7bdat is dataset from the book Save a copy of the macro in a file after the fill in the blanks are done.

The stuff in the blah.sas file: The stuff in the macro file:

tol: all variables starting with the letters tol

HRP Narrow to Wide  Of course you can transpose back to wide from narrow.  If you download the keyboard macros today you will see that proc transpose now gives you a code template.