Computing for Research I Spring 2012 Exploratory Data Analysis and Hypothesis Testing February 21 Primary Instructor: Elizabeth Garrett-MAyer.

Slides:



Advertisements
Similar presentations
Non-parametric test Introduction, Wilcoxon rank sum test and Man-Whitney U test Reporter: Shao-Li Han.
Advertisements

EPI 809 / Spring 2008 Chapter 9 Nonparametric Statistics.
 Last time we discussed t-tests: how to use sample means of quantitative variables to make inferences about parameters.  Today we’ll use the very same.
Nonparametric tests and ANOVAs: What you need to know.
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
Final Review Session.
Wilcoxon Tests What is the Purpose of Wilcoxon Tests? What are the Assumptions? How does the Wilcoxon Rank-Sum Test Work? How does the Wilcoxon Matched-
DEPENDENT SAMPLES t-TEST What is the Purpose?What Are the Assumptions?How Does it Work?
Nemours Biomedical Research Statistics March 26, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Student’s t statistic Use Test for equality of two means
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Population Proportion The fraction of values in a population which have a specific attribute p = Population proportion X = Number of items having the attribute.
5-3 Inference on the Means of Two Populations, Variances Unknown
Nonparametric and Resampling Statistics. Wilcoxon Rank-Sum Test To compare two independent samples Null is that the two populations are identical The.
Mann-Whitney and Wilcoxon Tests.
Statistics Idiots Guide! Dr. Hamda Qotba, B.Med.Sc, M.D, ABCM.
Nonparametrics and goodness of fit Petter Mostad
Statistical Methods II
Leedy and Ormrod Ch. 11 Gray Ch. 14
One Sample  M ean μ, Variance σ 2, Proportion π Two Samples  M eans, Variances, Proportions μ1 vs. μ2 σ12 vs. σ22 π1 vs. π Multiple.
Biostat 200 Lecture 8 1. Hypothesis testing recap Hypothesis testing – Choose a null hypothesis, one-sided or two sided test – Set , significance level,
Teaching with Stata Peter A. Lachenbruch & Alan C. Acock Oregon State University
Amsterdam Rehabilitation Research Center | Reade Testing significance - categorical data Martin van der Esch, PhD.
Education 793 Class Notes T-tests 29 October 2003.
Stata Workshop #1 Chiu-Hsieh (Paul) Hsu Associate Professor College of Public Health
NONPARAMETRIC STATISTICS
Nonparametrics and goodness-of-fit
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Parametric & Non-parametric Parametric Non-Parametric  A parameter to compare Mean, S.D.  Normal Distribution & Homogeneity  No parameter is compared.
Biostat 200 Lecture 7 1. Hypothesis tests so far T-test of one mean: Null hypothesis µ=µ 0 Test of one proportion: Null hypothesis p=p 0 Paired t-test:
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
Nonparametric Statistical Methods: Overview and Examples ETM 568 ISE 468 Spring 2015 Dr. Joan Burtner.
Testing means, part II The paired t-test. Outline of lecture Options in statistics –sometimes there is more than one option One-sample t-test: review.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Two Sample t test Chapter 9.
Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Biostat 200 Lecture 8 1. The test statistics follow a theoretical distribution (t stat follows the t distribution, F statistic follows the F distribution,
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
Academic Research Academic Research Dr Kishor Bhanushali M
Biostat 200 Lecture 8 1. Where are we Types of variables Descriptive statistics and graphs Probability Confidence intervals for means and proportions.
Biostat 200 Lecture 7 1. Outline for today Hypothesis tests so far – One mean, one proportion, 2 means, 2 proportions Comparison of means of multiple.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Basics of Biostatistics for Health Research Session 3 – February 21, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health Sciences.
Computing for Research I Spring 2014 Primary Instructor: Elizabeth Garrett-Mayer Introduction to Stata February 19.
Analisis Non-Parametrik Antonius NW Pratama MK Metodologi Penelitian Bagian Farmasi Klinik dan Komunitas Fakultas Farmasi Universitas Jember.
8.1 Estimating µ with large samples Large sample: n > 30 Error of estimate – the magnitude of the difference between the point estimate and the true parameter.
Related Sample t-test A related sample t-test is used when participants in the two groups are not independent of participants in the other groups. – Repeated.
Review of BUSA3322 Mary M. Whiteside. Methodologies Two sample tests Analysis of variance Chi square tests Simple regression Multiple regression Time.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Nonparametric Statistics
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Nonparametric Statistics - Dependent Samples How do we test differences from matched pairs of measurement data? If the differences are normally distributed,
Nonparametric statistics. Four levels of measurement Nominal Ordinal Interval Ratio  Nominal: the lowest level  Ordinal  Interval  Ratio: the highest.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 16 : Summary Marshall University Genomics Core Facility.
Chapter 4. Inference about Process Quality
What we’ll cover today Transformations Inferential statistics
SA3202 Statistical Methods for Social Sciences
STAT120C: Final Review.
Do Statistical Analysis with Stata
Presentation, data and programs at:
Do you know population SD? Use Z Test Are there only 2 groups to
Oregon State University
Proportions and Sample Size
Confidence Intervals and Hypothesis Tests
CHAPTER 6 Statistical Inference & Hypothesis Testing
Summary of Tests Confidence Limits
Producing good data through sampling and experimentation
Presentation transcript:

Computing for Research I Spring 2012 Exploratory Data Analysis and Hypothesis Testing February 21 Primary Instructor: Elizabeth Garrett-MAyer

Exploratory Data Analysis We’ve already discussed some basic stuff – sum and sum, detail – tab What other sorts of exploration might we do? Confidence intervals – for continuous variables – for categorical variables

Immediate command for CIs Continuous: cii N xbar s Binary: cii N phat or cii N x

Confidence intervals For a continuous variable: mean varlist Example: * estimate means of ceramide variables mean c18ceramide mean totalc - s1pc1

Additional options tab initialre initial mean c18ceramide, over(initialre) mean c18ceramide, vce(bootstr) mean c18ceramide, vce(bootstr) over(initialre) mean c18ceramide, over(initialre) mean c18ceramide, level(90)

Confidence intervals for proportion proportion varlist Examples proportion failure proportion failure death initialre proportion failure, vce(bootstr) proportion failure, cluster(patient) proportion failure, level(90)

Hypothesis Testing A number of different approaches Options – nonparametric vs. parametric – continuous vs. categorical (vs. other?) – one vs. two vs. more than two groups

One sample t-tests ttesti N mean sd null ttest varname == null ttest var1 == var2 *paired Examples: ttesti ttest c18c == 10 ttest frombaselines1p==100 ttest frombaselinec18==100

Two sample t-tests ttesti N1 mean1 sd1 N2 mean2 sd2 ttest varname1 == varname2, unpaired ttest varname, by(groupvar) Examples: ttest c18, by(sex) ttest c18, by(sex) unequal

Nonparametric? ranksum : two group comparison kwallis : >= two group comparison signrank : matched pairs signed ranks test signtest : sign test of matched pairs

Nonparametric? *nonparametric tests ranksum c18, by(sex) kwallis c18, by(sex) use ceramide.alldata, clear keep if cycle==3 gen c18dif = frombaselinec signrank c18dif=0 signrank frombaselinec18=100 signtest c18dif=0 signtest frombaselinec18=100

Anova anova y x (note that x is assumed to be categorical) anova y x1 x2 Examples: anova c18c initialre

One sample binomial tests prtest and bitest Difference? – prtest uses large sample approximations – bitest uses exact test bitest varname==p0 bitesti N x p0

One sample binomial tests use "SCBC2004.v9.dta", clear replace ercat=. if ercat==9 gen ercatn=cond(ercat==2,0,1) replace ercatn=. if ercat==. tab ercat ercatn bitest ercatn=0.50 bitest ercatn=0.65 prtest ercatn=0.65

Two (or more) sample binomial tests tab y x, exact tab y x, chi tab ercatn grade tab ercatn stage