Presentation, data and programs at:

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

Chapter 3 Properties of Random Variables
A PowerPoint®-based guide to assist in choosing the suitable statistical test. NOTE: This presentation has the main purpose to assist researchers and students.
AP Statistics Course Review.
STATISTICAL ANALYSIS. Your introduction to statistics should not be like drinking water from a fire hose!!
Apr-15H.S.1 Stata: Linear Regression Stata 3, linear regression Hein Stigum Presentation, data and programs at: courses.
Statistical Tests Karen H. Hagglund, M.S.
QUANTITATIVE DATA ANALYSIS
Final Review Session.
Descriptive Statistics
1 Econ 240A Power Outline Review Projects 3 Review: Big Picture 1 #1 Descriptive Statistics –Numerical central tendency: mean, median, mode dispersion:
Analysis of Research Data
Jul-15H.S.1 Short overview of statistical methods Hein Stigum Presentation, data and programs at: courses.
Jul-15H.S.1 Linear Regression Hein Stigum Presentation, data and programs at:
Summary of Quantitative Analysis Neuman and Robson Ch. 11
1 Introduction to biostatistics Lecture plan 1. Basics 2. Variable types 3. Descriptive statistics: Categorical data Categorical data Numerical data Numerical.
Quantitative Methods: Choosing a statistical test Summer School June 2015 Dr. Tracie Afifi.
Two Sample Tests Ho Ho Ha Ha TEST FOR EQUAL VARIANCES
PPA 501 – A NALYTICAL M ETHODS IN A DMINISTRATION Lecture 3b – Fundamentals of Quantitative Research.
Things that I think are important Chapter 1 Bar graphs, histograms Outliers Mean, median, mode, quartiles of data Variance and standard deviation of.
Statistics for clinical research An introductory course.
Descriptive Statistics e.g.,frequencies, percentiles, mean, median, mode, ranges, inter-quartile ranges, sds, Zs Describe data Inferential Statistics e.g.,
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
For 95 out of 100 (large) samples, the interval will contain the true population mean. But we don’t know  ?!
Biostat 200 Lecture 7 1. Hypothesis tests so far T-test of one mean: Null hypothesis µ=µ 0 Test of one proportion: Null hypothesis p=p 0 Paired t-test:
The normal distribution Binomial distribution is discrete events, (infected, not infected) The normal distribution is a probability density function for.
RESULTS & DATA ANALYSIS. Descriptive Statistics  Descriptive (describe)  Frequencies  Percents  Measures of Central Tendency mean median mode.
Central Tendency Introduction to Statistics Chapter 3 Sep 1, 2009 Class #3.
Determination of Sample Size: A Review of Statistical Theory
TYPES There are several TYPES of variables that reflect characteristics of the data Ratio Interval Ordinal Nominal.
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
Linear Correlation. PSYC 6130, PROF. J. ELDER 2 Perfect Correlation 2 variables x and y are perfectly correlated if they are related by an affine transform.
Review Lecture 51 Tue, Dec 13, Chapter 1 Sections 1.1 – 1.4. Sections 1.1 – 1.4. Be familiar with the language and principles of hypothesis testing.
Statistics for Neurosurgeons A David Mendelow Barbara A Gregson Newcastle upon Tyne England, UK.
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Non-parametric Tests Research II MSW PT Class 8. Key Terms Power of a test refers to the probability of rejecting a false null hypothesis (or detect a.
A QUANTITATIVE RESEARCH PROJECT -
Quiz.
A radical view on plots in analysis
Course Objectives Define the concepts of Biostatistics, and common terminologies used Describe the different types of Scales of measurements Populations,
Chapter 12 Simple Linear Regression and Correlation
Review 1. Describing variables.
business analytics II ▌assignment one - solutions autoparts 
Stata Intro Mixed Models
APPROACHES TO QUANTITATIVE DATA ANALYSIS
CHOOSING A STATISTICAL TEST
Description of Data (Summary and Variability measures)
Y - Tests Type Based on Response and Measure Variable Data
A statistical package for epidemiologists
SA3202 Statistical Methods for Social Sciences
Nonparametric Statistical Methods: Overview and Examples
Introduction to analysis DAGitty
Introduction to Statistics
Basic Statistical Terms
Nonparametric Statistical Methods: Overview and Examples
Chapter 12 Simple Linear Regression and Correlation
Nonparametric Statistical Methods: Overview and Examples
Nonparametric Statistical Methods: Overview and Examples
SPSS Intro and Analysis
Presentation, data and programs at:
Regression diagnostics
Mean, Median, Mode The Mean is the simple average of the data values. Most appropriate for symmetric data. The Median is the middle value. It’s best.
Descriptive Statistics
Learning outcomes By the end of this session you should know about:
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Biostatistics Lecture (2).
Introductory Statistics
Central Tendency & Variability
Presentation transcript:

Presentation, data and programs at: Stata 2, Bivariate Hein Stigum Presentation, data and programs at: http://folk.uio.no/heins/ Timing: Intro and continuous symmetrical: 60 min Skewed, categorical and regression (not survival): 60 min 8:30-9:30 Groups 15+60 min 9:30-10:45 Plenary 45 min 10:45.11:30 Dec-18 Dec-18 H.S. H.S. 1

Datatypes Categorical data Numerical data Nominal: married/ single/ divorced Ordinal: small/ medium/ large Numerical data Discrete: number of children Continuous: weight Coding 1, 2, 3, is 2 twice as much as 1 1. Set of methods for categorical data proportion married 1. Set of methods for numerical data average weight Dec-18 Dec-18 H.S. H.S. 2

Data type dictates type of analysis Start with continuous data Dec-18 Dec-18 H.S. H.S. 3

Continuous symmetric outcome Example: Birth weight Dec-18 Dec-18 H.S. H.S. 4

Distribution kdensity weight drop if weight<2000 kdensity weight Dec-18 Dec-18 H.S. H.S. 5

Central tendency and dispersion Mean and standard deviation: Mean with confidence interval: Std Dev for Data Std Err for Estimate Dec-18 Dec-18 H.S. H.S. 6

Compare groups, equal variance? Not equal Compare boys and girls Fokus om means or fokus on low tail gives opposite results!! Dec-18 Dec-18 H.S. H.S. 7

2 independent samples Are birth weights the same for boys and girls? Density plot Scatterplot Scatter to see linear/no-linear effect, look for outliers Density to see equal variance Dec-18 Dec-18 H.S. H.S. 8

2 independent samples test Dec-18 Dec-18 H.S. H.S. 9

K independent samples Is birth weight the same over parity? Density plot Scatterplot Scatter to see linear/no-linear effect, look for outliers Density to see equal variance Equal means? Linear effect? Outliers? Equal variances? Dec-18 Dec-18 H.S. H.S. 10

K independent samples test equal means? Equal variances? Dec-18 Dec-18 H.S. H.S. 11

Continuous by continuous Does birth weight depend on gestational age? Scatterplot Scatterplot, outlier dropped Dec-18 Dec-18 H.S. H.S. 12

Continuous by continuous tests Cut gestational age up in groups, then use T-test or ANOVA or Use linear regression with 1 covariate Dec-18 Dec-18 H.S. H.S. 13

Test situations 2 independent samples K independent samples ttest weight, by(sex) K independent samples oneway weight parity By continuous regress weight gestAge 2 dependent samples (Paired) ttest weight_last_year = weight_today 1: ttest weight=10 4: ttest weight0=weight1 (assumes paired test) Equal/unequal Dec-18 Dec-18 H.S. H.S. 14

Continuous skewed outcome Example: Number of sexual partners Dec-18 Dec-18 H.S. H.S. 15

Distribution kdensity partners if partners<=50 Dec-18 Dec-18 H.S. Lower 75% fractile here than on next page because partner>50 are dropped here Dec-18 Dec-18 H.S. H.S. 16

Central tendency and dispersion Median and percentiles: cci binomial exact; conservative confidence interval normal normal, based on observed centiles meansd normal, based on mean and standard deviation Dec-18 Dec-18 H.S. H.S. 17

2 independent samples Do males and females have the same number of partners? Scatterplot Density plot Scatter to see linear/no-linear effect, look for outliers Density to see equal variance Unequal variance! Test somewhat problematic Dec-18 Dec-18 H.S. H.S. 18

2 independent samples test equal medians? Could also use T-test since the “difference in means” is probably normal from 400 observations, even thou the underlying distribution are quite skewed. T-test gives p=0.0000 Dec-18 Dec-18 H.S. H.S. 19

K independent samples Do partners vary with age? Scatterplot Scatterplot (partners<20) Density plot (partners<20) Scatter to see linear/no-linear effect, look for outliers Problems with unequal variance Dec-18 Dec-18 H.S. H.S. 20

K independent samples test equal medians? Probably a cohort effect rather than an age effect Oneway anova gives p=0.48, and Bartlett’s test for equal var gives p=0.000, that is clearly unequal variances. Group sizes also somewhat different. Both tests (K-Wallis and anova) shaky. Regroup to 2 groups, or remove outlier Dec-18 Dec-18 H.S. H.S. 21

Table of tests Categorical ordered: use nonparametric tests Dec-18 Mann-Whithey U=Wilcoxon rank sum Categorical ordered: use nonparametric tests Dec-18 Dec-18 H.S. H.S. 22

Example: Being bullied Categorical data Example: Being bullied Shown flowchart p3 Boys more or less than girls (2 sided test) Dec-18 Dec-18 H.S. H.S. 23

Frequency and proportion Proportion with CI: Proportion: May standardize, adjust for clusters, use bootstrap or jacknife est May weigth if stratified sample Dec-18 Dec-18 H.S. H.S. 24

Proportion, confidence interval x=”disease” n=total number proportion: standard error: confidence interval: How much increase n to get half the standard error? Dec-18 Dec-18 H.S. H.S. 25

Crosstables Are boys bullied as much as girls? equal proportions? Dec-18 Dec-18 H.S. H.S. 26

Ordered categories, trend Does bullied vary with age? twoway (fpfitci bullied agegr) /// (lfit bullied agegr) Could also have used age as countinuous. Have not shown the data as two rugs. Dec-18 H.S.

Ordered categories, trend equal proportions? Dec-18 Dec-18 H.S. H.S. 28

Table of tests Categorical ordered: use nonparametric tests Dec-18 Mann-Whithey U=Wilcoxon rank sum For matched CC data: Mc-Nemar for 2*2 tables symmetry for K*K tables (outcome with more than 2 categories) Categorical ordered: use nonparametric tests Dec-18 Dec-18 H.S. H.S. 29