Differential Gene Expression Dennis Kostka, Christine Steinhoff Slides adapted from Rainer Spang.

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

Personal Response System (PRS). Revision session Dr David Field Do not turn your handset on yet!
Hypothesis Testing making decisions using sample data.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
From the homework: Distribution of DNA fragments generated by Micrococcal nuclease digestion mean(nucs) = bp median(nucs) = 110 bp sd(nucs+ = 17.3.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Statistical Decision Making
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Detecting Differentially Expressed Genes Pengyu Hong 09/13/2005.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
10 Hypothesis Testing. 10 Hypothesis Testing Statistical hypothesis testing The expression level of a gene in a given condition is measured several.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Differentially expressed genes
ANOVA Determining Which Means Differ in Single Factor Models Determining Which Means Differ in Single Factor Models.
Statistical Analysis of Microarray Data
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Lecture 9: One Way ANOVA Between Subjects
Lecture 10 Review Rank Sum test (Chapter 4.2) Welch t-test for comparing two normal populations with unequal spreads (Chapter 4.3.2) Practical and statistical.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
8-2 Basics of Hypothesis Testing
On Comparing Classifiers: Pitfalls to Avoid and Recommended Approach Published by Steven L. Salzberg Presented by Prakash Tilwani MACS 598 April 25 th.
Different Expression Multiple Hypothesis Testing STAT115 Spring 2012.
Example 10.1 Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Concepts in Hypothesis Testing.
Probability Distributions and Test of Hypothesis Ka-Lok Ng Dept. of Bioinformatics Asia University.
Lecture Slides Elementary Statistics Twelfth Edition
Multiple testing in high- throughput biology Petter Mostad.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Essential Statistics in Biology: Getting the Numbers Right
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Inferential Statistics 2 Maarten Buis January 11, 2006.
User Study Evaluation Human-Computer Interaction.
No criminal on the run The concept of test of significance FETP India.
Microarray data analysis David A. McClellan, Ph.D. Introduction to Bioinformatics Brigham Young University Dept. Integrative Biology.
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
A A R H U S U N I V E R S I T E T Faculty of Agricultural Sciences Introduction to analysis of microarray data David Edwards.
Statistical analysis of expression data: Normalization, differential expression and multiple testing Jelle Goeman.
Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Chapter 221 What Is a Test of Significance?. Chapter 222 Thought Question 1 The defendant in a court case is either guilty or innocent. Which of these.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments Presented by Nan Lin 13 October 2002.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
MRNA Expression Experiment Measurement Unit Array Probe Gene Sequence n n n Clinical Sample Anatomy Ontology n 1 Patient 1 n Disease n n ProjectPlatform.
Statistics for Differential Expression Naomi Altman Oct. 06.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Application of Class Discovery and Class Prediction Methods to Microarray Data Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Statistical Analysis of Microarray Data By H. Bjørn Nielsen.
Comp. Genomics Recitation 10 4/7/09 Differential expression detection.
The Broad Institute of MIT and Harvard Differential Analysis.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 –Multiple hypothesis testing Marshall University Genomics.
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Analysis of (cDNA) Microarray.
Hypothesis Testing and Statistical Significance
Microarray Data Analysis Xuming He Department of Statistics University of Illinois at Urbana-Champaign.
Micro array Data Analysis. Differential Gene Expression Analysis The Experiment Micro-array experiment measures gene expression in Rats (>5000 genes).
Estimating the False Discovery Rate in Genome-wide Studies BMI/CS 576 Colin Dewey Fall 2008.
Hypothesis Testing: One Sample Cases
Differential Gene Expression
Statistical inference: distribution, hypothesis testing
Hypothesis Testing: Hypotheses
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Comparing Groups.
Chapter 9 Hypothesis Testing.
Presentation transcript:

Differential Gene Expression Dennis Kostka, Christine Steinhoff Slides adapted from Rainer Spang

Two cell/tissue /disease types: wild-type / mutant control / treated disease A / disease B responding / non responding etc. etc.... For every sample (cell line/patient) we have the expression levels of thousands of genes and the information whether it is A or B Genes Patients, Samples, Timepoints...

Differential gene expression: Which genes are differentially expressed in the two tissue type populations?

A B A cost efficient (cheap) experiment: We observe a gene with a two-fold higher expression in profile A than in profile B. Is two-fold trust worthy? Well, by how much can this gene change in group A and in group B? By no more than 10% than the answer is yes, by up to 500% then the answer is no.

A B A cost efficient (cheap) experiment II: Is a three-fold induced gene more trust worthy than a two-fold induced gene? Actually this depends on the within class variability of the two genes again, it can be the other way round.

Product intensity (logscale) Logratio Is a three-fold induced gene more trust worthy than a two-fold induced gene?

AB Conclusion: In addition to the differences in gene expression you also have a vital interest in its variability... This information is needed to obtain meaningful lists of genes Therefore: Invest money in repeated experiments! AB

Standard Deviation and Standard Error Standard Deviation (SD): Variability of the measurement Standard Error (SE): Variability of the mean of several measurements n Replications Normal Distributed Data:

Conclusion: Repetitions lead to a more precise measurement of gene expression. Single expression measurements are very noisy, average expression across several repetitions is much less noisy Therefore: Invest money in repeated experiments! AB

The additive scale: You will want to use the wealth of statistical theory to analyze your data - Most statistics works on an additive scale (Significance of differences etc...) - Gene expression works on a multiplicative scale (fold changes...) Conclusion: Transform your data to the additive scale -Simple way: take logs Reminder:

Questions: Which genes are differentially expressed? -> Ranking Are these results „significant“ -> Statistical Analysis

Ranking: Problem: Produce an ordered list of differentially expressed genes starting with the most up regulated gene and ending with the most down regulated gene Ranking means finding the right genes … drawing our attention to them In many applications it is the most important step

Ranking is not Testing Ranking: Finding the right genes Testing: Deciding whether genes are significant There is more then one way to rank There is more then one way to test The criteria for which ranking is best is different from the criteria which test is best … power is often no argument

Gene, candidate 1 Gene, candidate 2 Gene, candidate 3 Gene, candidate 4 Gene, candidate 5 Gene, candidate 6 Gene, candidate 7 Gene, candidate 8 Gene, candidate 9 Gene,.... Order due to some score, Intuitively: Fold change 1st: most differential, 2nd: second most diff... Ranking: Order Genes due to amount of fold change/Score -> maybe some that are not differential in reality (False Positive)

Gene, candidate 1 Gene, candidate 2 Gene, candidate 3 Gene, candidate 4 Gene, candidate 5 Gene, candidate 6 Gene, candidate 7 Gene, candidate 8 Gene, candidate 9 Gene,.... Order due to some score, Frequently T Score 1st: most differential, 2nd: second most diff... Testing: Find Genes due to amount of fold change/Score which are significant s.t. there are less than 5% False Positives -> maybe you miss some (False Negatives)

Which gene is more differentially expressed?

You need to score differential gene expression Different scores lead to different rankings What scores are there? Ranking is Scoring

Fold Change & Log Ratios You have transformed your data to additive scale! Factors become differences: If you want to rank by fold change you compute the average expression in both groups and subtract them.

T-Score Idea: Take variances into account Change: low Change: high Change: high Variance: high Variance: low Variance:high

T ~ 0 Change: SMALL Variance:HIGH T huge Change: HIGH Variance:SMALL

Change: SMALL Variance:SMALL T ? Change: HIGH Variance:HIGH

T-Test PROBLEMS There are many genes (-> tests) but only few repetitions is „using s“ a estimate good? if measured variance is small, T becomes easily very large Therefore: for microarray it is reasonable to use a modified version of the Ttest

Fudge Factors: You need to estimate the variance from data You might underestimate an already small variance (constantly expressed genes) The denominator in T becomes really small Constantly expressed genes show up on top of the list Correction: Add a constant fudge factor s 0  Regularized T-score ->Limma ->SAM ->Twilight

More Scores: - Wilcoxon Score (robust) - PAUc Score (separation) - paired t-Score (paired Data) - F-Score (more then 2 conditions) - Correlation to a reference gene - etc etc

Different scores give different rankings ALL vs AML (Golub et al.)

Which Score is the best one? That depends on your problem...

Ok, I chose a score and found a set of candidate genes Can I trust the observed expression differences?  Statistical Analysis Next Question:

Coffee

P-Values Everyone knows that the p-value must be below is a holy number both in medicine and biology... what else should you know about p- values

Rumors If the gene is not differentially expressed the p-value is high If the gene is differentially expressed the p-values is low Both these statements are wrong!

Reminder: Type I and Type II ERROR H0 Null Hypothesis: Gene NOT differential H1 Alternative Hypothesis: NOT H0 Positive: rejected H0 (differential gene) Negative: accepted H0

Reminder: Type I and Type II ERROR H0 H1

We observe a score S =1.27 Can this be just a random fluctuation? Assume: It is a random fluctuation = The gene is not differentially expressed = The null hypothesis holds Theory gives us the distribution of the score under this assumption P-Value: Probability that a random score is equal or higher to S =1.27 in absolute value (two sided test) The basic Idea behind p-values:

Permutations and empirical p-values

If a gene is not differentially expressed: The p-value is a random number between 0 and 1! It is unlikely that such a number is below 0.05 (5% probability)

If a gene is differentially expressed: The p-value has no meaning, since it was computed under the assumption that the gene is not differentially expressed.

Testing only one gene: If the gene is not differentially expressed a small p-value is unlikely, hence we should be surprised by this observation. If we make it a rule that we discard the gene if the p-values is above 0.05, it is unlikely that a random score will pass this filter

1 gene 10 genes 30,000 genes Multiple testing with only non-induced genes

The Multiple Testing Problem P-values are random numbers between 0 and 1. For only one such number it is unlikely to fall in this small interval, but if we have such numbers many will be in there.

We test m hypotheses true hypotheses rejected hypotheses TRUE FALSE Rejected Acctepted Error = false positive Error = false negative Error = false positive Error = false negative H0 H1

FWER=Family-wise error rate: Probability of at least one Type1-error (False Positive) among the accepted (significant) genes TRUE FALSE Rejected Accepted H0 H1

FDR = False Discovery Rate Expected number of Type 1 – errors (False Positives) among rejected hypotheses TRUE FALSE Rejected Accepted H0 H1 with if

Controlling the family wise error rate (FWER) If we want to avoid random numbers in this interval we need to make it smaller. The more numbers, the smaller. For numbers very small. This strategy is called: Controlling the family wise error rate

How to control the FWER? Note, that adjusting the interval border can also be done by adjusting the p-values and leaving the cut off at There are many ways to adjust p-values for multiple testing: Bonferroni: Better: Westfall and Young

In microarray studies controlling the FWER is not a good idea... It is too conservative. A different type of error measure became more popular The False Discovery Rate What is the idea?

The FDR 1.Score genes and rank them 2.Choose a cutoff 3.Loosely speaking: The FDR is the best guess for the number of false positive genes that score above the cutoff

The confusing literature: There are many different definitions of the false discovery rate in the literature: - Original: Benjamini-Hochberg - Positive FDR - Conditional FDR - Local FDR There is also a fundamental difference between controlling and estimating a FDR

In microarray analysis it became popular to use estimated FDRs Differences to p-values: The FDR refers to a list of genes. The p-value refers to a single gene. The p-value is based on the assumption that the gene is not differentially expressed, the FDR makes no such assumption. P-values need to be corrected for multiplicity, FDRs not!

Another difference in concept: If a 4x change has a small p-value, this means that 4x change is too high to be random fluctuation Conclusion: 4x change is significant If a list of 150 genes with 4x change or more has a small estimated FDR this means that we have more genes on this level than would be expected by chance. Conclusion: 4x change can be noise, but 150 genes on that level are too many to be explained just by random fluctuation. In FWER Analysis the fold change 4x is significant, in FDR Analysis it is the number 150 that is significant.

Histograms of the p-values of all genes on the array

FWER: Vertical cutoff FDR: Horizontal cutoff

The typical plots Expected random score vs observed scores: Deviations from the main diagonal are evidence for differentially expressed genes

What you typically observe No differential gene expression A lot of differential gene expression Global changes in gene expression

Summary Replications are useful, not only for statistical reasons (5-8 per leg) Low FWER p-values will lead to many missed genes FDR (SAM) seems more appropriate Often there are many induced genes There are many open questions related to this type of intensive multiple tests

Data... Golub et al Affymetrix Chip 38 tumors: 27 acute lymphoblastic leukemia (ALL) 11 acute myeloid leukemia (AML) 6817 Genes... Filtering? Genes ALLAML

Genes ALLAML

Questions ?

Coffee